Documente Academic
Documente Profesional
Documente Cultură
5.
5.5
Moderation
5.5.1 Moderation procedure
Moderation is the principal tool for ensuring marking reliability, though not the only one (see
section 5.4.4). Every examiner, except the principal examiner for each component who sets the
standard, sends a sample of their marking to an experienced and reliable examiner, who acts as team
leader. This sample is re-marked by the team leader and a statistical comparison of the paired set of
marks determines whether the original examiners marking is acceptable, perhaps with some slight
adjustment, or unacceptable. Moderation is a hierarchical process and the different levels of the
hierarchy for a typical large entry examination component are illustrated in Figure 2.
Principal examiner
Team leader
Asst ex
Asst ex
Asst ex
Team leader
Asst ex
Asst ex
Asst ex
Asst ex
Team leader
Asst ex
Asst ex
Asst ex
Asst ex
Asst ex
Figure 2: Moderation hierarchy for a typical large entry, externally assessed component. The
diagram has been simplified for ease of representationmost teams actually consist of about
10 assistant examiners.
The principal examiner for a component is often the chief examiner or deputy chief examiner,
but can also be a highly experienced and reliable former team leader. Generally, the principal
examiner is also the author of the examination paper or was greatly involved in setting that
paper. Team leaders are experienced examiners who have shown that they can mark
consistently and accurately over a number of examination sessions. Each team leader will
generally oversee up to 10 assistant examiners. For examination paper components, the
moderation sample size is 15% of each examiners total allocation of scripts, with a minimum
of 10 and a maximum of 20 scripts. Examiners are requested to submit samples covering a
number of schools and as full a range of marks as possible.
80
Team leader mark (y)
70
60
50
Correlation coefficient
r = 0.99
40
30
y = 0.95x + 3.13
20
10
0
0
10
20
30
40
50
60
70
80
to apply some other kind of moderation adjustment, perhaps different linear adjustments
for different parts of the mark range
2.
3.
In all such cases, a recommendation will be made by the SAM/CAM on whether to retain the
services of the examiner for future sessions.
Moderated mark
To overcome this problem, tailing is applied to marks in the top 20% and bottom 20% of the
available mark range. At these extremes, the calculated regression line is modified and
substituted by new tailed lines that link from the regression line to the maximum coordinates
and minimum coordinates, as shown in Figure 4.
Regression line
Tailed
lines
Senior moderator
Assistant
moderator
School
School
Assistant
moderator
School
Senior moderator
Assistant
moderator
School
School
Assistant
moderator
School
School
Assistant
moderator
School
Assistant
moderator
School
School
Figure 5: The hierarchy of moderation applied to internally assessed components. The diagram
is simplified. In reality, each assistant moderator is likely to receive internal assessment
samples from about 10 schools, and each senior moderator is likely to oversee about 10
assistant moderators.
The slightly different approach to the role of internal assessment moderation is indicated by the
different titles given to the examiners who carry it outthat is, principal moderator, senior
moderator and assistant moderator. The great majority of these are experienced DP teachers. All
internally assessed components are marked by applying assessment criteria, and in the majority
of cases the teacher has access to considerably more information about the context and process
underlying the candidate work than the moderator can have. Because of this, moderators for
most internal assessment components, except for language orals, are asked to judge whether the
teachers marking seems appropriate, rather than simply to re-mark the work disregarding the
marks awarded by the teacher. Teachers marks should be altered only when the moderator is
sure they are inappropriate.
Despite this more flexible approach, the great majority of all moderation failures arise from
internal assessment. The IBO does not accredit teachers in regard to their competence in
conducting internal assessment, and is not in a position to control which teachers conduct
internal assessment in the same way that it can with assistant examiners. Also, it can be difficult
for a teacher to submit a satisfactory moderation sample covering a wide range of marks or
types of work, given the restricted and sometimes non-representative or very homogenous
nature of student groups within a class at a given school. If the sample is unavoidably restricted,
it becomes more difficult for that sample to meet the statistical requirements for processing
through the automatic moderation system.
Because of such factors, and because internally assessed work is inherently more open-ended
and more open to subjectivity in marking, the criteria for passing automatically through