Documente Academic
Documente Profesional
Documente Cultură
2010
CHAPTER I
INTRODUCTION
A. Background
In the first chapter we’ve already known about achievement, assessment and
instruction. Assessment is used as a broad category that includes all of the methods used to
determine the extent to which students are achieving the intended learning competences of
instruction including both testing and performance assessment. Both are important. The
knowledge test tells how well the students know what to do and the performance assessment
tells how skillfully the students can do it.
Nowadays, most teachers use selection-type tests because the questions can be
answered in relatively short time, they are easy to scores and the results can be expressed in
numbers that are easily recorded, compared and reported to others. Various studies have
shown that between 80 – 90 % of teacher made tests focus on knowledge outcomes.
However, education is best served in paper-and-pencil testing and the assessment of actual
performance with both focusing on complex learning tasks.
B. Problems
To make the teaching learning process run well and succeed there are some important
things which have to be considered. Here, we’re going to discuss about:
C. Purposes
Based on the problems above, the purposes of this discussion are to:
4. Describe the meaning of validity and reliability and the role play in preparing
assessment procedures
Teacher must choose a right assessment method to assess students. There are many
kinds of assessment methods but they can be summarized in four main categories. They are:
1. Selected-response Tests, it requires the student to choose the correct or best answer, as
in multiple-choice, true-false and matching tests.
2. Supply-response tests, they require the student to respond with a word, short phrase or
completely essay answer.
Those methods require students to apply their knowledge and skills to performance
tasks in realistic setting. They can also make a review the task before submitting it to the
teacher and make it more realistic.
Based on the major types of assessment methods mentioned above, we can determine
the characteristics of each. There’s no the worst or the best methods among them. Each
method plays an important role in assessing students. Those methods can be compared by
their characteristics that are typical of it. They are the realism of tasks, complexity of tasks,
assessment time needed and judgment in scoring. The more difficult the task is, the higher the
characteristic is.
a. Realism of tasks
It means how far the tasks contribute or simulate performance in the real
world. Selected-response tests have the lowest realism of tasks among others
because they emphasize in selecting a response from a given list of possible
answer. In fact, highly structured problems seldom occur in the real world. The
extended performance assessments have the highest realism of tasks among others
because they involve simulating performance in the real world. For examples do
lab work, give a speech and operate a machine require comprehensive of response
that often occurs in the real world. In between those two extreme differences,
there are the supply-tests and the restricted-response performance. They provide
amount of structured problem but having greater freedom of response that are
more realistic problems than selection-type tests.
However, there has been also a trend to make paper-and-pencil tests more
authentic or having more realism. The tasks are designed based on the fact in the
real world so students can select the most realistic answers to solve it from some
alternative answers provided. In some cases, it’s the combination of both
selection-type test and supply-response test by giving opinion why the selected
answer was chosen.
b. Complexity of Tasks
d. Judgment in scoring
To assess students, teacher must specify the learning outcomes so the students
will be more focus on what they’re going to perform.
There must be some criteria in assessing student so we can classify them based
on their intelligence. If students’ performance is surpassed of that of others it’s
included to an excellent performance. If the performance was lower than those of
others it’s included into poor performance. Establishing criteria for assessing
performance is very difficult. But it can be easier if the intended learning
outcomes is clearly stated first.
Should be positive and provide a guide for improving both performance and
self assessment.
The main characteristics of a good assessment procedure are validity and reliability.
Validity means the appropriateness and meaningfulness of the inference we make from
assessment result. Performance assessment is sometimes viewed more valid assessment than
the paper-and-pencil tests because performance assessment more focuses on the performance
taught, simply said it focuses directly on practice. For example, if we want to determine
whether the students can read, we ask them to read. If we want to determine whether the
students can speech, we ask them to speech. Those tasks have the appearance of being valid.
However, each requires its own specifications and scoring criteria. The time consuming of
performance assessment also makes the sampling becomes limited.
Reliability refers to the consistency of the assessment result. It means that in any kind
of situation and condition the students are expected to get the same result. For example, if
students get 70 in chemistry test, they will get at least the same score when we tested them at
a different time or with different sample of equivalent items. This consistency of results
indicates that they are relatively free from errors.
However, we can’t expect that the result would be perfectly consistent in any different
occasions or different samples. Many factors can influence the result. Such as ambiguities,
various kind of samples, fluctuation o motivation and attention of students and also luck can
cause the errors occur to the assessment result. So the point is, we have to minimize the errors
that the result of the assessment can be as reliable as possible. The relation between validity
and reliability is that reliability provides the consistency of result that makes valid inferences
possible. Both validity and reliability of assessment result can be provided for during the
preparation of assessment procedures. When we include enough number of tests, and we use
not an ambiguity procedure, irrelevant source of difficulty, we are providing both reliability
and validity.
There are some important features that can enhance the validity and reliability of
assessment result.
Both methods are very useful in determining the capability of students. The first
method shows an individual’s performance compared with others and the second one tells
whether individual can do a performance. Norm referenced and criterion referenced only
interprets the result of the performance of students. Norm referenced mostly uses score to
make a rank of the students. While criterion referenced is based on details of student
performance. There are some differences between norm-referenced and criterion-referenced.
It can be seen as follow:
Norm-referenced tests are typically used to survey an achievement over a large range
of learning outcomes. While criterion-referenced tests are typically used for mastery testing,
and we can help students to improve learning by determining what tasks they can’t do. Norm-
referenced interpretation is most useful when we concern about relative ranking of students
and to classify students. It’s the best way of combining both two types of test to get a
maximum result.
CHAPTER III
CONCLUSION
2. Selected-response tests are lowest in realism and complexity of tasks assessed but
require little time to administer and can be quickly scored.
Supply-response tests are higher in realism and complexity of tasks than selected-response
tests, but they are more consuming time and more difficult to score.