Documente Academic
Documente Profesional
Documente Cultură
Why is it necessary?
1. Validity
- “How valid is the test for the decision that I need to make?” or
- “How valid is the interpretation I propose for the test?”
A. Face validity
B. Content validity
C. Criterion validity (Concurrent and Predictive)
A) Face Validity:
- refers to the extent to which the physical appearance of the test corresponds to what it
is claimed to measure
B) Content Validity:
- means that extent to which the content of the test is truly a representative of the
content of the course.
- means that extent to which the content of the test is truly a representative of the
content of the course. A well constructed achievement test should cover the
objectives of instruction, not just its subject matter. Three domains of behavior are
included: cognitive, affective and psychomotor.
C) Criterion-Related Validity
• Concurrent validity is established when the test and the criterion are
administered at about the same time.
• Predictive validity concerns with the degree to which a test can predict
candidates’ future performance.
1. Face Validity ascertains that the measure appears to be assessing the intended
construct under study. The stakeholders can easily assess face validity. Although this is
not a very “scientific” type of validity, it may be an essential component in enlisting
motivation of stakeholders. If the stakeholders do not believe the measure is an accurate
assessment of the ability, they may become disengaged with the task.
Type of
Definition Example/Non-Example
Validity
A semester or quarter exam that only includes
The extent to which the content
content covered during the last six weeks is
Content of the test matches the
not a valid measure of the course's overall
instructional objectives.
objectives -- it has very low content validity.
The extent to which scores on the
test are in agreement with If the end-of-year math tests in 4th grade
Criterion (concurrent validity) or predict correlate highly with the statewide math tests,
(predictive validity) an external they would have high concurrent validity.
criterion.
If you can correctly hypothesize that ESOL
The extent to which an
students will perform differently on a reading
assessment corresponds to other
Construct test than English-speaking students (because
variables, as predicted by some
of theory), the assessment may have construct
rationale or theory.
validity.
Therefore, reliability is “the extent to which a test produces consistent scores." This
means that the higher the extent, the more reliable the test. Valid test is also a reliable
test, but a reliable test may not be a valid one. Stability of tests scores tested at different
time.
Discriminating Power
• Discriminating power of the test is its power to discriminate between the upper and
lower groups who took the test.
Discrimination Index
• The power of the item to discriminate the students between those who scored
high and those who scored low in the overall test. In other words, it is the power
of the item to discriminate the students who know the lesson and those who do
not know the lesson.
1. Positive Discrimination
2. Negative Discrimination
3. Zero Discrimination
The item of the test is answered correctly or know the answer by all the
examinee’s
An item is not answered correctly any of the examinee