Documente Academic
Documente Profesional
Documente Cultură
INTRODUCTION
In this paper, the writers propose a model of usefulness that includes five test
qualities. They are validity, reability, practically, impact, and authenticity.
Moreover, the writers also propose about language use in language test.
1
CHAPTER II
THEORETICAL FRAMEWORK
2
Principle 2 : The individual test qualities cannot be evaluated independently,
but must be evaluated in terms of their combined effect on the overall
usefulness of the test.
1. Validity
a. Definition of Validity
A test is valid if tests what it is supposed to tests. In other words, a test is said
to be valid if it measures accurately what it is intended to measure. Thus it is
not valid, for instance, to test writing ability with an essay question that
requires specialist knowledge of history or biology – unless it is known that all
students share this knowledge before they do the test. Validity is a concept
designating an ideal state – to be pursued, but not to be attained. 1 As the
roots of the words imply, validity has to do with truth, strength, and value.
Rather, validity is like integrity, character to be assessed.
b. Types of Validity
1. Construct Validity
3
research activity, the means by which theories are put to the test and are
confirmed, modified, or abandoned.
2. Content Validity
3. Face Validity
4. Criterion-Related Validity
2
Arthur Hughes, Testing for Language Teachers, ( New York: Cambridge University Press, 1989 ), p.
26
3
ibid p. 22
4
Another approach to test validity is to see how far results on the test agree
with those provided by some independent and highly dependaple assessment
of the candidate’s ability. This independent assessment is thus the criterion
measure agains which the test is validated. There are essentially two kinds of
criterion-related validity, they are concurrent validity and predictive validity.
a. Concurrent Validity
Concurrent validity is established when the test and the criterion are
administered at about the same time.
b. Predictive Validity
Predictive validity concerns the degree to which a test can predict
candidates’ future performance.
2. Reliability
4
Lyle F. Bachman, Adrian S. Palmer, Language Testing in Practice: Designing and
Developing Useful Language Tests, ( Oxford: Oxford University Press, 1996), p. 19.
5
Scores on test tasks with Reliability Scores on test tasks with
characteristic A characteristics A’
3. Practicality
6
In order to assess practicality, we need to resources which classify into three
types , they are as follows:
a. Human Resources
They include test writers, scorers or raters, and test administrators.
b. Material Resources
Rooms for test development and test administration, equipment such
as papers, pictures, typewriters.
c. Time
It consists of development time ( time from beginning of the test
development process to the reporting of scores from the first
operational administration.
4. Impact
a. Washback
7
An aspect of impact that has been particular interest to both language
testing researches and practitioners is what is referred to as
‘washback” and most discussions of this have focused on processes
( learning and instructions). Washback as the direct impact of testing
on individuals , and it is widely assumed to exist.
b. Impact on individuals
A variety of individuals will be affected by and thus have an interest, or
hold a “ stake”, in the use of a given test in any particular situation.
1. Impact on test takers
Test takers can be affected by three aspects of the testing
procedure:
- The experience of taking and, in some cases of preparing for
the test,
- The feedback they receive about their performance on the test,
and
- The decisions that may be made out about them on the basis of
their test scores.
2. Impact on teachers
The second group of individuals who are directly affected by tests
are test users, and in an instructional program the test users that
are most directly affected by test use are teachers. As noted above,
impact on the program of instruction, as implemented by classroom
teachers, has been referred to by language testers as washback.
One way to minimize the potential for negative impact on
instruction is to change the way we test so that the characteristics
of the test and test tasks correspond more closely to the
characteristics of the instructional program.
8
A test developers and test users must always consider the societal and
educational value systems that inform our test use. The consideration
of values and goals is particularly complex in the contect of second or
foreign language testing, since the this situation inevitably leads us to
the realization that the values and goals that inform test use may vary
from one culture to the next. For example, one culture may place great
value on individual effort and achievement, while in another culture
group cooperation and respect for authority may be highly valued.
Values and goals also chane over time, so that issues such as secrecy
and access to information, privacy and confidentially, which are now
considered by many to be basic rights of test takers, were at one time
not even a matter of consideration. The other examples, the potential
impact on the language teaching practice and language in a given
country of using a particular type of test task, such as the multiple-
choice item or a specific type of oral interview, in widely used high-
stakes tests on a national level.
The impact of the test use needs to be considered within the values and
goals of society and the educational program in which it takes place, and
according to the potential consequences of such use.
5. Authenticity
9
Authenticity
Characteristics of the Characteristics of of
TLU task the test task
The characteristics of tasks are of interest for several reasons, they are :
10
A language use task is an activity that involves individuals in using
language for the purpose of achieving a particular goal or objective in a
particular situations.
11
5. A Framework of language task characteristics
The framework of task characteristics consist of a set features for
describing five aspects of task : setting, test rubric, input, expected
response, and relationship between input and respone. Framework is
to provide a basis for language test development and use. This
involves three activities :
a. Describing TLU tasks as a basis for designing language test tasks,
b. Describing different test tasks in order to insure their comparability
and as a means for assessing reliability,
c. Comparing the characterictics of TLU and test tasks to assess
authenticity.
12
a. Format
b. Language of input
c. Topical characteristics
13
whether the test taker or language user must also rely on the
information in the context or in his own topical knowledge.
- Direct : the response includes primarily information supplied in
the input.
- Indirect : the response includes information not supplied in the
input.
CHAPTER III
CONCLUSION
There are five test qualities : validity, reliability, authenticity, impact, and
practically. These five test qualities all contribute to tets usefulness, so that
14
they cannot be evaluated independently of each other. Furthermore, the
relative importance of these different qualities will vary from one testing
situation to another. So that, test usefulness can only be evaluated for
specific testing situation.
REFERENCES
15
David Brinberg, Joseph E. McGrath. Validity and the Research Process. California:
Sage Publications, 1984.
16