Documente Academic
Documente Profesional
Documente Cultură
Revisions of clinical psychological tests are occurring at more frequent intervals than in the past. These
revisions involve such practical issues as the goals of the revision, the economic considerations relating
to the revision, the methodology of the revision, and the degree to which the revision meets the available
scientific and ethical standards governing the use of these tests in individual assessment. A tension
between practical and ethical issues in the test revision process is inevitable and demands the best of
psychologists in their decision making. Test developers, psychologists, patients, and consumers of the
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
test interpretations and recommendations have legitimate interests and a stake in seeing that test revisions
This document is copyrighted by the American Psychological Association or one of its allied publishers.
are applied with maximal effectiveness and fairness in the broadest sense.
Revisions of clinical psychological tests used in individual are taking tests, and those who are consuming test results and
assessment appear to have become more frequent, not only in findings based on them. There are both practical and ethical
terms of actual numbers of original versions of tests or tests being considerations involved, which have diverse influences on the
updated but also in terms of a net reduction in the amount of time process of test revision or modification. The goal of the present
between test revisions. These test revisions are most noticeable to article is to examine some of the considerations that clinicians,
clinicians and others using general purpose psychological tests educators, and researchers incorporate in their thinking regarding
(e.g., Wechsler Adult Intelligence Scale [WAIS], Minnesota Mul- the ongoing acceleration of test development. The article also is
tiphasic Personality Inventory [MMPI]). However, the trend to- meant to suggest appropriate questions or considerations when test
ward revision also encompasses tests used for special purposes revisions appear. The article is cast in general terms and does not
(e.g., Wide Range Achievement Test, Beck Depression Inventory). apply to any particular type of test or maker of tests; issues specific
Even checklist-type self-rating scales require informed updating to group testing or nonclinical populations are not presented here.
despite the fact that the central idea of the test is unchanged (Lubin
& Zuckerman, 1998). Where full-scale revisions of tests are not Test Guidelines and Ethical Canons:
undertaken, there are often new methods of scoring, new scales, or
When Must a Test Be Revised?
derivative (e.g., brief) versions of tests introduced for psycholo-
gists' consideration and use. In addition, measures introduced There are no absolute practical or ethical guidelines suggesting
originally for one kind of use in certain developmental assessment when a test requires revision, although there are consensus scien-
contexts (e.g., Vineland Scales) sometimes find expanded uses in tific (American Psychological Association [APA], 1966; APA,
their revision as measures of activities of daily living applied to American Educational Research Association, & National Council
new populations (e.g., geriatric trauma and chronic disease pa- on Measurement in Education, 1974, 1985) and ethical (APA,
tients) as well as to patients with congenital skill limitations. 1992) codes touching on issues that should drive or structure
Indeed, if the introductory, promotional, or sales materials offered decisions about the need for test revision. The current published
by direct mail to psychologists are any indicator, the extent and version of Standards for Educational and Psychological Testing
rate of individual test development has never been greater. (APA et al., 1985) states in Standard 3.18 that:
These revisions represent a trend demanding greater attention
from those who are selecting and administering tests, those who A test should be amended or revised when new research data, signif-
icant changes in the domain represented, or new conditions of test use
and interpretation make the test inappropriate for its intended uses. An
Kenneth M. Adams, Psychology Service, Ann Arbor Veterans Affairs apparently old test that remains useful need not be withdrawn or
Health System and Departments of Psychiatry and Psychology, The Uni- revised simply because of the passage of time. But it is the respon-
versity of Michigan. sibility of test developers and test publishers to monitor changing
This article was informed by discussions with colleagues in a number of conditions and to amend, revise or withdraw the test as indicated,
roles related to the process of test revisions. Whether corporate officials, (p. 29)
psychologist consumers, test critics, or attorneys knowledgeable about
tests, they deserve a note of appreciation for their candor and cooperation. This standard can be contrasted with the previous version, Stan-
I especially want to note the contributions of Cathy Zepelin from a lifetime dards for Educational and Psychological Tests (APA et al., 1974),
of personal experience with test triumphs and foibles and Robert Ivnik, to
which makes more specific recommendations concerning the in-
whom I turned to for some visionary thoughts about this issue and came
formation that is desirable for psychologists to know regarding a
away rewarded as I suspected that I might.
Correspondence concerning this article should be addressed to Kenneth test's revision.
M. Adams, Psychology Service (116B), Ann Arbor Veterans Affairs The ethical standard (APA, 1992) comes to the same recom-
Health System, 2215 Fuller Road, Ann Arbor, Michigan 48105-2300. mendation from the behavioral perspective of proscription in Stan-
Electronic mail may be sent to kmadams@umich.edu. dard 2.07, Obsolete Tests and Outdated Test Results:
281
282 ADAMS
Psychologists do not base their assessment or intervention decisions above represent consensus agreements to the current state of the art
or recommendations on data or test results that are outdated for the to which professionals expect to be held. To point out that these
current purpose. Similarly, psychologists do not base such decisions professional community standards are in some senses aspirational,
or recommendations on tests or measures that are obsolete and not or that individual standards should not be interpreted or seized
useful for the current purpose, (p. 1603)
upon in isolation to evaluate certain actions of psychologists
For example, the ease of use of a particular test, its acceptance by should not lessen the importance or salience of these standards.
test takers, or its familiarity to assessment consumers is easily However, guidelines and principles are inherently abstract and
outweighed by the fact that changes in the psychologist's practice cannot encompass the experience and practical considerations
demographics make the available norms for the test's interpreta- likely to shape the behaviors of all those involved with and
tion tenuous for the new kinds of patients being assessed. This affected by the test revision process. In this regard, one excellent
occurs even though the particular application of the test is well resource (Eyde et al., 1993) uses an empirical test use methodol-
within the range of intended uses and there is an absence of ogy and case vignettes to create a superior syllabus in responsible
and ethical test use.
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
acteristics. Even collecting new local norms on the old test is What blend of practical and principled considerations reason-
neither feasible nor a complete solution to the problem. A more ably enters the decision to revise a test or adopt the revision? The
recent complication affecting older tests and their revisions may answers obviously are influenced by the particular details of a
also occur in instances of public disclosure of test stimuli whereby given situation, but a representative set of considerations is pro-
coaching of patients on test taking takes place. This seems to be the vided here. This is not intended to reflect an exclusive or ordinal
case in some forensic situations. Other examples of professional set of priorities, but rather to indicate the interrelatedness of
practices not in compliance with the code regarding test obsoles- principled guidance and practical realities of test revision.
cence are discussed in commentary or case volumes published
since the APA (1992) code (e.g., Canter, Bennett, Jones, & Nagy, Cost
1994).
Many other conditions may also dictate a need for test revision. No guidelines of any kind are available suggesting the cost that
The more obvious indications are the recognition of test materials producers of individual clinical psychological tests should bear for
or stimuli as outdated (e.g., the picture of a 1950s automobile in test revision or improvement. Absolute cost may be an impossible
one intelligence test that has since been revised). Language usage parameter to specify, and in this regard, the issue is probably best
also changes in dramatic ways, rendering certain items ineffective described as cost-effectiveness.
or neglecting to include new ideas or concepts (e.g., the Internet) When major revision was an infrequent event, it was possible
that relate more to adaptive behavior in modern society. Item for test publishers to contemplate a revision over a long period of
performance characteristics (cf. Knowles & Condon, 2000) may time and to make fairly good estimates of how a revision would be
also change in complex ways and are not invariant across revi- accepted and how to fund the cost of the revision process. In
sions. The demographic characteristics of the groups used in the addition, the publisher or owner would benefit from a large amount
standardization of one revision of a test might change, as is the of independent scientific evidence concerning the use of a test and
case with multinational Hispanic immigration patterns in the de- data indicating just what features of a test were strong or required
mography of the United States over the past 10 years. Finally, attention for improvement in the revision. This typically extended
there are changes in the definitions of problems that tests are to the subtests, items, norms, and even the format of the test. This
intended to identify, as was the case for post traumatic stress type of revision scenario permitted deliberate planning and some
disorder in relation to personality tests when the Diagnostic and notion of likely economic parameters of budgeting, expenditures,
Statistical Manual of Mental Disorders (4th ed., American Psy- and sales. It is a scenario that is now gone forever, except for the
chiatric Association, 1994) was introduced (Adams, 1996). unusual small test owner or publisher with a single test, a known
A wider appreciation of all of the potential implications of the or familiar customer constituency, and little competition.
test revision process can be generated readily from a review of the Instead, test revision in a contemporary context occurs when a
General Principles section of the APA Ethical Principles (APA, test is owned by or receives exclusive distribution through a test
1992, p. 1599). No psychologist could fail to generate inductively publishing firm. Test publishing firms, in turn, are often part of
a list of desired actions and attributes of a clinical test revision larger corporations with business concerns extending far beyond
process, including multiple entries under each of the realms of psychology and publishing. This new environment has several
competence, integrity, professional and scientific responsibility, noteworthy features.
respect for people's rights and dignity, concern for others' welfare, First, the decision to revise a test itself may be made in an
and social responsibility. The psychologist could organize these as environment wherein die economic return of a test becomes the
mandatory, discretionary, or enhancing to structure such a list, but salient factor in decision making about the test. This is a complex
that would not shorten it given individual views and devotion to calculus in that a firm doing business in psychological tests and
these professional lodestars. materials views a test revision as an investment needed to maintain
product competitiveness and to meet societal or legal obligations
Experience and Practical Considerations: Considerations for test viability. The process also needs to ensure that the revision
of a particular test is successful to help support the wider portfolio
Leavening Ethical Issues in Test Revision
of test products offered or to fund new development projects. In a
Guidelines for psychological tests and ethical principles gov- corporate environment, this ownership structure means that the
erning their use in the practice of psychology such as those cited psychological test revision project must do well financially to
SPECIAL SECTION: ETHICAL AND PRACTICAL ISSUES 283
support the viability of the entire psychological test subsidiary occur now at about 10-year intervals, increased costs of newer test
in the wider organizational context of the corporation. For materials and the introduction of modified or knock-off products
example, a test publishing subsidiary of a large corporation may (e.g., brief versions, screening versions, or special versions of tests
have other subsidiaries selling books, software, etc., that have for particular research needs) have increased the amount that
an economic balance that a financially unsound test subsidiary psychologists can expect to budget and spend on test materials.
might undermine. Even essential test reference resources are sometimes extra-cost
Second, the number of people involved in the test revision options (The Psychological Corporation, 1997).
process has expanded greatly. Test and measurement experts and One cost-containment strategy to counter this difficulty involves
clinical application specialists previously predominated in the test partial revisions of tests designed to fix specific problems identi-
revision process, with consultants employed as needed. In this fied in the field or on research studies done after test deployment.
environment, psychologists held sway with decisions influenced in However, many major test revisions are also partial revisions
the main by clinical and measurement issues. However, the pro- because certain items, a particular format, or a method of scoring
cess is now augmented by the participation of corporate officials, is retained (e.g., MMPI-2). Psychologists can expect to see such
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Goals and Mission of the Test Revision tests with social concerns is ethically defensible—and, coinciden-
tally, the best economic strategy for test revisions.
The task of revising a test should represent an opportunity to Thus, the goals for the revision of a test would need to incor-
revisit the basic assumptions that guided its original development porate (a) accumulated experience with the previous version, (b)
and previous revisions, if any. The original theory or field guiding advances in the theory and method for the assessment of the
a personality or cognitive test may have grown or been superseded construct in question, (c) an appreciation of new ways in which the
by new research developments, such as item response theory test will be challenged, (d) consideration of financial and market-
(Carroll, 1990; Rasch, 1980). Revisions of tests may encompass place relationships between test sellers and psychologists and
new concepts and developments (Thorndike, Hagen, & Saltier, between psychologists and their clients, and (e) technological
1986) and create a test that represents a new paradigm and leads changes likely to impact the uses of the test. It is a given that
the field but may also be unfamiliar to psychologists using the decisions about the revised test will reflect some trade-offs be-
previous version. tween the desirable and the possible.
Every test revision project must address the question of how
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
success of the process. Adherents to the latter viewpoint are would be the introduction of mandated warnings about adverse
usually slower to see flaws or interpretative pitfalls that arise from outcomes or side effects on psychological test protocols printed in
procedural or methodological causes. the manner of package inserts for drugs as required by the Food
It is at this juncture that most test publishers fail to avail and Drug Administration.
themselves of the observations and suggestions of career psychom- Finally, support by test makers for revised test versions is in
etrists, whose ongoing practical experience in the test environment great need of improvement. Existing training materials are rudi-
typically far exceeds that of most psychologists. It is these spe- mentary in their teaching about the revised test as a rule. Compe-
cialists in actual test situations who can best predict likely points tent telephone support, newsletters, World Wide Web pages, and
of patient acceptance or irritation. They are also invaluable in other methods of solving existing problems and anticipating new
pinpointing expected administration and scoring deficiencies be- ones seem to be conspicuously absent resources once a psychol-
fore the test is offered for sale, and these issues are identified as ogist is sitting with the new test kit on the table. Future efforts at
flaws. creating an ongoing relationship between test publishers, custom-
Revalidation of a test rarely satisfies all aspects of the Standards ers, and consumers of test revisions makes good basic business
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
(APA, 1966; APA et al., 1974, 1985) in every respect, and this sense.
This document is copyrighted by the American Psychological Association or one of its allied publishers.
cannot be expected to change in the future. Reflection suggests that The ethical revision of clinical psychological tests for individual
no single test-publishing entity could encompass all the sources of assessment, thus, is not a process with easy or expected features or
skill and patient material needed to create a flawless revision. processes. Test makers try to anticipate the changing needs of
However, attention to the letter and spirit of aspirational technical psychologists and their patients, but few psychologists are aware
standards is certain to improve the revised test product. of the context and mechanics of the test revision process or think
that they have the kind of input or ongoing relationship that would
Revision as a Continuous Process improve the outcome of test revision. Test publishers may do well
to actually issue requests for user comments in trade publications
The process of test revision once had a phase in which the test such as the APA Monitor. This would not only alert test users to the
revision was complete and the revision was at least symbolically existence of the test revision process but would also invite con-
placed in the hands of the provider community. This hand-off may structive comment at no great cost to the publisher. It is also firmly
not remain feasible or desirable either now or in the future. in the spirit of seeing the development of psychological tests as a
Experience in the use of a revision is a precious commodity early public trust as well as a commercial enterprise.
in the life of the revised test, as the validation data in the test The tension between the practical and ethical issues is inherent
manual serve as the only known standard of comparison. Accord- in the test revision process and is unlikely to be legislated in a way
ingly, the debut of a revised test is often accompanied by a frenzied satisfactory to every involved party. The best informed and doc-
accumulation of cases regarding the performance of the revised umented decision-making approach appears to be the only way for
test in practice by researchers wanting to publish. This process psychologists to respond to the challenge.
gives way to the growth of a more deliberate and experience-based
literature, eventually leading to some body of opinion in the
professional community regarding the revision. References
Test makers may decide to lead this process rather than to Adams, K. M. (1996, August). Non-trauma traumata: The hits just keep on
follow it in considering the process of test revision a continuous coming. Paper presented at the meeting of the American Psychological
one. I suggest that psychological test publishers deal with their Association, Toronto, Ontario, Canada.
customers in the way that software producers do in issuing updates American Psychiatric Association. (1994). Diagnostic and statistical man-
and upgrades large and small. There are obvious normative prob- ual of mental disorders (4th ed.). Washington, DC: Author.
lems with continuous modifications of tests, but problems need not American Psychological Association. (1966). Standards for educational
accumulate to a critical mass of obsolescence before being and psychological tests and manuals. Washington, DC: Author.
American Psychological Association. (1992). Ethical principles of psy-
addressed.
chologists and code of conduct. American Psychologist, 47, 1597-1611.
In a similar vein, actual computer software for test administra-
American Psychological Association, American Educational Research As-
tion, scoring, and interpretation badly needs improved support and sociation, and National Council on Measurement in Education. (1974).
forward development planning to serve contemporary needs. Many Standards for education and psychological tests. Washington, DC:
psychologists either retain outdated computers solely because they American Psychological Association.
will run the only available test software he or she needs or because American Psychological Association, American Educational Research As-
its unlimited uses feature has been eliminated in the next revision, sociation, and National Council on Measurement in Education. (1985).
and their perception of the per use cost is that it is too high. Standards for education and psychological testing. Washington, DC:
Test revision as a continuous process also carries with it the idea American Psychological Association.
that product liability conceptions and law apply or are considered Canter, M. B., Bennett, B. E., Jones, S. E., & Nagy, T. F. (1994). Ethics for
psychologists: A commentary on the APA ethics code. Washington, DC:
to apply in ways not very often seen in psychological practice
American Psychological Association.
today. Errors are made in decisions that are based on revised Carroll, J. B. (1990). Estimating item and ability parameters in heteroge-
psychological tests, but there are few, if any, comparable profes- neous tests with the person characteristic function. Applied Psycholog-
sions where error plays no role in practice. Even the most severe ical Measurement, 12, 109-125.
test critic would be unlikely to suggest that product liability Eyde, L. D., Robertson, G. J., Krug, S. E., Moreland, K. L., Robertson,
standards used in manufacturing or the idea of recalls of revisions A. G., Shewan, C. M., Harrison, P. L., Porch, B. E., Hammer, A. L., &
of psychological tests would make great sense. Even less enticing Primoff, E. S. (1993). Responsible test use: Case studies for assessing
286 ADAMS
human behavior. Washington, DC: American Psychological Associa- Strauss, E., Spreen, O., & Hunter, M. (2000). Implications of test revisions
tion. for research. Psychological Assessment, 12, 237-244.
Knowles, E. S., & Condon, C. A. (2000). Does the rose still smell as sweet? Thorndike, R. L., Hagen, E. P., & Saltier, J. M. (1986). Stanford-Binet
Item variability across test forms and revisions. Psychological Assess- Intelligence Scale, (4th ed.). Itasca, IL: Riverside Publishing Company.
ment, 12, 245-252. Wechsler, D. (1955). Manual for the Wechsler Adult Intelligence Scale.
Lubin, B., & Zuckerman, M. (1998). Manual for the Multiple Affect New York: The Psychological Corporation.
Adjective Checklist—Revised. San Diego, CA: Educational and Indus- Wechsler, D. (1981). WAIS-R Manual: Wechsler Adult Intelligence
trial Testing Service.
Scale—Revised. San Antonio, TX: The Psychological Corporation.
Pope, K. S., Butcher, J. N., & Seelen, J. (1993). The MMPI, MMPI-2, and
MMPI-A in court. Washington, DC: American Psychological Association.
The Psychological Corporation. (1997). WAIS-III—WMS-III Technical
Manual. San Antonio, TX: Author. Received May 24, 1999
Rasch, G. (1980). Probabilistic models for some intelligence and attain- Revision received December 7, 1999
ment tests. Chicago: University of Chicago Press. Accepted December 9, 1999
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
This document is copyrighted by the American Psychological Association or one of its allied publishers.
The Publications and Communications Board has opened nominations for the
editorships of Journal of Applied Psychology, Journal of Consulting and Clinical
Psychology, Journal of Educational Psychology, Psychological Bulletin, and Journal
of Personality and Social Psychology: Interpersonal Relations and Group Processes
for the years 2003-2008. Kevin R. Murphy, PhD, Philip C. Kendall, PhD, Michael
Pressley, PhD, Nancy Eisenberg, PhD, and Chester A. Insko, PhD, respectively, are the
incumbent editors.