Sunteți pe pagina 1din 86

VITA NAME: HOME ADDRESS: Ronald K.

Hambleton 268 Iduna Lane Amherst, MA 01002 (413) 253-5344 Center for Educational Assessment Hills South/Room 154 University of Massachusetts Amherst, MA 01003 (413) 545-0262 FAX: (413) 545-4181 e-mail: rkh@educ.umass.edu

OFFICE ADDRESS:

MARITAL STATUS: Married, two sons BIRTH DATE: BIRTHPLACE: EDUCATION: B.A. Honors, University of Waterloo, 1966 Major: Mathematics; Minor: Psychology University of Toronto, 1967 Major: Psychometric Methods; Minor: Statistics June 27, 1943 Hamilton, Ontario, Canada

M.A.

Ph.D. University of Toronto, 1969 Major: Psychometric Methods; Minor: Computer Science, Statistics AWARDS AND HONORS:

Graduate Fellowship, University of Toronto, 1966-1969. American College Testing Summer Postdoctoral Fellowship, 1971. Research Fellowship, Educational Research Institute of British Columbia, Vancouver, Canada, 1982. President, National Council on Measurement in Education, 1989-1990. President, International Test Commission, 1990-1994. Psychometric Fellowship, University of Twente, The Netherlands, 1991. National Council on Measurement in Education Career Achievement Award, 1993. University of Massachusetts Chancellor's Medal, 1994. Honorary Doctorate, University of Umea, Faculty of Social Sciences, 1994. President, Division II, International Association of Applied Psychology, 1998-2002. President, Division 5, American Psychological Association, 1996-1997 College Outstanding Teacher Award, University of Massachusetts, 1996-1997. Appointed Distinguished University Professor, University of Massachusetts, 1998. 2003 Association of Test Publishers Career Achievement Award.

Honorary Doctorate, University of Oviedo, Oviedo, Spain, 2003. International Test Commission Award for Distinguished Service, 2003. E. F. Lindquist Award for Outstanding Research in Assessment (AERA and ACT), 2005. University of Massachusetts Award for Outstanding Accomplishments in Research and Creative Activity, 2005. Samuel J. Messick Award for Scientific Contributions to the Field of Measurement, Division 5 of APA, 2006.

PROFESSIONAL EXPERIENCE: Appointments

Lecturer, Ontario College of Education, University of Toronto, Summers 1968-1972. Graduate Assistant, Department of Measurement and Evaluation, The Ontario Institute for Studies in Education, 1966-1969. Assistant Professor (1969-1973), Associate Professor (1973-1980), and Professor (19801998), Distinguished University Professor (1998-present), University of Massachusetts at Amherst. Visiting Professor, School of Business Administration, United States International University, Summer 1976. Adjunct Professor, Graduate School of Applied Behavioral Sciences, California American University, 1976-1980. Chairperson, Laboratory of Psychometric and Evaluative Research, University of Massachusetts at Amherst, 1973-present. Lecturer, George Washington University, Summer 1980. Visiting Professor, University of Leiden, The Netherlands, Fall 1981. Visiting Scholar, UCLA, Fall 1982. Visiting Professor, Technical Teachers' Training Institute, Bhopal, India, Summer 1987. Member, National Faculty, Center for the Study of Evaluation, UCLA, 1987-1991. Visiting Professor, University of Umea, Sweden, September, 1990, June, 2004. Visiting Professor, University of Ottawa, Spring, 1992. Executive Director, Center for Educational Assessment, University of Massachusetts, 2004-present.

National/International Committee Work

Joint AERA-NCME-APA Committee on Test Standards, 1977-1978. AERA Publications Committee, 1979-1981. APA Psychological Tests and Assessment Committee, 1980-1982. APA Division 5 Public Affairs Committee, 1982-1984. APA representative to the International Test Commission, 1982-1986. NCME Board of Directors, 1983-1986. NCME representative to the Joint Committee on Standards for Educational Evaluation, 1984-1987. NCME Publications Committee, 1984-1986. ETS Blue Ribbon Committee to Evaluate the Mantel-Haenszel Statistic, Spring 1986. ETS Advisory Panel on Design of Assessment Services Relating to the Educational Equality Project, Member, 1985.

International Test Commission, Vice-President, 1986-1990; President, 1990-1994; PastPresident, 1994-1998. New Jersey High School Proficiency Test Technical Advisory Committee, Chaiperson, 1986-present. NCME Committee on the Recruitment of Measurement Professionals, Member, 1987. NCME Vice-President, 1988-1989; President, 1989-1990; Past-President, 1990-1991. NCME Awards Committee, Chairperson, 1989. NCME Membership Committee, Chairperson, 1989. National Research Advisory Committee to the National Board of Medical Examiners, Member, 1989-1991. Technical Review Committee for the National Adult Literacy Project, Member, 19901993. Division 5, APA Workshops Committee, Member, 1990-1991. National Assessment of Educational Progress (NAEP) Technical Advisory Committee, Member, 1990-1994. NAGB-ACT Technical Advisory Committee to the NAEP Achievement-Level Setting in Mathematics, Reading, and Writing, Member, 1991-2000. European Conference on Educational Research, Research Methodology, and Evaluation Research, Program Co-Chairperson, 1992. National Board for Professional Teaching Standards, Technical Analysis Group, Member, 1992-1996. International Association of Applied Psychology, Division 2, Executive Committee, Member, 1992-1996. NCME International Measurement Issues Committee, Member, 1992-1994. National Board of Medical Examiners John P. Hubbard Award Committee, Member, 1993, 1994. International Committee to Develop Guidelines for Adapting Instruments and Establishing Score Equivalence, Chairperson, 1992-2000. Professional Examination Service, Board of Directors, 1994-1999. NCME Instructional Modules Committee, Member, 1994-1998. Massachusetts Assessment Advisory Committee, Member, 1994-1997. European Association of Psychological Assessment Awards Committee, Member, 19941995. KIRIS National Technical Review Committee, Chairperson, 1994-1995. Technical Advisory Committee, Graduate Record Examinations Program, Member, 1995-1997. Board on International Comparative Studies in Education, National Research Council, Member, 1995-1998. Technical Advisory Panel, Department of Defense Education Activity, Member, 19951996. NAEP Design and Feasibility Committee, National Assessment Governing Board, Member, 1996. National Council on Measurement in Education Student Dissertation Awards Committee, Chair, 1997-1998. National Council on Measurement in Education Nominations Committee, Member, 1997. International Advisory Committee to the Swedish Scholastic Aptitude Testing Program, member, 1992-present. Technical Advisory Committee on Computer-Based Exams, British Columbia Department of Education, Member, 1996-1998.

Technical Advisory Committee to the Early Childhood Longitudinal Study, U.S. Department of Education, Member, 1996-1999. Committee to Develop International Guidelines on Core Standards for Test Use, International Test Commission, Member, 1996-1999. Technical Review Panel for the Computerization of the USMLE, National Board of Medical Examiners, Member, 1996-2000. Scientific Advisory Board to the National Institute for Testing and Evaluation, Israel, Member, 1996-present. IAAP Division 2 1998 Program Committee, Member, 1996-1998. Technical Review Panel for the Standardized Patient Project, National Board of Medical Examiners, Member, 1996-2001. AIR Technical Advisory Committee for the Volunteer National Test, Member, 19972000. National Research Council Committee on Embedding Common Test Items in State and District Assessments, Member, 1999. Massachusetts Department of Education Technical Advisory Committee, 1997-2003. Virginia Department of Education Technical Advisory Committee, Chairperson, 1999present. Florida Department of Education Technical Advisory Committee, 1998-1999. Wisconsin Department of Education Technical Advisory Committee, 1998-2002. New York Department of Education Blue Ribbon Committee on English Language Arts, Member, 1999. Delaware Department of Education Technical Advisory Committee, Member, 2001present. Graduate Management Admissions Council, Technical Advisory Committee, Member, 2002-2005. Program Committee of the Joint European Conference of the IACCP and the ITC, Graz, Austria, 1996-1999. Cultural Review Panel, OECD/PISA 2000 Project to Assess School Achievement in 30 Countries, Chairperson, 1999. GMAT Research Policy Task Force, Member, 1999-2000. New York State Career and Technical Education Advisory Group, Member, 1999present. NIMH Project to Develop and Validate a Consumer Mental Health Outcome Measure, Consultant, 1999-Present. Virginia Technical Advisory Committee, Chairperson, 1999-present. Technical Review Committee for the Maryland Testing Program, Chairperson, 19992000. National Technical Analysis Group (TAG-2), National Board for Professional Teaching Standards, Member, 1996-2003. Psychometric Oversight Committee, American Institute of Certified Public Accountants, Chairperson, 1999-present. Assessment Advisory Committee, South Africa, Member, 2000-present. National Research Council Committee on Embedding Items in Assessments, Member, 1999. Pennsylvania Department of Education Technical Advisory Committee, Member, 1996present. Selection Committee for the Medical College of Canadas Outstanding Achievement in the Evaluation of Clinical Competence Award, Member, 2001-2003.

National Cancer Institute, Cancer Outcomes Measurement Working Group, Member, 2001-2002. Delaware Department of Education, Technical Advisory Committee, Member, 20012003. Advisory Committee to the West Virginia Department of Education, Member, 20002001. Advisor to the Connecticut Department of Education on Standard Setting, 2001. AERA International Relations Committee, Member, 20022005. Department of Health and Human Services Project to Develop a Consumer Mental Health Outcomes Measure, Consultant, 1997-2003. SRI International Project to Evaluate the Performance Standards in Washington State, Consultant, 2002. Graduate Management Admission Council Technical Advisory Committee, Member, 2003-2005. National Council on Measurement in Education Career Award Committee, 2002-2004, 2005-present. HEM National Technical Advisory Committee, Member, 2003-2007. SHL Scientific Advisory Board, Member, 2003-present. Educational Quality and Accountability Office, Ontario Department of Education Technical Advisory Committee Member, 2003-2004. Alaska Department of Education, Technical Advisory Committee Member, 2004-present. National Board of Medical Examiners, Center for Innovation Advisory Committee Member, 2005-2007. Medical Council of Canada Award for Outstanding Achievement Committee, Member, 2003-2005. Center for Applied Linguistics Test Design Committee Member, 2005-2006. 9th European Congress of Psychology, International Advisory Board, 2004-2005. Center on Outcomes, Research and Education, Northwestern University, Project to Refine and Standardize Health Literacy Assessment, Consultant, 2005-2008. Technical Advisory Committee, PISA, Chairperson, 2005. National Board of Osteopathic Medical Examiners, Consultant, 2004-2005. NIH Statistical Co-ordinating Center for PROMIS, Consultant, 2005-present. Ordinate Corporation, Consultant, 2005. Harcourt Education Measurement Project with EQAO, Ontario, Consultant, 2004-2005. NCEO/University of Minnesota Technical Work Group, Member, 2006-2010. APA Divisions 5 and 52 Task Force to Improve Quantitative Skills Training in CrossCultural Psychology, 2006-present. Medical Council of Canadas Examination Development Advisory Committee, Member, 2006-present. Assessment Strategies Inc., Consultant, 2006-present. IAAP Division 2, Secretary-Treasurer, 2007-present. Institute of Education Sciences Statistics and Modeling Scientific Review Panel, Member, 2007-2009. Pearson Advisory Board, 2007-present. American Psychological Association Psychological Tests and Assessment Committee, Member, 2008-2010. Washington Advisory Group on Assessment of English Language Learners, 2007. Puerto Rico NAEP Technical Panel, 2008-present.

Consulting Activities School Districts Cincinnati, Cleveland, OH; Amherst, Barre, Billerica, Concord, Holyoke, Lowell, Westfield, Worcester, MA; Providence, RI; Baltimore, Hagerstown, Montgomery County, MD; Kamehameha Schools, Honolulu, HI; Manhasset, New York, Rochester, Port Washington, NY; Houston, Dallas, TX; Glendale, AZ; Newark, DE; New York City; Warren Hills, NJ; Los Angeles, CA; Atlanta, GA; Baton Rouge, LA; Suffield, CT; Hampton, ME: Charleston, SC; Philadelphia, PA; Washington, DC; Tulsa, OK State and Provincial Departments of Education Alabama, Alaska, California, Connecticut, Delaware, Florida, Georgia, Hawaii, Kentucky, Louisiana, Maryland, Massachusetts, Michigan, New Jersey, New Mexico, New York, Pennsylvania, Rhode Island, Texas, Virginia, West Virginia, Wisconsin, British Columbia, Ontario, Quebec, Alberta International Australia, Canada, England, France, Germany, India, Indonesia, Israel, Italy, Japan, The Netherlands, Saudi Arabia, Scotland, Singapore, Spain, Swaziland, Sweden, Taiwan Professional Exams Federation of State Boards of Physical Therapy Institute of Banking, Saudi Arabia Municipal Securities Rulemaking Board National Association of Security Dealers New York Stock Exchange American Institute of Certified Public Accountants National Board of Medical Examiners American Board of Family Practice American Board of Internal Medicine Law School Admissions Council National Association of Purchasing Management National Center for Health Education Canadian Nursing Association National Commission for Health Certifying Agencies Educational Services for the Professions American Dental Association Professional Examination Service Certified Systems Professionals IOX Associates Graduate Management Admission Council The Medical Council of Canada Educational Commission for Foreign Medical Graduates National Board of Chiropractic Examiners

Industry Xerox Polaroid Corporation American Telephone & Telegraph GM/UAW Hewlett-Packard Hoffman-Roche Microsoft RAND Simplex Time Recorder Westat Other Abt Associates American College Testing Program American Council of Learned Societies American Council on Education American Institutes for Research Antioch University Brown University Buros Institute Educational Collaborative for Greater Boston, Inc. Educational Testing Service Educational Development Corporation Educational Quality and Accountability Office, Province of Ontario Erlbaum Publishers Foreign Service Institute Harcourt Educational Measurement Harper and Row HumRRO Institute for International Research International Education Associates Kluwer Academic Publishers Manpower Demonstration Research Corporation Mathematica Policy Research Mediax National Assessment Governing Board

National Center for Education Statistics National Institute of Education National Opinion Research Center New England Research Institute Northwest Regional Educational Laboratory Nuclear Power Office of Educational Research and Improvement, U.S. Dept. of Education Office of Technology Assessment - U.S. Congress Pelavin Associates Riverside Publishing Company RMC Sage Publications SHL Group, Inc. Springer-Verlag Publishers SRI International Teaching Resources UNESCO University of Indiana Medical School U.S. Army U.S. Air Force WICAT Systems Reviewing Activities

Reviewer to the AERA Division D Program Committee. (1972, 1975, 1979-present) Reviewer to the APA Division 5 Program Committee. (1991-present)
Occasional Reviewer for Psychometrika, Review of Educational Research; Curriculum Theory Network; Educational Psychologist; American Educational Research Journal; Canadian Journal of Education; Psychological Bulletin; Social Science Research; Educational Researcher; Educational Evaluation and Policy Analysis; Journal of Applied Psychology; Journal of Cross-Cultural Psychology; American Psychologist; Journal of Experimental Psychology; Educational Measurement: Issues and Practice; Research Quarterly for Exercise and Sport; Linguistics and Education; European Journal of Psychological Assessment, Educational Assessment; Archives of Clinical Neuropsychology. Advisory Editor to the Journal of Educational Measurement. (1972-1980) Co-Chairperson of the NERA-NCME Program Committee. (1972, 1973) Editorial Consultant to Review of Research in Education. (1982) Advisory Editor to Applied Psychological Measurement. (November 1976-present) Associate Editor to Journal of Educational Statistics. (1981-1989) Book Review Editor to Journal of Educational Measurement. (1984-1986) Advisory Editor to Evaluation and the Health Professions. (1987-1997)

Advisory Editor to Educational and Psychological Measurement. (1988-present) Advisory Editor to Revista Portuguesa Educacao. (1986-1998) Advisory Editor to Psicothema. (1989-present) Editorial Consultant to Educational Measurement. (1989, 3rd edition) Advisory Editor to Sage's Measurement Methods for the Social Sciences. (1988-2002) Advisory Editor to the Journal of Educational Measurement. (1988-1992) APA Division 15, National Advisory Committee to the Handbook of Educational Psychology. (1989-1994) Editor to Instructional Topics in Educational Measurement Series, NCME. (1990-1991) Consulting Editor to Multivariate Behavioral Research. (1990-present) Advisory Editor to Applied Measurement in Education. (1990-present) Associate Editor to European Journal of Psychological Assessment. (1993-present) Advisory Editor to Educational Research Quarterly. (1993-present) Advisory Editor to Instructional Topics in Educational Measurement Series. (1997-1999) Advisory Editor to Current Issues in Education (1999 - present) Consulting Editor to the International Journal of Testing. (1999 - present) Advisory Editor to Indian Journal of Vocational Education. (2001-present) Advisory Editor to Metodologa de las Ciencias del Comportamiento. (2002-present) Advisory Editor to European Journal of Methodology. (2004-present) Advisory Editor to Psychology Science. (2006-present)

Miscellaneous Professional Activities

Invited speaker at Educational testing Service, University of Alberta, University of Delaware, National Institute of Education, University of Stirling, University of Montreal, North Texas State University, Tulsa Reading Council, University of Connecticut, University of Giessen, University of Ottawa, Miami-Dade Community College, Michigan Educational Research Association, Ontario Institute for Studies in Education, Scottish Council for Educational Technology, University of Leiden, UCLA, Scottish Council of Educational Research, London University, University of Maryland School of Nursing, U.S. Army (20 workshops), Congressional Hearings on Uses of Achievement Scores,

Plymouth University, British Post Office, University of Wisconsin, National Board of Medical Examiners, University of Hawaii, University of Amsterdam, University of Twente, Free University of Amsterdam, Florida Educational Research Association.

Instructor, 1977, 1978, 1979, 1980, and 1981 Two-Day AERA Training Programs entitled, "Introduction to Criterion-Referenced Testing and Measurement." Member, Advisory Board for the Johns Hopkins University Symposium on Educational Research, 1977-1982. Instructor, Invitational Seminar on Methods of Mental Measurement, Plymouth, England, September, 1987. Instructor, UNESCO sponsored psychometric methods course, Bhopal, India, July, 1987. Instructor, Invitational Seminar on Advanced Psychometric Methods, National Institute for Testing and Evaluation, Jerusalem, Israel, January, 1989. Participant and reviewer, U.S. Department of Education's Assessment of Student Learning in Post-Secondary Education Workshop, November 15-17, 1991. Consultant to the Cross-European Longitudinal Study of Aging, 1995-2000. Instructor, Invitational Seminar on Item Response Theory, London, England, August, 2004.

UNIVERSITY SERVICE:

University Human Subjects Review Committee, 1972-1974. University Research Council, 1972-1974. School of Education Personnel Committee, Co-Chairperson, 1974. School of Education Dean Search Committee, 1975-1976. EPRA Division Personnel Committee, Chairperson, 1983. University Committee to Evaluate Teaching, 1984. University Graduate Fellowship Awards Committee, 1987, 1988. School of Education Dean Search Committee, 1987. School of Education Task Force on Governance, 1989. Laboratory of Psychometric and Evaluative Research Program, Chairperson, 1973present. School of Education Dean Search Committee, 1994. School of Education Dean Evaluation Committee, 1998. Provosts Distinguished Professor Committee, Chairperson, 1999-2003. EPRA Department Academic Matters Committee, Chairperson, September 2002-present. Center for Educational Assessment, Co-Director, 2000-2004. Center for Educational Assessment, Executive Director, 2004-present.

10

RESEARCH AND EVALUATION CONTRACT AND GRANT AWARDS:

University of Massachusetts Faculty Research Grant (Comparative Study of Test Administration Procedures and Scoring Methods with Achievement Tests), 1970. Massachusetts Division of Special Education Grant (An Evaluative Study of In-Service Teacher Training), 1976. National Institute of Education Basic Skills Research Grant (Psychometric and Statistical Contributions to the Theory and Practice of Criterion-Referenced Testing), 1976-1977. Air Force Contract (Applications of Latent Trait Theory to the Development of NormReferenced and Criterion-Referenced Tests ), 1977-1978. Air Force Contract (Latent Trait Model Contributions to Criterion-Referenced Testing Technology), 1979-1980. National Assessment of Educational Progress (Utilization of Latent Trait Models with NAEP Exercise Results), 1982. Air Force Contract (Construction and Validation of Air Force Specialty Diagnostic Achievement Tests), 1984-1988. Massachusetts Department of Education Contract (Programs to Assist School Districts in Collecting and Using Achievement Test Data), 1987-1988. Chapter 636 Program Evaluation Contract (Evaluation of the Worcester Chapter 636 Programs), 1988. Chapter 636 Program Evaluation Contract (Evaluation of the Worcester Chapter 636 Programs), 1988-1989. NY DOE Contract (Mantel-Haenszel Item Bias and IRT Analyses), 1989. Institute for International Research (Development of Criterion-Referenced Tests in Swaziland), 1990-1994. Graduate Management Admission Council (Solving GMAT Technical Problems with IRT Models), 1990-1994. Indonesian Ministry of Education (Four-Month Psychometric Training Program for Educators), 1991. National Science Foundation (Methods of Setting Standards on Performance Assessments in State Wide Assessment Contexts), 1995-1998. Law School Admissions Council (Assessing Item Difficulty with Anchor-Based Methods and Bayesian Statistics), 1996-1998. National Assessment of Educational Progress (Enhancing Score Reporting), 1996-1997. Massachusetts Department of Education (Psychometric Analyses of the MCAS), 19981999. Microsoft, Inc. (Computer-Based Test Examinations), 1998-present. Harcourt Educational Measurement (Psychometric Analyses on State Assessment Data), 2000-2003. Massachusetts Department of Education (MCAS Validity Studies), 2002-2004. Measured Progress (MCAS Research and Validity Studies), 2004-present. College Board (Enhancements in Score Reporting), 2006-2008. Pearson Educational Measurement (Validity Studies), 2007-present.

11

COMPUTER PROGRAMMING EXPERIENCE:

Many years of experience writing computer programs. Programs written include: Hambleton, R. K. Computation of Swineford's tendency to gamble scores, Fortran IV program for the IBM 7094 Computer. Department of Measurement and Evaluation, the Ontario Institute for Studies in Education, 1969. Hambleton, R. K. Computation of information curves and efficiency of three logistic test models, Fortran IV program for the CDC 3600 Computer. Center for Educational Research, School of Education, University of Massachusetts at Amherst, 1970. Hambleton, R. K. Estimating observed-score distributions using logistic test models, Fortran IV program for the CDC 3600 Computer. Center for Educational Research, School of Education, University of Massachusetts at Amherst, 1970. Hambleton, R. K., & Barbuto, P. F. (1971). A computer program for optimal scaling. Behavioral Science, 16, 413. Hambleton, R. K., & Rovinelli, R. (1973). A Fortran IV program for generating examinee response data from logistic test models. Behavioral Science, 17, 7374. (Revised, September 1990) Hambleton, R. K., & Rovinelli, R. A computer simulation program for item-examinee sampling. Center for Educational Research, School of Education, University of Massachusetts at Amherst, 1971. Hambleton, R. K., & Traub, R. E. An individual differences model for multi-dimensional scaling, Fortran IV program for the IBM 7094 Computer. Department of Measurement and Evaluation, The Ontario Institute for Studies in Education, 1969. Liang, T., Han, K. T., & Hambleton, R. K. (in press). ResidPlots-2: Computer software for IRT graphical residual analyses. Applied Psychological Measurement. Murray, L., Hambleton, R. K., & Simon, R. A Fortran IV program to carry out residual analyses for logistic test models. Laboratory of Psychometric and Evaluative Research, School of Education, University of Massachusetts at Amherst, 1982. (Revised, June 1988) Rogers, H. J., & Hambleton, R. K. A program to conduct IRT item bias investigations. Laboratory of Psychometric and Evaluative Research, School of Education, University of Massachusetts at Amherst, 1987. Rogers, H. J., & Hambleton, R. K. (1994). MH: A Fortran V program to compute the Mantel-Haenszel statistic for detecting differential item functioning. Educational and Psychological Measurement, 54(1), 101-104. Rovinelli, R., & Hambleton, R. K. (1972). A general Fortran IV program for the analysis of semantic differential data. Behavioral Science, 17, 74.

12

Sheehan, D. S., & Hambleton, R. K. (1974). A general Fortran IV test-scoring program. Educational and Psychological Measurement, 34, 169-171. TEACHING INTERESTS:

Principles of Educational and Psychological Testing, Modern Assessment Practices, Classical Test Theory and Practices, Item Response Theory and Applications, Educational Research Methods, Advanced Measurement Seminar.

PROFESSIONAL AFFILIATIONS:

American Educational Research Association American Psychological Association (Fellow of Divisions 5 and 15) International Association of Applied Psychology National Council on Measurement in Education Northeastern Educational Research Association Psychometric Society Canadian Educational Research Association British Psychological Society

COMPLETED STUDIES: (a) Dissertations The effects of item order and anxiety on test performance and stress. Unpublished masters thesis, University of Toronto, 1968. Empirical investigation of the Rasch test-theory model. Unpublished doctoral dissertation, University of Toronto, 1969. (b) Publications Allalouf, A., Hambleton, R. K., & Sireci, S. (1999). Identifying the causes of DIF in translated verbal items. Journal of Educational Measurement, 36(3), 185-198. Avis, N. E., Smith, K. W., Hambleton, R. K., et al. (1996). Development of the multidimensional index of life quality: a quality of life measure for cardiovascular disease. Medical Care, 34(11), 1102-1120. Avis, N. E., Smith, K. W., Mayer, K. H., Swislow, L., & Hambleton, R. K. (2001). The multidimensionalquality of life questionnaire for persons with HIV/AIDS: Development and evaluation (Final Report). Newton, MA: NERI. Bartram, D., & Hambleton, R. K. (Eds.). (2006). Computer-based testing and the internet: Issues and advances. New York: Wiley.

13

Boulet, J., Friedman, M., Hambleton, R. K., Burdick, W., & Ziv, A. (1997). Assessing the adequacy of the post-encounter written scores in standardized patient exams. In A. Scherpbier, C. van der Vleuten, & J. Rethans (Eds.), Proceedings of the Seventh Ottawa Conference on Medical Education (pp. 410-412). Dordrecht, The Netherlands: Kluwer Academic Publishers. Boulet, J. R., Friedman Ben-David, M., Hambleton, R. K., Burdick, W., Ziv, A., & Gary, N. E. (1998). An investigation of the sources of measurement error in the postencounter written scores from standardized patient examinations. Advances in Health Science Education, 3, 89-100. Boulet, J. R., McKinley, D. W., Whelan, G. P., & Hambleton, R. K. (2003). Quality assurance methods for performance-based assessments. Advances in Health Sciences Education, 8, 27-47. Boulet, J. R., McKinley, D. W, Whelan, G. P., & Hambleton, R. K. (2003). The effect of task exposure on repeat candidate scores in a high stakes performance assessment. Teaching and Learning in Medicine, 15, 227-232. Boulet, J. R., McKinley, D. W., Whelan, G. P., van Zanten, M., & Hambleton, R. K. (2002). Clinical skills deficiencies among first-year residents: Utility of the ECFMG clinical skills assessment. Academic Medicine, 77, S33-S35. Bourque, M. L., & Hambleton, R. K. (1993). Measurement issues in setting standards on NAEP. Measurement and Evaluation in Counselling and Development, 26(1), 41-47. Caban, J. P., Hambleton, R. K., Coffing, D. G., Conway, M. T., & Swaminathan, H. (1978). Mental imagery as an approach to spelling instruction. Journal of Experimental Education, 46, 15-21. Clauser, B., Mazor, K., & Hambleton, R. K. (1993). The effects of purification of the matching criterion on the identification of DIF using the Mantel-Haenszel procedure. Applied Measurement in Education, 6, 269-280. Clauser, B., Mazor, K. M., & Hambleton, R. K. (1994). The effects of score group width on the Mantel-Haenszel procedure. Journal of Educational Measurement, 31(1), 67-78. Clauser, B. E., Mazor, K., & Hambleton, R. K. (1991). The influence of test homogeneity on the identification of DIF test items using the Mantel-Haenszel procedure. Applied Psychological Measurement, 15(4), 353-359. de Gruijter, D. N. M., & Hambleton, R. K. (1983). Using logistic test models in criterion-referenced test item selection. In R. K. Hambleton (Ed.), Applications of item response theory. Vancouver, BC: Educational Research Institute of British Columbia. de Gruijter, D. N. M., & Hambleton, R. K. (1984). On problems encountered using decision theory to set cut-off scores. Applied Psychological Measurement, 8, 18.

14

de Gruijter, D. N. M., & Hambleton, R. K. (1984). Reply to van der Linden's "Thoughts on the Use of Decision Theory to Set Cut-off Scores." Applied Psychological Measurement, 8, 19-20. Fernandez-Ballesteros, R., Hambleton, R. K., & van de Vijver, F. (1999). EXCELSA protocol adaptation procedures. In J. J. F. Schroots, R. Fernandez-Ballesteros, & G. Rudinger (Eds.), Aging in Europe (pp. 169-184). Amsterdam: IOS Press. Friedman, M., Boulet, J. R., Burdick, W. P., Ziv, A., Hambleton, R. K., & Gary, N. E. (1997). Issues of validity and reliability concerning who scores the postencounter patient progress note. Academic Medicine, 72(10), 579-581. Gifford, J. A., & Hambleton, R. K. (1981). Construction and use of criterion-referenced tests in program evaluation studies. Academic Psychology Bulletin, 3, 411-436. Goodman, D., & Hambleton, R. K. (2004). Student test score reports and interpretive guides: Review of current practices for future research. Applied Measurement in Education, 17, 145-220. Goodman, D., & Hambleton, R. K. (2005). Some misconceptions about large-scale educational assessments. In R. Phelps (Ed.), Defending standardized testing (pp. 91-110). Mahwah, NJ: Erlbaum. Gorth, W. P., & Hambleton, R. K. (1972). Measurement considerations for criterionreferenced testing and special education. Journal of Special Education, 6, 303314. Green, L. W., Cook, T., Doster, M. E., Fors, S. W., Hambleton, R. K., Smith, A., & Walberg, H. J. (1985). Thoughts from the School Health Education Evaluation Advisory Panel. Journal of School Health, 55, 300. Gumpert, R., & Hambleton, R. K. (1979). Situational leadership: How Xerox managers fine tune managerial styles to employee maturity and task needs. Management Review, 6, 303-314. Haley, S. M., Ni, P., Hambleton, R. K., Slavin, M. D., & Jette, A. M. (2006). Computeradaptive testing improves accuracy and precision of scores over random item selection in a physical functioning item bank. Journal of Clinical Epidemiology, 59, 1174-1182. Hambleton, R. K. (1973). Collection of various psychometric and technological area bibliographies. JSAS Catalog of Selected Documents in Psychology, 3, 93. (240 pages) Hambleton, R. K. (1974). Assessing student progress: A criterion-referenced measurement approach. In D. W. Allen & J. Hecht (Eds.), Controversies in education (pp. 370-376). New York: Saunders. Hambleton, R. K. (1977). Some comments on Aikenhead's "New Methodology for Test Construction." Journal of Research in Science Teaching, 14, 473-474.

15

Hambleton, R. K. (1978). Development and validation of criterion-referenced tests and using and reporting of test score information for classroom teachers. Proceedings of the Fifth Annual Conference on Measurement and Evaluation. Los Angeles: Los Angeles County Public Schools. Hambleton, R. K. (1978). On the use of cut-off scores with criterion-referenced tests in instructional settings. Journal of Educational Measurement, 25, 277-290. Hambleton, R. K. (1979). Latent trait models and applications. In R. E. Traub (Ed.), New directions for testing and measurement: Analysis of test data (pp. 13-32). San Francisco: Jossey-Bass. Hambleton, R. K. (1980). Test score validity and standard-setting. In R. Berk (Ed.), Criterion-referenced testing: State of the art. Baltimore: Johns Hopkins University Press. Hambleton, R. K. (1980). Latent ability scales: Interpretations and uses. In S. Mayo (Ed.), New directions for testing and measurement: Interpreting test scores (pp. 73-97). San Francisco: Jossey-Bass. Hambleton, R. K. (Ed.). (1980). Contributions to criterion-referenced testing technology. Applied Psychological Measurement, 4, 421-581. (Special Issue) Hambleton, R. K. (1982). Latent trait model contributions to criterion-referenced testing technology (Final Report F33615-79-C-0020). Lowry AFB: Air Force Human Resources Laboratory. Hambleton, R. K. (1982). Utilization of item response models with NAEP exercise results (Final Report). Washington, DC: National Institute of Education. Hambleton, R. K. (1982). Competency-based education. The World Book Encyclopedia. Chicago: World Book-Childcraft International, Inc. Hambleton, R. K. (1982). Advances in criterion-referenced testing technology. In C. Reynolds & T. Gutkin (Eds.), Handbook of school psychology. New York: Wiley. Hambleton, R. K. (1983). Application of item response models to criterion-referenced assessment. Applied Psychological Measurement, 7, 33-44. Hambleton, R. K. (Ed.). (1983). Applications of item response theory. Vancouver, BC: Educational Research Institute of British Columbia. Hambleton, R. K. (1984). Criterion-referenced measurement. In T. Husen & T. N. Postlethwaite (Eds.), International encyclopedia of education: Research and studies. New York: Pergamon Press. (Reprinted in M. Eraut [Ed.], The international encyclopedia of educational technology. New York: Pergamon Press. Reprinted in J. P. Keeves [Ed.], Educational research, methodology, & measurement: An international handbook. New York: Pergamon Press, 1988.)

16

Hambleton, R. K. (1984). Validating the test scores. In R. Berk (Ed.), A guide to criterion-referenced test construction (pp. 199-230). Baltimore, MD: The Johns Hopkins University Press. Hambleton, R. K. (1984). Determining suitable test lengths. In R. Berk (Ed.), A guide to criterion-referenced test construction (pp. 144-168). Baltimore, MD: The Johns Hopkins University Press. Hambleton, R. K. (1984). Using microcomputers to develop tests. In M. Hiscox, & E. Bryzezinski (Eds.), Educational measurement: Issues and practice, 3, 10-14. Hambleton, R. K. (1984). Item response theory. Professional Examination Service Quarterly Newsletter. New York: Professional Examination Service. Hambleton, R. K. (1984). Commentary. Professions Education Researcher Notes, 6, 910. Hambleton, R. K. (1985). New technical advances in measurement for certification exams. In Proceedings of the National Conference on Continuing Competence Assurance in the Health Professions (pp. 102-110). Washington, DC: The National Commission for Health Certifying Agencies. Hambleton, R. K. (1985). A review of the Nelson-Denny Reading Test. In R. C. Sweetland & D. N. Keyser (Eds.), Test critiques: Volume III. Kansas City: Test Corporation of America. (Reprinted in R. C. Sweetland and D. N. Keyser [Eds.], Test Critiques Applied Topics. Kansas City: Test Corporation of America, 1988.) Hambleton, R. K. (1985). Criterion-referenced assessment of individual differences. In C. Reynolds & V. L. Willson (Eds.), Methodological and statistical advances in the study of individual differences (pp. 393-424). New York: Plenum Press. Hambleton, R. K. (1986). The validity of NAPM's Certified Purchasing Management process. Journal of Purchasing and Materials Management, 2-10. Hambleton, R. K. (1986). The changing conception of measurement: A commentary. Applied Psychological Measurement, 10, 415-421. Hambleton, R. K. (Ed.). (1986). Standards for educational and psychological testing: Six reviews. Journal of Educational Measurement, 23(1), 83-98. Hambleton, R. K. (1987). Computerized adaptive testing: Theory, applications, and standards. Bulletin of the International Test Commission, 14, 5-18. Hambleton, R. K. (1987). The three-parameter logistic model. In D. L. McArthur (Ed.), Alternative approaches to the assessment of achievement (pp. 129-158). Boston: Kluwer Academic Publishers. Hambleton, R. K. (1987). Evaluating criterion-referenced tests. ERIC Digest Series. Princeton, NJ: ERIC Clearinghouse of Tests, Measurement, and Evaluation.

17

Hambleton, R. K. (1987). Determining optimal test lengths with a fixed total testing time. Educational and Psychological Measurement, 47, 339-347. Hambleton, R. K. (1988). A review of Iowa Tests of Basic Skills, Forms G and H. In D. J. Keyser & R. C. Sweetland (Eds.), Test critiques: Volume VI. Kansas City: Test Corporation of America. (Reprinted in D. J. Keyser and R. C. Sweetland [Eds.], Test Critiques Applied Topics. Kansas City: Test Corporation of America, 1988.) Hambleton, R. K. (1989). Principles and applications of item response theory. In R. L. Linn (Ed.), Educational measurement (3rd edition, pp. 147-200). New York: Macmillan. Hambleton, R. K. (Ed.). (1989). Applications of item response theory. International Journal of Educational Research, 13, 121-220. Hambleton, R. K. (1991). Issues to be considered in the content validity portions of RFPs for large-scale assessment programs. In P. Aschbacher & E. L. Baker (Eds.), Improving large-scale assessment. Los Angeles, CA: Center for Research on Evaluation, Standards and Student Testing, UCLA. Hambleton, R. K. (1989). Item response theory models and methods for measurement in exercise science and sport. In M. J. Safrit (Ed.), Measurement theory and practice in exercise science and sport (pp. 1-29). Madison, WI: University of Wisconsin Press. Hambleton, R. K. (1989). Constructing tests with item response models: A discussion of methods and two problems. Bulletin of the International Test Commission, 16, 96-106. Hambleton, R. K. (1989). Preparation of exam items for the Uniform CPA Examination (Final Report). New York: American Institute of Certified Public Accountants. Hambleton, R. K. (1989). Portrait, notice biographique et bibliographique. Revue de Psychologie Applique, 39(4), 309-323. Hambleton, R. K. (1990). Other objective formats. In AICPA, Uniform CPA examination item writer's guide (Chapter 3, pp. 22-43). New York: American Institute of Certified Public Accountants. Hambleton, R. K. (1990). Setting achievement levels for the 1990 NAEP mathematics assessment: Handbook for judges. Washington, DC: National Assessment Governing Board. Hambleton, R. K. (1990). Criterion-referenced testing methods and practices. In T. Gutkin & C. Reynolds (Eds.), Handbook of school psychology (2nd ed.; pp. 388414). New York: Wiley. Hambleton, R. K. (1990). Item response theory: Introduction and bibliography. Psicothema, 2(1), 97-107.

18

Hambleton, R. K. (1990). Criterion-referenced measurement in student and curriculum evaluation. In A. Lewy (Ed.), International Encyclopedia of Curriculum. New York: Pergamon Press. Hambleton, R. K. (1990). Criterion-referenced assessment in evaluation. In H. J. Walberg and G. D. Haertel (Eds.), The International Encyclopedia of Educational Evaluation. New York: Pergamon Press. Hambleton, R. K. (Ed.). (1991). Test translations for cross-cultural studies. Bulletin of the International Test Commission, 18, 1-101. Hambleton, R. K. (1991). Individualized criterion-referenced testing (Technical Manual). Tulsa, OK: Educational Development Corporation. Hambleton, R. K. (1992). What skills do teachers need in educational testing? In D. Bateson (Ed.), Classroom testing in Canada, Proceedings of the Second Invitational Conference on Classroom Testing (pp. 91-96). Vancouver, BC: University of British Columbia. Hambleton, R. K. (1992). Measurement advances to address educational policy questions. In T. J. Plomp, J. M. Pieters, & A. Feteris (Eds.), Book of summaries: European Conference on Educational Research (pp. 681-684). Enschede, The Netherlands: University of Twente. Hambleton, R. K. (1992). Setting standards on national tests. International Journal of Psychology, 27, 570. (Abstract). Hambleton, R. K. (1992). Test translations for cross-cultural studies. In B. Wilpert, H. Motoaki, & J. Misumi (Eds.), Proceedings of the 22nd International Congress of Applied Psychology (pp. 271-275). Hillsdale, NJ: Erlbaum. Hambleton, R. K. (1992). The uses of international data in setting achievement levels (Final Report). Washington, DC: National Center for Educational Statistics. Hambleton, R. K. (1992). Item response theory: Measurement for the 1990s. CLEAR Exam Review, Winter, 18-20. Hambleton, R. K. (1992). Fitting item response models to the Series 7 Examination and equating test scores. Amherst, MA: Psychometric and Evaluative Research Services, Inc. Hambleton, R. K. (1993). International Test Commission: Organization, goals, and current projects. European Journal of Psychological Assessment, 9(1), 54-56. Hambleton, R. K. (1993). Translating achievement tests for use in cross-national studies. European Journal of Psychological Assessment, 9(1), 57-68. Hambleton, R. K. (1993). Summary of conference on test use with children and youth. European Review of Applied Psychology, 43, 261-262.

19

Hambleton, R. K. (1994). Municipal Securities Rulemaking Board guide to item writing and review. Washington, DC: MSRB. (65 pages.) Hambleton, R. K. (1994). Rise and fall of criterion-referenced measurement? Educational Measurement: Issues and Practice, 13(4), 21-26. Hambleton, R. K. (1994). Item response theory: A broad psychometric framework for measurement advances. Psicothema, 6(3), 535-556. Hambleton, R. K. (1994). Guidelines for adapting educational and psychological tests: A progress report. European Journal of Psychological Assessment, 10(3), 229244. Hambleton, R. K. (1995). Meeting the measurement challenges of the 1990s and beyond: New assessment models and methods. In T. Oakland & R. K. Hambleton (Eds.), International perspectives on academic assessment (pp. 83104). Boston, MA: Kluwer Academic Publishers. Hambleton, R. K. (1995). Criterion-referenced measurement. In T. Husen & T. N. Postlethwaite (Eds.), International Encyclopedia of Education (2nd ed.; pp. 11831189). New York: Pergamon Press. Hambleton, R. K. (1995). Setting standards on criterion-referenced tests. In T. Husen & T. N. Postlethwaite (Eds.), International Encyclopedia of Education (2nd ed.; pp. 5721-5726). New York: Pergamon Press. Hambleton, R. K. (1996). Adapting psychological tests: technical guidelines for improving practices. International Journal of Psychology, 31(3), 439. (Abstract) Hambleton, R. K. (1996). Advances in assessment models, methods, and practices. In D. Berliner & R. Calfee (Eds.), Handbook of educational psychology (pp. 899925). New York: Macmillan. Hambleton, R. K. (1996). New models and methods for psychological tests. Contemporary Group Care Practice Research and Evaluation, 6(1), 34-41. Hambleton, R. K. (1996). Adapting tests for use in multiple languages and cultures. In J. Muiz (Ed.), Psicometria (pp. 207-238). Madrid: Editorial Universitas, S.A. Hambleton, R. K. (1997). The future of educational assessment: likely directions and technical problems to overcome. NERA Researcher, 35(3), 6-9. Hambleton, R. K. (1997). Measurement quality of the Kentucky Instructional Results Information System (KIRIS), 1991-1994. In J. Millman (Ed.), Grading teachers, grading schools (pp. 210-218). Newbury Park, CA: Corwin Press. Hambleton, R. K. (1998). Future directions in item response modeling and applications. In J. Muiz (Ed.), Introduccon a la Teora de respuesta a los tems. Madrid: Ediciones Pirmide, S.A.

20

Hambleton, R. K. (1998). Setting performance standards on achievement tests. In L. H. Hansche (Ed.), Handbook for the development of performance standards: Meeting the requirements of Title I. Washington, DC: U.S. Department of Education. Netherlands: IEA. Hambleton, R. K. (1998). Criterion-referenced testing principles, technical advances, and evaluation guidelines. In C. Reynolds & T. Gutkin (Eds.), Handbook of school psychology (3rd ed., pp. 409-434). New York: Wiley. Hambleton, R. K. (1998). Enhancing the validity of NAEP achievement level score reporting. In M. L. Bourque (Ed.), Proceedings of the Achievement Level Workshop (pp. 77-98). Washington, DC: National Assessment Governing Board. Hambleton, R. K. (1999). Politicians fail, not the teachers. Education Connection, Winter Issue, 19-22. Hambleton, R. K. (2000). International Test Commission. In A. E. Kazdin (Ed.), Encyclopedia of Psychology. New York: Oxford University Press. Hambleton, R. K. (2000). Emergence of item response modeling in instrument development and data analysis. Medical Care, 38(9), II 60-65. Hambleton, R. K. (Ed.). (2000). Advances in performance assessment methodology. Applied Psychological Measurement, 24(4), 291-378. Hambleton, R. K. (2001). Growing problems in applied psychology: Limited training in assessment. IAAP Newsletter, 13(1), 11-12. Hambleton, R. K. (2001). Setting performance standards on educational assessments and criteria for evaluating the process. In G. Cizek (Ed.), Setting performance standards: Concepts, methods, and perspectives (pp. 89-116). Hillsdale, NJ: Lawrence Erlbaum Associates. Hambleton, R. K. (2001). The next generation of the ITC test translation and adaptation guidelines. European Journal of Psychological Assessment, 17(3), 164-172. Hambleton, R. K. (2002). How will we understand and use test score information? In R. W. Lissitz & W. D. Schafer (Eds.), Assessments in Educational Reform (pp. 192205). Boston: Allyn and Bacon. Hambleton, R. K. (2002). New computer-based technical issues: Developing items, pretesting, test security, and item exposure. In C. Mills et al. (Eds.), Computerbased testing: Building the foundation for future assessments (pp. 193-203). Mahwah, NJ: Lawrence Erlbaum Publishers. Hambleton, R. K. (2002). Adapting achievement tests into multiple languages for international assessments. In A. Porter, & A. Gamoran (Ed.), Methodological advances in large-scale cross-national education surveys (pp. 58-79) Washington: National Academy of Sciences.

21

Hambleton, R. K. (2003). Criterion-referenced testing: Methods and procedures. In R. Fernandez-Ballesteros (Ed.), Encyclopedia of psychological assessment (pp. 280283). London: Sage. Hambleton, R. K. (2003). Setting passing scores on tests . . . not too high . . . not too low . . . but just about right. Education Connection, pp. 11-14. Hambleton, R. K. (2004). Theory, methods, and practices in testing for the 21st century. Psicothema, 16, 696-701. Hambleton, R. K. (2005). Issues, designs, and technical guidelines for adapting tests in multiple languages. In R. K. Hambleton, P. Merenda, & C. Spielberger (Eds.), Adapting educational and psychological tests for cross-cultural assessment (pp. 338). Hillsdale, NJ: Lawrence Erlbaum Associates. Hambleton, R. K. (2005). Applications of item response theory. In J. Lipscomb, C. C. Gotay, & C. Snyder (Eds.), Outcomes of assessment in cancer (pp. 445-464). Cambridge, UK: Cambridge University Press. Hambleton, R. K. (2005). Foreword. In W. J. van der Linden. Models for optimal test design (p. i to v). New York: Springer-Verlag. Hambleton, R. K. (2005). Biography of Frederic Lord. In B. Everitt & D. Howell (Eds.), Encyclopedia of Statistics in Behavioral Science (pp. 1104-1106). West Sussex, UK: John Wiley & Sons. Hambleton, R. K. (2006). Psychometric models, test designs and item types for the next generation of educational and psychological tests. In D. Bartram & R. K. Hambleton (Eds.), Computer-based testing and the internet: Issues and advances (pp. 77-90) New York: Wiley. Hambleton, R. K. (2006). Good practices for identifying differential item functioning. Medical Care, 44(11), 182-188. Hambleton, R. K. (2006, winter). An interview with Ronald Hambleton. People and Organizations@Work, 1-2, 13. Hambleton, R. K., Anderson, G. E., & Murray, L. (1983). Applying micro-computers to classroom testing practices. In W. Hathaway (Ed.), New directions for testing and measurement: Testing in the schools. San Francisco: Jossey-Bass. Hambleton, R. K., & Bollwark, J. (1991). Adapting tests for use in different cultures: Technical issues and methods. Bulletin of the International Test Commission, 18, 3-32. Hambleton, R. K., Bollwark, J., & Traub, R. E. (1990). NCME Publication Survey Results. Educational Measurement: Issues and Practice, 9(1), 17-18. Hambleton, R. K., & Bourque, M. L. (1991). Initial performance standards for the 1990 NAEP Mathematics Assessment (Technical Report). Washington, DC: National Assessment Governing Board. (403 pages)

22

Hambleton, R. K., Brennan, R. L. Brown, W., Dodd, B., Forsythe, R. A., Mehrens, W. A., Nellhaus, J., Reckase, M., Rindone, D., van der Linden, W. J., & Zwick, R. (2000). A response to Setting Reasonable and Useful Performance Standards in the National Academy of Sciences Grading the Nations Report Card. Educational Measurement: Issues and Practice, 19, 5-13. Hambleton, R. K., Clauser, B. E., Mazor, K. M., & Jones, R. W. (1993). Advances in the detection of differentially functioning test items. European Journal of Psychological Assessment, 9(1), 1-18. Hambleton, R. K., & Cook, L. L. (1977). Latent trait models and their use in analyzing educational test data. Journal of Educational Measurement, 14, 75-96. Hambleton, R. K., & Cook, L. L. (1983). The robustness of item response models and effects of test length and sample size on the precision of ability estimates. In D. Weiss (Ed.), New horizons in testing (pp. 33-49). New York: Academic Press. Hambleton, R. K., & Cook, L. L. (1984). The robustness of latent trait models. In D. Weiss (Ed.), Proceedings of the 1979 Computerized Adaptive Testing Conference. Minneapolis, MN: University of Minnesota. Hambleton, R. K., & de Gruijter, D. N. M. (1983). Application of item response models to criterion-referenced test item selection. Journal of Educational Measurement, 20, 355-367. Hambleton, R. K., & de Jong, J. (Eds.). (2003). Advances in translating and adapting educational and psychological tests: A special issue. Language Testing, 20(2), 127-134. Hambleton, R. K., & Dirir, M. (2003). Classical and modern item analysis. In R. Fernandez-Ballesteros (Ed.), Encyclopedia of psychological assessment (pp. 188192). London: Sage. Hambleton, R. K., Dirir, M., & De Brisay, M. (1993). New measurement models and methods for constructing language tests. Carlton Papers in Applied Language Studies, 10, 63-81. Hambleton, R. K., & Eignor, D. R. (1977). Adaptive testing applied to hierarchically structured objectives-based curricula. In D. Weiss (Ed.), Proceedings of the Second Computerized Adaptive Testing Conference. Minneapolis, MN: University of Minnesota. Hambleton, R. K., & Eignor, D. R. (1978). Guidelines for evaluating criterionreferenced tests and test manuals. Journal of Educational Measurement, 15, 321327. Hambleton, R. K., & Eignor, D. R. (1979). Competency test development, validation, and standard-setting. In R. M. Jaeger & C. Tittle (Eds.), Minimum competency achievement testing. Berkeley, CA: McCutchan Publishing Co.

23

Hambleton, R. K., Eignor, D. R., & Rovinelli, R. (1979). Toward better achievement tests and test score interpretations in PSI courses. Journal of Personalized Instruction, 3, 180-186. Hambleton, R. K., & Fennessy, L. (1994). Progrs techniques dan le developpement d'examens d'accreditaiton. Mesure et valuation en ducation, 17(2), 83-106. Hambleton, R. K., & Fennessy, L. M. (1995). Technical advances in credentialing examination development. In D. Laveault, B. D. Zumbo, M. E. Gessaroli, & M. W. Boss (Eds.), Modern theories of measurement: Problems and issues (pp. 279303). Ottawa, Canada: University of Ottawa Press. Hambleton, R. K., Gorth, W. P., & O'Reilly, R. P. (1973). An application of an evaluation model for classroom instruction. Journal of Educational Systems, 2, 117-131. (In T. T. Liao & D. C. Miller [Eds.], [1978]. Systems approach to instructional design. Farmingdale, NY: Baywood Publishing Co.) Hambleton, R. K., Gower, C., & Bollwark, J. (1988). Assessing higher order thinking skills. Proceedings of the 29th Annual Conference of the Military Testing Association (pp. 628-633). Ottawa, Canada. Hambleton, R. K., & Gumpert, R. (1982). Validity of Hersey-Blanchard's theory of leader effectiveness. Group and Organizational Studies, 7, 225-242. Hambleton, R. K., & Han, N. (2005). Assessing the fit of IRT models to educational and psychological test data: A five step plan and several graphical displays. In W. R. Lenderking & D. Revicki (Eds.), Advances in health outcomes research methods, measurement, statistical analysis, and clinical applications (pp. 57-78). Washington: Degnon Associates. Hambleton, R. K., Hutten, L., & Swaminathan, H. (1976). A comparison of several methods for assessing student mastery in objectives-based instructional programs. Journal of Experimental Education, 45, 57-64. Hambleton, R. K., Impara, J., Mehrens, W., Plake, B. S., Pitoniak, M. J., Zenisky, A. L., & Smith, L. F. (2000). Psychometric review of the Maryland School Performance Assessment Program (Final Report). Baltimore, MD: Abell Foundation Hambleton, R. K., Jaeger, J., Koretz, D., Linn, R. L., Millman, J., & Phillips, S. (1995, June). A review of the measurement quality of the Kentucky Instructional Results Information System (Final Report). Frankfort, KY: Office of Educational Accountability. Hambleton, R. K., Jaeger, R., Plake, B. S., & Mills, C. N. (2000). Setting performance standards on complex educational assessments. Applied Psychological Measurement, 24(4), 355-366. Hambleton, R. K., & Jirka, S. (2004). How to do your best on standardized tests: Some suggestions for adult learners. Adventures in Assessment, 16, 5-12.

24

Hambleton, R. K., & Jirka, S. (2006). Anchor-based methods for judgmentally estimating item statistics. In S. Downing & T. Haladyna (Eds.), Handbook of test development (pp. 399-420). Mahwah, NJ: Lawrence Erlbaum Publishers. Hambleton, R. K., & Jodoin, M. (2003). Item response theory: Models and features. In R. Fernandez-Ballesteros (Ed.), Encyclopedia of psychological assessment (pp. 509-514). London: Sage. Hambleton, R. K., & Jones, R. W. (1992). International impact of IRT models on testing practices. (Abstract). International Journal of Psychology, 27, 371. Hambleton, R. K., & Jones, R. W. (1993). Comparison of classical test theory and item response theory and their applications to test development. Educational Measurement: Issues and Practice, 12(3), 38-47. Hambleton, R. K., & Jones, R. W. (1994). Item parameter estimation errors and their influence on test information functions. Applied Measurement in Education, 7(3), 171-186. Hambleton, R. K., & Jones, R. W. (1994). Comparison of empirical and judgmental methods for detecting differential item functioning. Educational Research Quarterly, 18(1), 21-36. Hambleton, R. K., Jones, R. W., & Rogers, H. J. (1993). Influence of item parameter estimation errors in test development. Journal of Educational Measurement, 30(2), 143-155. Hambleton, R. K., & Jurgensen, C. (1990). Criterion-referenced assessment of school achievement. In C. R. Reynolds & T. W. Kamphaus (Eds.), Handbook of psychological and educational assessment of children: Volume 1, intelligence and achievement (pp. 456-476). New York: The Guilford Press. Hambleton, R. K., & Kanjee, A. (1995). Translating tests and attitude scales. In T. Husen & T. N. Postlethwaite (Eds.), International Encyclopedia of Education (2nd ed.; pp. 6328-6334). New York: Pergamon Press. Hambleton, R. K., & Kanjee, A. (1995). Increasing the validity of cross-cultural assessments: use of improved methods for test adaptations. European Journal of Psychological Assessment, 11(3), 147-157. Hambleton, R. K., & Li, S. (2005). Statistical audit of the ABCTE professional teaching knowledge, elementary education, English/language arts and secondary mathematics tests. Leesburg, VA: Mid-Atlantic Psychometric Services. Hambleton, R. K., & Li. S. (2005). Translation and adaptation issues and methods for educational and psychological tests. In C. Frisby & C. Reynolds (Eds.), Handbook of multicultural school psychology (pp. 881-903). New York: Wiley.

25

Hambleton, R. K., & Li, S. (2005). Criterion-referenced testing: Purposes, technical issues and advances.. In B. Everitt & D. Howell (Eds.), Encyclopedia of Statistics in Behavioral Science (pp. 435-440). West Sussex, UK: John Wiley & Sons. Hambleton, R. K., & Ma, X. (2003). Investigation of IRT model fit and equating for the National Board of Chiropractic Examiners (Final Report). Greeley, CO: NBCE. Hambleton, R. K., Malaka, M., & Jones, R. W. (1994). Teachers' handbook on achievement testing. Arlington, VA: Institute for International Research. Hambleton, R. K., & Martois, J. (1983). Evaluation of a test score prediction system based upon item response model principles and procedures. In R. K. Hambleton (Ed.), Applications of item response theory (pp. 196-211). Vancouver, BC: Educational Research Institute of British Columbia. Hambleton, R. K., & Meara, K. (2000). Newspaper coverage of NAEP results - 1990 to 1998. In M. L. Bourque & S. Byrd (Eds.), Student performance standards on the National Assessment of Educational Progress (pp. 133-155). Washington, DC: National Assessment Governing Board. Hambleton, R. K., Merenda, P., & Spielberger C. (Eds.). (2005). Adapting educational and psychological tests for cross-cultural assessment. Mahwah, NJ: Lawrence Erlbaum. Hambleton, R. K., Mills, C. N., & Simon, R. (1983). Determining the lengths for criterion-referenced tests. Journal of Educational Measurement, 20, 27-38. Hambleton, R. K., & Murphy, E. (1991). Changes in educational testing practices. The Kamehameha Journal of Education, 2(2), 17-26. Hambleton, R. K., & Murphy, E. (1992). A psychometric perspective on authentic measurement. Applied Measurement in Education, 5(1), 1-16. Hambleton, R. K., & Murray, L. N. (1983). Goodness-of-fit investigations with item response models. In R. K. Hambleton (Ed.), Applications of item response theory (pp. 71-94). Vancouver, BC: Educational Research Institute of British Columbia. Hambleton, R. K., & Murray, L. N. (1984). Testing in the United States with microcomputers. Bulletin of the International Test Commission, 11, 17-24. Hambleton, R. K., & Novick, M. R. (1973). Toward an integration of theory and method for criterion-referenced tests. Journal of Educational Measurement, 10, 159-170. (Also published as ACT Research Report No. 53. Iowa City, IA: American College Testing Program, 1972.) Hambleton, R. K., & Oakland, T. (1993). International Test Commission: Goals, activities, and membership. Psychology International, 4(2), 8-9. Hambleton, R. K., & Oakland, T. (Eds.). (2004). Advances in assessment testing and practices. Applied Psychology: International Review, 53(2), 155-259.

26

Hambleton, R. K., & Patsula, L. (1996). Test adaptations: review of methods and suggestions for additional research. International Journal of Psychology, 31(3), 84. (Abstract) Hambleton, R. K., & Patsula, L. (1998). Adapting tests and questionnaires for use in multiple languages and cultures. Social Indicators Research, 45, 153-171. Hambleton, R. K., & Patsula, L. (1999). Increasing the validity of adapted tests: Myths to be avoided and guidelines for improving test adaptation practices. Journal of Applied Testing Technology, 1, 1-16. Hambleton, R. K., Peele, H. A., Swaminathan, H., & Sawyer, J. (1973). The Jencks-saw puzzle: Sorting out relationships among schooling, cognitive skills, and income. Meforum, 1, 23-33. Hambleton, R. K., & Pitoniak, M. J. (2002). Testing and measurement. In J. Wixted (Ed.), Stevens handbook of experimental psychology (3rd ed., 517-561). New York: John Wiley and Sons. Hambleton, R. K., & Pitoniak, M. J. (2006). Setting performance standards. In R. L. Brennan (Ed.), Educational measurement (4th ed.). Westport, CT: American Council on Education/Praeger. Hambleton, R. K., & Plake, B. (1995). Using an extended Angoff procedure to set standards on complex performance assessments. Applied Measurement in Education, 8(1), 41-55. Hambleton, R. K., & Powell, S. (1983). A framework for viewing the process of standard-setting. Evaluation and the Health Professions, 6, 3-24. Hambleton, R. K., Roberts, D. M., & Traub, R. E. (1970). A comparison of the reliability and validity of two methods for assessing partial knowledge of a multiple-choice test. Journal of Educational Measurement, 7, 75-82. Hambleton, R. K., Robin, R., & Xing, D. (2000). Item response models for the analysis of educational and psychological data. In H. E. A. Tinsley & S. Brown (Eds.), Handbook of applied multivariate statistics and mathematical modeling (pp. 553-581). New York: Academic Press. Hambleton, R. K., & Rogers, H. J. (1986). Advances in preparing certification and licensure examinations. Evaluation and the Health Professions, 9, 205-229. Hambleton, R. K., & Rogers, H. J. (1989). Design of an item bias review form: Issues and questions (Final Report). Albany, NY: Department of Education. (ERIC Clearinghouse on Tests, Measurements, and Evaluation: TM012649) Hambleton, R. K., & Rogers, H. J. (1989). Detecting biased test items: Comparison of the IRT area and Mantel-Haenszel methods. Applied Measurement in Education, 2, 313-334.

27

Hambleton. R. K., & Rogers, H. J. (1989). Solving criterion-referenced measurement problems with item response models. International Journal of Educational Research, 13, 145-160. Hambleton, R. K., & Rogers, H. J. (1989). Die anwendung von item-response-modellen in nationalen lernerfolgsmessungen. In J. K. Ingekamp & W. H. Schreiber (Eds.), Was sissen unsere Schuler? (pp. 267-310). Weinheim: Deutscher, Studien, Verlag. Hambleton, R. K., & Rogers, H. J. (1990). Using item response models in educational assessments. In W. H. Schreiber & K. Ingekamp (Eds.), International developments in large-scale assessment (pp. 155-184). Windsor, UK: NFERNelson. Hambleton, R. K., & Rogers, H. J. (1990). Approaches for identifying and understanding bias in test items. (Abstract). In S. E. Newstead, S. H. Irvine, & P. D. Dann (Eds.), Cognition and motivation: Lectures and seminars. Dordrecht, The Netherlands: Kluwer Academic Publishers. Hambleton, R. K., & Rogers, H. J. (1991). Evaluation of the plot method for identifying potentially biased test items. In P. L. Dann, S. H. Irvine, & J. M. Collis (Eds.), Computer-based human assessment (pp. 307-330). Boston, MA: Kluwer Academic Publishers. Hambleton, R. K., & Rogers, H. J. (1991). Advances in criterion-referenced measurement. In R. K. Hambleton & J. Zaal (Eds.), Advances in educational and psychological testing: Theory and applications (pp. 3-41). Boston: Kluwer Academic Publishers. Hambleton, R. K., & Rogers, H. J. (1995). Item bias review (EDO-TM-95-9). Washington, DC: ERIC. Hambleton, R. K., & Rogers, H. J. (2002). A differential item functioning analysis of the National Health Survey (Laboratory of Psychometric and Evaluative Research Report No. 418). Amherst, MA: University of Massachusetts, School of Education. Hambleton, R. K., & Rovinelli, R. (1975). Toward better college grading practices: A framework for research and development. In D. W. Allen, M. A. Melnick, & C. C. Peelle (Eds.), Reform, renewal, and reward: Improving university teaching. Amherst, MA: Clinic to Improve University Teaching, University of Massachusetts. Hambleton, R. K., & Rovinelli, R. (1986). Assessing the dimensionality of a set of test items. Applied Psychological Measurement, 10, 287-302. Hambleton, R. K., Rovinelli, R., & Gorth, W. P. (1971). Efficiency of various itemexaminee sampling designs for estimating test parameters. Proceedings of the 79th Annual Convention of the American Psychological Association, 5, 121-122. (Summary)

28

Hambleton, R. K., Rovinelli, R., Sheehan, D., & Newby, J. (1975). A comparative study of middle school students in different instructional programs. JSAS Catalog of Selected Documents in Psychology, 5, 199-200. (130 pages) Hambleton, R. K., & Scarpati, S. (2002). Reform of vocational education and new testing practices in the United States. Indian Journal of Vocational Education, 4, 1-10. Hambleton, R. K., & Sheehan, D. S. (1971). On the evaluation of higher-order science objectives. Science Education, 61, 307-315. Hambleton, R. K., & Simon, R. (1980). National Assessment of Educational Progress social studies and citizenship exercises and their usefulness for improving instruction. In P. L. Williams & J. R. Moore (Eds.), Criterion-referenced testing for the social studies (Bulletin 64). Washington, DC: National Council for the Social Studies. Hambleton, R. K., & Sireci, S. G. (1997). Future directions for norm-referenced and criterion-referenced achievement testing. International Journal of Educational Research, 21, 379-393. Hambleton, R. K., Sireci, S. G., & Robin, F. (1999). Adapting credentialing exams for use in multiple languages. CLEAR Exam Review, 10(1), 24-28. Hambleton, R. K., & Slater, S. C. (1994). NAEP state reports in mathematics: Valuable information for policy-makers. New England Journal of Public Policy, 10(1), 209-222. Hambleton, R. K., & Slater, S. C. (1995, October). Are NAEP executive summary reports understandable to policy-makers and educators? Los Angeles, CA: CRESST, UCLA. Hambleton, R. K., & Slater, S. C. (1997). Item response theory models and testing practices: Current international status and future directions. European Journal of Psychological Assessment, 13(1), 21-28. Hambleton, R. K., & Slater, S. C. (1997). Reliability of credentialing examinations and the impact of scoring models and standard-setting policies. Applied Measurement in Education, 10(1), 19-38. Hambleton, R. K., Slater, S. C., Narayanan, P., & Setiadi, H. (1996). Automated test construction: concepts, technical advances, and applications. In J. Muiz (Ed.), Psicometria (pp. 705-728). Madrid: Editorial Universitas, S. A. Hambleton, R. K., & Stetz, F. P. (1979). The development of objectives-based instructional programs in career education. Journal of Career Education, 5, 220225. Hambleton, R. K. & Swaminathan, H. (1985). Item response theory: Principles and applications. Boston, MA: Kluwer Academic Publishers.

29

Hambleton. R. K., & Swaminathan, H. (1985). A look at psychometrics in the Netherlands. Dutch Journal of Psychology, 40, 446-451. Hambleton, R. K., Swaminathan, H., & Algina, J. (1976). Some contributions to the theory and practice of criterion-referenced testing. In D. N. M. de Gruijter & L. J. Th. van der Kamp (Eds.), Advances in psychological and educational measurement (pp. 51-62). New York: Wiley. Hambleton, R. K., Swaminathan, H., Algina, J., & Coulson, D. (1978). Criterionreferenced testing and measurement: A review of technical issues and developments. Review of Educational Research, 48, 1-47. Hambleton, R. K., et al. (1976). Evaluation of student progress and school environment in the Anisa early childhood educational program. Research Relating to Children Bulletin 36 (Abstract). Urbana-Champaign, IL: Educational Resources Information Center/Early Childhood Education, University of Illinois. Hambleton, R. K., Swaminathan, H., & Cook, L. L. (1981). Program evaluation methods and techniques for day care and early childhood program personnel. In D. Streets (Ed.), Administrative handbook for day care and preschool administration. Boston: Allyn and Bacon, Inc. Hambleton, R. K., Swaminathan, H., Cook, L. L., Eignor, D., & Gifford, J. A. (1978). Developments in latent trait theory: A review of models, technical issues, and applications. Review of Educational Research, 48, 467-510. Hambleton, R. K., Swaminathan, H., Gifford, J. A., & Mills, C. (1981). Individualized criterion-referenced testing technical manual. Tulsa, OK: Educational Development Corporation. Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory. Newbury Park, CA: Sage Publications, Inc. Hambleton, R. K., & Traub, R. E. (1971). Information curves and efficiency of three logistic test models. British Journal of Mathematical and Statistical Psychology, 24, 273-281. (Summary published in the Proceedings of the 78th Annual Convention of the American Psychological Association, 1970, 4, 121-122.) Hambleton, R. K., & Traub, R. E. (1973). Analysis of empirical data using two logistic latent trait models. British Journal of Mathematical and Statistical Psychology, 26, 195-211. Hambleton, R. K., & Traub, R. E. (1974). The effects of item order on test performance and stress. Journal of Experimental Education, 43, 40-46. Hambleton, R. K., & van der Linden, W. (Eds.). (1982). Technical contributions to item response theory. [special issue] Applied Psychological Measurement, 6, 373492. Hambleton, R. K., & Wedman, I. (Eds.). (1997). Advances in assessment practices [special issue]. European Journal of Psychological Assessment, 13(1), 1-58.

30

Hambleton, R. K., & Xing, D. (2006). Optimal and Nonoptimal computer-based test designs for making pass-fail decisions. Applied Measurement in Education, 19(3), 221-239. Hambleton, R. K., Yu, J., & Slater, S. C. (1999). Field test of the ITC guidelines for adapting educational and psychological tests. European Journal of Psychological Assessment, 15(3), 270-276. Hambleton, R. K., & Zaal, J. (Eds.). (1991). Advances in educational and psychological testing: Theory and applications. Boston, MA: Kluwer Academic Publishers. Hambleton, R. K., Zaal, J., & Pieters, J. P. M. (1991). Computerized adaptive testing: Theory, applications, and standards. In R. K. Hambleton & J. Zaal (Eds.), Advances in educational and psychological testing: Theory and applications (pp. 341-366). Boston: Kluwer Academic Publishers. Hambleton, R. K., & Zenisky, A. (2003). Issues and practices of performance assessment. In C. R. Reynolds & T. W. Kamphaus (Eds.), Handbook of psychological and educational assessment of children (2nd ed., pp. 377-404). New York: The Guilford Press. Hambleton, R. K., & Zhao, Y. (2005). Item response theory models for the analysis of dichotomously scored data. In B. Everitt & D. Howell (Eds.), Encyclopedia of Statistics in Behavioral Science (pp. 982-990). West Sussex, UK: John Wiley & Sons. Hersey, P., Blanchard, K. H., & Hambleton, R. K. (1978). Contracting for leadership style: A process and instrumentation for building effective work relationships. In W. W. Burke (Ed.), The cutting edge: Current theory and practice in organization development. La Jolla, CA: University Associates. Jodoin, M., Zenisky, A., & Hambleton, R. K. (2006). Comparison of the psychometric properties of several computer-based test designs for credentialing exams with multiple purposes. Applied Measurement in Education, 19(3), 203-220. Jones, R. W., & Hambleton, R. K. (1992). Recent advances in psychometric methods. Revista Portuguesa de Educacao, 5(2), 1-13. Linn, R. L., Drasgow, F., Camara, W., Crocker, L., Hambleton, R. K., Plake, B. S., Stout, W., & van der Linden, W. J. (2002). Computer-based testing: A research agenda. In C. N. Mills, M. T. Potenza, J. J. Fremer, & W. C. Ward (Eds.), Computer-based testing: Building the foundation for future assessments (pp. 289300). Mahwah, NJ: Lawrence Erlbaum Publishers. Linn, R. L., & Hambleton, R. K. (1991). Customized tests and customized test norms. Applied Measurement in Education, 4(3), 185-207. Lu, Y., & Hambleton, R. K. (2004). Statistics for detecting disclosed items in a CAT environment. Metodologiz de las Ciencias del Comportamiento, 5(2), 225-242..

31

Madaus, G., Airasian, P., & Hambleton, R. K. (1982). Development and application of criteria for screening commercial standardized tests. Educational Evaluation and Policy Analysis, 4, 401-415. Mazor, K., Clauser, B., & Hambleton, R. K. (1992). The effect of sample size on the functioning of the Mantel-Haenszel statistic. Educational and Psychological Measurement, 52, 443-451. Mazor, K., Clauser, B., & Hambleton, R. K. (1994). Identification of non-uniform differential item functioning using a variation of the Mantel-Haenszel procedure. Educational and Psychological Measurement, 54(2), 284-291. Mazor, K., Hambleton, R. K., & Clauser, B. (1998). Effects of conditioning on two internally derived ability estimates in multi-dimensional DIF analyses. Applied Psychological Measurement, 22, 357-368. McKinley, D. W., Boulet, J. R., & Hambleton, R. K. (2005). A work-centered approach for setting passing scores on performance-based assessments. Evaluation and the Health Professions, 28(3), 349-369. Meara, K., Hambleton, R. K., & Sireci, S. G. (2001). Setting and validating standards on professional licensure and certification exams: A survey of current practices. CLEAR Exam Review, 12(2), 17-23. Mislevy, R., Forsyth, R., Hambleton, R. K., Linn, R. L., & Yen, W. (1996, June). NAEP design/feasibility report. Washington, DC: National Assessment Governing Board. Muiz, J., & Hambleton, R. K. (1992). Medio siglo de teoria de respuesta a los items. Anuario de Psicologia, 52, 41-66. Muiz, J., & Hambleton, R. K. (1997). Directions for the translation and adaptation of tests. Papeles del Psicologo, August, 63-70. Muiz, J., & Hambleton, R. K. (1999). Psychometric issues in computer-based testing. In J. Olea, V. Ponsoda, & G. Prieto (Eds.), Computerized testing: Fundamentals, strategies, and applications (pp. 23-52). Madrid: Piramide. Muniz, J., & Hambleton, R. K. (2000). Adaptacin de los tests de unas culturas a otras. Metodologa de las Ciencias del Comportamiento, 2(2), 129-149. Muiz, J., & Hambleton, R. K. (2000). Adaptacin de los tests de unas culturas a otras. Metodologa de las Ciencias del Comportamiento, 2(2), 129-149. Muiz, J., Hambleton, R. K., & Xing, D. (2001). Small sample studies to detect flaws in item translations. International Journal of Testing, 1(2), 115-135. Oakland, T., & Hambleton, R. K. (Eds.). (1995). International perspectives on academic assessment. Boston, MA: Kluwer Academic Publishers.

32

Oakland, T., Poortinga, Y., Schlegel, J., & Hambleton, R. K. (2001). International Test Commission: Its history, current status, and future directions. International Journal of Testing, 1(1), 3-32. Olsen, L. K., Hambleton, R. K., & others. (1985). Development and application of the student test used in the School Health Education Evaluation. Journal of School Health, 55, 309-315. Phillips, G. W., Mullis, I. V. S., Bourque, M. L., Williams, P. L., Hambleton, R. K., Owen, E. H., & Barton, P. E. (1993). Interpreting NAEP scales. Washington, DC: National Center for Education Statistics. Pitoniak, M. J., Hambleton, R. K., & Biskin, B. H. (2003). Setting standards on tests containing computerized performance tasks (Center for Educational Assessment Research Report No. 488). Amherst, MA: University of Massachusetts, School of Education. Plake, B. S., & Hambleton, R. K. (2000). A standard-setting method designed for complex performance assessments: Categorical assignments of student work. Educational Assessment, 6(3), 197-215. Plake, B. S., & Hambleton, R. K. (2001). The analytic judgment method for setting standards on complex performance assessments. In G. Cizek (Ed.), Setting performance standards: Concepts, methods, and perspectives. Hillsdale, NJ: Lawrence Erlbaum Associates. Plake, B. S., Hambleton, R. K., & Jaeger, R. M. (1997). A new standard-setting method for performance assessments: The dominant profile judgment method and some field-test results. Educational and Psychological Measurement, 57(3), 400-411. Popham, W. J., & Hambleton, R. K. (1990). Can you pass the test on testing? Principal, 38-39. Ranney, P., & Hambleton, R. K. (2006). Its time to consider a new test model in clinical licensure programs. Journal of the American Dental Association, 137, 30-42. Robin, F., Sireci, S. G., & Hambleton, R. K. (2003). Evaluating the equivalence of different language versions of a credentialing exam. International Journal of Testing, 3(1), 1-20. Robin, R., Xing, D., & Hambleton, R. K. (1999). Review of the software package, Rasch Scaling Program (R.S.P.). Applied Psychological Measurement, 23(1), 90-94. Rogers, H. J., & Hambleton, R. K. (1989). Evaluating computer-simulated baseline statistics for interpreting item bias statistics. Educational and Psychological Measurement, 49, 355-369.

33

Rovinelli, R., & Hambleton, R. K. (1977). On the use of content specialists in the assessments of criterion-referenced test item validity. Dutch Journal of Educational Research, 2, 49-60. Royer, M., Hambleton, R. K., & Cadorette, L. (1978). Individual differences in memory: Theory, data and educational implications. Contemporary Educational Psychology, 3, 182-203. Royer, J. M., Lynch, D. J., Hambleton, R. K., & Bulgareli, C. (1984). Using the sentence verification technique to assess the comprehension of technical text. American Educational Research Journal, 21, 839-870. Sheehan, D. S., & Hambleton, R. K. (1977). A predictive study of success in an individualized science program. Journal of School Science and Mathematics, 77, 13-20. Sheehan, D. S., & Hambleton, R. K. (1977). Adapting instruction to student differences in an individualized science program. Journal of Research in Science Teaching, 14, 27-32. Sireci, S. G., Hambleton, R. K., Huff, K. L., & Jodoin, M. G. (2000). Setting standards on licensure exams using direct consensus (Laboratory of Psychometric and Evaluative Research Report No. 395). Amherst, MA: University of Massachusetts, School of Education. Sireci, S. G., Hambleton, R. K., & Pitoniak, M. J. (2004). Setting passing scores on licensure exams using direct consensus. CLEAR Exam Review, 15, 21-25. Sireci, S. G., Patsula, L., & Hambleton, R. K. (2005) Statistical methods for identifying flawed items in the test adaptation process. In R. K. Hambleton, P. Merenda, & C. Spielberger (Eds.), Adapting educational and psychological tests for crosscultural assessment (pp. 93-115). Hillsdale, NJ: Lawrence Erlbaum Associates. Skorupski, W., & Hambleton, R. K. (2005). What are panelists thinking when they participate in standard-setting studies? Applied Measurement in Education, 18(3), 233-255. Smith, I. L., & Hambleton, R. K. (1991). Content validity studies of licensing examinations. Educational Measurement: Issues and Practice, 9, 7-10. Smith, I. L., Hambleton, R. K., & Rosen, G. A. (1988). Content validity studies of the Examination for Professional Practice in Psychology. Professional Practice of Psychology, 9(1), 43-80. Spineti, J., & Hambleton, R. K. (1977). A computer simulation study of tailored testing strategies for objectives-based instructional programs. Educational and Psychological Measurement, 37, 139-158. Stufflebeam, D. L., & Hambleton, R. K. (1988). Improving personnel evaluations through professional standards. Bulletin of the International Test Commission, 15, 3-24.

34

Stufflebeam, D. L., Hambleton, R. K., & others. (1989). Professional standards for educational evaluation systems. Beverly Hills, CA: Sage Publications. Swaminathan, H., Hambleton, R. K., & Algina, J. (1974). Reliability of criterionreferenced tests: A decision-theoretic formulation. Journal of Educational Measurement, 11, 263-267. Swaminathan, H., Hambleton, R. K., & Algina, J. (1975). A Bayesian decision-theoretic procedure for use with criterion-referenced tests. Journal of Educational Measurement, 12, 87-98. Swaminathan, H., Hambleton, R. K., Sireci, S., Xing, D., & Rizavi, S. (2003). Small sample estimation in dichotomous item response models: Effects of priors based on judgmental information on the accuracy of item parameter estimates. Applied Psychological Measurement, 27, 27-51. Traub, R. E., & Hambleton, R. K. (1972). The effect of scoring instructions and degree of speededness on the validity and reliability of multiple-choice tests. Educational and Psychological Measurement, 32, 737-758. Traub, R. E., & Hambleton, R. K. (1972). The effect of instruction on the cognitive structure of statistical and psychometric concepts. Canadian Journal of Behavioral Science, 6, 30-44. Traub, R. E., Hambleton, R. K., & Singh, B. (1969). Effects of promised reward and threatened penalty on performance of a multiple-choice vocabulary test. Educational and Psychological Measurement, 29, 847-861. van der Linden, W. J., & Hambleton, R. K. (1997) Item response theory: brief history, common models, and extensions. In W. J. van der Linden & R. K. Hambleton (Eds.), Handbook of modern item response theory (pp. 1-28). New York: Springer-Verlag. van der Linden, W. J., & Hambleton, R. K. (Eds.). (1997). Handbook of modern item response theory. New York: Springer-Verlag Publishers. van de Vijver, F., & Hambleton, R. K. (1996). Translating tests: some practical guidelines. European Psychologist, 1, 89-99. Wainer, H., Hambleton, R. K., & Meara, K. (1999). Alternative displays for communicating NAEP results: A redesign and validity study. Journal of Educational Measurement, 36(4), 301-335. Watts, J., Brown, W., Hambleton, R. K., & Mora, L. (2001). West Virginia accountability study (Final Report). Atlanta, GA: Southern Regional Education Board. Welsh, W., & Hambleton, R. K. (1976). On the use of goals in evaluation: A review of selected issues. Phi Delta Kappa's CEDR Quarterly, 9, 11-15.

35

Whelan, G. P., Boulet, J. R., McKinley, D. W., Norcini, J. J., van Zanten, M., Hambleton, R. K., Burdick, W. P., & Peitzman, M. D. (2005). Scoring standardized patient examinations: Lessons learned from the development and administration of the ECFMG Clinical Skills Assessment. Medical Teacher, 27, 200-206. Xing, D., & Hambleton, R. K. (2004). Impact of test design, item quality, and item bank size on the psychometric properties of computer-based credentialing examinations. Educational and Psychological Measurement, 64(1), 5-21. Yu, J., & Hambleton, R. K. (1996). Field test of the ITC guidelines for adapting psychological tests. International Journal of Psychology, 31(3), 439. (Abstract) Zenisky, A. L., & Hambleton, R. K. (2003). Formats for assessments. In R. FernandezBallesteros (Ed.), Encyclopedia of psychological assessment (pp. 420-424). London: Sage. Zenisky, A. L., Hambleton, R. K., & Robin, F. (2003). Detection of differential item functioning in large-scale assessments: A study of evaluating a two-stage approach. Educational and Psychological Measurement, 63, 51-64. Zenisky, A. L., Hambleton, R. K., & Robin, F. (2004). DIF detection and interpretation in large-scale science assessments: Informing item writing practices. Educational Assessment, 9, 61-78. Zenisky, A. L., Hambleton, R. K., & Sireci, S. G. (2002). Identification and evaluation of local item dependencies in the Medical College Admissions Test. Journal of Educational Measurement, 39(4), 291-309. Zenisky, A. L., Hambleton, R. K., & Robin, F. (2003). Detection of differential item functioning in large-scale assessments: A study of evaluating a two-stage approach. Educational and Psychological Measurement, 63, 51-64. (c) Reviews Clauser, B., & Hambleton, R. K. (1994). A review of Holland and Wainer's Differential Item Functioning. Journal of Educational Measurement, 31(1), 88-92. Eignor, D. E., & Hambleton, R. K. (1977). A review of H. W. Collins, J. H. Johansen, & J. A. Johnson's Educational Measurement and Evaluation. Educational and Psychological Measurement, 37, 273-276. Eignor, D. E., & Hambleton, R. K. (1979). A review of Gronlund's Constructing Achievement Tests. Educational and Psychological Measurement, 39, 246-249. Fitzpatrick, A., & Hambleton, R. K. (1979). A review of Thorndike and Hagen's Measurement and Evaluation in Psychology and Education. Educational and Psychological Measurement, 39, 249-251.

36

Hambleton, R. K. (1972). A review of the new forms S and T of the Bennett Mechanical Comprehension Test. Journal of Educational Measurement, 1971, 8, 55-56. Reprinted in Buros, O. (Ed.), The Seventh Mental Measurements Yearbook. Highland Park, NJ: Gryphon Press, pp. 1486-1487. Hambleton, R. K. (1978). A review of the CGP Self-Scoring Placement Tests in English and Mathematics. In O. Buros (Ed.), The Eighth Mental Measurements Yearbook. Highland Park, NJ: Gryphon Press. Hambleton, R. K. (1978). A review of the Everyday Skills Tests. In O. Buros (Ed.), The Eighth Mental Measurements Yearbook. Highland Park, NJ: Gryphon Press. Hambleton, R. K. (1985). A review of the Differential Aptitude Test. In J. Mitchell (Ed.), The Ninth Mental Measurements Yearbook (pp. 504-505). Lincoln, NE: Buros Institute. Hambleton, R. K. (1985). A review of the Steenburgen Diagnostic-Prescriptive Program. In J. Mitchell (Ed.), The Ninth Mental Measurements Yearbook (pp. 1477-1478). Lincoln, NE: Buros Institute. Hambleton, R. K. (1992). A review of Hudson Education Skills Inventory. In J. C. Conoley & J. J. Kramer (Eds.), The Eleventh Mental Measurements Yearbook (pp. 390-392). Lincoln, NE: Buros Institute of Mental Measurements, University of Nebraska. Hambleton, R. K. (1992). A review of Survey of Problem-Solving and Educational Skills. In J. C. Conoley & J. J. Kramer (Eds.), The Eleventh Mental Measurements Yearbook (pp. 908-910). Lincoln, NE: Buros Institute of Mental Measurements, University of Nebraska. Hambleton, R. K. (1995). A review of The Seventh Edition of the Metropolitan Achievement Tests. In J. C. Conoley & J. Impara (Eds.), The Twelfth Mental Measurements Yearbook (pp. 606-610). Lincoln, NE: The Buros Institute. Hambleton, R. K. (2003). Tribute to Ross E. Traub. Alberta Journal of Educational Research, 49(3), 208-210. Hambleton, R. K. (2005). Review of the Iowa Tests of Basic Skills, Forms, K, L, M. In D. J. Keyser & R. C. Sweetland (Eds.), Test critiques (volume 11) (pp. 138-150). Kansas City: Test Corporation of America. Hambleton, R. K. (2005). A review of the Academic Competence Evaluation Scales. In R. A. Spies, & B. S. Plake (Eds.), The 16th Mental Measurements Yearbook (pp. 1-4). Lincoln, NE: Buros Institute of Mental Measurements, University of Nebraska. Hambleton, R. K. (2005). A review of the Wechsler Memory Tests. In R. A. Spies, & B. S. Plake (Eds.), The 16th Mental Measurements Yearbook (pp. 1097-1099). Lincoln, NE: Buros Institute of Mental Measurements, University of Nebraska.

37

Hambleton, R. K. (2006). National Council on Measurement in Education. In N. Salkind (Ed.), Encyclopedia of Measurement and Statistics. Newbury Park, CA: Sage. Hambleton, R. K., & Carter, W. (1977). A review of D. P. Warwick & C. A. Lininger's, The Sample Survey: Theory and Practice. Educational and Psychological Measurement, 37, 568-569. Hambleton, R. K., & Cook, L. L. (1977). A review of D. G. Lewis' Assessment in Education. Educational and Psychological Measurement, 37, 559-560. Hambleton, R. K., & Kaplan-deVries, D. (1985). A review of the Basic Achievement Skills Individual Screener (BASIS). Journal of Counseling and Development, 63, 383-384. Hambleton, R. K., & Murray, L. (1983). A review of Thorndike's Applied Psychometrics. Applied Psychological Measurement, 7, 243-245. Hambleton, R. K., & Narayanan, P. (1992). Review of RASCAL. Rasch Measurement, 6(3), 236. Hambleton, R. K., & Powers, T. (1973). A review of G. H. Bracht, K. D. Hopkins, and J. C. Stanley's Perspectives in Educational and Psychological Measurement. Educational and Psychological Measurement, 33, 512-513. Hambleton, R. K., & Rovinelli, R. (1972). A review of W. Clemans' Educational Uses of the Computer: An Introduction. Educational and Psychological Measurement, 32, 526-529. Hambleton, R. K., & Swaminathan, H. (1981). A review of Lord's Applications of Item Response Theory to Practical Testing Problems. Journal of Educational Measurement, 18, 178-180. Jones, R. W., & Hambleton, R. K. (1992). A review of Osterlind's Constructing Test Items. Journal of Educational Measurement, 29, 195-197. Sheehan, D. S., & Hambleton, R. K. (1975). A review of D. M. Shoemaker's Principles and Procedures of Multiple Matrix Sampling. Educational and Psychological Measurement, 35, 1059-1061. Swaminathan, H., & Hambleton, R. K. (1972). A review of Van der Geer's Introduction to Multivariate Analysis for the Social Sciences. Educational and Psychological Measurement, 32, 1152-1156. (d) Technical Reports (Reports Published in Books or Journals Are Not Included) Algina, J., Bourque, M. L., Hambleton, R. K., & Larrivee, B. An evaluative study of selected outcomes of the Hampton Maine Anisa Program (1973-1974) (Final Report). Hampden, ME: Hampden School Department. (130 pages)

38

Arrasmith, D., & Hambleton, R. K. (1987). Steps for setting standards with the Angoff method (Final Report). New York: Professional Examination Service. Avis, N. E., Smith, K. W., Mayer, K. H., Swislow, L., & Hambleton, R. K. (1997). The multidimensional quality of life questionnaire for persons with HIV/AIDS: development and evaluation (Final Report). Watertown, MA: New England Research Institute. Bourque, M. L., Goodman, G., Hambleton, R. K., & Han, N. (2004). Reliability estimates for the ABTE tests in elementary education, professional teaching knowledge, secondary mathematics and English/language arts (Final Report). Leesburg, VA: Mid-Atlantic Psychometric Services. Clauser, B., Mazor, K., & Hambleton, R. K. (1991). Examination of various influences on the Mantel-Haenszel statistic (Laboratory of Psychometric and Evaluative Research Report No. 210). Amherst, MA: School of Education, University of Massachusetts. Cook, L. L., Eignor, D., Fitzpatrick, A., Gifford, J. A., Hambleton, R. K., Swaminathan, H., & Wroble, L. An evaluative study of the Social Literacy Project, 1977. (120 pages) Coulson, D., & Hambleton, R. K. (1974). Some validation methods for domainreferenced tests (Laboratory of Psychometric and Evaluative Research Report No. 7). Amherst, MA: School of Education, University of Massachusetts. Eignor, D. R., & Hambleton, R. K. (1979). Effects of test length and advancement score on several criterion-referenced test reliability and validity indices (Laboratory of Psychometric and Evaluative Research Report No. 86). Amherst, MA: School of Education, University of Massachusetts. Eignor, D. R., Hambleton, R. K., & Blanchard, K. (1976). Improving leadership effectiveness: Situational leadership theory, instrumentation, and applications (Laboratory of Psychometric and Evaluative Research Report No. 41). Amherst, MA: School of Education, University of Massachusetts. Ertel, K., Hambleton, R. K., & Schiff, R. (1973). Career education potential and alternatives in the Southern Berkshire Region: A study of schools with limited resources (Final Report). Boston: Massachusetts Commission for Occupational Education. (158 pages) Fitzpatrick, A. R., & Hambleton, R. K. (1983). Similarity between the skills covered by the Louisiana Basic Skills Tests and the skills covered by commonly used standardized achievement tests (Grades 2, 3, 4) (Final Report). Amherst, MA: Psychometric and Evaluative Research Services, Inc. Friedman, M., van Zanten, M., White, D., Hambleton, R. K., & Whelan, G. P. A survey of clinical skills of foreign medical graduates in their first year of residency (Research Report). Philadelphia, PA: Educational Commission for Foreign Medical Graduates.

39

Gifford, J. A., Cook, L. L., & Hambleton, R. K. (1976). Alternative schools: Rationale, descriptions, and problems of evaluation (Laboratory of Psychometric and Evaluative Research Report No. 32). Amherst, MA: School of Education, University of Massachusetts. Gimpel, J. R., Boulet, J. R., Weidner, Al., Dowling, D. J., Hambleton, R. K., Kerns, L., Solomon, M., & LaMarra, D. (2005). Standard setting summary report: COMLEX-USA Level 2-PE (Final Report). Philadelphia: National Board of Osteopathic Medical Examiners. Hambleton, R. K. (1970). Evaluation and research model for METEP (Final Report). Washington: Office of Education. Hambleton, R. K. (1971). A report on the research and evaluation activities in the Jamesville-Dewitt individualized instruction program in ninth grade science (Final Report). Albany, NY: Bureau of School and Cultural Research, New York State Education Department (122 pages). Hambleton, R. K. (1972). An evaluative study of the Educational Project to Implement Conservation (Final Report). Westfield, MA: Westfield Public Schools. (80 pages) Hambleton, R. K. (1974). A comment on Crehan's techniques for validating criterionreferenced testing (Laboratory of Psychometric and Evaluative Research Report No. 14). Amherst, MA: School of Education, University of Massachusetts. Hambleton, R. K. (1976). An assessment of School of Education grading practices and preferences (Laboratory of Psychometric and Evaluative Research Report No. 21). Amherst, MA: School of Education, University of Massachusetts. Hambleton, R. K. (1977). What classroom teachers need to know about criterionreferenced testing (Laboratory of Psychometric and Evaluative Research Report No. 50). Amherst, MA: School of Education, University of Massachusetts. Hambleton, R. K. (1977). Contributions to criterion-referenced test theory: On the uses of item characteristic curves and related concepts (Laboratory of Psychometric and Evaluative Research Report No. 51). Amherst, MA: School of Education, University of Massachusetts. Hambleton, R. K. (1977). Worcester Title I reading program evaluation (1976-1977) (Final Report). Providence, RI: International Educational Associates. Hambleton, R. K. (1978). An evaluative study of Project Support (1977-1978) (Final Report). Billerica, MA: Billerica School Department. (75 pages) Hambleton, R. K. (1978). Assessment of second level manager competence (Final Report). Basking Ridge, NJ: American Telephone and Telegraph. (62 pages) Hambleton, R. K. (1979). A field study of the validity of Hersey-Blanchard's model of leadership effectiveness (Final Report). Rochester, NY: Xerox Corporation.

40

Hambleton, R. K. (1984). Standard-setting: State of the art, future prospectus (Laboratory of Psychometric and Evaluative Research Report No. 142). Amherst, MA: School of Education, University of Massachusetts. Hambleton, R. K. (1985). Validity investigation for the certification examination of the National Association of Purchasing Management (Final Report). Amherst, MA: Psychometric and Evaluative Research Services, Inc. (93 pages) Hambleton, R. K. (1991). Follow-up evaluation study of the 1989 to 1991 workshops of the Consortium for the Improvement of Math and Science Teaching (Final Report). North Adams, MA: North Adams State College. Hambleton, R. K. (1995). Setting achievement levels on the NAEP mathematics assessment: Response to technical criticisms (Laboratory of Psychometric and Evaluative Research Report No. 250). Amherst, MA: University of Massachusetts, School of Education. Hambleton, R. K. (2004). 2002-2003 MCAS research and validity studies (Final Report). Amherst, MA: University of Massachusetts, Centr for Educational Assessment. Hambleton, R. K. (2004). Review of the translation/adaptation process for the Child Assessment Battery for the Head Start National Reporting System (Final Report). Washington: Government Accounting Office. Hambleton, R. K., & Berberoglu, G. (1997, May). TIMSS instruments adaptation process: a formative evaluation (Final Report). Amsterdam, The Netherlands. Hambleton, R. K., & Bourque, M. L. (1975). An evaluation of the Providence Title I Mathematics Remediation Laboratory Program (Final Report). Providence, RI: Providence School Department. Hambleton, R. K., & Eignor, D. (1978). Comments on selected questions raised in connection with the home environment study (Final Report). Princeton, NJ: Mathematica Policy Research. Hambleton, R. K., & Eignor, D. (1979). Comments on the Alaska instructional diagnostic system (Final Report). Portland, OR: Northwest Regional Educational Laboratory. Hambleton, R. K., & Eignor, D. (1979). A practitioner's guidebook to criterionreferenced test development, validation, and test score usage (Laboratory of Psychometric and Evaluative Research Report No. 70). Amherst, MA: School of Education, University of Massachusetts. (2nd ed.) Hambleton, R. K., & Gifford, J. A. (1977). An evaluative study of the CIP Screening Device and related instruments in Project CHILD FIND (Final Report). Providence, RI: Providence School Department.

41

Hambleton, R. K., & Gorth, W. P. (1971). Criterion-referenced testing: Issues and applications (Center for Educational Research Technical Report No. 13). Amherst, MA: School of Education, University of Massachusetts. (ERIC: ED 060 025) Hambleton, R. K., Gower, C., Bollwark, J., Mazor, K., & Donovan, C. (1989). Evaluation of the 1988-1989 Worcester Chapter 636 Magnet School Program (Final Report). Amherst, MA: School of Education, University of Massachusetts. (215 pages) Hambleton, R. K., Jones, R. W., & Cadman, S. (1993). Innovations in testing and evaluation of student competencies in technical and vocational education (Final Report). Paris: UNESCO. Hambleton, R. K., & Meara, K. (2000). Newspaper coverage of NAEP results - 1990 to 1998 (Laboratory of Psychometric and Evaluative Research Report No. 366). Amherst, MA: University of Massachusetts, School of Education. Hambleton, R. K., & Murray, J. (1977). A comparative study of faculty and student attitudes toward a variety of college grading purposes and practices (Laboratory of Psychometric and Evaluative Research Report No. 48). Amherst, MA: University of Massachusetts, School of Education. Hambleton, R. K., Murray, L., & Anderson, J. (1983). Uses of item statistics in item evaluation and test development (Laboratory of Psychometric and Evaluative Research Report No. 131). Amherst, MA: University of Massachusetts, School of Education. Hambleton, R. K., Murray, L., & Williams, P. (1983). Fitting item response models to the Maryland Functional Reading Test results (Laboratory of Psychometric and Evaluative Research Report No. 139). Amherst, MA: University of Massachusetts, School of Education. Hambleton, R. K., & Olszewski, F. (1972). Woodworking objective and test item bank (Final ESCOE Report). Boston, MA: Massachusetts Department of Education. Hambleton, R. K., & Pauker, R. (1976). Coordination and delivery of in-service education in Massachusetts project: Year one evaluation report (Final Report). Boston, MA: Department of Education. Hambleton, R. K., & Pauker, R. (1976). An evaluation plan for the project to coordinate and deliver in-service education in Massachusetts (Final Report). Boston, MA: Department of Education. Hambleton, R. K., & Rovinelli, R. (1971). Efficiency of various item-examinee sampling designs for estimating test parameters (Center for Educational Research Technical Report No. 12). Amherst, MA: School of Education, University of Massachusetts.

42

Hambleton, R. K., Sireci, S. G., Swaminathan, H., Xing, D., & Rizavi, S. (2003, October). Anchor-based methods for judgmentally estimating item difficulty parameters (Law School Admission Council Computerized Testing Report 9805). Newtown, NJ: LSAC. Hambleton, R. K., & Smith, I. L. (1988). Content validity and fairness review of the 1987 forms of the Examination for Professional Practice of Psychology (Final Report). Washington, DC: American Association of State Psychology Boards, Inc. (132 pages) Hambleton, R. K., & Smith, T. (1999). An evaluation of the general/public 1996 NAEP Science Reports (Laboratory of Psychometric and Evaluative Research Report No. 361). Amherst, MA: University of Massachusetts, School of Education. Hambleton, R. K., Stetz, F. P., & Newby, J. F. (1973). An assessment of selected components of the Baltimore Model Cities Project (Final Report). Baltimore, MD: Baltimore Model Cities Staff. (88 pages) Hambleton, R. K., Swaminathan, H., Arrasmith, D., Gower, C., & Rogers, H. J. (1986). Proposed steps for constructing and validation Air Force Specialty Diagnostic Achievement Tests (Laboratory of Psychometric and Evaluative Research Report No. 164). Amherst, MA: School of Education, University of Massachusetts. Hambleton, R. K., Swaminathan, H., Arrasmith, D., Gower, C., Rogers, H. J., & Zhou, A. (1986). Development of an integrated system to assess and enhance basic job skills: Research plan, personnel measurement subsystem (Laboratory of Psychometric and Evaluative Research Report No. 163). Amherst, MA: School of Education, University of Massachusetts. Hambleton, R. K., Swaminathan, H., Bollwark, J., Gower, C., Reshetar, R., Rogers, H. J., & Zhou, A. (1986). Program to assist school districts in collecting and using achievement test data (Final Report). Holyoke and Lowell, MA: Holyoke and Lowell Public School Systems. (39 pages) Hambleton, R. K., Swaminathan, H., & Eignor, D. (1976). An evaluative study of the leadership development and team building laboratory for administrative personnel of the Baltimore City Public School System (Final Report). Baltimore, MD: Baltimore Public Schools. Hambleton, R. K., et al. (1976). An evaluative study of the third year of the Anisa program in the Hampden, Maine School System (Final Report). Hampden, ME: Hampden School Department. Hambleton, R. K., & Zhao, Y. (2004). Alignment of MCAS grade 10 English Language Arts and Mathematics Assessments with the curricula frameworks and the test specifications (Center for Educational Assessment Research Report No. 538). Amherst, MA: University of Massachusetts, Center for Educational Assessment.

43

MacCormack, J., Miller, C., Hambleton, R. K., & Eignor, D. (1976). Goal setting ability in young children: Theory, instrumentation, and measurement (Laboratory of Psychometric and Evaluative Research Report No. 25). Amherst, MA: School of Education, University of Massachusetts. Madaus, G., Airasian, P., & Hambleton, R. K. (1979). Development and application of criteria for screening commercial standardized tests (Final Report). Boston, MA: Massachusetts Department of Education. Malaka, M., & Hambleton, R. K. (1991). Formative evaluation of the first two criterionreferenced testing workshops for Swaziland teachers (Final Report). Amherst, MA: School of Education, University of Massachusetts. (37 pages) Mazor, K., Miller, T., & Hambleton, R. K. (1992). Predicting the academic success of minority students (Laboratory of Psychometric and Evaluative Research Report No. 248). Amherst, MA: University of Massachusetts, School of Education. Meara, K., Hambleton, R. K., & Sireci, S. G. (2000). A survey of standard-setting practices in the credentialing/licensing field (Laboratory of Psychometric and Evaluative Research Report No. 387). Amherst, MA: University of Massachusetts, School of Education. Mills, C. N., & Hambleton, R. K. (1980). Guidelines for reporting criterion-referenced test score information (Laboratory of Psychometric and Evaluative Research Report No. 100). Amherst, MA: School of Education, University of Massachusetts. Mills, C. N., Hambleton, R. K., Biskin, B., Kobrin, J., Evans, J., & Pfeffer, M. (2000). A comparison of the standard-setting methods for the Uniform CPA Examination (Technical Report). Jersey City, NJ: American Institute of Certified Public Accountants. Newby, J., Hambleton, R. K., Rovinelli, R., & Sheehan, D. (1972). A comparative study of creative behavior of middle school students in different instructional programs (Supplemental Report No. 1). Concord, MA: Concord School Department. Olsen, J., Hambleton, R. K., & Reckase, M. D. (1998). Tekcheck psychometric review (Final Report). Orem, UT: Alpine Media. O'Reilly, R. P., & Hambleton, R. K. (1971). A CMI model for an individualized learning program in ninth grade science (Center for Educational Research Technical Report No. 14). Amherst, MA: School of Education, University of Massachusetts. Patsula, L., & Hambleton, R. K. (1999). A comparative study of ability estimates obtained from computer-adaptive and multi-stage testing (Laboratory of Psychometric and Evaluative Research Report No. 348). Amherst, MA: University of Massachusetts, School of Education.

44

Pauker, R., & Hambleton, R. K. (1976). Matching students and teachers to maximize learning: What do students think? (Laboratory of Psychometric and Evaluative Research Report No. 46). Amherst, MA: School of Education, University of Massachusetts. Rollins, L., & Hambleton, R. K. (1997). Job analysis study of municipal securities sales representatives, public finance professionals, and traders and underwriters (Final Report). Washington, DC: Municipal Securities Rulemaking Board. Rollins, L., & Hambleton, R. K. (2000). Job analysis study for the Series 53 Examination (Final Report). Washington, DC: Municipal Securities Rulemaking Board. Roman, J., & Hambleton, R. K. (1979). Screening tests for primary school children (Laboratory of Psychometric and Evaluative Research Report No. 101). Amherst, MA: School of Education, University of Massachusetts. Rovinelli, R., & Hambleton, R. K. (1973). Some procedures for the validation of criterion-referenced test items (Final Report). Albany, NY: Bureau of School and Cultural Research, New York State Education Department. (96 pages) Setiadi, H., & Hambleton, R. K. (1996, June). Item banks to improve assessment practices (Final Report). Jakarta: Indonesian Department of Education. Setiadi, H., & Hambleton, R. K. (1996, June). Item selection using IRT models (Final Report). Jakarta: Indonesian Department of Education. Sheehan, D. S., & Hambleton, R. K. (1972). An evaluative study of the JamesvilleDeWitt individualized science program (1971-1972) (Final Report). Albany, NY: Bureau of School and Cultural Research, New York State Education Department. (191 pages) Sheehan, D. S., & Hambleton, R. K. (1972). An evaluative study of the JamesvilleDeWitt individualized science program (1971-1972) (Supplemental Report No. 1). Albany, NY: Bureau of School and Cultural Research, New York State Education Department. (228 pages) Sheehan, D. S., & Hambleton, R. K. (1976). A review of selected factors affecting questionnaire and interview results (Laboratory of Psychometric and Evaluative Research Report No. 29). Amherst, MA: School of Education, University of Massachusetts. Stetz, F. P., & Hambleton, R. K. (1973). An assessment of the Berkshire Hills Schools readiness program (Final Report). Pittsfield, MA: Berkshire Hills School System. Swaminathan, H., Hambleton, R. K., & Pauker, R. (1976). An evaluative study of Project Self (Final Report). Rocky Hill, CT: Rocky Hill Board of Education.

45

Traub, R. E., Gundlack, L., Wolfe, C., Hambleton, R. K., & Winslow, I. (1968). Technical Report for the Canadian Scholastic Aptitude Test Pretest: May-June 1968. Toronto: Ontario Institute for Studies in Education. Traub, R. E., Tuppen, C. J., & Hambleton, R. K. (1966). Validity and reliability of the Dominion Group Tests of Learning Capacity (Test Development Papers). Toronto: Ontario Institute for Studies in Education. Xing, D., & Hambleton, R. K. (1998). Documentation for running Bilog 3.11 in Windows 95 (Laboratory of Psychometric and Evaluative Research Report No. 342). Amherst, MA: University of Massachusetts, School of Education. Zenisky, A. L., Hambleton, R. K., & Sireci, S. G. (2000). Effects of local item dependencies on the validity of IRT item, test, and ability statistics (Laboratory of Psychometric and Evaluative Research Report No. 363). Amherst, MA: University of Massachusetts, School of Education. (e) Published Tests Blanchard, K. H., Hambleton, R. K., Zigmari, D., & Forsyth, D. (1981). Leader Behavior Analysis, Self and Other (Form A). Escondido, CA: Blanchard Training and Development. Hambleton, R. K. (1974). Diagnostic tests of selected reading skills. Providence, RI: International Educational Associates. Hambleton, R. K. (1975). Reading skills inventory: A criterion-referenced assessment (three editions). Materials produced included: (1) (2) (3) (4) (5) Reading skills inventory description and technical manual. Indicators of prereading skills test. (Two forms) Indicators of word-attack skills test. (Two forms) Indicators of dictionary skills test. (Two forms) Indicators of reading comprehension test. (Nine levels, two forms)

Providence, RI: International Educational Associates. Hambleton, R. K. (1983). Blueprint for Learning. A comprehensive K-12 criterionreferenced reading and mathematics testing system. Tulsa, OK: Educational Development Corporation. Hambleton, R. K., Blanchard, K. H., & Hersey, P. (1977). Professional Maturity Scale. LaJolla, CA: University Associates. Hersey, P., Blanchard, K. H., & Hambleton, R. K. (1980). Leadership Scale. LaJolla, CA: University Associates.

46

PAPERS PRESENTED AT PROFESSIONAL MEETINGS: Allalouf, A., Bastari, Sireci, S., & Hambleton, R. K. (1997, October). Comparing the dimensionality of a test administered in two languages. Paper presented at the meeting of NERA, Ellenville, NY. Allalouf, A., Hambleton, R. K., & Sireci, S. (1998, April). Detecting the causes of differential item functioning in translated verbal items. Paper presented at the meeting of NCME, San Diego. Avis, N. E., Smith, K. W., Hambleton, R. K., Feldman, H. A., Selwyn, A., & Jacobs, A. (1994, October). Development of the multidimensional index of life quality: A quality of life measure for cardiovascular disease. Paper presented at the Drug Information Association Second Symposium on Contributed Papers in Quality of Life Evaluation, Charleston, SC. Baldwin, P., Keller, L. A., & Hambleton, R. K. (2004, April). Using auxiliary information for small sample estimation with the Medical College Admission Test. Paper presented at the meeting of NCME, San Diego. Berberoglu, G., & Hambleton, R. K. (2004, July). Translating tests across languages for different uses: Issues, problems, and possible solutions. Paper presented at the JURE Conference, Istanbul. Berberoglu, G., Sireci, S. G., & Hambleton, R. K. (1997, March). Comparing translated items using bilingual and monolingual items. Paper presented at the meeting of NCME, Chicago. Berberoglu, G., & Hambleton, R. K. (2005, July). Test translation for intra-cultural and crosscultural purposes: Issues, problems, techniques, and solutions. Paper presented at the 9th European Congress of Psychology, Granada, Spain. Berberoglu, G., Sireci, S. G., & Hambleton, R. K. (1997, July). A comparison of the graded response model and the Mantel-Haenszel method for detecting DIF across different language groups. Paper presented at the Fifth European Congress of Psychology, Dublin, Ireland. Bollwark, J., & Hambleton, R. K. (1990, May). Using the Mantel-Haenszel method in item bias studies. Paper presented at the meeting of the New England Educational Research Organization, Rockport, Maine. Boulet, J., Friedman, M., Hambleton, R. K., Burdick, R., & Ziv, A. (1996, June). Assessing the adequacy of the post-encounter written scores in simulated patient exams. Paper presented at the 7th Ottawa Medical Testing Conference, Maastricht, The Netherlands. Boulet, J., Hambleton, R. K., Burdick, W. B., & Friedman, M. (1998, September). The use of case performance data to improve the technical quality of standardized patient examinations. Paper presented at the meeting of the Association of Medical Educators in Europe, Prague.

47

Boulet, J., Hambleton, R. K., Friedman, M., & Whelan, G. (1998, April). A comprehensive holistic approach for setting standards on performance assessments. Paper presented at the meeting of NCME, San Diego. Boulet, J., McKinley, D., Hambleton, R. K., & Whelan, G. P. (1999, September). Quality control measures to monitor the accuracy and consistency of scores from standardized patient assessments. Paper presented at the meeting of the AMEE, Linkoping, Sweden. Boulet, J. R., McKinley, D., Whelan, G. P., van Zanten, M., & Hambleton, R. K. (2002, November). Clinical skills deficiencies among first-year residents. Paper presented at the annual meeting of the Association of American Medical Colleges, San Francisco. Boulet, J. R., McKinley, D. W., Whelan, G., & Hambleton, R. K. (2002, April). The effect of task exposure on repeat candidate scores in a high-stakes performance assessment. Paper presented at the meeting of AERA, New Orleans. Clauser, B., Mazor, K., & Hambleton, R. K. (1990, April). The influence of test homogeneity on item bias results using the Mantel-Haenszel procedure. Paper presented at the meeting of AERA, Boston. Clauser, B., Mazor, K., & Hambleton, R. K. (1991, April). Examination of various influences on the Mantel-Haenszel statistic. Paper presented at the meeting of AERA, Chicago. Clauser, B., Mazor, K., & Hambleton, R. K. (1992, April). Effects of score group width on DIF with the MH procedure. Paper presented at the meeting of AERA, San Francisco. Cook, L. L., & Hambleton, R. K. (1978, April). Application of latent trait theory to the development of norm-referenced and criterion-referenced tests. Paper presented at the meeting of NCME, Toronto. Cook, L. L., & Hambleton, R. K. (1979, April). Effects of test length and sample size on the estimates of precision of latent ability scores. Paper presented at the meeting of AERA, San Francisco. Cook, L. L., & Hambleton, R. K. (1979, April). A comparative study of item selection methods utilizing latent trait theoretic models and concepts. Paper presented at the meeting of AERA, San Francisco. Coulson, D., & Hambleton, R. K. (1974, August). On the validation of criterion-referenced tests designed to measure individual mastery. Paper presented at the meeting of APA, New Orleans. Eignor, D. R., & Hambleton, R. K. (1974, April). Effects of test length and advancement score on several criterion-referenced test reliability and validity indices. Paper presented at the meeting of AERA, San Francisco. Elosua, P., Hambleton, R. K., & Zenisky, A. (2006, July). Improving the methodology for detecting biased test items. Paper presented at the 5th ITC Conference on Adapting Tests, Brussels.

48

Fernandos-Ballesteros, R., Hambleton, R. K., & ONeil, T. (2001, July). The European Survey on Aging Protocol (ESAP): Translation and adaptation to seven European countries. Paper presented at the International Congress of Gerontology, Vancouver, BC. Friedman, M., Boulet, J., Burdick, B., Ziv, A., Hambleton, R. K., & Gary, N. (1997, October). Who should score the post-encounter patient progress note? Paper presented at the annual meeting of the American Association of Medical Colleges, Washington, DC. Friedman, M., Hambleton, R. K., Boulet, J., Ziv, A., Peitzman, S., Burdick, W. B., & Whelan, G. (1998, September). The learning curve in implementing standard-setting procedures in the health profession. Paper presented at the meeting of the Association of Medical Educators in Europe, Prague. Gifford, J. A., & Hambleton, R. K. (1979, October). Construction and use of criterionreferenced tests in program evaluation studies. Paper presented at the meeting of NERA, Ellenville, New York. Gifford, J. A., & Hambleton, R. K. (1980, April). Construction and use of criterion-referenced tests in program evaluation studies. Paper presented at the meeting of AERA, Boston. Goodman, D., & Hambleton, R. K. (2003, April). Reporting student results on state assessments: Current practice, problems, and possibilities. Invited paper presented at the meeting of NCME, Chicago. Hambleton, R. K. (1968, April). The effects of item order and anxiety on test performance and stress. Paper presented at the meeting of AERA, Chicago. Hambleton, R. K. (1969, May). The role of computers in education. An invited address at the meeting of the Ontario Vocational Educational Association, London, Ontario. Hambleton, R. K. (1972, March). Applications of Bayesian statistical methods to individually prescribed instruction programs. Paper presented at the meeting of NCME, Chicago. Hambleton, R. K. (1973, April). A decision-theoretic approach to criterion-referenced testing and measurement. Paper presented at the meeting of AERA, New Orleans. Hambleton, R. K. (1973, April). A review of several testing models for individualized instruction. Paper presented at the meeting of AERA, New Orleans. Hambleton, R. K. (1973, October). Objectives-based instruction, testing, and measurement. Paper presented at the meeting of NERA, Ellenville, New York. Hambleton, R. K. (1974, August). Recent developments in criterion-referenced assessment. Paper presented at the meeting of APA, New Orleans. Hambleton, R. K. (1974, August). Criterion-referenced testing: A review of recent developments. Invited paper presented at the meeting of NERA, Ellenville, New York. Hambleton, R. K. (1974). College grading practices: A review of the issues. Paper presented at the First International Conference on Improving University Teaching, University of Massachusetts at Amherst.

49

Hambleton, R. K. (1975, April). Toward a theory and practice of criterion-referenced testing. Paper presented at an invited symposium at the meeting of AERA, Washington. Hambleton, R. K. (1976, October) A survey of evaluative methods and program results of the three-year Anisa field project. Paper presented at the meeting of NERA, Ellenville, New York. Hambleton, R. K. (1977, April). Contributions to criterion-referenced test theory: On the uses of item characteristic curves and related concepts. Paper presented at the meeting of AERA, New York. Hambleton, R. K. (1977, May). Guidelines for more effective objectives-based reading programs. Paper presented at the meeting of the International Reading Association, Miami Beach. Hambleton, R. K. (1977, June). The validity of criterion-referenced tests. Paper presented at the Third International Symposium on Educational Testing, University of Leyden, The Netherlands. Hambleton, R. K. (1978, April). Standards for educational and psychological tests. Paper presented at the meeting of AERA, Toronto. Hambleton, R. K. (1978, May). Constructing criterion-referenced reading tests: What are the steps? Paper presented at the International Reading Association, Houston. Hambleton, R. K. (1978, October). Validation of criterion-referenced test score interpretations and standard setting methods. Invited paper presented at the First Annual Johns Hopkins University National Symposium on Educational Research, Washington. Hambleton, R. K. (1979, March). Advances in testing technology. Presentation at the Learning Tomorrow for Today's Generations Conference at the University of Massachusetts at Amherst. Hambleton, R. K. (1979, April). Testing assumptions and determining the goodness of fit of latent trait models. Paper presented at the meeting of AERA, San Francisco. Hambleton, R. K. (1979, April). Applications of latent trait theory to the development and use of criterion-referenced tests. Paper presented at the meeting of AERA, San Francisco. Hambleton, R. K. (1979, May). Setting standards on criterion-referenced reading tests: What are the steps? Paper presented at the meeting of the International Reading Association, Atlanta. Hambleton, R. K. (1979, June). Competency testing: Setting educational performance standards for the individual. Invited paper presented at the 9th Annual Conference on Large-Scale Assessment, Denver. Hambleton, R. K. (1979, June). Determining the validity of competency tests. Invited paper presented at the 9th Annual Conference on Large-Scale Assessment, Denver.

50

Hambleton, R. K. (1979, October). Will the real competency test please stand up? Keynote address at the meeting of NERA, Ellenville, New York. Hambleton, R. K. (1980, April). Review methods for criterion-referenced test items. Paper presented at the meeting of AERA, Boston. Hambleton, R. K. (1980, May). Guidelines for selecting criterion-referenced tests. Invited paper at the meeting of the International Reading Association, St. Louis. Hambleton, R. K. (1980, June). Ability estimation with three logistic test models. Paper presented at the Fourth International Symposium of Educational Testing, Antwerp, Belgium. Hambleton, R. K. (1980, June). Putting the Rasch model into perspective: Its advantages and disadvantages for district and state assessment applications. Invited paper presented at the 10th Annual Conference on Large-Scale Assessment, Denver. Hambleton, R. K. (1981, April). Latent ability scales, interpretations, and uses. Paper presented at the meeting of AERA, Los Angeles. Hambleton, R. K. (1981, April). Advances in criterion-referenced measurement in reading. Invited presentation at the meeting of the International Reading Association, New Orleans. Hambleton, R. K. (1981, June). Goodness of fit studies for latent trait models. Invited paper presented at the 11th Annual Conference on Large-Scale Assessment, Boulder, Colorado. Hambleton, R. K. (1981, December). Measures of goodness of fit for item response models. Invited paper presented at the meeting of the Netherlands Psychometric Society, Amsterdam. Hambleton, R. K. (1982, March). Recent advances in competency test development, standardsetting, and validity assessment. Invited presentation at the Fourth Annual Northern New England Educational Tests, Measurement, and Evaluation Conference, Plymouth, New Hampshire. Hambleton, R. K. (1982, June). The utilization of item response models with NAEP mathematics exercises. Invited presentation at the 12th Annual Large-Scale Assessment Conference, Boulder, Colorado. Hambleton, R. K. (1982, August). Some pitfalls in applying item response models. Paper presented at the meeting of APA, Washington, DC. Hambleton, R. K. (1983, April). Standard-setting: State of the art, future prospectus. Paper presented at the meeting of AERA, Montreal. Hambleton, R. K. (1983, June). Applications of item response theory. Invited presentation at the meeting of the Canadian Society for the Study of Education, Vancouver. Hambleton, R. K. (1984, April). Promising solutions to several problems that arise in applying IRT. Paper presented at the meeting of AERA, New Orleans.

51

Hambleton, R. K. (1984, July). Applications of item response theory. Invited paper presented at the 23rd International Congress of Psychology, Acapulco. Hambleton, R. K. (1984, December). New technical advances in measurement for certification and licensure exams. Invited address at the NCHCA National Conference on Continuing Competence Assurance, Miami Beach. Hambleton, R. K. (1985, April). A competency test program evaluation from a psychometrician's viewpoint. Paper presented at the meeting of AERA, Chicago. Hambleton, R. K. (1986, March). Objectives-based testing. Invited presentation at the Orlando Conference, Lake Buena Vista, Florida. Hambleton, R. K. (1987, May). Uses of computers in school testing programs. Invited presentation at the Conference on Measurement and Evaluation, Los Angeles. Hambleton, R. K. (1987, May). Future of item response theory. Invited presentation at the Conference on Measurement and Evaluation, Los Angeles. Hambleton, R. K. (1988, August). Some pitfalls in current educational testing practices. Invited paper presented at the 24th International Congress of Psychology, Sydney, Australia. Hambleton, R. K. (1989, June). Educational testing practices: Trends, problems, and future directions. President's invited address at the meeting of the Canadian Educational Research Association, Quebec City. Hambleton, R. K. (1989, October). Item response models in physical education. Keynote address at the Sixth Measurement and Evaluation Symposium, University of Wisconsin, Madison. Hambleton, R. K. (1990, April). Future directions for educational assessment. President's address presented at the meeting of NCME, Boston. Hambleton, R. K. (1990, June). What do teachers need to know about testing? Invited presentation at a national conference on classroom testing practices, Victoria, BC. Hambleton, R. K. (1990, November). Future directions for educational assessment. Keynote address at the meeting of the Florida Educational Research Association, Deerfield Beach, FL. Hambleton, R. K. (1991, August). Meeting the measurement challenges of the 1990s: New psychometric models, methods, and tests. Invited address presented at the meeting of APA, San Francisco. Hambleton, R. K. (1991, September). Advances in item bias research. Invited presentation at the First European Congress on Psychological Assessment, Barcelona, Spain. Hambleton, R. K. (1991, November). Setting standards and choosing testing methods for national and international assessments. Invited presentation at the Assessing Learning and Educational Achievement Conference, Johnson Foundation Conference Center, Racine, Wisconsin.

52

Hambleton, R. K. (1992, April). Item response theory: A broad psychometric framework for measurement advances. Invited presentation at the meeting of NCME, San Francisco. Hambleton, R. K. (1992, April). The case for item response theory. Invited presentation at the meeting of AERA, San Francisco. Hambleton, R. K. (1992, April). Uses of international data in setting American educational standards. Invited presentation at a joint meeting of NCES/NAGB, Washington, DC. Hambleton, R. K. (1992, June). Measurement advances to address educational policy questions. Keynote address at the European Conference of Educational Research, Enschede, The Netherlands. Hambleton, R. K. (1992, June). Translating tests and establishing test score equivalence. Invited paper at the meeting of the Canadian Educational Research Association, Charlottestown, Prince Edward Island. Hambleton, R. K. (1992, July). Setting standards on national tests. Paper presented at the 25th International Congress of Psychology, Brussels, Belgium. Hambleton, R. K. (1993, April). Rise and fall of criterion-referenced measurement? Invited paper presented at the meetings of AERA and NCME, Atlanta. Hambleton, R. K. (1993, June). New measurement models, methods, and tests for the 1990s and beyond. Paper presented at the meeting of CERA, Ottawa. Hambleton, R. K. (1993, August). Guidelines for translating tests. Presentation at the meeting of APA, Toronto. Hambleton, R. K. (1994, February). Methodological issues arising in cross-national comparative studies. Invited paper presented at the American Association for the Advancement of Science, San Francisco. Hambleton, R. K. (1994, April). Setting performance standards: Essential research studies. Paper presented at the meeting of NCME, New Orleans. Hambleton, R. K. (1994, April). Scales, scores, and reporting forms to enhance the utility of educational testing. Invited paper presented at the meeting of NCME, New Orleans. Hambleton, R. K. (1994, April). International perspectives on assessment: International Test Commission. Paper presented at the meeting of NCME, New Orleans. Hambleton, R. K. (1994, June). Setting performance standards: New methods and essential research studies. Invited presentation at the Medical Council of Canada's "Post Ottawa Conference," Toronto. Hambleton, R. K. (1994, July). Developing guidelines for adapting instruments. Invited paper presented at the 23rd Congress of Applied Psychology, Madrid.

53

Hambleton, R. K. (1994, November). Standard-setting methods for performance assessments in clinical problem-solving. Invited presentation at the meeting of the Research in Medical Education Conference, Boston. Hambleton, R. K. (1994, December). Translating tests: Issues and methods. Invited presentation at the NCES Limited English Proficiency Conference, Washington. Hambleton, R. K. (1995, January). Standard-setting in state assessments: current status and future research directions. Invited presentation at the CCSSO-SCASS meeting, New Orleans. Hambleton, R. K. (1995, May). New directions for college admissions testing and research in the United States. Invited presentation at the Third International SweSAT Conference, Umea, Sweden. Hambleton, R. K. (1995, June). Psychological testing in the 21st century. Key-note address at the Congress on Psychometrics, Pretoria, South Africa. Hambleton, R. K. (1995, June). The detection of item bias: methods, research findings, and applications. Invited presentation at the Congress on Psychometrics, Pretoria, South Africa. Hambleton, R. K. (1995, June). Adapting tests for use in multiple languages and cultures: issues, methods, and guidelines. Invited presentation at the Congress on Psychometrics, Pretoria, South Africa. Hambleton, R. K. (1995, July). Guidelines for adapting psychological tests for use in multiple languages and cultures. Paper presented at the Fourth European Congress of Psychology, Athens. Hambleton, R. K. (1995, August). Setting standards on performance assessments: technical issues and promising methods. Paper presented at the meeting of APA, New York. Hambleton, R. K. (1995, August). Psychological assessment advances for the 21st century: New psychometric models, methods, and technology. Keynote address presented at the Third European Congress of Psychological Assessment, Trier, Germany. Hambleton, R. K. (1995, October). Translating psychological tests and medical examinations: Main issues, methods, and technical guidelines. Invited paper presented at the Medical Selection Conference, Fribourg, Switzerland. Hambleton, R. K. (1995, December). Assessing student progress in Massachusetts: Radical changes for the 21st century. Invited presentation at the Academy for Legislators: An Educational Forum, University of Massachusetts Amherst. Hambleton, R. K. (1996, February). Reactions to "Domain scores: A new concept in reporting NAEP results". Presentation at the NAGB Work Group on Planning Meeting, Washington, DC. Hambleton, R. K. (1996, February). Producing comparable scores on non-equivalent examinations. Presentation at a meeting of the NASBA Users' Panel, Orlando, FL.

54

Hambleton, R. K. (1996, April). Guidelines for adapting educational and psychological tests. Paper presented at the meeting of NCME, New York. Hambleton, R. K. (1996, April). Assessing medical competence: some promising solutions. Keynote address presented at the annual meeting of the Northeast Group on Educational Affairs in Medicine, Philadelphia. Hambleton, R. K. (1996, May). Reporting of state assessment results: issues, methods, and essential research. Presentation at the CCSSO State Collaborative on Assessment and Student Standards Meeting, St. Louis. Hambleton, R. K. (1996, May). Setting standards on performance assessments: progress report. Presentation at the CCSSO State Collaborative on Assessment and Student Standards Meeting, St. Louis. Hambleton, R. K. (1996, June). Innovations in large scale assessment: psychometric lessons learned from Kentucky. Paper presented at the National Conference on Large Scale Assessment, Phoenix, Arizona. Hambleton, R. K. (1996, August). Adapting psychological tests: technical guidelines for improving practices. Paper presented at the 26th International Congress of Psychology, Montreal. Hambleton, R. K. (1996, August). Development of guidelines for adapting psychological and educational tests for use in multiple languages and cultures. Invited paper presented at the 13th Congress of the International Association for Cross-Cultural Psychology, Montreal, Canada. Hambleton, R. K. (1996, August). Application of the Joint Committee's Program Evaluation Standards to education. Paper presented at the meeting of APA, Toronto. Hambleton, R. K. (1996, October). The future of educational assessment: Likely directions and technical problems to overcome. Keynote address presented at the annual meeting of NERA, Ellenville, NY. Hambleton, R. K. (1996, December). Setting performance standards on achievement tests in Title I. Presentation at the meeting of SCASS, Washington. Hambleton, R. K. (1997, March). Issues and methods in setting standards on performance assessments. Invited presentation at the meeting of the Northeast Group on Educational Affairs, Washington, DC. Hambleton, R. K. (1997, March). NAEP redesign: technical committee report and some personal observations. Invited paper presented at the meeting of AERA, Chicago. Hambleton, R. K. (1997, March). Some notes on item response theory. Invited graduate student seminar at the AERA meeting, Chicago. Hambleton, R. K. (1997, May). Judgmental estimates of item difficulty. Presentation at the Annual Swedish Scholastic Aptitude Conference, Umea, Sweden.

55

Hambleton, R. K. (1997, July). Issues, methods,and guidelines for adapting tests from one language and culture to another. Paper presented at the Fifth European Congress of Psychology, Dublin, Ireland. Hambleton, R. K. (1997, July). Establishing cross-cultural validity: a discussion. Paper presented at the Fifth European Congress of Psychology, Dublin. Hambleton, R. K. (1997, July). Future directions in educational assessment. Invited presentation at the Scientific Council of the National Institute for Testing and Evaluation, Jerusalem. Hambleton, R. K. (1997, August). Increasing the validity of NAEP scores and score reporting with achievement levels. Invited paper presented at the NAEP Achievement Levels Workshop, Boulder, Colorado. Hambleton, R. K. (1997, August). Changing measurement models and methods for the 21st century. Invited Division 5 Presidential Address at the meeting of the American Psychological Association, Chicago. Hambleton, R. K. (1997, October). Promising GMAT item formats for the 21st century. Invited presentation at the international workshop on the GMAT, Paris, France. Hambleton, R. K. (1997, December). Setting performance standards on national and state educational assessments. Invited presentation at the Title I-CCSSO Conference, Washington. Hambleton, R. K. (1998, April). Setting standards on multi-format assessments: a review of methods and a program of research. Paper presented at the meetings of AERA and NCME, San Diego. Hambleton, R. K. (1998, May). Computer-based testing: The promises and the problems to overcome. Paper presented at the 26th annual meeting of the Canadian Society for the Study of Education. Hambleton, R. K. (1998, June). Setting standards on complex performance assessments. Paper presented at the Large-Scale Assessment Conference, Colorado Springs, CO. Hambleton, R. K. (1998, August). Translation and adaptation of psychological tests: Issues, research designs, statistical approaches, and practical steps. Invited paper presented at the 24th International Congress of Applied Psychology, San Francisco. Hambleton, R. K. (1998, September). Translating and adapting credentialing exams into multiple languages: Issues, steps, and guidelines. Invited paper at the 18th annual meeting of CLEAR, Denver. Hambleton, R. K. (1998, October). Advances in standard-setting methodology. Invited presentation at the Measurement and Evaluation: Current and Future Research Directions Conference, Banff, Alberta, Canada. Hambleton, R. K. (1998, October). Educational assessment for the 21st century. Keynote address at the 3rd National Forum on Educational Evaluation, Veracruz, Mexico.

56

Hambleton, R. K. (1998, December). Are the Massachusetts teacher tests valid? Invited presentation at Westfield State College, Westfield, MA. Hambleton, R. K. (1999, April). Guidelines for adapting and translating educational and psychological tests. Invited paper presented at the meeting of NCME, Montreal. Hambleton, R. K. (1999, April). Performance assessment: A synthesis of current research and future directions. Invited paper presented at the meeting of NCME, Montreal. Hambleton, R. K. (1999, May). Issues, designs and technical guidelines for adapting tests in multiple languages and cultures. Invited address at the International Conference on Adapting Tests for Use in Multiple Languages and Cultures. Washington, DC. Hambleton, R. K. (1999, June). Setting standards on complex performance assessments. Invited paper presented at the 19th annual National Conference on Large-Scale Assessment, Snowbird, Utah. Hambleton, R. K. (1999, July). Issues, designs, and guidelines for adapting tests. Invited address at the Joint European Conference of the IACCP and the ITC, Graz, Austria. Hambleton, R. K. (1999, July). Advances in test adaptation methodology. Invited presenter in a symposium at the Joint European Conference of the IACCP and the ITC, Graz, Austria. Hambleton, R. K. (1999, August). Advances in testing methods. Invited presentation at the Sweden Department of Education, Stockholm. Hambleton, R. K. (1999, September). Advances in item response modeling of educational and psychological test data. Invited presentation at the 6th Congress of Social Science Methodology, University of Oviedo, Oviedo, Spain. Hambleton, R. K. (1999, September). Computer-based testing: Ten promises, ten problems to overcome. Keynote address at the 6th Congress of Social Science Methodology, University of Oviedo, Oviedo, Spain. Hambleton, R. K. (1999, October). Evaluative criteria and methods for setting performance standards. Invited presentation at the Edward F. Reidy, Jr., First Interactive Lecture Series. Dover, NH: The National Center for the Improvement of Educational Assessment. Hambleton, R. K. (2000, February). Computer-enhanced assessment: Great promise and problems to overcome. Keynote address at the American Test Publishers Conference, Carmel, CA. Hambleton, R. K. (2000, April). Test and scoring models for the new generation of assessments. Invited paper presented at the meeting of NCME, New Orleans. Hambleton, R. K. (2000, April). Evaluation of NAEP standard-setting: Lets see both sides. Paper presented at the meeting of NCME, New Orleans, LA. Hambleton, R. K. (2000, April). Enhancing the validity of the test adaptation process: Improving the judgmental process. Paper presented at the meeting of NCME, New Orleans, LA.

57

Hambleton, R. K. (2000, April). Setting standards on complex performance assessments: A summary of an NSF-CCSSO-NCME project. Paper presented at the meeting of NCME, New Orleans, 2000. Hambleton, R. K. (2000, April). Advances in standard-setting methods. Paper presented at the NCME meeting, New Orleans, LA. Hambleton, R. K. (2000, June). Improving the ways we report test scores to policy-makers and the public. Invited presentation at the University of Maryland Invitational Conference on Measurement, College Park, MD. Hambleton, R. K. (2000, June). Possible methods for setting performance standards on NAEP. Invited presentation to the National Assessment Governing Board Achievement Levels Committee, Snowbird, Utah. Hambleton, R. K. (2000, June). A look at NAEP score reporting: Progress, the press, and Pophams proposals. Invited presentation to the National Assessment Governing Board Achievement Levels Committee, Snowbird, Utah. Hambleton, R. K. (2000, July). Computer-based exams: Current issues, advances, and essential research. Invited paper presented at the 27th International Congress of Psychology, Stockholm. Hambleton, R. K. (2000, September). Translation of NAEP achievement levels to the Voluntary National Tests. Invited paper presented to a meeting of AIR and NAGB, Washington. Hambleton, R. K. (2000, November). New advances in assessment practices. Keynote address presented at the meeting of the Association for Educational Assessment, Prague. Hambleton, R. K. (2001, April). What we know about standards-based score reporting. Paper presented at the meeting of AERA, Seattle. Hambleton, R. K. (2001, July). New approaches for improving the ways test scores are reported. Invited paper presented at the 7th European Congress of Psychology, London. Hambleton, R. K. (2001, December). Future directions for adult education assessment. Presentation at the National Academies Board on Testing and Assessment Meeting on Performance Assessments for Adult Education, Washington, DC. Hambleton, R. K. (2002, February). A new challenge: Making results from large-scale assessments understandable and useful. Invited presentation at the Provincial Testing in Canadian Schools: Research, Policy, and Practice Conference, Victoria, British Columbia. Hambleton, R. K. (2002, February). Adapting credentialing exams for use in multiple languages. Invited presentation at ATPs Conference on Computer-Based Testing, Carlsbad, CA. Hambleton, R. K. (2002, February). A non-technical introduction to item response theory for credentialing exams: Models, applications, and issues. Invited presentation at ATPs Conference on Computer-Based Testing, Carlsbad, CA.

58

Hambleton, R. K. (2002, April). Test designs for the next generation of large-scale assessments. Invited presentation at the NCME meeting, New Orleans. Hambleton, R. K. (2002, April). Misconceptions about the technical aspects of large scale state assessments. Key-note address at the meeting of the New England Educational Research Organization, Northampton, Massachusetts. Hambleton, R. K. (2002, June). Test designs and item formats for the next generation of assessments. Invited discussant remarks at the International Conference on ComputerBased Testing and the Internet, Winchester, England. Hambleton, R. K. (2002, June). Testing in the 21st century: Whats new and what measurement problems need to be solved? Keynote address at the GITP Conference, Psychological Research: Luxury or Necessity, Amsterdam, the Netherlands. Hambleton, R. K. (2002, June). Adding meaning to test scores, finally! Presentation at the 32nd Annual National Conference on Large-Scale Assessment, Palm Desert, California. Hambleton, R. K. (2002, July). Progress in large-scale medical testing: Methodological advances and new challenges. Keynote address at the Tenth Ottawa Conference for Medical Education, Ottawa. Hambleton, R. K. (2002, July). The promises and challenges of computer-based testing [Abstract]. Proceedings of the 25th International Congress of Applied Psychology, Singapore. Hambleton, R. K. (2002, November). Setting performance standards on state assessments. Invited presentation at the Harcourt Midwest Assessment Forum, Chicago. Hambleton, R. K. (2002, December). Psychometric developments, 1966 to 2002, and challenges for the future. Invited presentation at the International Conference on Measurement for the Social Sciences (Festschrift to Honour Ross Traub), Toronto. Hambleton, R. K. (2003, January). Theory, methods, and practices in testing for the 21st century. Presentation at the Honoris Causa Ceremony, University of Oviedo, Spain. Hambleton, R. K. (2003, February). Advances in testing practices in the 21st century . . . not so fast. Keynote address at the annual meeting of the Association of Test Publishers, Amelia Island, Florida. Hambleton, R. K. (2003, April). Evaluation of new computer-based test designs for credentialing exams. Paper presented at meeting of NCME, Chicago. Hambleton, R. K. (2003, July). Computer-based testing: Great concept but many statistical problems to overcome. Invited address at the IX Seminar on Applied Statistics, Rio de Janeiro, Brazil. Hambleton, R. K. (2003, July). Applying item resonse theory models in educational testing. Keynote address at the IX Seminar on Applied Statistics, Rio de Janeiro, Brazil.

59

Hambleton, R. K. (2004, February). ITC guidelines for adapting exams into multiple languages and cultures. Invited presentation at the ATP Conference on Computer-Based Testing, Palm Springs, 2004. Hambleton, R. K. (2004, February). Setting AICPA passing scores: So how much is good enough? Invited presentation at the ATP Conference on Computer-Based Testing, Palm Springs, 2004. Hambleton, R. K. (2004, June). Comparing IRT models for the analysis of quality of life research data. Invited address at the 2004 International Society for Quality of Life Research Symposium, Boston. Hambleton, R. K. (2004, June). Consistency of performance standards over grades and subjects. Presentation at the annual CCSSO Conference, Boston. Hambleton, R. K. (2004, June). Traditional and modern approaches to outcomes measurement. Invited presentation at the Advances in Health Outcomes Measurement Conference, Bethesda, MD. Hambleton, R. K. (2004, October). Guidelines and methodology for adapting educational and psychological tests. An invited presentation at the 4th International Test Commission Conference on Equitable Assessment Practices, Williamsburg, VA. Hambleton, R. K. (2005, February). A new challenge in testing: Making test scores more understandable. An invited presentation at ATPs Innovations in Testing Conference, Scottsdale, AZ. Hambleton, R. K. (2005, May). Educational assessment in the 21st century: Two stories to tell so far. Keynote presentation at the CERA Meeting, London, Ontario. Hambleton, R. K. (2005, July). Item response theory: Recent advances and technical challenges. Invited presentation at the 9th European Congress of Psychology, Granada, Spain. Hambleton, R. K. (2005, November). Advances in assessment for the 21st century. Invited presentation at the meeting of the Center for Innovation, National Board of Medical Examiners, Philadelphia. Hambleton, R. K. (2006, February). Making diagnostic score reports more clear and meaningful for candidates. An invited presentation at the ATP Conference, Orlando, FL. Hambleton, R. K. (2006, February). Using item response theory (IRT) models to equate test scores. An invited presentation at the ATP Conference, Orlando, FL. Hambleton, R. K. (2006, March). Six big problems to overcome in educational and psychological measurement. An invited presentation at the University of Oviedo, Spain. Hambleton, R. K. (2006, May). Applying IRT models to health science data. An invited presentation at Northwestern University, Evanston. Hambleton, R. K. (2006, June). Automated test assembly with item response theory. An invited presentation at the CCSSO meeting, San Francisco.

60

Hambleton, R. K. (2006, June). Multiple languages in large-scale assessments. An invited presentation at the CCSSO meeting, San Francisco. Hambleton, R. K. (2006, July). Recent developments in educational assessment. Invited presentation at the 26th International Congress of Applied Psychology, Athens, Greece. Hambleton, R. K. (2006, August). Issues in test adaptation methodology. Invited paper presented at the meeting of APA, New Orleans. Hambleton, R. K. (2006, August). Five big challenges in educational and psychological assessment. Invited presentation at the meeting of APA, New Orleans. Hambleton, R. K. (2006, October). Item response theory and models for the next generation of educational and psychological tests. An invited presentation at the Winemiller 2006 Conference on Methodological Development of Statistics in the Social Sciences, Columbia, Missouri. Hambleton, R. K. (2006, October). Applications of item response theory to improve health outcomes assessment. An invited presentation at the Conference on New Methods for the Analysis of Family and Dyadic Processes, University of Massachusetts, Amherst. Hambleton, R. K., Arrasmith, D., & Smith, I. L. (1986, April). Optimal selection of test items. Paper presented at the meeting of NCME, Washington, DC. Hambleton, R. K. Arrasmith, D., & Smith, I. L. (1986, June). Optimal item selection for credentialing examinations. Paper presented at the meeting of the Psychometric Society, Toronto. Hambleton, R. K., & Artes-Ferragud, M. (1990, June). New directions in item response theory: Applications of multichotomous response models. Paper presented at the meeting of the Canadian Educational Research Association, Victoria, BC. Hambleton, R. K., & Berberoglu. G. (1997, March). Third International Mathematics and Science Study: test adaptation methods and results. Paper presented at the meeting of NCME, Chicago. Hambleton, R. K., Blanchard, K. H., & Hersey, P. (1978, June). Validity of situational leadership theory and applications. Paper presented at the 19th International Congress of Applied Psychology, Munich. Hambleton, R. K., & Bollwark, J. (1990, July). Test translations in cross-cultural studies. Invited paper presented at the meeting of the International Congress of Applied Psychology, Kyoto, Japan. Hambleton, R. K., Bollwark, J., & Rogers, H. J. (1990, April). Detecting potentially biased test items. Paper presented at the meeting of AERA, Boston. Hambleton, R. K., & Boulet, J. (1996, September). Psychometric methods for medical examinations. Presentation at the annual meeting of the Association for Medical Education in Europe, Copenhagen.

61

Hambleton, R. K., & Bourque, M. L. (1992, April). Methodological considerations in setting standards on national examinations. Invited paper presented at the meeting of AERA, San Francisco. Hambleton, R. K., & Cadman, S. (1994, July). Item response theory models and applications: Current status and future directions. Invited paper presented at the 23rd Congress of Applied Psychology, Madrid. Hambleton, R. K., & Cook, L. L. (1976, April). Introduction to latent trait models and their use in analyzing educational test data. Paper presented at the meeting of NCME, San Francisco. Hambleton, R. K., & Cook, L. L. (1978, April). Robustness of latent trait models. Paper presented at the meeting of AERA, Toronto. Hambleton, R. K., Dirir, M., & Lam, P. (1992, April). Effects of optimal test designs on measurement precision and decision accuracy. Paper presented at the meeting of AERA, San Francisco. Hambleton, R. K., & Eignor, D. R. (1977, July). Adaptive testing applied to hierarchically structured objectives-based curricula. Invited paper presented at the Second Conference on Computerized Adaptive Testing, University of Minnesota. Hambleton, R. K., & Eignor, D. R. (1978, April). Criteria for evaluating criterion-referenced tests and test manuals. Paper presented at the meeting of NCME, Toronto. Hambleton, R. K., & Eignor, D. R. (1978, February). Minimum competency level identification: A review of selected issues, methods, and implementation strategies. Paper presented at the AERA Conference on Minimum Competency Testing, Washington. Hambleton, R. K., & Eignor, D. R. (1978, April). Allocating testing time in objectives-based instructional programs. Paper presented at the meeting of APA, Toronto. Hambleton, R. K., & Fennessy, L. (1991, November). Advances in credentialing examination methods. Invited paper presented at the International Symposium on Modern Theories in Measurement: Problems and Issues. Chateau Montebello, Montebello, Quebec, Canada. Hambleton, R. K., & Friedman, M. (1996, September). Advances in assessment using standardized patient methodology: a psychometrician's perspective. Keynote address presented at the annual meeting of the Association for Medical Education in Europe, Copenhagen. Hambleton, R. K., & Gifford, J. A. (1979, July). Robustness of latent trait models. Invited paper presented at the 1979 Computerized Adaptive Testing Conference, Minneapolis. Hambleton, R. K., & Gorth, W. P. (1970, October). Item Analysis for criterion-referenced tests. Paper presented at the meeting of NERA, Liberty, New York. Hambleton, R. K., Gorth, W. P., & O'Reilly, R. P. (1971, October). A formative evaluative model for classroom instruction. Paper presented at the meeting of NERA, Liberty, New York.

62

Hambleton, R. K., Gower, C., & Bollwark, J. (1987, October). Assessing problem-solving ability with computer-adaptive testing procedures. Paper presented at the 29th meeting of the Military Testing Association, Ottawa, Canada. Hambleton, R. K., Gower, C., & Bollwark, J. (1988, April). New testing methods to assess technical problem solving. Paper presented at the meeting of AERA, New Orleans. Hambleton, R. K., Gower, C., & Bollwark, J. (1988, August). Computer-administered tests to assess troubleshooting skills. Paper presented at the meeting of APA, Atlanta. Hambleton, R. K., Gower, C., & Rogers, H. J. (1989, April). Customized testing: Review of issues and methods. Paper presented at the meeting of NCME, San Francisco. Hambleton, R. K., & Han, N. (2004, April). Assessing the fit of IRT models. Paper presented at the meeting of NCME, San Diego. Hambleton, R. K., & Han, N. (2006, April). Have my test items been stolen? Item statistics to find out. Invited paper presented at the meeting of NCME, San Francisco. Hambleton, R. K., Han, N., & Ying, L. (2004, February). Detecting disclosed test items in a computer-based testing environment. Invited presentation at the ATP Conference on Computer-Based Testing, Palm Springs, CA. Hambleton, R. K., Hutten, L., & Swaminathan, H. (1974, August). A comparison of several methods for assessing student mastery in objectives-based instructional programs. Paper presented at the meeting of APA, New Orleans. Hambleton, R. K., Jaeger, R., & Plake, B. (1994, October). Performance standard setting on the EAG assessment package: What was done? What was learned? Presentation at the first NBPTS-ADL-TAG colloquium on measurement and methodology, Washington. Hambleton, R. K., Jaeger, R. M., Plake, B. S., & Mills, C. (1997, March). Issues and methods for setting standards on performance assessments. Paper presented at the meeting of AERA, Chicago. Hambleton, R. K., & Jodoin, M. (2001, February). Applying item response models to credentialing exams: Answers to the 10 most important questions. Invited presentation at the ATP Conference on Computer-Based Testing, Tucson, Arizona. Hambleton, R. K., & Jones, R. W. (1991, April). Influence of various factors on the accuracy of test information functions. Paper presented at the meeting of NCME, Chicago. Hambleton, R. K., & Jones, R. W. (1992, April). Comparison of statistical and judgmental methods for assessing DIF. Paper presented at the meeting of NCME, San Francisco. Hambleton, R. K., & Jones, R. W. (1992, July). International impact of item response theory on testing practices. Invited paper presented at the 25th International Congress of Psychology, Brussels, Belgium.

63

Hambleton, R. K., & Jones, R. W. (1993, April). Item parameter estimation errors and their influence on test information functions. Paper presented at the meeting of NCME, Atlanta. Hambleton, R. K., Jones, R. W., & Rogers, H. J. (1990, May). Comparison of empirical and judgmental methods for detecting potentially biased test items. Paper presented at the meeting of the New England Educational Research Organization, Rockport, Maine. Hambleton, R. K., & Kanjee, A. (1992, October). Methodological issues in large scale assessment. Invited paper presented at the International Symposium in China's Higher Education Examinations, Nanjing, China. Hambleton, R. K., & Kanjee, A. (1993, April). Enhancing the validity of cross-national validity studies: Solving the test translation problem. Paper presented at the meeting of AERA, Atlanta. Hambleton, R. K., & Kanjee, A. (1994, July). Enhancing the validity of cross-cultural testing issues, research designs, and psychometric methods. Paper presented at the 23rd Congress of Applied Psychology, Madrid. Hambleton, R. K., & Li, S. (2004, August). Effective implementation of the International Test Commission Guidelines for Adapting Tests. Invited presentation at the 28th International Congress of Psychology, Beijing, China. Hambleton, R. K., Li, S., & Sireci, S. G. (2003, April). Identifying common problems in item translation: A meta analysis. Paper presented at the meeting of NCME, Chicago. Hambleton, R. K., & Martois, J. S. (1982, April). Validity of a derived score prediction system based on item response theory principles and procedures. Paper presented at the meeting of AERA, New York. Hambleton, R. K., Martois, J. S., & Williams, C. (1983, April). Detection of biased items with item response models. Paper presented at the meeting of AERA, Montreal. Hambleton, R. K., & Meara, K. (1998, August). The Graduate Record Examination: What is the validity evidence? Invited paper presented at the meeting of the American Psychological Association, San Francisco. Hambleton, R. K., & Meara, K. (1999, November). Newspaper coverage of NAEP results: 1990-1998. Presentation at the meeting of the National Assessment Governing Board, Washington, DC. Hambleton, R. K., & Mills, C. N. (1981, April). Ability estimation with three logistic test models. Paper presented at the meeting of NCME, Los Angeles. Hambleton, R. K., Mills, C. N., & Simon, R. (1981, April). Determining the optimal length of a criterion-referenced test. Paper presented at the meeting of NCME, Los Angeles. Hambleton, R. K., & Murray, J. (1977, April). A comparative study of faculty and student attitudes toward a variety of college grading purposes and practices. Paper presented at the meeting of NCME, New York.

64

Hambleton, R. K., & Murray, L. N. (1984, April). Assessing the dimensionality of NAEP reading items: A look at several approaches. Paper presented at the meeting of AERA, New Orleans. Hambleton, R. K., Murray, L. N., & Williams, P. (1983, April). Fitting item response models to test data: Approaches and examples. Paper presented at the meeting of AERA, New York. Hambleton, R. K., & Patsula, L. (1996, August). Adaptation/translation of tests: issues, technical advances, and practical steps. Paper presented at the meeting of APA, Toronto. Hambleton, R. K., & Patsula, L. (1996, August). Test adaptations: review of methods and suggestions for additional research. Paper presented at the 26th International Congress of Psychology, Montreal. Hambleton, R. K., & Patsula, L. (1997, September). Adapting tests for use in multiple languages and cultures: sources of error, possible solutions, and practical guidelines. Invited paper presented at the Fourth European Conference on Psychological Testing, Lisbon. Hambleton, R. K., & Patsula, L. (1998, April). Increasing the validity of adapted tests: Problems to overcome and guidelines to follow for improving test adaptation practices. Paper presented at the meeting of AERA, San Diego. Hambleton, R. K., & Plake, B. S. (1994, April). Using an extended Angoff procedure to set standards on complex performance assessments. Paper presented at a joint meeting of AERA and NCME, New Orleans. Hambleton, R. K., & Plake, B. S. (1997, March). An anchor-based approach to setting standards on complex performance assessments. Paper presented at the meeting of AERA, Chicago. Hambleton, R. K., Plake, B. S., & Engelhard, G. (2001, April). Richard M. Jaegers contributions to standard-setting methods. Invited symposium at the meeting of AERA, Seattle. Hambleton, R. K., & Powell, S. (1978, May). Future directions in testing. Paper presented at the National Future Studies Conference, University of Massachusetts at Amherst. Hambleton, R. K., Powell, S., & Eignor, D. R. (1979, April). Issues and methods for standardsetting. Paper presented at the meeting of NCME, San Francisco. Hambleton, R. K., Powers, T., & Rovinelli, R. (1972, April). An investigation of the effects of test administration procedures and scoring on the reliability and validity of achievement tests. Paper presented at the meeting of AERA, Chicago. Hambleton, R. K., Roberts, D. M., & Traub, R. E. (1969, February). Comparison of two methods for assessing partial knowledge. Paper presented at the meeting of the Canadian Conference for Research in Education, Victoria, British Columbia. Hambleton, R. K., & Rogers, H. J. (1985, April). Evaluation of the plot method for identifying biased test items. Paper presented at the meeting of AERA, Chicago.

65

Hambleton, R. K., & Rogers, H. J. (1985, April). Advances in developing certification and licensure tests. Paper presented at the meeting of AERA, Chicago. Hambleton, R. K., & Rogers, H. J. (1986, April). Promising advances in assessing the fit of item response models. Paper presented at the meetings of AERA and NCME, San Francisco. Hambleton, R. K., & Rogers, H. J. (1987, June). Solving criterion-referenced testing problems with item response models. Paper presented at the biannual meeting of the European Psychometric Society, Enschede, The Netherlands. Hambleton, R. K., & Rogers, H. J. (1988, April). Applications of IRT models to criterionreferenced measurement problems. Invited paper presented at the meetings of AERA and NCME, New Orleans. Hambleton, R. K., & Rogers, H. J. (1988, April). Detecting biased test items: Comparison of the IRT area and Mantel-Haenszel methods. Paper presented at the meeting of AERA, New Orleans. Hambleton, R. K., & Rogers, H. J. (1988, June). Applying IRT models to large-scale assessment data. Invited paper presented at the International Symposium on Large-Scale Assessments in an International Perspective, Deidesheim, Federal Republic of Germany. Hambleton, R. K., & Rogers, H. J. (1989, April). Detecting potentially biased test items: Comparison of empirical and judgmental methods. Paper presented at the meeting of AERA, San Francisco. Hambleton, R. K., & Rogers, H. J. (1990, April). Solving some practical problems that arise in using IRT models. Invited one-day training session at the meeting of NCME, Boston. Hambleton, R. K., Rogers, H. J., & Arrasmith, D. (1986, April). A comparison of the MantelHaenszel statistic and item response methods of identifying differential item performance. Paper presented at the meeting of AERA, San Francisco. Hambleton, R. K., Rogers, H. J., & Arrasmith, D. (1986, August). Identifying potentially biased test items: A comparison of Mantel-Haenszel statistic and several item response theory methods. Paper presented at the meeting of APA, Washington, DC. Hambleton, R. K., Rogers, H. J., & Jones, R. W. (1990, August). Influence of item parameter estimation errors in test development. Paper presented at the meeting of APA, Boston. Hambleton, R. K., & Rovinelli, R. J. (1983, April). Assessing the dimensionality of a set of test items. Paper presented at the meeting of AERA, Montreal. Hambleton, R. K., Rovinelli, R. J., & Gorth, W. P. (1971, April). Efficiency of various itemexaminee sampling designs for estimating test parameters. Paper presented at the meeting of APA, Washington, DC. Hambleton, R. K., & Simon, R. (1979, October). A comprehensive model for building criterionreferenced tests. Paper presented at the meeting of NERA, Ellenville, New York.

66

Hambleton, R. K., & Simon, R. (1980, April). Steps for constructing criterion-referenced tests. Paper presented at the meeting of AERA, Boston. Hambleton, R. K., & Slater, S. (1994, October). Using performance standards to report national and state assessment data: Are the reports understandable and how can they be improved? Invited paper presented at the Joint Conference on Standard-Setting for Large-Scale Assessments, Washington. Hambleton, R. K., & Slater, S. (1995, April). Reliability issues and methods for credentialing exams. Paper presented at the meetings of AERA and NCME, San Francisco. Hambleton, R. K., & Slater, S. (1995, July). Item response theory: Models and applications. Paper presented at the Fourth European Congress of Psychology, Athens. Hambleton, R. K., & Slater, S. C. (1996, April). Are NAEP executive summary reports understandable to policy-makers and educators? Invited paper presented at the meeting of NCME, New York. Hambleton, R. K., Stetz, R., & Rios, A. (1983, April). The development of objectives-based programs in occupational education. Paper presented at the meeting of NERA, Ellenville, New York. Hambleton, R. K., Sutnick, A. I., & Friedman, M. (1995, September). New methods for setting standards on performance assessments. Paper presented at the meeting of the Association for Medical Education in Europe, Zaragoza, Spain. Hambleton, R. K., Swaminathan, H., & Algina, J. (1975, June). Toward a theory and practice of criterion-referenced testing. Paper presented at the Second International Symposium of Educational Testing, Montreaux, Switzerland. Hambleton, R. K., Swaminathan, H., Sireci, S., Xing, D., & Rizavi, S. (1998, April). Estimating item statistics with judgmental data and Bayesian statistical procedures. Paper presented at the meeting of AERA, San Diego. Hambleton, R. K., & Traub, R. E. (1970, February). Analysis of empirical data using the Rasch model and two- and three-parameter logistic models. Paper presented at the meeting of AERA, Minneapolis. Hambleton, R. K., & Traub, R. E. (1970, May). Some preliminary results on the robustness of the Rasch test theory model. Paper presented at the meeting of the New England Educational Research Organization (NEERO), Boston. Hambleton, R. K., & Traub, R. E. (1970, August). Information curves and efficiency of three logistic test models. Paper presented at the meeting of the American Psychological Association, Miami. Hambleton, R. K., & Traub, R. E. (1971, April). Some results on the robustness of the Rasch test theory model. Paper presented at the meeting of AERA, New York.

67

Hambleton, R. K., et al. (1977, April). Measurement models for the future: A review of latent trait models, technical developments, and applications. Symposium presented at the meeting of AERA and NCME, New York. Hambleton, R. K., & van der Linden, W. (1993, June). Advances in measurement models, methods, and practices. Invited paper presented at the ITC Conference on Test Use with Children and Youth, Oxford, England. Hambleton, R. K., & Xing, D. (2002, January). Maximizing the usefulness of computer-based test designs for making pass-fail decisions. Paper presented at the meeting of the Canadian Educational Research Association, Toronto. Hambleton, R. K., & Yu, J. (1991, December). Impact of item response theory models on testing practices. Invited paper presented at the International Symposium on Psychological Measurement, Nanjing, P.R.C. Hambleton, R. K., & Zaal, J. (1986, July). Computerized adaptive testing: Theory, applications, and standards. Paper presented at the 21st meeting of the International Congress of Applied Psychology, Jerusalem. Hambleton, R. K., & Zenisky, A. (2001, April). Increasing the meaningfulness of score scales and reports. Paper presented at the meeting of NCME, Seattle. Hambleton, R. K., Zenisky, A., & Jodoin, M. (2001, July). Computer-based test designs and item formats for the next generation of tests. Invited paper presented at the 7th European Congress on Psychology, London. Han, N., & Hambleton, R. K. (2004, April). Detecting exposed test items in a computer-based testing environment. Paper presented at the NCME meeting, San Diego. Han, N., Li, S., & Hambleton, R. K. (2005, April). Kernel versus IRT equating. Paper presented at the meeting of NCME, Montreal. Jaeger, R. M., Hambleton, R. K., & Plake, B. S. (1995, April). Eliciting configural performance standards through a sequenced application of complementary methods. Paper presented at the meetings of AERA and NCME, San Francisco. Jaeger, R. M., Plake, B., & Hambleton, R. K. (1993, January). Designs for setting standards on multidimensional performance assessments. Paper presented at the meeting of the North Carolina Association for Research in Education, Greensboro, NC. Jaeger, R., Plake, B. S., & Hambleton, R. K. (1993, April). Integrating multi-dimensional performances and setting standards. Paper presented at the meeting of NCME, Atlanta. Jirka, S. J., Baldwin, S. G., Karantonis, A. M., Wells, C. S., & Hambleton, R. K. (2006, October). Population invariance: Comparison of converted scores for a national testing program. Paper presented at the Northeastern Educational Research Association, Kerhonkson, New York.

68

Jodoin, M., Zenisky, A., & Hambleton, R. K. (2002, April). Comparison of the psychometric properties of several computer-based test designs for credentialing exams. Paper presented at the meeting of NCME, New Orleans. Jones, R. W., & Hambleton, R. K. (1991, April). Fitting IRT models to the Graduate Management Admissions Test. Paper presented at the meeting of NEERO, Portsmouth, NH. Karantonis, A. M., Baldwin, S. G., Jirka, S. J., Wells, C. S., & Hambleton, R. K. (2006, October). Item parameter invariance across states in a national assessment program. Paper presented at the Northeastern Educational Research Association, Kerhonkson, New York. Karantonis, A. M., Wells, C., & Hambleton, R. K. (2007, April). Defining performance categories: Using an IRT-based approach to identify exemplar items. Paper presented at the NCME meeting, Chicago. Lam, P., Swaminathan, H., & Hambleton, R. K. (1992, April). Use of binary programming in test designs to address content balancing in adaptive tests. Paper presented at the meeting of AERA, San Francisco. Ma, X., Klauck, S., Ying, L., & Hambleton, R. K. (2001, October). DIF analyses on a state assessment. Paper presented at the meeting of NERA, Ellenville, NY. Mazor, K., Clauser, B., & Hambleton, R. K. (1991, April). The effect of sample size on the functioning of the Mantel-Haenszel statistic. Paper presented at the meeting of NCME, Chicago. Mazor, K., Clauser, B., & Hambleton, R. K. (1992, April). Detection methods for non-uniform bias. Paper presented at the meeting of NCME, San Francisco. Mazor, K., Hambleton, R. K., & Clauser, B. (1994, April). The effects of conditioning on two internally derived ability estimates in multidimensional DIF analysis. Paper presented at the meeting of AERA, New Orleans. McCormack, J., Miller, C., Hambleton, R. K., & Eignor, D. R. (1976, May). Goal-setting ability in young children: Theory, instrumentation, and measurement. Paper presented at the annual meeting of NEERO, Provincetown, Massachusetts. McKinley, D. W., Boulet, J. R., & Hambleton, R. K. (2000, April). Standard setting for performance based assessment: A pilot study using an empirically defined, multi-faceted approach. Paper presented at the meeting of AERA, New Orleans. McKinley, D. W., Boulet, J. R., Hambleton, R. K., & Burdick, W. P. (1999, September). Statistical procedures for improving standardized patient assessments. Paper presented at the meeting of the AMEE, Linkoping, Sweden. McKinley, D. W., Boulet, J., & Hambleton, R. K. (2003, September). Psychometric challenges associated with standardized patient assessments. Paper presented at the meeting of the Association for Medical Education in Europe, Bern, Switzerland.

69

McKinley, D. W., Boulet, J. R., & Hambleton, R. K. (2004, July). An examinee-centered approach to setting passing scores for standardized patient examinations. Paper presented at the Ottawa Conference for Medical Education, Barcelona, Spain. Melican, G., Breithaupt, K., Mills, C. N., Hambleton, R. K. (2005, April). Multi-stage testing and case studies in a functioning licensing examination. Paper presented at the meeting of NCME, Montreal. Mills, C. N., & Hambleton, R. K. (1979, October). Issues and methods of reporting criterionreferenced test scores. Paper presented at the meeting of NERA, Ellenville, New York. Mills, C. N., & Hambleton, R. K. (1980, April). Guidelines for reporting criterion-referenced test score information. Paper presented at the meeting of AERA, Boston. Mills, C. N., & Hambleton, R. K. (1982, April). Developing norms for a vertically equated item bank. Paper presented at the meeting of AERA, New York. Mills, C., Jaeger, R. M., Plake, B. S., & Hambleton, R. K. (1998, April). An investigation of several new methods for establishing standards on complex performance assessments. Paper presented at the meeting of AERA, San Diego. Mills, C. N., Plake, B. S., Jaeger, R. M., & Hambleton, R. K. (1997, March). Lessons learned: a comparison of two methods for establishing performance standards on complex performance assessments. Paper presented at the meeting of AERA, Chicago. Monahan, P. O., Stump, T. E., Finch, H., & Hambleton, R. K. (2005, April). Bias of exploratory and cross-validated DETECT index under null hypothesis of unidimensionality. Paper presented at the meeting of NCME, Montreal. Muniz, J., & Hambleton, R. K. (1991, April). Medio siglo de teoria de respuesta a los items. Invited paper presented at the Second Congress of Behavioral Sciences Methodology, Canary Islands, Spain. Muiz, J., Hambleton, R. K., & Xing, D. (1997, July). Small sample empirical procedures for detecting poorly translated or adapted test items. Paper presented at the Fifth European Congress of Psychology, Dublin, Ireland. Muiz, J., Hambleton, R. K., & Xing, D. (1997, September). Evaluation of differential item functioning in small samples. Paper presented at the Congress of Methodology for the Social Sciences, Seville, Spain. Muiz, J., Hambleton, R. K., & Xing, D. (1998, April). Small sample studies to detect flaws in test translation. Paper presented at the meeting of NCME, San Diego. Muiz, J., Hambleton, R. K., & Xing, D. (1998, August). Small sample statistical approaches for identifying poorly adapted test items. Invited paper presented at the 24th International Congress of Applied Psychology, San Francisco. Muiz, J., Hambleton, R. K., & Xing, D. (1999, May). Small sample detection of poorly translated test items. Paper presented at the International Conference on Adapting Tests for Use in Multiple Languages and Cultures, Washington, DC.

70

Murray, L. N., & Hambleton, R. K. (1981, April). Building item banks. Paper presented at the meeting of NEERO, Lenox, Massachusetts. Murray, L. N., & Hambleton, R. K. (1983, April). Compiling evidence to address item response model-test data fit. Paper presented at the meeting of AERA, Montreal. Narayanan, P., Hambleton, R. K., & Plake, B.S. (1994, April). Two-stage testing as an approximation to computerized adaptive testing. Paper presented at the meeting of AERA, New Orleans. Oakland, T., & Hambleton, R. K. (1999, April). Improving testing practices around the world. Invited paper presented at the meeting of NCME, Montreal. O'Reilly, R. P., & Hambleton, R. K. (1981, April). A CMI model for an individualized learning program in ninth grade science. Paper presented at the meeting of AERA, New York. O'Reilly, R. P., & Hambleton, R. K. (1971, April). Applied CMI models for groups and individually prescribed instruction in New York State. Paper presented at the meeting of NCME, New York. Patsula, L., & Hambleton, R. K. (1999, April). Accuracy of ability estimates obtained from computerized adaptive, paper and pencil, and multi-stage tests. Paper presented at the meeting of NCME, Montreal. Pauker, R., & Hambleton, R. K. (1976, April). Matching students and teachers to maximize learning: What do students think? Paper presented at the meeting of the International Congress for Individualized Instruction, Boston. Pitoniak, M. J., Hambleton, R. K., & Biskin, B. H. (2003, April). Setting standards on tests containing computerized performance tasks. Paper presented at the meeting of NCME, Chicago. Pitoniak, M., Hambleton, R. K., & Sireci, S. (2002, April). Comparative analysis of two methods for setting standards. Paper presented at the meeting of NCME, New Orleans. Plake, B. S., Hambleton, R. K., & Jaeger, R. M. (1995, April). Score profile method for setting standards for complex performance assessments. Paper presented at the meeting of AERA, San Francisco. Plake, B. S., & Hambleton, R. K. (1998, April). Categorical assignments of student work: an analytical standard-setting method designed for complex performance assessments with multiple performance categories. Paper presented at the meetings of AERA and NCME, San Diego. Rogers, H. J., & Hambleton, R. K. (1987, April). Evaluation of computer-simulated baseline statistics for use in item bias studies. Paper presented at the meeting of AERA, Washington, DC. Rovinelli, R., & Hambleton, R. K. (1973, October). Some procedures for the validation of criterion-referenced test items. Paper presented at the meeting of NERA, Ellenville, New York.

71

Rovinelli, R., & Hambleton, R. K. (1976, April). On the use of content specialists in the assessment of criterion-referenced test item validity. Paper presented at the meeting of AERA, San Francisco. Rovinelli, R., & Hambleton, R. K. (1976, May). Improving the quality of achievement tests used in PSI programs. Paper presented at the Third National Conference on Personalized Instruction, Washington, DC. Royer, M., Hambleton, R. K., & Cadorette, L. (1976, April). Individual differences in the longterm retention of meaningful materials. Paper presented at the meeting of AERA, San Francisco. Skorupski, W. P., & Hambleton, R. K. (2003, April). What are panelists really thinking when they set performance standards? Paper presented at the meeting of NCME, Chicago. Sheehan, D. S., & Hambleton, R. K. (1972, October). An application of latent partition analysis to the evaluation of instruction. Paper presented at the joint meeting of NERA-NCME, Boston. Sheehan, D. S., & Hambleton, R. K. (1976, April). A review of selected factors affecting questionnaire and interview results. Paper presented at the meeting of AERA, San Francisco. Slawson, D. A., Novak, J., & Hambleton, R. K. (1988, April). A qualitative approach to the evaluation of expert system shells. Paper presented at the meeting of AERA, New Orleans. Smith, I. L., Hambleton, R. K., & Rosen, G. (1988, August). Content validity studies of the Examination for Professional Practice of Psychology. Paper presented at an invited symposium at the meeting of APA, Atlanta. Spineti, R., & Hambleton, R. K. (1973, October). A computer simulation study of tailored testing strategies for objectives-based instructional programs. Paper presented at the meeting of NERA, Ellenville, New York. Swaminathan, H., Hambleton, R. K., & Algina, J. (1973, October). A decision-theoretic approach to issues in criterion-referenced assessment. Paper presented at the meeting of NERA, Ellenville, New York. Swaminathan, H., Hambleton, R. K., & Algina, J. (1974, April). Reliability of criterionreferenced tests. Paper presented at the meeting of APA, New Orleans. Traub, R. E., & Hambleton, R. K. (1970, February). Effect of scoring instructions and degree of speededness on validity and reliability of multiple-choice tests. Paper presented at the meeting of AERA, Minneapolis. Traub, R. E., & Hambleton, R. K. (1971, April). The effect of instruction upon the semantic space defined by measurement concepts. Paper presented at the meeting of AERA, New York.

72

Traub, R. E., Hambleton, R. K., & Singh, B. (1968, February). Effects of promised reward and threatened penalty on performance in a multiple-choice vocabulary test. Paper presented at the meeting of AERA, Chicago. van de Vijver, F. J. R., & Hambleton, R. K. (1996, August). Translating tests: Some practical guidelines. Paper presented at the meeting of APA, Toronto. Wainer, H., Hambleton, R. K., & Meara, K. (1999, April). Alternative displays for communicating NAEP results: A redesign and validity study. Paper presented at the meeting of NCME, Montreal. Welsh, W., & Hambleton, R. K. (1975, April). On the use of goals in evaluation: A review of selected issues. Paper presented at the meeting of AERA, Washington, DC. Xing, D., & Hambleton, R. K. (2002, April). Impact of test design, item quality, and item bank size on the psychometric properties of computer-based credentialing exams. Paper presented at the meeting of NCME, New Orleans. Ying, L., & Hambleton, R. K. (2004, April). Statistics for detecting disclosed items in a CAT environment. Paper presented at the meeting of NCME, San Diego. Ying, L., & Hambleton, R. K. (2004, April). Statistics for detecting disclosed items in a CAT environment. Paper presented at the meeting of NCME, San Diego. Yu, J., & Hambleton, R. K. (1996, August). Field test of the ITC guidelines for adapting psychological tests. Paper presented at the 26th International Congress of Psychology, Montreal. Zenisky, A. L., & Hambleton, R. K. (2004, April). Investigating the effects of selected multistage test design alternatives on credentialing outcomes. Paper presented at the NCME meeting, San Diego. Zenisky, A. L., Hambleton, R. K., & Robin, F. (2001, August). Two-stage large sample DIF procedures for state assessments. Paper presented at the meeting of APA, San Francisco. Zenisky, A. L., Hambleton, R. K., & Sireci, S. G. (2000, April). Effects of item dependencies among MCAT items on the validity of IRT item, test, and ability statistics. Paper presented at the meeting of NCME, New Orleans. Zhao, Y., & Hambleton, R. K. (2006, April). Impact of IRT model misfit on score precision and performance classifications. Paper presented at the meeting of NCME, San Francisco. Zhao, Y., & Hambleton, R. K. (2006, October). Consequences of IRT model fit in equating. Paper presented at the Northeastern Educational Research Association, Kerhonkson, New York. Zumbo, B. D., Sireci, S. G., & Hambleton, R. K. (2003, April). Revisiting exploratory methods for construct comparability: Is there something to be gained for the ways of the old? Paper presented at the meeting of NCME, Chicago.

73

INVITED DISCUSSANT AT PROFESSIONAL MEETINGS:

Applications of criterion-referencing to the testing of language. Symposium presented at the meeting of the Eastern Psychological Association, Washington, DC, 1973. Criterion-referenced testing. Symposium presented at the meeting of AERA, Chicago, 1974. Perspectives on criterion-referenced testing. Paper-reading session at the meeting of NCME, San Francisco, 1976. Evaluation of student progress and school environment in the Anisa early childhood educational program. Symposium presented at the meeting of NEERO, Provincetown, Massachusetts, 1976. Mastery teaching and mastery testing: The integration of instruction and measurement. Symposium presented at the meeting of AERA, Toronto, 1978. What's happening in measurement? The use of Rasch and other latent trait models. Symposium presented at the meeting of the Eastern Educational Research Association, Williamsburg, Virginia, 1978. Practical uses of item response theory. Symposium presented at the meeting of AERA, San Francisco, 1979. Applications of the Rasch test model. Symposium presented at the meeting of AERA, San Francisco, 1979. Latent trait applications. Symposium presented at the meeting of the NERA, Ellenville, New York, 1979. Issues in setting performance standards. Symposium at the 10th Annual Conference on Large-Scale Assessment, Denver, 1980. Competency testing in Detroit. Symposium presented at the meeting of AERA, Boston, 1980. Comparison and evaluation of standard-setting methods. Symposium presented at the meeting of AERA, Boston, 1980. Local and state competency testing. Symposium presented at the meeting of AERA, Boston, 1980. Methods and issues in setting standards for minimum proficiency tests. Symposium presented at the meeting of NCME, Los Angeles, 1981. Measurement challenges of basic skills assessment programs. Symposium presented at the meeting of AERA, Los Angeles, 1981. A multidisciplinary review of criterion-referenced measurement. Symposium presented at the meeting of AERA, Los Angeles, 1981.

74

Impact of test disclosure legislation on national testing programs. Symposium presented at the 11th Annual Conference on Large-Scale Assessment, Boulder, Colorado, 1981. The use of item response theory for the development of tests and the interpretation of test scores. Symposium presented at the meeting of NCME, New York, 1982. Measurement models for assessment data. Symposium presented at the meeting of AERA, New York, 1982. Using statewide basic skills tests to make promotion decisions: Political and psychometric issues. Symposium presented at the meeting of AERA, New York, 1982. Practically induced expansions in measurement technology. Symposium presented at the meeting of AERA, New York, 1982. Latent trait models: How useful are they to professional education? Symposium presented at the meeting of AERA, New York, 1982. Comparing the one- and three-parameter latent trait models: Point, counterpoint, and discussion. Symposium presented at the meeting of AERA, New York, 1982. State testing programs and testing policies: How they influence schools. Symposium presented at the meeting of AERA, Montreal, 1983. Framework for problem identification in test projects. Symposium presented at the meeting of AERA, Montreal, 1983. Issues and developments in item response theory. Symposium presented at the meeting of AERA, New Orleans, 1984. The criterion problem in professional evaluation: Ministry, medicine, and law. Symposium presented at the meeting of AERA, New Orleans, 1984. Critical measurement issues in learning disabilities. Invited symposium presented at the meeting of APA, Toronto, 1984. Fitting item response models to multidimensional data. Symposium presented at the meeting of AERA, Chicago, 1985. NAEP: An educational indicator. Symposium presented at the meeting of NCME, Chicago, 1985. Setting standards for high-stakes tests. Symposium presented at the meetings of AERA and NCME, San Francisco, 1986. Promising item response model applications. Critique session presented at the meetings of AERA and NCME, San Francisco, 1986. Building tests with item response models. Symposium presented at the meeting of APA, Washington, DC 1986.

75

Item response theory. Symposium presented at the meeting of AERA, Washington, DC, 1987. Multidimensional item response models: Models and data. Symposium presented at the meeting of AERA, Washington, DC, 1987. Research on differential item functioning. Papers presented at the meeting of NCME, New Orleans, 1988. Customization of a national standardized achievement test. meeting of NCME, New Orleans, 1988. Papers presented at the

Assessing dimensionality of test data. Papers presented at the meeting of AERA, New Orleans, 1988. Techniques for detecting differential item performance. Papers presented at the meeting of AERA, New Orleans, 1988. Criterion-referenced passing points: New applications, adjustments, and alternatives. Papers presented at the meeting of AERA, New Orleans, 1988. Frontiers of assessment in the teaching profession. Papers presented at the meeting of AERA, New Orleans, 1988. Personnel evaluation standards. Symposium presented at the meeting of AERA, San Francisco, 1989. Setting standards of performance. Papers presented at the meeting of NCME, San Francisco, 1989. Assessing the utility of IRT models. Papers presented at the meeting of NCME, Boston, 1990. Strong modeling approaches to problems in measuring learning and change. Symposium presented at the meeting of NCME, Boston, 1990. Research design methodology. Papers presented at the NEERO meeting, Rockport, Maine, 1990. Methodological and practical issues in the normative application of criterion-referenced assessments. Papers presented at the meeting of NCME, Chicago, 1991. Data-based development of licensure tests for teachers. Papers presented at the meeting of NCME, Chicago, 1991. Application of performance-based assessment for a whole literacy program. Symposium presented at the meeting of AERA, San Francisco, 1992. Multidimensional IRT models. Papers presented at the meeting of AERA, Atlanta, 1993.

76

Equating computer adaptive and paper-and-pencil tests: experiences and lessons learned. Symposium presented at the meeting of AERA, San Francisco, 1995. Applied dimensionality. Symposium presented at the meeting of NCME, San Francisco, 1995. Assessment in Kentucky: Things are going quite nicely, thank you. Symposium presented at the meeting of NCME, San Francisco, 1995. Content validity: An important construct in measurement. Symposium presented at the meeting of NCME, San Francisco, 1995. CATucopia: Measurement issues faced by a large-scale computer adaptive testing program. Symposium presented at the meeting of NCME, New York, April 1996. Perspectives on reporting scaling results to students and teachers. Symposium presented at the meeting of NCME, New York, April, 1996. Validity considerations for automated scoring of open-ended responses. Symposium presented at the meeting of NCME, Chicago, 1997. The 1997 USMLE Step 1 CBT field-test: Examinee performance, perceptions and pacing. Symposium presented at the meeting of the NCME, San Diego, 1998. Linking complex performance-based assessments: A comparison of novel procedures. Symposium presented at the meeting of the AERA, San Diego, 1998. Test-taker rights and responsibilities: Issues and perspectives. Symposium presented at the meeting of the American Psychological Association, San Francisco, 1998. An international perspective on the development of test standards. Invited symposium at the meeting of the 24th International Congress of Applied Psychology, San Francisco, August, 1998. Methodological advances in test adaptations for cross-cultural and cross-lingual assessment. Invited symposium at the meeting of the 24th International Congress of Applied Psychology, San Francisco, August, 1998. Translations dif research: Advances and applications. Symposium presented at the meeting of NCME, Montreal, April, 1999. Latent trait and latent class modeling. Symposium presented at the meeting of the AERA, Montreal, April, 1999. What have we learned about the test accommodation strategies for English language learners? Symposium presented at the meeting of the NCME, Montreal, April, 1999. Understanding fairness in a CAT environment. Symposium presented at the meeting of NCME, Montreal, April, 1999.

77

Issues in grading essays and passages. Symposium presented at the AERA meeting, New Orleans, April, 2000. Advances in automated scoring of performance assessments. Symposium presented at the NCME meeting, New Orleans, April, 2000. A comparison of methods for setting standards on NAEP. Symposium presented at the CCSSO Large-Scale Assessment Conference, Snowbird, Utah, June, 2000. Technical issues in item response theory. Paper presentation session at the meeting of the AERA, Seattle, April, 2001. Advances in test adaptation methodology. Symposium presented at the meeting of NCME, New Orleans, April, 2002. Advances in measurement: Improving measurement by using IRT and MCMC methods. Paper presentation session at the meeting of NCME, New Orleans, 2002. School assessment and evaluation. Submitted paper session at the meeting of AERA, Chicago, 2003. International perspectives: Issues of achievement and reform. Submitted papers session at the meeting of AERA, Chicago, 2003. Making test results more useful and understandable. Invited symposium at the meeting of NCME, Chicago, 2003. Science and mathematics in an international perspective. Submitted papers session at the meeting of AERA, San Diego, 2004. Standard setting methods: Studying sources of complexity. Invited symposium at the meeting of NCME, Montreal, 2005. Test translation methodology: New approaches, practical examples. Symposium presented at the 9th European Congress of Psychology, Granada, Spain, 2005. Methodological developments in international educational research: Experiences from the OECD PISA study. Symposium presented at the meeting of the AERA, Stan Francisco, 2006. Administration mode effects in computer-based large-scale assessments. Symposium presented at the meeting of the AERA, San Francisco, 2006. Topics in IRT modeling. Submitted papers session at the meeting of NCME, San Francisco, 2006. Response-time modeling and applications. Discussant for this invited presentation at the meeting of NCME, San Francisco, April, 2006.

78

Designing accessible large-scale reading assessments for students with disabilities: Research and practice. Discussant for this session at the meeting of the CCSSO, San Francisco, June, 2006. Setting performance standards under NCLB: Approaches, issues, and implications. Discussant for this session at the meeting of the CCSSO, San Francisco, June, 2006. Is your definition of proficiency limited by the standard setting method you use? Discussant for this session at the meeting of the CCSSO, San Francisco, June, 2006. Theoretical and practical aspects of vertically-articulated standards. Discussant for this session at the meeting of the CCSSO, San Francisco, June, 2006. Exploration of personality across 19 countries. Discussant for this session at the 5th International Test Commission Conference on Test Adaptation, Brussels, July, 2006. Psychometric lessons learned in a large-scale medical licensure performance assessment. Discussant for this invited session at the meeting of NCME, Chicago, 2007. Standard-setters: Stand up and take a stand. Discussant for this invited session at the meeting of NCME, Chicago, 2007. Comparability of adapted versions of multilingual tests: Implications of incomparability on score interpretations in international assessments. Discussant for this session at the meeting of NCME, Chicago, 2007. Innovations in standard setting. Discussant for this session at the meeting of NCME, Chicago, 2007. Making NAEP scores more meaningful. Panel member for this session at the NSSC 2008 Winter Assessment Literacy Workshop, Washington. The role of user-centered design in building better assessments. Discussant for this session at the meeting of AERA, New York, 2008. The big challenges and research opportunities in testing and measurement. Discussant and chairperson for this session at the meeting of AERA, New York, 2008. Dissecting the bookmark standard setting procedure. Discussant for this session at the meeting of NCME, New York, 2008. Technical advances in international assessments such as TIMSS and PISA. Discussant for this session at the meeting of NCME, New York, 2008.

79

Recent Activities (Since September, 2007) STUDIES IN PROGRESS/NEW COMPLETED STUDIES: In Preparation Hambleton, R. K. (in preparation). National Assessment of Educational Progress. In CC Clauss -Ehlers (Ed.), Enclyclopedia. Heidelberg, Germany: Springer. Hambleton, R. K. (in preparation). Five big challenges for educational and psychological assessment. Measurement: Interdisciplinary Research and Perspectives. (invited) Hambleton, R. K., Plake, B. S., & Mills, C. N. (in preparation). Handbook on setting performance standards. Hambleton, R. K., & Swaminathan, H. (in preparation). Item response theory: Principles and applications (2nd ed.). Boston, MA: Kluwer Academic Publishers. Hambleton, R. K., & van der Linden, W. J. (in preparation). Polytomous response IRT models: Brief history of model building advances. In M. Nering & R Ostini (Eds.), Development and applications of polytomous item response theory models. Mahwah, NJ: Lawrence Erlbaum Associates, Inc., Publishers. Hambleton, R. K., & Zenisky, A. (in preparation). Adapting tests for cross-cultural assessment. In D. Matsumoto & F. van de Vijver (Eds.), Cross-cultural research methods. Oxford, England: Oxford University Press. Hambleton, R. K., & Zenisky, A. (in preparation). Improving score reporting practices. CLEAR. Hambleton, R. K., Zumbo, B., & Sireci, S. G. (in preparation). Psychometric methods and practices. Mahwah, NJ: Erlbaum Publishers. Jette, A. M., McDonough, C. M., Haley, S. M., Ni, P., Olarsch, S., Latham, N., Hambleton, R. K., Felson, D., Kim Y. J., & Hunter, D. (in press). A computer-adaptive disability instrument for lower extremity osteoarthritis research demonstrated promising breadth, precision, and reliability. Journal of Clinical Epidemiology. Jette, A. M., McDonough, C.M., Ni, P, Haley, S. M., Hambleton, R. K., Olarsch, S., Hunter, D., Kin, Y., Felson, D. (in review). A functional difficulty and functional pain instrument for lower extremity. Lyren, P. E., & Hambleton, R. K. (in preparation). Systematic equating error with randomlyequivalent groups designs: An examination of the equal ability distribution assumption. Ni, P., Haley, S. M., Hambleton, R. K., & Jette, A. M. (in preparation). IRT model selection using Markov Chain Monte Carlo estimation in a functional difficulty item bank for persons with osteoarthritis. In Press Byrne, B.M., Oakland, T., Leong, F.T.L., van de Vijver, F.J.R., Hambleton, R.K., Cheung, F.M.,

80

& Bartram, D. (in press). A critical analysis of cross-cultural research and testing practices: Implications for improved education and training in psychology. Training and Education in Professional Psychology. Gregoire, J., & Hambleton, R. K. (Eds.). (in press). Advances in test adaptation research [Special Issue]. International Journal of Testing. Haley, S. M., Fragala-Pinkham, M. A., Dumas, H. M., Ni, P., Gorton, G., Watson, K., Montpetit, K., Bilodeau, N., Hambleton, R. K., & Tucker, C. A. (in press). Evaluation of an item bank for a computerized adaptive test of activity in children with cerebral palsy. Physical Therapy. Haley, S. M., Ni, P., Dumas, H. M., Fragala-Pinkham, M. A., Hambleton, R. K., Montpetit, K., Bilodeau, N., Gorton, G. E., Watson, K., & Tucker, C. A. (in press). Measuring global physical health in children with cerebral palsy: Illustration of a multidimensional bifactor model and computerized adaptive testing. Quality of Life Research. Hambleton, R. K. (in press). Criterion-referenced testing. In E. Anderman (Ed.), Psychology of classroom learning: An encyclopedia. Detroit: Macmillan Reference. Hambleton, R. K., Sireci, S. G., & Smith, Z. R. (in press). How do other countries measure up to the mathematics achievement levels on the National Assessment of Educational Progress? Applied Measurement in Education. Han, N., & Hambleton, R. K. (in press). Using moving averages to detect exposed test items in computer-based testing. In S. Sawilowsky (Ed.), Real data analysis. Greenwich, CT: Information Age Publishers. Tucker, C., Gorton, G., Watson, K., Fragala-Pinkham, M., Dumas, H., Montpetit, K., Bilodeau, N., Ni, P., Hambleton, R., & Haley, S. (in press). Development of a parent-report computer adaptive test to assess physical functioning in children with cerebral palsy lower extremity and mobility skills. Developmental Medicine & Child Neurology. Tucker, C., Montpetit, K., Bilodeau, N., Dumas, H., Fragala-Pinkham, M., Watson, K., Gorton, G., Ni, P., Hambleton, R., Mulcahey, M., & Haley, S. (in press). Development of a parent-report computer adaptive test to assess physical functioning in children with cerebral palsy II. Developmental Medicine & Child Neurology. van de Vijver, F. J. R., & Hambleton, R. K. (in press). Adapting educational tests for multicultural assessment. Educational Measurement: Issues and Practice. Wells, C. S., Baldwin, S., Hambleton, R. K., Sireci, S. G., Karatonis, A., & Jirka, S. (in press). Evaluating score equity assessment for state NAEP. Applied Measuement in Education. Zenisky, A., Hambleton, R. K., & Luecht, R. (in press). Multi-stage testing. In W. J. van der Linden & C. Glas (Eds.), Computerized adaptive testing. New York: Springer.

Zenisky, A., Hambleton, R. K., & Sireci, S. G. (in press). Getting the message out: An evaluation of NAEP score reporting practices with implications for disseminating test

81

results. Applied Meaurement in Education. Completed Hambleton, R. K. (2008). Criterion-referenced testsnorm-referenced tests. In G. McCulloch & D. Crook (Eds.), International Encyclopedia of Education. London: Routledge. Hambleton, R. K. (2008). Measurement specialists look to the future. NCME Newsletter, 16(2), 2-3. Hambleton, R. K., & Sireci, S. (2008). Development and validation of enhanced SAT score scales using item mapping and performance category descriptions (Final Report). New York: College Board. Han, N., & Hambleton, R. K. (2008). Detecting the unintended exposure of test items in operational testing programs. In C. L. Wild & R. Ramaswamy (Eds.), Improving testing: Applying quality tools and techniques (pp. 323-348). Mahwah, NJ: Lawrence Erlbaum Associates, Inc., Publishers. Keller, L. A., Hambleton, R. K., Parker, P., & Copella, J. (2008). MCAS equating research: An investigation of FCIP-1, FCIP-2, and Stocking and Lord equating methods (Center for Educational Assessment Research Report No. 690). Amherst, MA: University of Massachusetts, Center for Educational Assessment. Liang, T., Han, K., & Hambleton, R. K. (2008). Users guide for ResidPlots-2: Computer software for IRT graphical residual analyses, Version 2.0 (Center for Educational Assessment Research Report No. 688). Amherst, MA: University of Massachusetts, Center for Educational Assessment.

Lyrn, P.-E., & Hambleton, R. K. (2008). Systematic equating error with the randomlyequivalent groups design: An examination of the equal ability distribution assumption (EM Report No. 61). Ume, Sweden: Ume University, Department of Educational Measurement.
Monahan, P. O., Stump, T. E., Finch, H., & Hambleton, R. K. (2007). Bias of exploratory and cross-validated DETECT index under null hypothesis of unidimensionality. Applied Psychological Measurement, 31 (6), 483-503. Reeve, B. B., Hays, R. D., Bjorner, J. B., Cook, K. F., Crane, P. K., Teresi, J., Thissen, D., Revicki, D. A., Weiss, D. J., Hambleton, R. K, & others. (2007). Psychometric evaluation and calibration of health-related quality of life item banks. Medical Care, 45(5), 22-31. Sireci, S. G., & Hambleton, R. K. (2009). Mission--Protect the public: Licensure and certification testing in the 21st century. In R. P. Phelps (Ed.), Correcting fallacies about educational and psychological testing (pp. 199-218). Washington, DC: American Psychological Association.

Swaminathan, H., Hambleton, R. K., & Rogers, H. J. (2007). Assessing the fit of item response

82

theory models. In C. R. Rao & S. Sinharay (Eds.), Handbooks of statistics: Psychometrics (Volume 27; pp. 683-718). Amsterdam: North Holland. PAPERS PRESENTED/TO BE PRESENTED AT PROFESSIONAL MEETINGS: Deng, N., & Hambleton, R. K. (2008, March). Assessment dimensionality of multi-stage tests. Paper presented at the meeting of NCME, New York. Deng, N., Wells, C. S., & Hambleton, R. K. (2008, October). A confirmatory factor analytic study examining the dimensionality of an educational achievement test. A paper presented at the meeting of the NERA, Hartford. (Published in the NERA Proceedings, 2008.) Elosua, P., & Hambleton, R. K. (2008, July). DIF detection methods and consequences. Presentation at the 6th Conference of the International Test Commission, Liverpool, England. Elosua, P., & Hambleton, R. K. (2008, July). Test score comparability across language and cultural groups in the presence of item bias. An invited paper presented at the Third European Congress of Methodology, Oviedo, Spain. Hambleton, R. K. (2007, February). Methods and guidelines for translating and adapting educational and psychological tests into multiple languages and cultures. An invited presentation at the 2007 ATP Innovations in Testing Conference, Palm Springs, CA. Hambleton, R. K. (2007, June). A new challenge: Making test scores more understandable and useful. A presentation presented at the annual CCSSO meeting, Nashville. Hambleton, R. K. (2007, June). Making diagnostic score reports more clear and meaningful for users. A presentation at the annual CCSSO meeting, Nashville. Hambleton, R. K. (2007, July). What are the psychometric skills needed in cross-cultural psychology today? Invited presentation at the meeting of the 10th European Congress of Psychology, Prague. Hambleton, R. K. (2007, July). International Test Commission guidelines for adapting educational and psychological tests. Invited presentation at the meeting of the 10th European Congress of Psychology, Prague. Hambleton, R. K. (2007, August). Major challenges for educational and psychological testing practices. Invited presentation at the National Authority for Measurement and Evaluation in Education Conference, Jerusalem, Israel. Hambleton, R. K. (2007, October). Cross-cultural instrument translation and instrumentation. An invited presentation at the Cooper Institute Diversity in Physical Activity and Health: Measurement and Research Issues and Challenges Conference, Dallas, TX. Hambleton, R. K. (2008, January). On-going challenge for NAEP: Making score reports understandable and useful. Keynote address at the NSSC 2008 Winter Assessment Literacy Workshop, Washington.

83

Hambleton, R. K. (2008, March). A non-technical introduction to item response theory for credentialing exams and achievement tests. An invited presentation at the ATP Innovations in Testing Conference, Dallas, Texas. Hambleton, R. K. (2008, March). Reporting candidate scores in more understandable and meaningful ways: A review of the recent literature and promising research. An invited presentation at the ATP Innovations in Testing Conference, Dallas, Texas. Hambleton, R. K. (2008, March). Comparative perspectives on classical psychometrics and item response theory. Invited presentation at the meeting of AERA, New York. Hambleton, R. K. (2008, March). Guidelines for translating and adapting educational and psychological tests. Paper presented at the meeting of AERA, New York. Hambleton, R. K. (2008, June). CATfrom an educational testing perspective. A presentation at the Promis Psychometric Summit-2, Northwestern University, Evanston. Hambleton, R. K. (2008, July). The next great challenges for psychological and educational measurement. Keynote address delivered at the Third European Congress of Methodology, Oviedo, Spain. Hambleton, R. K. (2008, July). The International Test Commission Guidelines for Adapting Tests, 2nd edition: A progress report. Invited presentation at the 29th International Congress of Psychology, Berlin. Hambleton, R. K. (2008, September). A personal history of computer-adaptive testing. An invited address at the International Conference on Outcomes Measurement, Bethesda, MD. Hambleton, R. K. (2009, February). Problems to overcome in globalizing testing. A keynote address at the Association of Test Publishers Conference, Palm Springs, CA. Hambleton, R. K. (2009, February). Predicting future directions for testing. Invited presentation at the Association of Test Publishers Conference, Palm Springs, CA. Hambleton, R. K., Deng, N., & Lozano, L. (2009, February). Customized test score norms using item response theory: A new example. Paper presented at the meeting of the American Test Publishers Conference, Palm Springs, CA. Hambleton, R. K., & Han, N. (2008, July). Detecting exposed test items in a computerized adaptive testing environment. Paper presented at the 6th Conference of the International Test Commission, Liverpool, England. Hambleton, R. K., & Han, N. (2008, July). Catching exposed test items with IRT-based statistics in computer-based testing. Paper presented at the 29th International Congress of Psychology, Berlin. Hambleton, R. K., & Lozano, L. (2008, July). Customized test score norms with item response theory. A presentation at the 6th Conference of the International Test Commission, Liverpool, England.

84

Hambleton, R. K., Sireci, S., & Smith, Z. (2008, March). Are the NAEP achievement levels in mathematics set too high? Paper presented at the meeting of NCME, New York. Hambleton, R. K., & Wells, C. (2008, July). Using IRT models to construct tests and equate and report scores. A workshop at the 6th Conference of the International Test Commission, Liverpool, England. Hambleton, R. K., & Zenisky, A. (2008, July). A key for valid uses of tests: Making test score reports more understandable and user-friendly. Key-note address presented at the 6th Conference of the International Test Commission, Liverpool, England. Hambleton, R. K., & Zenisky, A. (2008, October). Reporting test scores in more meaningful ways: Some new findings, research methods, and guidelines for score report design. A presentation at the NERA meeting, Hartford. Lozano, L., & Hambleton, R. K. (2008, July). Constructing and evaluating customized test score norms. An invited paper presented at the Third European Congress of Methodology, Oviedo, Spain. Lyrn, P.-E., & Hambleton, R. K. (2007, April). Systematic equating error with randomlyequivalent groups designs: An examination of the equal ability distribution assumption. Paper presented at the meeting of NCME, Chicago. Meng, Y., Wells, C. S., & Hambleton, R. K. (2008, October). A comparison of methods for handling missing data when assessing dimensionality via linear factor analysis. Paper presented at the meeting of NERA, Hartford. Ni, P., Jette, A. M., Haley, S. M., & Hambleton, R. K. (2008, March). IRT model selection using Markov Chain Monte Carlo estimation in a physical functioning item bank. Paper presented at the Patient-Reported Outcomes Measurement Information System meeting, Washington. Pitoniak, M., & Hambleton, R. K. (2007, April). Setting performance standards. Paper presented at the meeting of NCME, Chicago. Sireci, S., & Hambleton, R. K. (2008, July). Communicating results of comparisons of international assessments to NAEP. A paper presented at the 6th International Test Commission Conference, Liverpool, England. Sireci, S., Hambleton, R. K., Huff, K. (2008, July). Enhancing the meaningfulness of score scales using item response theory. An invited paper presented at the Third European Congress of Methodology, Oviedo, Spain. Wells, C. S., Hambleton, R. K., & Liang, T. (2008, July). A nonparametric approach for investigating model fit in item response theory. An invited paper presented at the Third European Congress of Methodology, Oviedo, Spain. Yoo, H., & Hambleton, R. K. (2008, October). Item exposure control for computerized-adaptive testing: A review of methods. Paper presented at the meeting of NERA, Hartford. Zenisky, A., Hambleton, R. K., & Sireci, S. (2008, July). Communicating the utility of NAEP score reports. A paper presented at the 6th International Test Commission Conference,

85

Liverpool, England. Zhao, Y., & Hambleton, R. K. (2008, October). Graphical approaches for assessing differential item functioning in polytomously-scored items. Paper presented at the meeting of the NERA, Hartford. Current Version: March 18, 2009

86

S-ar putea să vă placă și