2009 DOF Manual 1 PDF

Manual for the ASEBA
Direct Observation Form
Stephanie H. McConaughy
& Thomas M. Achenbach
al for the ASEBA Direct Observation Form McConaughy & Achenbach
the ASEBA Direct Observation Form McConaughy & Achenbach
ibrary of Congress Control Number: xxxxxxx

ISBN 978-1-932975-12-3
Manual for the ASEBA
Direct Observation Form
Stephanie H. McConaughy, University of Vermont

& Thomas M. Achenbach, University of Vermont
Ordering Information
This Manual and other ASEBA materials can be ordered from:
ASEBA Fax: 802-656-5131
1 South Prospect Street E-mail: mail@ASEBA.org
Burlington, VT 05401-3456 Web: www.ASEBA.org
Proper bibliographic citation for this Manual:
McConaughy, S. H., & Achenbach, T. M. (2009). Manual for the ASEBA Direct Observation Form. Burlington,
VT: University of Vermont, Research Center for Children, Youth, & Families.
Related Books
Achenbach, T.M., (2009). The Achenbach System of Empirically Based Assessment (ASEBA): Devel-
opment, Findings, Theory, and Applications. Burlington, VT: University of Vermont, Research Center
for Children, Youth, & Families.
Achenbach, T.M., & McConaughy, S.H. (2009). School-Based Practitioners’ Guide for the Achenbach Sys-
tem of Empirically Based Assessment (ASEBA) (6th ed.). Burlington, VT: University of Vermont, Research
Center for Children, Youth, & Families.
Achenbach, T.M., Pecora, P.J., & Wetherbee, K.M. (2009). Child and Family Service Workers’ Guide for the
Achenbach System of Empirically Based Assessment (ASEBA) (6th ed.). Burlington, VT: University of Ver-
mont, Research Center for Children, Youth, & Families.
Achenbach, T.M., & Rescorla, L.A. (2001). Manual for the ASEBA School-Age Forms & Profiles. Burlington,
VT: University of Vermont, Research Center for Children, Youth, & Families.
Achenbach, T.M., & Rescorla, L.A. (2009). Mental Health Practitioners’ Guide for the Achenbach System of
Empirically Based Assessment (ASEBA) (6th ed.). Burlington, VT: University of Vermont, Research Center
for Children, Youth, & Families.
Achenbach, T.M., & Rescorla, L.A. (2007). Multicultural Guide for the ASEBA School-Age Forms & Pro-
files. Burlington, VT: University of Vermont, Research Center for Children, Youth, and Families.
Achenbach, T.M., & Rescorla, L.A. (2007). Multicultural Understanding of Child and Adolescent Psychopa-
thology: Implications for Mental Health Assessment. New York: Guilford Press.
Achenbach, T.M., & Ruffle, T.M. (2007). Medical Practitioners’ Guide for the Achenbach System of Empiri-
cally Based Assessment (ASEBA) (5th ed.). Burlington, VT: University of Vermont, Research Center for Chil-
dren, Youth, & Families.
McConaughy, S.H. (2005). Clinical Interviews for Children and Adolescents: Assessment to Intervention.
New York: Guilford Press.
McConaughy, S.H., & Achenbach, T.M. (2001). Manual for the Semistructured Clinical Interview for Chil-
dren and Adolescents (2nd ed.). Burlington, VT: University of Vermont, Research Center for Children, Youth,
& Families.
McConaughy, S.H., & Achenbach, T.M. (2004). Manual for the Test Observation Form for Ages 2-18.
Burlington, VT: University of Vermont, Research Center for Children, Youth, & Families.
Copyright 2009 S.H. McConaughy & T.M. Achenbach. All rights reserved.
Unauthorized reproduction prohibited by law.
ISBN 978-1-932975-12-3 Library of Congress xxxxxxxxxxx
Printed in the United States of America 14 13 12 11 10 9 8 7 6 5 4 3 2 1
405
ii
User Qualifications
The Direct Observation Form (DOF) is designed a thorough knowledge of the procedures and cau-
for rating observations of 6-11-year-old children in tions presented in this Manual.
school classrooms, at recess, and in other group set-
Our standards for use are consistent with the Stan-
tings. Observers should have some knowledge of
dards for Educational and Psychological Testing
child behavior and development and of the method-
(1999) prepared and endorsed by the American Edu-
ology of behavioral assessment. Observers may be
cational Research Association (AERA), American
paraprofessionals, such as teachers’ aides, under-
Psychological Association (APA), and National
graduate or graduate students, and research assistants,
Council on Measurement in Education (NCME) and
as well as professionals in education, school psychol-
with the Code of Fair Testing Practices in Educa-
ogy, clinical psychology, and related disciplines.
tion (2004) prepared by the Joint Committee on Test-
Paraprofessionals and students should use the DOF
ing Practices. Users are expected to adhere to the
under the supervision of a qualified professional who
ethical principles of their professional organizations,
has knowledge of the theory and methodology of
such as the American Psychological Association and
standardized assessment.
National Association of School Psychologists.
To make proper interpretations of the DOF, the
The DOF is part of the Achenbach System of Em-
data should be scored on the DOF Profile. The
pirically Based Assessment (ASEBA). Users should
ASEBA ADM software provides instructions for
understand that ASEBA instruments are designed to
computer-scoring the DOF Profile. Interpretation of
provide standardized descriptions of an individual’s
the DOF Profile usually requires training in standard-
functioning. The DOF should not be the sole basis
ized assessment commensurate with at least a
for making diagnoses or other important decisions
Master’s degree in psychology, school psychology,
about children and adolescents. No scores on the
social work, special education, counseling, or a com-
DOF scales should be automatically equated with a
parable field. Trainees, observers, and data process-
particular diagnosis or disorder. Instead, the respon-
ing personnel may also use the computer software to
sible user will compare data obtained from the DOF
score the DOF Profile under the supervision of a
with data from other sources, such as parent reports,
qualified professional. No amount of prior training,
teacher reports, child interviews, and observations
however, can substitute for professional maturity and
during test sessions.
iii
Preface
The Direct Observation Form (DOF) is part of Arnold, Rachel Bérubé, Sarah Cochran, Levent
the Achenbach System of Empirically Based As- Dumenci, Anne Ellis, Patricia Fletcher, Masha
sessment (ASEBA). This Manual provides basic Ivanova, David Jacobowitz, Ramani Sunderaju,
information needed for understanding and using and Dan Walter. We are also grateful to the many
the DOF. It also provides instructions for complet- people who assisted in our data collection and data
ing and scoring the DOF and guidelines for train- management, including Lori Turner at the Univer-
ing DOF observers, plus information on develop- sity of Vermont Research Center for Children,
ment of the DOF, research on reliability and valid- Youth, and Families (RCCYF); Ricardo Eiraldi,
ity, and practical applications with case illustra- Thomas Power, and the staff of the Children’s
tions of how to integrate DOF results with other Hospital of Philadelphia; Kevin Antshel, Michael
assessment information. The DOF can be used to Gordon, and the staff of the Department of Psy-
rate and score multiple 10-minute observations of chiatry at SUNY Upstate Medical University; and
children’s behavioral and emotional problems in Robert Volpe of Northeastern University, who
school classrooms, at recess, and in other group served as a Postdoctoral Fellow at the RCCYF. We
settings. The DOF includes 89 problem items to are also grateful to the many psychology and school
be rated on a 4-point scale, plus on-task ratings for psychology graduate students who acted as observ-
each 10-minute observation session. The DOF Pro- ers, as well as the children, families, and school
file comprises empirically based scales and DSM- staff who cooperated in our research. We have ap-
oriented scales normed separately for classroom preciated the advice of our colleagues James
and recess observations for boys and girls ages 6 Hudziak, Cynthia LaRiviere, Leslie Rescorla,
to 11. James Tallmadge, and Robert Volpe regarding our
observational procedures. We are also grateful to
In developing the DOF over more than 20 years,
the University of Vermont Research Center for
we have benefited from the help and advice of
Children, Youth, and Families, Spencer Founda-
many colleagues. For their assistance with this
tion, W. T. Grant Foundation, National Institute of
Manual, we are particularly grateful to Janet
Child Health and Human Development, National
Institute on Disability and Rehabilitation Research
(U.S. Department of Education), and National In-
stitute of Mental Health for support of research that
has contributed to this effort.
iv
Reader’s Guide
I. Introductory Material Needed by Most Readers
A. Introduction and Rationale for the Direct Observation Form (DOF) ........... Chapter 1
B. Using the DOF and Rating the DOF Items .................................................. Chapter 2
C. Computer-Scored DOF Profile .................................................................... Chapter 3
D. Training DOF Observers and Conducting School Observations ................. Chapter 4
E. Practical Applications and Case Examples .................................................. Chapter 5
II. Constructing the DOF and DOF Profile .......................................................... Chapter 6
III. Statistical Data on Reliability and Validity

A. Reliability of the DOF .................................................................................. Chapter 7
B. Validity of the DOF ..................................................................................... Chapter 8
IV. Answers to Frequently Asked Questions ......................................................... Chapter 9
V. Mean DOF Scale Scores for Normative Samples of

Boys & Girls Ages 6-11 .................................................................................. Appendix A
VI. Mean DOF Scale Scores for Match Referred Children

and Nonreferred Controls Boys 6-11 and Girls 6-11 .................................. Appendix B
VII. Pearson Correlations Among Raw Scores for DOF Scales ......................... Appendix C
VIII. Items Comprising the 2009 DOF and the 1986 DOF ................................... Appendix D
v
Contents
1. Introduction and Rationale for the Direct Observation Form (DOF) ....................................................... 1
ADVANTAGES OF DIRECT OBSERVATIONS ........................................................................................... 1
MULTIAXIAL ASSESSMENT ...................................................................................................................... 2
STRUCTURE OF THIS MANUAL ............................................................................................................... 3
SUMMARY .................................................................................................................................................... 4
2. Using the DOF and Rating the DOF Items ................................................................................................... 5

COMPLETING PAGE 1 INFORMATION ..................................................................................................... 5
WRITING OBSERVATION NOTES ............................................................................................................ 12
RATING ON-TASK BEHAVIOR ................................................................................................................. 12
RATING DOF PROBLEM ITEMS .............................................................................................................. 13
GUIDELINES FOR RATING SPECIFIC DOF PROBLEM ITEMS ........................................................... 15
SUMMARY .................................................................................................................................................. 22
3. Computer-Scored DOF Profile ..................................................................................................................... 23

DOF PROFILE FOR CLASSROOM OBSERVATIONS ............................................................................. 23
DOF PROFILE FOR RECESS OBSERVATIONS ....................................................................................... 33
SUMMARY .................................................................................................................................................. 38
4. Training DOF Observers and Conducting School Observations .............................................................. 41

TRAINING DOF OBSERVERS ................................................................................................................... 41
GUIDELINES FOR OBSERVATIONS IN SCHOOLS ................................................................................ 42
ASSESSING INTER-OBSERVER AGREEMENT ..................................................................................... 46
ASSESSING INTER-RATER RELIABILITY ............................................................................................. 51
SUMMARY .................................................................................................................................................. 55
5. Practical Applications and Case Examples ................................................................................................. 56

SEQUENCE FOR USING THE DOF AND OTHER ASEBA FORMS ...................................................... 56
SCHOOL-BASED ASSESSMENTS............................................................................................................ 59
ASSESSMENT OF ADHD .......................................................................................................................... 61
ASSESSMENT OF EMOTIONAL DISTURBANCE ................................................................................. 61
ASSESSMENT OF LEARNING DISABILITIES ....................................................................................... 62
CASE EXAMPLE OF ASSESSMENT OF ADHD...................................................................................... 65
CASE EXAMPLE OF A SCHOOL-BASED ASSESSMENT OF BEHAVIOR PROBLEMS .................... 67
SUMMARY .................................................................................................................................................. 70
6. Constructing the DOF and DOF Profile...................................................................................................... 71

EARLIER VERSIONS OF THE DOF .......................................................................................................... 71
PSYCHOMETRIC APPROACH TO THE 2009 DOF ................................................................................. 73
STATISTICAL DERIVATION OF DOF SYNDROMES FOR CLASSROOM OBSERVATIONS ............. 73
LOW FREQUENCY ITEMS RETAINED ON THE DOF ........................................................................... 79
AGGRESSIVE BEHAVIOR SYNDROME FOR RECESS OBSERVATIONS ........................................... 79
DSM-ORIENTED ATTENTION DEFICIT/HYPERACTIVITY PROBLEMS AND INATTENTION
AND HYPERACTIVITY-IMPULSIVITY SUBSCALES ........................................................................ 81
NORMATIVE SAMPLE .............................................................................................................................. 82
vi
Contents vii
ASSIGNING NORMALIZED T SCORES TO RAW SCORES ................................................................... 82

MEAN T SCORES ........................................................................................................................................ 88
NORMAL, BORDERLINE, AND CLINICAL RANGES ........................................................................... 88
SUMMARY .................................................................................................................................................. 89
7. Reliability of the DOF ................................................................................................................................... 91

INTER-RATER RELIABILITY ................................................................................................................... 91
TEST-RETEST RELIABILITY .................................................................................................................... 93
INTERNAL CONSISTENCY ...................................................................................................................... 94
SUMMARY .................................................................................................................................................. 96
8. Validity of the DOF ....................................................................................................................................... 97

CONTENT VALIDITY OF DOF ITEMS ..................................................................................................... 97
CRITERION-RELATED VALIDITY ........................................................................................................... 98
SUMMARY ................................................................................................................................................ 107
9. Answers to Frequently Asked Questions ................................................................................................... 108

FEATURES OF THE DOF ......................................................................................................................... 108
APPLICATIONS OF THE DOF .................................................................................................................. 111
RELATIONS TO OTHER ASSESSMENT PROCEDURES ...................................................................... 111
RELATIONS TO DSM AND SPECIAL EDUCATION CLASSIFICATIONS ........................................... 112
References ......................................................................................................................................................... 114
APPENDIX A: Mean DOF Scale Scores for Normative Samples ................................................................ 118
APPENDIX B: Mean DOF Scale Scores for Matched Referred Children and Nonreferred Controls Boys
6-11 .............................................................................................................................................................. 119
APPENDIX B: Mean DOF Scale Scores for Matched Referred Children and Nonreferred Controls Girls
6-11 ............................................................................................................................................................. 120
APPENDIX C: Pearson Correlations among Raw Scores for DOF Scales for Class-
room Observations .................................................................................................................................... 121
APPENDIX D: Items Comprising the 2009 DOF and the 1986 DOF ........................................................ 122
Index ................................................................................................................................................................. 125

Chapter 1
Introduction and Rationale for the
Direct Observation Form (DOF)
The Direct Observation Form (DOF) is a stan- and percentiles for five syndrome scales derived
dardized form for rating observations of children’s from factor analyses of classroom observations,
behavior in classrooms, at recess, and in other plus a DSM-oriented Attention Deficit/Hyperac-
group settings. During a 10-minute period, the tivity Problems scale with Inattention and Hyper-
observer writes a narrative description of the child’s activity-Impulsivity subscales, and a Total Prob-
behavior, affect, and interactions in space provided lems score. The DSM-oriented scale and subscales
on the DOF. The observer also rates the child for include DOF problem items consistent with
being on-task or off-task for 5 seconds at the end symptom criteria for Attention Deficit/Hyperac-
of each 1-minute interval. At the end of the 10- tivity Disorder (ADHD), as defined in the Diag-
minute observation, the observer rates the child on nostic and Statistical Manual of Mental Disorders-
88 specific problem items using a 0-1-2-3 scale. Fourth Edition and Fourth Edition-Text Revision
Item 89 is open-ended for rating other problems (DSM-IV; DSM-IV-TR; American Psychiatric As-
not covered by items 1 through 88. sociation, 1994, 2000). The DOF also has an Ag-
gressive Behavior syndrome scale for scoring ob-
Because children’s behavior may vary consid-
servations during recess and in other non-classroom
erably from one occasion to another, the DOF com-
settings. The DOF Profile has separate norms for
puter-scoring program requires at least two obser-
boys and girls ages 6 to 11. Because of the com-
vations of the target or “identified” child. When-
plexity of averaging scores across multiple obser-
ever possible, we recommend 3 to 6 separate ob-
vation sessions, the DOF scales can only be scored
servations of behavior on at least two different days.
by computer. The DOF computer-scoring program
We also recommend obtaining separate observa-
also provides raw scores for each of the 89 prob-
tions in the morning and afternoon. Observers
lem items and a narrative report that summarizes a
should complete one DOF for each 10-minute ob-
child’s scores on each of the DOF scales.
servation. The DOF computer-scoring program will
then average ratings across observation sessions.
ADVANTAGES OF DIRECT
Because the significance of a child’s behavior OBSERVATIONS
depends partly on how it may deviate from the be-
havior of other children, we recommend observ- Direct observation of children’s behavior is a
ing one or two “control” children in the same set- classic assessment method used by clinical and
ting as the identified child. The control children school psychologists (Sattler & Hoge, 2006;
should be the same age and gender of the identi- Shapiro & Heick, 2004; Wilson & Reschly, 1996).
Numerous coding systems have been developed
fied child, but should be located as far as possible
from the identified child in the group setting. Ob-for scoring direct observations of children’s behav-
servers do not need to know the names of the con- ior in classrooms (Volpe, DiPerna, Hintze, &
trol children. Chapter 2 provides more detailed in-Shapiro, 2005) and playground settings (Leff &
Lakin, 2005). Systematic direct observations share
structions for observing identified and control chil-
dren. the following characteristics: (a) their goal is to
measure specific target behaviors; (b) the target be-
The DOF Profile provides raw scores, T scores haviors are defined in a manner that makes them
readily observable with a minimum of inference;
(c) the observations are conducted according to
1 standardized procedures; (d) the times and places
for observations are specified; and (e) the obser-
2 1. Introduction and Rationale
quantified and/or summarized in a standardized No one assessment method should serve as the
manner that does not vary from one observer to sole basis for evaluating children’s functioning or
another (Volpe et al., 2005). for making important decisions about children. In-
stead, responsible evaluators will compare data ob-
Many systems for coding observations focus on
tained from one source or method with data ob-
a limited set of target behaviors (e.g., academic
tained from other sources. We use the term, “mul-
engaged time, out-of-seat, physical aggression,
tiaxial assessment” to describe the process of gath-
verbal aggression) and rely on continuous record-
ering and integrating information across multiple
ing or time sampling methods. Continuous record-
data sources.
ing methods count the number of times a behavior
(or event) occurs within a given period or record To facilitate multiaxial assessment, we designed
the duration of time in which the behavior (or the DOF as a component of the Achenbach Sys-
event) was observed. Continuous recording is most tem of Empirically Based Assessment (ASEBA).
effective when behaviors have discreet beginnings The ASEBA comprises an integrated set of rating
and ends, low to moderate rates of occurrence, and forms for assessing competencies, adaptive func-
are present only briefly. Time sampling records the tioning, and problems in easy and cost-effective
presence or absence of target behaviors within short ways. The ASEBA forms most relevant for use with
specified time intervals. Time sampling is useful the DOF are the Child Behavior Checklist for Ages
when multiple simultaneous target behaviors 6 to 18 (CBCL/6-18; Achenbach & Rescorla,
hinder continuous recording or when samples of 2001), Teacher’s Report Form (TRF; Achenbach
behavior are observed across different settings. & Rescorla, 2001), Youth Self-Report (YSR;
Achenbach & Rescorla, 2001), Test Observation
The DOF, by contrast, is designed for rating di-
Form (TOF; McConaughy & Achenbach, 2004),
rect observations of multiple specific behaviors
and the Semistructured Clinical Interview for Chil-
over a specific interval (10 minutes). The observer
dren and Adolescents (SCICA; McConaughy &
writes a narrative running log of observations over
Achenbach, 2001). The ASEBA also includes
the 10-minute period, while also rating the child
forms for children 1½ to 5, adults ages 18 to 59,
as being on-task or off-task during the last 5 sec-
and older adults ages 60 to 90. ASEBA data for
onds of each 1-minute interval. At the end of the
ages 6 to 11 can be integrated with standardized
10-minute period, the observer rates the child on
test data, medical data, developmental history, and
each of 89 DOF problem items. The DOF has the
other information obtained from records and in-
following advantages: (a) it provides a structured
terviews, as outlined in Table 1-1. The multiaxial
and efficient method for rating observations of a
assessment model includes the following five axes:
broad range of specific types of problems; (b) in-
dividual problem items are grouped into empiri- Axis I. Parent Data. Standardized ratings of
cally based syndrome scales, a DSM-oriented children’s competencies and problems by par-
ADHP scale and subscales, and Total Problems; ents, using the CBCL/6-18, plus history of the
(c) norms provide a standard for judging the se- child’s development, problems, competencies, and
verity of problems by comparing an individual’s interests as reported by parents in interviews and
DOF scores to large samples of nonreferred chil- questionnaires.
dren of the same gender and age range; and (d)
Axis II. Teacher Data. Standardized ratings of
scores from DOFs for large samples can be tested
the child’s school performance and problems by
for reliability and validity as done for other stan-
teachers, using the TRF, plus history of the child’s
dardized rating scales.
school performance as reported by teachers on re-
port cards, comments in school records, and inter-
MULTIAXIAL ASSESSMENT views.
1. Introduction and Rationale 3
Table 1-1
Examples of Multiaxial Assessment Procedures for Ages 6 to 11
Axis I Axis II Axis III Axis IV Axis V

Parent Teacher Cognitive Physical Direct Assessment
Report Report Assessment Assessment of Child
CBCL/6-18a TRFb TOFc Height, weight DOFd
History School records Ability tests Medical exam SCICAe
Parent Caregiver Achievement tests Neurological YSRf (for age 11)

interview interview exam
Perceptual-motor tests Self-concept mea-
sures
Language tests Personality tests
a
CBCL/6-18 = Child Behavior Checklist/6-18 (Achenbach & Rescorla, 2001).
b
TRF = Teacher’s Report Form (Achenbach & Rescorla, 2001).
c
TOF = Test Observation Form (McConaughy & Achenbach, 2004).
d
DOF = Direct Observation Form (McConaughy & Achenbach, 2009).
e
SCICA = Semistructured Clinical Interview for Children and Adolescents (McConaughy & Achenbach,
2001).
f
YSR = Youth Self-Report (Achenbach & Rescorla, 2001).
Axis III. Cognitive Assessment. Ability tests, cal interviews, using the SCICA; standardized self-
such as the Cognitive Assessment System (CAS; ratings by 11-year-olds, using the YSR; self-con-
Naglieri & Das, 1997), Stanford-Binet Intelligence cept measures, personality tests, and other mea-
Scales-Fifth Edition (SB5; Roid, 2003), Wechsler sures for assessing behavioral and emotional func-
Intelligence Scale for Children-Fourth Edition tioning.
(WISC-IV, Wechsler, 2003), Woodcock-Johnson III
The model in Table 1-1 provides guidelines for
Tests of Cognitive Abilities (WJ III COG; Wood-
multiaxial assessment of 6-to -11-year-old children.
cock, McGrew, & Mather, 2001), and Kaufman As-
However, not all sources of data may be relevant
sessment Battery for Children (KABC; Kaufman
or available for every child. For example, self-rat-
& Kauf- man, 1983); achievement tests; tests of
ings may not be useful for children younger than
perceptual-motor skills; and speech and language
age 11 and children who cannot reflect on their
tests. The TOF can also be used by test examiners
own behavior. Parents’ reports are highly relevant,
to obtain standardized ratings of the child’s test
but may not be available from both parents if the
session behavior.
child lives with only one parent or a parent surro-
Axis IV. Physical Assessment. Height and gate. Teachers’ reports are usually relevant for
weight, physical and/or neurological abnormali- school children if one or more teachers are avail-
ties and disabilities, medical and medication his- able to provide them. Standardized ratings of be-
tory. havioral and emotional characteristics observed
during testing can add important information about
Axis V. Direct Assessment of the Child. Direct
a child’s reactions to structured assessment and can
observations in group settings, using the DOF; clini-
help examiners judge the validity of test scores.
4 1. Introduction and Rationale
Direct observations in classrooms or other group file. Chapter 6 provides background on earlier ver-
settings can be compared with parent and teacher sions of the DOF, development of the DOF item
reports and with test session observations. All rel- set, statistical analyses to derive the five DOF syn-
evant information from the five axes should be in- drome scales for classroom observations and the
tegrated into cohesive formulations of children’s Aggressive Behavior syndrome scale for recess
cognitive and behavioral/emotional functioning in observations, development of the DOF DSM-ori-
order to meet their needs. ented Attention Deficit/Hyperactivity Problems
scale and its Inattention and Hyperactivity-Impul-
STRUCTURE OF THIS MANUAL sivity subscales, assignment of T scores to raw
scores, and borderline and clinical cutpoints for
This Manual provides information for using and the DOF problem scales and On-task. Chapter 7
scoring the DOF, plus details of its development, presents data on reliability, while Chapter 8 pre-
standardization, and psychometric properties. User sents data on validity of the DOF. Chapter 9 an-
qualifications are presented on Page iii. In this swers frequently asked questions about the DOF
chapter, we discussed our rationale for developing and the general approach we have used to develop
the DOF within the context of a multiaxial assess- the DOF and its scoring profile.
ment model. Chapter 2 discusses how to use the
DOF, including how to record observations and rate Appendix A presents mean T scores and raw
the DOF problem items. Chapter 3 describes the scores, standard deviations, and standard errors for
computer-scored DOF Profile and its narrative re- DOF scale scores for the normative sample. Ap-
port. Chapter 4 provides guidelines for conduct- pendix B presents mean T scores, raw scores, and
ing school observations and training DOF observ- standard deviations for matched samples of referred
ers. Chapter 5 discusses practical applications of children and nonreferred controls. Appendix C dis-
the DOF for use in schools and mental health as- plays correlations among the DOF scale scores. Ap-
sessments. Case examples illustrate how the DOF pendix D shows the 89 items on the 2009 version
can be used to assess children’s problems and how of the DOF compared to the 97 items of the 1986
to integrate DOF results with data from other DOF.
sources.
SUMMARY
The remaining chapters present technical de-
tails of our research on the DOF and the DOF Pro- We designed the DOF as a standardized form
for rating direct observations of children’s behav-
ior in classrooms and other group settings. The
DOF Profile for classroom observations displays
five empirically based syndrome scales, a DSM-
oriented Attention Deficit/Hyperactivity Problems
scale and Inattention and Hyperactivity-Impulsiv-
ity subscales, plus Total Problems scores. The DOF
Profile for recess observations has an empirically
based Aggressive Behavior syndrome scale and
Total Problems score. The DOF scales are scored
on norms for boys and girls ages 6 to 11. As part of
the ASEBA, the DOF provides data that can be
easily compared to data obtained from parents,
teachers, youths’ self-ratings, test session obser-
vations, and observations during child clinical in-
terviews.
Chapter 2
Using the DOF and Rating the DOF Items
The 2009 edition of the DOF is a revision of recommend that separate DOFs also be completed
the 1986 version, as explained in detail in Chapter for one or two “control” children in the same set-
6. As shown in Figure 2-1, the first page of the ting as the identified child. Observers should ran-
DOF includes spaces to write demographic infor- domly select control children who are of the same
mation about the identified child and control chil- gender and age as the identified child, but who are
dren, the date and time of observations, and infor- located far enough away so as not to be interacting
mation about the observer and setting. Page 1 also with the identified child. Observers should com-
provides brief instructions for writing notes, rat- plete one DOF for each 10-minute observation of
ing On-task, and rating the DOF problem items each control child, as done for the identified child.
that are discussed in detail in this chapter. Page 2 Ideally, observers should complete one DOF for a
provides space for writing observation notes and control child observed just before the identified
rating the child’s on-task and off-task behavior at child and one DOF for a second control child ob-
the end of each 1-minute interval. Page 3 lists the served just after the identified child. We recom-
89 DOF problem items to be rated at the end of mend obtaining at least two separate 10-minute ob-
each 10-minute observation. Page 4 provides servations for each control child. Page 1 of the DOF
instructions for completing information at the top provides boxes for indicating whether each 10-
of Page 1. minute observation was done for the “Identified
Child,” “Control Child 1,” or “Control Child 2.”
Observers should complete one DOF for each
Chapter 4 discusses procedures for selecting con-
10-minute observation. As indicated in Chapter 1,
trol children.
the DOF computer-scoring program requires at
least two observations of the identified child. The DOF computer-scoring program, described
Whenever possible, we recommend obtaining 3 to in Chapter 3, automatically averages the observer’s
6 separate 10-minute observations of the identi- ratings for a minimum of two and a maximum of
fied child on at least two different days. To obtain six separate DOFs for the identified child for
a stable index of behavior, observations should all each set of observations. It also separately aver-
be conducted within a one- to two-week time ages ratings for 2 to 6 DOFs for each of the two
frame. We also recommend obtaining separate ob- control children. The computer-scored DOF Pro-
servations in the morning and afternoon across dif- file displays mean raw scores and corresponding T
ferent days to provide a broad sampling of the scores and percentiles for each DOF scale for the
child’s behavior. Some observers may choose to identified child, along with mean raw scores, T
obtain several sets of observations over longer time scores, and percentiles for ratings averaged across
frames for purposes such as progress monitoring, two control children. Chapter 3 provides details of
assessing the stability of observed behaviors, and the computer-scored DOF Profile.
evaluating outcomes of interventions.
COMPLETING PAGE 1
Because the significance of a child’s problem INFORMATION
behavior depends partly on its deviance from the
behavior of other children in similar contexts, we On Page 1 of the DOF (see Figure 2-1), observ-
ers record demographic information about the iden-
5
6 2. Using the DOF and Rating the DOF Items
Figure 2-1. Page 1 of the Direct Observation Form.

2. Using the DOF and Rating the DOF Items 7
Figure 2-1 (cont.) Page 2 of the Direct Observation Form.



tified child and the particular child being observed may decide not to record the identified child’s full
(identified or control), the observer, and setting. name on the DOF until after you have left the ob-
Instructions for completing each field are provided servation setting so neither the identified child nor
on Page 4 of the DOF. peers will see the name of the child being observed.
On the DOF for control children, you can write a
The instructions for each field at the top of Page
brief description of the child in the space for the
1 are shown here in smaller font and discussed in
identified child’s name (e.g., boy with dark curly
more detail.
hair; girl with blond hair in front row) to help you
ID# identify multiple DOFs for the same control child.
This space is for an anonymous user-created ID
Or you can use an abbreviation to link the control
number for the identified child. The ID number is child to the identified child. The descriptive infor-
usually assigned by an administrator or other mation for control children can help to answer
appropriate staff member. The same ID number questions that may arise when you are trying to
should be used for control children matched to identify multiple DOFs for a particular control child
the identified child.
linked to the identified child.
The space at the top of the DOF is for a user-
Child’s Gender
defined ID number that is unique for each identi-
fied child. The same ID number should be assigned Check “Boy” or “Girl” for the gender of the child
to each control child who is linked to the identi- being observed. Ideally, the gender of the con-
trol child should match the gender of the identi-
fied child (Control 1, Control 2). The ID number fied child.
may be created by an administrator or other ap-
propriate staff member who is coordinating the Child’s Age
observations. In some cases, the observer may also On DOFs for the identified child, write age in
assign the ID number if the observer is acting as years. On DOFs for control children, write age of
an independent user (e.g., a school psychologist the control child if known, or write age of the iden-
using the DOF to assess a child). For computer- tified child as an estimate of the control child’s
age, or leave blank.
scoring, the ID number will serve as key informa-
tion for linking an identified child to control chil- Administrators, coordinators, or observers
dren. should write the age in years of the identified child.
Observers do not need to know the names and ages
Identified Child’s Name
of control children. On DOFs for control children,
Write the first, middle (if available), and last name you can write the age of the identified child as an
of the identified child (e.g., John Eric Smith). On estimate of the control child’s age or leave this
the DOFs for control children matched to the
identified child, write a brief description of the
space blank.
control child (e.g., boy with dark curly hair) and/ Child’s Ethnic Group or Race
or write an abbreviation of the identified child’s
name to create a link to the control child (e.g., if Write the known or apparent ethnic group or race
the identified child is John Eric Smith, Control 1 of the child being observed (e.g., White, African
might be labeled “JES-C1”). American, Asian).
Whenever possible, write the full name of the In this space, write the known or apparent eth-
identified child. Avoid using initials and writing nic group or race of the child being observed (Iden-
only the first or last name of the identified child tified Child, Control 1, Control 2). You can use
because more than one child may have the same your own terminology for ethnic group or race or
name. However, as discussed in Chapter 4, you choose from a list of terms. The DOF computer-
scoring program provides the following list of
terms for data entry: African American, Asian, Observation Set

Latino/Latina, Native American, Pacific Islander, Assign a label to identify the set or group of DOFs
White (non-Latino), Other. You can also create your for the identified child and control children to be
own terms for this field for data entry. computer-scored on the same DOF Profile. This
might be a time frame for the set of observations
Observer’s Name (e.g., Fall 2009) or a specific setting for the ob-
Write the observer’s first and last name or servations (e.g., math class, library). The com-
initials. puter-scoring program allows a minimum of 2 and
maximum of 18 DOFs as an observation set to
Observation # be scored on one DOF Profile: 2 to 6 DOFs for
the Identified Child, 1 to 6 for Control 1, and 1 to
Write a separate unique number for each 10- 6 for Control 2. DOFs for control children are op-
minute observation for the identified child (e.g., tional.
1, 2, 3, 4, 5, 6) and each 10-minute observation
for each control child. Observation set is a required field for computer-
scoring. When you enter each DOF into the com-
Write a separate unique number for each sepa-
puter-scoring program, you must assign a label to
rate 10-minute observation in sequence for each
identify it as a member of a set of DOFs that will
individual child. For example, if you observe the
be selected as one group to be scored on the same
identified child six times, record observation num-
DOF Profile. As explained in Chapter 3, the com-
bers 1, 2, 3, 4, 5, and 6 for each of the six DOFs in
puter-scoring program averages ratings on DOF
sequence for the identified child. The six observa-
items across multiple DOFs separately for the iden-
tions may span the course of several days. The
tified child and matched control children. You can
observation number, “Today’s Date,” and “Time
use any label that is meaningful to you to identify
of Day” should be consistent with the sequence of
which DOFs will form a set for the averaging pro-
observations. If you observe one control child
cess in computer-scoring. Examples are a label for
(Control Child 1) twice, record observation num-
a time frame for the set of observations or a spe-
bers 1 and 2 for each DOF in sequence for that
cific setting for the observations. You can use the
control child. If you observe a second control child
same observation set label for a minimum of two
(Control Child 2) twice, record observation num-
and maximum of 18 DOFs for computer-scoring.
bers 1 and 2 for each DOF in sequence for that
DOFs for control children are optional. There must
child.
be at least two DOFs for control children when
Grade or Level observations for control children are included in
Write the grade (e.g., Kindergarten, 1st, 4th) or
an observation set, as explained in Chapter 3.
level in school (e.g.,1-2) of the child being ob- Observed Child
served. Ideally, the grade or level of the control
child should match the grade or level of the iden- Check one box to indicate whether the observed
tified child. child for each DOF is the “Identified Child,” “Con-
trol Child 1,” or “Control Child 2.”
Identified Child’s Birthdate
This is a required field for computer-scoring.
Write the identified child’s birthdate.
Check the box, “Identified Child,” to indicate that
The DOF and the DOF computer-scoring pro- the observed child was the selected identified child
gram use month-day-year format for birthdate. On whose name is recorded on the DOF form. Check
DOFs for a control child, write the birthdate of the the box, “Control Child 1,” for the first control child
identified child. In addition to the ID number and in the same setting who is to be matched to the
identified child’s name, the birthdate will provide identified child. Check the box, “Control Child 2,”
another way to link control children to the appro- for the second control child in the same setting who
priate identified child.
is to be matched to the identified child. Whenever complete sentences. Instead, record brief notes and
possible, the gender of the control children should abbreviations that will help you rate the 89 DOF
be the same as the gender of the identified child. problem items listed on Page 3. The numbered
boxes in the left-hand column on Page 2 demar-
Time of Day
cate 1-minute intervals for rating on-task, as ex-
Write the time of the beginning of the 10-minute plained in the next section.
observation in hours and minutes and a.m. or
p.m. (e.g., 9:20 a.m., 12:30 p.m.) By scanning the list of DOF problem items be-
fore each observation session, you can familiarize
Today’s Date
yourself with the types of behaviors to describe.
Write the date of the observation. When appropriate, note the frequency (e.g., by chit
The DOF and the DOF computer-scoring pro- marks), duration (e.g., 20 sec), or intensity of spe-
gram use month-day-year format for the date of cific problems to help you choose between ratings
the observation. of 1, 2, or 3 for each problem item. Sometimes,
you may want to describe events during the 10-
Setting minute observation that affect the child’s behav-
Check one box to indicate whether the observa- ior, such as the teacher’s behavior or behavior of
tion was conducted in the classroom or at re- peers. For example, you may observe that a child
cess. If you conduct an observation in a setting daydreams or is restless during independent seat
other than class or recess, choose the setting work in class, but does not show these problems
option that most closely approximates the activ-
ity of children in that particular setting (e.g., lunch
when the teacher works with him/her directly. Or
= recess; small group instruction = class). You you may observe that a child is teased or hit by
can use the space to write the type of activity for another child, and subsequently teases back or be-
classroom observations (e.g., math, reading, comes involved in a fight. You may consider these
circle group) or recess observations (e.g., inside interactions when rating the child’s behavior on
games, outdoor play).
relevant DOF problem items. However, you should
This is a required field for computer-scoring. avoid making inferences about the child’s motiva-
Choose only one setting (Class or Recess) for each tions when rating specific DOF items, as instructed
DOF. The computer-scoring program uses these in a later section. Remember that DOF items are
two fields to determine whether the ratings from to be rated only for behavior observed in the 10-
that DOF will be scored on a DOF Profile based minute window for the observation period. The 10-
on norms for classroom observations or norms for minute observation window also applies to any
recess observations. There is no option for “Other” events that might affect the child’s behavior.
setting because there are no normative data for scor-
ing observations in settings other than class or re- RATING ON-TASK BEHAVIOR
cess. You also have the option of recording the type
The left-hand side of Page 2 of the DOF (see
of activity for each DOF for classroom observa-
Figure 2-1) contains 10 boxes in 2 columns for
tions (e.g., math, reading, circle group) or recess
observations (e.g., inside games, outdoor play). rating whether the child is on-task (ON TASK) or
not on-task (OFF TASK). These boxes represent
5-second intervals at the end of each minute of ob-
WRITING OBSERVATION NOTES
servation. In the last 5 seconds of each 1-minute
Use the spaces provided on Page 2 (see Figure interval, observe the child’s on-task behavior. If
2-1) to write a narrative description of the child’s the child’s behavior is on-task during the 5-second
behavior, affect, and interaction style over the 10- interval, draw a line through the box for “ON
minute observation period. You do not have to write TASK.” If the child is not on-task, draw a line
through the box for “OFF TASK.” tions.

Consider a child to be on-task if he/she is doing Figure 2-2 illustrates an observer’s notes and
what is expected in that situation (e.g., listening to on-task ratings for the first 10-minute observation
directions, reading a book, working on an assigned of 8-year-old Melinda Brandt (not her real name),
task at his/her desk, listening to others in circle whose computer-scored DOF Profile is presented
time, etc.). The child should be on-task for the main Chapter 3. Melinda is also discussed as a case
jority of the 5-second interval. You can use a stop- example in Chapter 5. The complete set of obser-
watch to indicate each 1-minute interval if you vations included four 10-minute observations of
wish, but this is not required. Another option is to Melinda and two 10-minute observations of each
watch the second hand on a clock or your watch of two control children in the same class.
and start each 1-minute on-task observation at a
specified time (e.g., when the second hand is on RATING DOF PROBLEM ITEMS
the 11).
Immediately after completing each 10-minute
If the child is not on-task for the majority of the observation, rate the child on the 89 DOF problem
5-second interval, rate the child as off- task. The items listed on Page 3. Be sure to complete your
following are examples of when a child is “off- ratings of DOF problem items before you start an-
task”: other 10-minute observation. Problem behaviors
do not have to attract the attention of the school
The child does something that requires the
staff in order to be rated as present. Equally im-
teacher to redirect him/her to get back “on-task.”
portant, your ratings of problem items should not
The child is doodling or drawing or playing with depend on your ratings of whether the child was
a toy or other object when he/she is supposed “on-task” or “off-task.” For example, a child may
to be listening to the teacher or working on an be considered on-task while working on an assign-
assignment. ment, but still be restless, or fidget, or look un-
happy. Some problems, such as 7. Doesn’t concen-
The child is looking around the room or is not
trate or pay attention for long, can suggest the child
looking at the teacher or someone else who is
is off-task. However, it is possible that a child could
speaking to him/her or to the whole class.
have problems concentrating during parts of the
The child is poking another student, talking to observation period, but then be on-task during the
another student, or clowning when he/she is sup- last 5 seconds of a 1-minute interval.
posed to be listening or working quietly.
To rate the DOF problem items, choose the one
At the end of the 10-minute observation period, item that specifically reflects each behavior actu-
count the number of intervals you rated the child ally observed during the 10-minute observation
as off-task and write the sum in the box for SUM period. Review your notes on Page 2 to help re-
OFF TASK. Count the number of intervals you member your observations. As you read the DOF
rated the child as on-task and write the sum in the problem items, you may also remember some be-
box for SUM ON TASK. The total number of in- haviors that may not have been described in your
tervals rated for SUM OFF TASK + SUM ON notes. You can rate such items even if you did not
TASK should not exceed 10. The computer-scor- write the specific behavior in your notes. (As you
ing program averages on-task ratings across mul- become more familiar with the DOF problem
tiple DOFs separately for the identified child and items, your observation notes should become more
for controls. The total number of intervals for on- closely aligned to your item ratings.) You may also
task and off-task on a single DOF must be > 8 for consider interactions with teachers and peers dur-
computer-scoring. On-task ratings are only scored ing the 10-minute observation period to rate spe-
for classroom observations, not recess observa-
Figure 2-2. Observer’s notes and on-task ratings for the first 10-minute observation of Melinda
Brandt.
cific problem items (e.g., 17. Tries to get attention sec). These notes will help you judge the frequency
of staff or 31. Gets teased). or intensity of the behavior for rating an item 1, 2,
or 3. Other problems (e.g., 11. Confused or seems
Rate the child on each DOF problem item ac-
to be in a fog; 16. Difficulty following directions)
cording to the following instructions written at the
will require your judgment for rating frequency or
top of Page 3:
intensity.
For each item that describes the child during the
10-minute observation period, circle: Be sure to rate only the one DOF item that most
0 = no occurrence; specifically describes a particular observation. For
1 = very slight or ambiguous occurrence;
example, several items describe attention problems
or hyperactivity, such as 7. Doesn’t concentrate or
2 = definite occurrence with mild to moderate in-
tensity/frequency and less than 3 minutes total
doesn’t pay attention for long; 9. Doesn’t sit still,
duration; restless, or hyperactive; 13. Fidgets, including with
3 = definite occurrence with severe intensity, high objects; 56. Easily distracted by external stimuli;
frequency, or 3 or more minutes total duration. and 57. Stares blankly. If a child exhibits any such
The intensity of the observed problem and the problems during the 10-minute observation period,
3-minute duration are guidelines for choosing rat- rate the one item that best fits the actual behavior
ings of 1, 2, or 3. If it is unclear whether a particu- observed. You may rate more than one item only if
lar problem occurred or if there was only a slight the child exhibits more than one different kind of
occurrence, rate the relevant item 1. If a particular problem, such as difficulty concentrating at cer-
problem definitely occurred with mild to moder- tain times, being easily distracted at other times,
ate intensity or frequency and less than 3 minutes and being restless. Avoid rating more than one item
total duration over the course of the 10-minute for the same observation. Figure 2-3 shows the
observation period, rate the relevant item 2. Rate observer’s ratings of Melinda Brandt based on
an item 3 if a particular problem occurred with notes for the same 10-minute observation period
severe intensity, or occurred for 3 or more minutes depicted in Figure 2-3.
over the 10-minute observation period, or occurred
intermittently for a total of 3 or more minutes GUIDELINES FOR RATING SPECIFIC
throughout the 10-minute observation period. It is DOF PROBLEM ITEMS
not necessary to actually time your observations This section provides guidelines to help you
of each problem. However, it is helpful to have a choose and rate specific DOF problem items based
clock in view so that you can judge whether a prob- on our research to develop the DOF. (We have not
lem occurred for at least 3 minutes versus less than found it necessary to give guidelines for every
3 minutes. For certain easily observed discreet be- item.) You can refer to these guidelines when ques-
haviors (e.g., fidgets, restless, makes odd noises, tions arise during rating. Several guidelines are in-
interrupts), you can make a note each time you tended to help you differentiate between similar
observe the behavior to help you judge the fre- items. It is not necessary to memorize the guide-
quency of that behavior. Or you can write chit lines for rating the DOF items. However, you
marks next to the initial note of the behavior for should have the guidelines available when you do
each time it occurred during the 10-minute period your ratings.
(e.g., fidgets ////). For certain other discreet behav-
iors, you can record the amount of time for each 1. Acts too young for age. Rate for a child who
instance of their occurrence (e.g., out of seat, 30 acts too young or seems immature for his/her chro-
nological age or has mannerisms of a younger child,
Figure 2-3. Observer’s ratings based on notes for the same observation of Melinda Brandt as in
Figure 2-2.
such as baby talk or acting like a baby, or making stimuli, such as noises or activity in the environ-
gestures typical of a younger child. Rate item 52 ment. The same child could be rated for both items
for showing off, clowning, or acting silly. if he/she fails to concentrate or pay attention at
certain times and is also easily distracted by spe-
2. Makes odd noises. Rate for humming, click-
cific stimuli at other times. Also rate item 7 when
ing, grunting, whistling, muttering, or singing to
a child has difficulty returning to a task or when
self, when these noises are not part of specified
there is no recovery of attention back to the origi-
activity, such as a song or imitation of animals.
nal task once attention has wandered.
This item can be rated even if the noises seem to
indicate a happy state in the child. Rate this item 8. Difficulty waiting turn in activities or tasks.
for vocal tics. Rate when a child has trouble waiting for his/her
turn in group activities or in class discussions.
3. Argues. Rate when a child argues with an
Examples are talking out of turn, cutting in line, or
adult or peer about something, such as requirements
grabbing materials from another child when he/
for an assignment, or rules of a task. Rate item 5 if
she is supposed to wait for a turn. Rate item 32 for
the child sasses, talks back, or is defiant toward a
children who interrupt the teacher or other chil-
teacher or staff member.
dren who are talking. Rate item 33 for children
4. Cheats. Rate for cheating in academic tasks who call out in class when they are expected to
or games. Examples are copying another child’s remain quiet or raise their hand before talking.
answers on assignments or tests when this is not
9. Doesn’t sit still, restless, or hyperactive. Rate
part of a cooperative group activity or breaking
for behaviors such as squirming in seat, frequently
rules of a game in order to win or get ahead.
changing position, swinging feet, or draping body
5. Defiant or talks back to staff. Rate for sassing across seat. Rate item 13 for fidgeting and item
or talking back to teacher or other school staff (e.g., 33 for more general impulsive behavior. Rate only
saying “This is stupid,” “I don’t want to do …” item 28 for out of seat behavior that is not due to
“Try and make me…”). If the child sasses or talks restlessness. If the child is restless in his/her seat
back, then refuses to do something that the teacher and gets out of seat to walk around the room, then
has asked him/her to do, you can also rate item 20 both items 9 and 28 may be rated.
for being disobedient.
11. Confused or seems to be in a fog. Rate for
6. Brags, boasts. Rate for bragging or boasting behaviors that suggest confused thinking or gen-
about accomplishments, skills, appearance, or pos- eral confusion about tasks or conversation. Rate
sessions. An example is a child who says he/she is item 77 for difficulty expressing self clearly. Rate
the smartest kid in the school or the toughest kid item 16 for difficulty following directions.
on the playground or a child who says he/she is
12. Cries. Rate when a child looks tearful or
better than anyone else in a skill or in appearance.
actually sheds tears. Rate 1 for slight or ambigu-
Do not rate if the child is giving a self appraisal in
ous tearfulness or crying, such as looking like about
response to a specific question about his/her per-
to cry with watery eyes. Rate 2 or 3 for obvious
formance on a task, activity or skill.
crying.
7. Doesn’t concentrate or doesn’t pay atten-
13. Fidgets. Rate for non-purposeful activity
tion for long. Rate for problems with concentra-
with hands that includes an object or non-purpose-
tion or short attention span, or intermittent lapses
ful “finger play.” Examples are twirling hair, twirl-
in attention. Item 7 should be used to rate behavior
ing glasses, tapping pencils, picking at paper edges,
that does not involve responses to particular dis-
and twisting the sleeve of a shirt, or tapping fin-
tracting stimuli, whereas item 56 should be used
gers together or playing with fingers. Rate item 38
to rate a child’s distraction by specific observable
for hand wringing or other nervous movements ing or disturbing another child by talking or some
with hands or fingers. activity. Rate item 46 for disrupting the activities
of a group of children. Rate item 32 for interrupt-
15. Daydreams or gets lost in thoughts. Rate if
ing or butting into an ongoing conversation or ac-
a child appears to be daydreaming, such as gazing
tivity of adults or peers.
out the window at nothing or looking off into space.
Rate item 7 when the child doesn’t pay attention 23. Doesn’t seem to listen to what is being said.
to instruction, lessons, or directions, or does not Rate for a child who appears not to be listening to
concentrate on work. Rate item 57 for blank star- a teacher’s instructions or directions or who does
ing or blankness of expression. not listen to other children when expected to lis-
ten, such as in circle time or class discussions. Rate
16. Difficulty following directions. Rate for a
item 7 when a child doesn’t concentrate or pay at-
child who appears to have difficulty carrying out
tention to his/her work or other activities when at-
instructions or who needs clarification or repeti-
tention would be expected.
tion of instructions. Rate also for a child who needs
directions or instructions simplified or rephrased 24. Eats, drinks, chews, or mouths things that
in a different way or who needs demonstrations are not food. Rate only for nonfood items such as
for carrying out tasks. paper, dirt, sand, or crayons, string, parts of cloth-
ing, and some body parts, such as hair. Do not rate
17. Tries to get attention of staff. Rate for de-
for chewing gum, sweets, soda, or junk foods. Rate
liberate attempts to get attention of the teacher or
item 42 for picking or scratching nose, skin or other
other staff in the room or area, such as raising or
parts of body. Rate item 76 for sucking thumb, fin-
waving hand a lot, going over to teacher’s desk, or
gers, hand or arm.
asking for help. Rate also for attempts to get the
observer’s attention if a child continues after an 25. Difficulty organizing activities or tasks.
explanation that observer cannot interact with child. Rate when a child seems disorganized in his/her
Rate item 52 for clowning or making faces. Both approach to assignments or other activities. Ex-
items 17 and 52 can be rated if a child clowns or amples are when a child has difficulty finding the
acts silly and at times directs the clowning toward right page in the book, has difficulty arranging ma-
staff for attention. Do not rate for raising hand in terials for a project, or whose desk is cluttered or
response to a teacher’s direction to raise hands or messy while working on assignments.
raising hand to answer a teacher’s question.
26. Fails to give close attention to details. Rate
18. Destroys own things. Rate for destroying for a child who overlooks details in completing
own things, such as ripping paper or drawings, tasks. Examples are skipping parts of an assign-
breaking pencils, breaking toys, or ripping clothes. ment or failing to notice plus or minus signs for
Rate item 19 for destroying other people’s things. numerical operations in math problems.
20. Disobedient. Rate for acts of disobedience, 27. Forgetful in activities or tasks. Rate for a
such as breaking school rules, or for behaviors that child who forgets materials or routines or who for-
result in punishment for violations of rules, such gets information that he/she would be expected to
as getting time-outs, getting detentions, or being remember. Examples are forgetting to bring pen-
sent to the principal’s office. Also rate when a child cils, papers, or books for working on assignments
refuses to comply with a teacher’s or other staff or a child who forgets to do expected routines, such
member’s request or directive, or when a child is as standing in line to go outside.
reprimanded but continues to do the behavior that
28. Out of seat. Rate when a child is out of his/
led to the reprimand.
her seat during times when he/she should remain
21. Disturbs other children. Rate for bother-
seated. Out of seat is when the child’s bottom is 34. Physically isolates self from others. Rate
off the seat or the child’s body weight is not sup- for a child who physically isolates self from others
ported by the chair (e.g., the child is just resting or a group, such as sitting alone in the corner of a
one leg on the chair). Do not rate for getting out of room. Rate item 75 for a child who generally ap-
seat to change activities or to respond to a teacher’s pears uninvolved, distant, or does not interact with
request, such as joining circle time or moving to a peers or staff, or who appears uninvolved off and
new section of the room for an activity. Do not on throughout the observation period.
rate when being out of seat is required for an ac-
37. Nervous, highstrung, or tense. Rate for ner-
tivity. Do rate for getting up to sharpen pencils,
vous, jumpy, overdriven, or “uptight” behavior or
get materials, or to talk to other students, unless
demeanor or a general feeling of nervous tension
the child was specifically directed to do so. Do not
from a child. Rate item 9 for a child who fails to
rate item 28 for recess observations unless chil-
sit still, is restless, or is overactive. Rate item 13
dren are expected to be seated for an activity.
for a child who fidgets with objects. Rate item 38
30. Gets into physical fights. Rate for physi- for more specific nervous behaviors, such as
cally fighting with peers, or adults, including hit- twitching, eye blinks, or facial tics.
ting, punching, pushing, scratching, kicking, etc.
38. Nervous movements, twitching, or tics, or
Do not rate for physical play that is part of a game
other unusual movements (describe). Rate for spe-
unless the physical play seems excessive. Rate item
cific nervous behaviors, such as twitching, eye
41 if a child initiates a physical attack on another
blinks, or facial tics. Rate item 37 for more gen-
person. Both items 41 and 30 may be rated if a
eral behaviors, such as jumpiness, overdriven or
child initiates a physical attack on someone that
“uptight” behavior or a demeanor or general feel-
then progresses into an ongoing physical fight. Item
ing of nervous tension from a child. Rate item 9
30 may also be rated when a child gets into a physi-
for a child who fails to sit still, is restless, or is
cal fight that was provoked, e.g., by name-calling
overactive and item 13 for fidgeting with objects.
or teasing.
Rate item 2 for vocal tics.
32. Interrupts. Rate for a child who interrupts
40. Too fearful or anxious. Rate for a child
or butts into an ongoing conversation or who in-
who expresses fears during the observation period
terrupts the teacher or other children while they
or who appears fearful. Rate item 53 for shyness
are talking. An example is a child who starts talk-
or timid behavior.
ing about something when a teacher is giving in-
structions or a child who asks a question before 41. Physically attacks people. Rate when a child
the teacher or peer has finished saying something. initiates a physical attack or initiates a fight with
Rate item 33 for a child who calls out in class when peers or adults (e.g., hits a teacher or peer, pushes
expected to remain quiet or expected to raise his/ or shoves a teacher or peer, throws something at
her hand before talking. Rate item 21 for a child another person, tries to physically harm another
who physically disturbs other children’s activities. person, etc.). Item 41 can be rated even when a
child has been provoked to attack, such as having
33. Impulsive or acts without thinking, includ-
been called a name. Do not rate for physical at-
ing calling out in class. Rate for immediate ac-
tacks that are part of a game or play unless the play
tions or responses that seem impulsive, such as
attacks seem excessive. Rate 1 if there is a slight
grabbing things or shifting from one action to an-
or ambiguous occurrence or if it is not clear whether
other, calling out answers without raising hand, or
a physical attack or attempt to harm someone else
careless or hurried approach to a specific task. Rate
was intended. Rate item 30 for ongoing physical
item 32 when a child interrupts a conversation or
fights with peers or adults. Both items 41 and 30
interrupts while the teacher is talking or giving in-
may be rated if a child initiates a physical attack
structions.
that then progresses into a physical fight. Do not or visual stimulus that takes the child off-task. Ex-
rate item 41 for hitting in the course of a physical amples are hearing noises or voices in or outside
fight unless the child clearly initiated the hitting. of the room, hearing or seeing cars, planes, etc.
outside the building, watching other children’s ac-
44. Apathetic, unmotivated, or won’t try. Rate
tivities when a child is supposed to be doing his/
for an “I don’t care attitude” or an apathetic ap-
her own work.
proach to tasks or instructions. Rate item 70 when
a child is underactive, seems tired, or is slow mov- 57. Stares blankly. Rate when a child’s eyes are
ing. not focusing on anything. Rate item 7 for prob-
lems concentrating or item 15 if child appears to
49. Avoids or is reluctant to do tasks that re-
be daydreaming.
quire sustained mental effort. Rate for a child who
tries to avoid doing assignments or other tasks that 58. Speech problem (describe). Rate for articu-
require effortful thinking or prolonged concentra- lation problems and other speech difficulties that
tion. Examples are procrastinating when required make it hard to understand what a child is saying.
to do difficult or lengthy assignments, such as math Examples are mispronouncing certain speech
or writing. sounds (e.g., r, l, th, w), slurred or garbled speech,
halting speech, or unusual grammatical structures.
50. Self-conscious or easily embarrassed. Rate
Rate item 77 for problems in verbal fluency or
for behaviors indicating self-consciousness or em-
when a child has trouble expressing his/her ideas
barrassment, such as blushing, looking apologetic,
or desires clearly. Do not rate item 58 for speech
sheepishness, or unusual sensitivity.
problems due to second language issues (e.g., En-
51. Slow to respond verbally. Rate for a child glish as a second language).
who is slow to answer questions from a teacher or
59. Wants to quit or does quit tasks. Rate when
peers or pauses for an unusual length of time be-
a child expresses a desire to quit a task (e.g., ask-
fore saying something. Item 51 can be rated for a
ing “Can I stop now?”), gives up, or actually does
child who seems to need “time to think” before
quit a task before completing it or quits before time
responding to questions.
limits are up.
52. Shows off, clowns, or acts silly. Rate for
60. Yawns. Rate 1 for one or two definite or
clowning or silly behavior to attract attention of
ambiguous yawns. Rate 2 or 3 for persistent yawn-
peers or adults. Examples are making faces, mak-
ing.
ing silly gestures, giggling, or mimicking others to
cause laughter. Rate item 2 for making odd noises. 61. Strange behavior. Rate for behavior that
Rate item 66 for teasing. seems very unusual or bizarre. Examples are mak-
ing strange comments about other people, rubbing,
53. Shy or timid. Rate for shy demeanor. Do
patting or touching other people inappropriately,
not rate for characteristics that are covered more
or making weird faces that are not intended to be
specifically by other items, such as item 50 for self-
silly or clowning. If the behavior is more specifi-
conscious or easily embarrassed.
cally covered by another item, rate the more spe-
54. Explosive or unpredictable behavior. Rate cific item instead, such as item 2 for making odd
for behavior that seems “explosive” or unpredict- noises or item 52 for showing off, clowning, or
able, such as emotional outbursts. Rate item 67 for acting silly.
temper tantrums, hot temper, or angry appearance.
62. Stubborn, sullen, or irritable. Rate for a
56. Easily distracted by external stimuli. Rate generally stubborn, sullen, or irritable demeanor.
when a child is distracted by a specific object, noise, Rate item 63 for sulking as a reaction to a request
from a teacher, other adult, or peer.
63. Sulks. Rate for sulking when it is a reaction lacks energy. Rate item 44 for a child who is apa-
to something that occurs during the observation thetic or unmotivated. Rate item 60 for yawning.
period. Rate item 62 for a more general demeanor
71. Unhappy, sad, or depressed. Rate for a child
of stubbornness, sullenness, or irritability.
who has an unhappy, sad, or depressed demeanor.
64. Swears or uses obscene language. Rate for Rate item 12 for a child who cries in response to a
words or verbal expressions that generally would specific event or request, but does not seem gener-
be considered swearing or obscene by teachers or ally unhappy. Rate item 71 if the child looks gen-
other adults, including swear words that may have erally unhappy or sad, which can or cannot include
become relatively common in a given culture or in crying. If a child cries and looks unhappy, then both
modern music. Do not include words that approxi- items 12 and 71 can be rated. Rate item 62 for a
mate swear words, such as “darn it.” Do include child who looks sullen or irritable and item 63 for
words referring to god when not used as part of a a child who sulks in response to a question or a
religious activity and include slang words that other request from someone. Do not rate item 71 based
people would consider offensive. only on inferences about a child’s feelings if the
child does not display a sad demeanor or appear-
66. Teases. Rate for physical or verbal teasing.
ance of unhappiness.
Rate 1 for playful teasing, such as making silly
faces at someone or tickling. Rate 2 or 3 for more 75. Withdrawn, doesn’t get involved with oth-
deliberate teasing or harassing, such as name call- ers. Rate for a child who appears uninvolved, dis-
ing or ridiculing other people. tant, or does not interact with peers or staff, or who
withdraws off and on throughout the observation
67. Temper tantrums, hot temper, or seems
period. Rate item 34 for physically isolating self
angry. Rate for overt temper tantrums or for ex-
from others.
pressions of anger or hot temper. Rate item 62 for
sullenness or irritability or grumpy mood. 76. Sucks thumb, fingers, hand, or arm. Rate
for sucking or mouthing thumb, fingers, hand, or
68. Threatens people. Rate for verbal or physi-
arm. Include chewing on thumb, fingers, hand, or
cal threats to other people, including peers and
arm. Rate item 42 for picking or scratching nose,
teachers. This can include when a child tells an
skin, or body parts. Rate item 24 for mouthing
intended victim that he/she is seeking or plotting
things that are not food and not body parts.
revenge, or when a child verbalizes threats to a
third party. The threat can be general, such as “I 77. Fails to express self clearly. Rate for prob-
am going to get you for that,” or more specific lems in verbal fluency or communicating meaning
threats. or using actions or gestures in place of verbal de-
scriptions. Rate item 58 for specific speech defects
69. Too concerned with neatness, cleanliness,
or articulation problems that make speech unclear.
or order. Rate for behaviors such as excessive ti-
Include problems communicating meaning due to
dying of materials or expressed concerns about
second language issues (e.g., English as a second
getting hands or clothing dirty. Do not rate only
language).
for erasures while drawing or writing, unless era-
sures are excessive or clearly due to overconcern 78. Impatient. Rate when a child’s comments
for neatness. or behaviors imply time pressure, such as when a
child wants to know when he/she can move on to
70. Underactive, slow moving, tired, or lacks
another task or asks when a desired activity will
energy. Rate when a child’s physical movements
happen, such as recess or lunch. Rate item 59 when
are slowed down, such as being slow in writing,
a child expresses a desire to quit a task or activity.
drawing, or walking across the room. Also rate
when a child looks physically tired or sleepy, or 79. Tattles. Rate for a child who spontaneously
tells teachers or authority figures about rule-break- 83. Doesn’t get along with peers. Rate for a
ing or wrongful behavior of other children. An child who doesn’t get along with certain children,
example is telling the teacher that another child even if the child may get along with other chil-
hit him/her or hit someone else. Do not rate when dren. Examples are a child who is rejected by peers
a child reports wrongful behavior in direct response when attempting to join a group or game or a child
to an adult’s questions about what happened. who complains of having no friends. Rate other
items for more specific problems getting along with
80. Repeats behavior over & over; compulsions
peers, such as item 3 for arguing and item 30 for
(describe). Rate for repetitive, purposeless behav-
getting into physical fights.
iors, such as touching things over and over, rub-
bing hands or arms on a table, making circles on a 84. Runs out of class (or similar setting). Rate
table with fingers, or repetitively straightening for a child who runs out of the classroom or an-
things on a desk. Do not rate item 80 for repetition other setting (e.g., library, gym, lunch hall) with-
of acts that are more specifically covered by other out permission. Rate 1 if the child leaves the set-
items, such as item 52 for clowning or acting silly ting without permission at a quick pace that might
or item 2 for making odd noises. If it is unclear not be considered “running.” Do rate for a child
whether the intensity or nature of the behavior who runs out of the classroom without permission
qualifies as a repetitive act or compulsion, rate 1. to go to the bathroom. Do not rate for children who
run out of the classroom for recess.
81. Easily led by peers. Rate for a child who
imitates or mimics other children or seems like a 85. Behaves irresponsibly (describe). Rate for
“follower.” Rate item 66 if mimicking other chil- doing physically dangerous things that are not play-
dren is done as teasing. Rate also for a child who ful (e.g., poking pencils in electrical outlets) or for
asks other children about what to do for general getting into an adult’s belongings (e.g., taking ob-
activities. Do not rate for a child who asks peers jects off the teacher’s desk or opening desk draw-
for specific help in assignments. ers without permission). Rate item 20 for being
disobedient or noncompliant and/or breaking rules.
82. Clumsy, poor motor control. Rate for a child
Rate items 18 or 19 for deliberately destroying
who has physical difficulty in motor tasks, such as
things (e.g., ripping up papers, breaking pencils).
looking clumsy for his/her age in walking, running,
Rate item 89 for “other problems” for a child who
or jumping. Rate item 89 for “other problems” for
steals objects from peers or adults.
fine motor problems, such as poor hand writing or
awkward pencil grasp. 86. Bossy. Rate for a child who tells other chil-
dren what to do when not requested or a child who
tries to dominate an activity (e.g., by vehemently
stating rules or making up his/her own rules for an
activity or game.)
87. Complains. Rate for complaints about tasks
or activities and complaints about somatic prob-
lems. Examples are complaining that a task is too
hard or boring, complaining that a task will take
too long, or asking “Do we have to do this?” in a
complaining tone of voice. Also rate item 87 for a
child who has somatic complaints without known
medical cause, such as dizziness, headaches, or
stomachaches, and for a child who expresses so-
matic complaints, such as complaining that his/
her hand hurts while writing. Rate item 74 for
whining tone of voice. Both items 74 and 87 may
be rated if a child whines and expresses a specific
Chapter 3
Computer-Scored DOF Profile
This chapter describes and illustrates the com- tings are described in detail in this chapter.
puter-scored DOF Profile. There is no hand-scored
DOF Profile because of the complexity of averag- DOF PROFILE FOR CLASSROOM
ing scores across multiple observation sessions. OBSERVATIONS
The DOF computer-scoring program is a module
in the ASEBA Assessment Data Manager (ADM) The DOF Profile for classroom observations
software. It can be purchased separately or as part consists of 4 pages. Page 1 displays bar graphs of
of the full ASEBA ADM package, which includes scores on the five syndrome scales: Sluggish Cog-
modules for other ASEBA forms. (For users of the nitive Tempo, Immature/Withdrawn, Attention
DOF, the most relevant other ASEBA forms are Problems, Intrusive, and Oppositional. Page 2 dis-
the CBCL/6-18, TRF, YSR, SCICA, and TOF.) plays bar graphs of Total Problems and On-task
scores, plus a list of items and ratings for Other
With the DOF Module, users enter an observer’s Problems not scored on the syndrome scales. Page
ratings of the 89 problem items and On-task for 3 displays bar graphs of the DSM-oriented Atten-
each 10-minute observation of an identified child tion Deficit/Hyperactivity Problems scale and its
and control children matched to the identified child. Inattention and Hyperactivity-Impulsivity
For computer-scoring the DOF Profile, the DOF subscales. Page 4 summarizes descriptive infor-
Module requires a minimum of two DOFs for the mation about each DOF that was used to create
identified child. DOFs for control children are op- the DOF Profile.
tional. When observations of control children are
included, there must be at least two DOFs from Figure 3-1 shows Page 1 of the DOF Profile
observations of one or both control children. Re- scored from classroom observations of 8-year-old
quiring at least two DOFs for the identified child Melinda Brandt. The DOF Profile was based on
and two DOFs across one or two control children four 10-minute observations of Melinda as the
is intended to guard against interpretation of DOF identified child and two 10-minute observations
scores based on only one 10-minute observation of each of two control children in the same class.
of the identified child and only one 10-minute ob- Figure 2-2 in Chapter 2 showed the observer’s notes
servation of a control child. The DOF Module al- and on-task ratings for the first 10-minute obser-
lows up to 18 DOFs to be scored as an “observa- vation of Melinda, while Figure 2-3 showed the
tion set” for one DOF Profile. Each observation observer’s ratings of the 89 problem items for the
set can include up to six DOFs for the identified same 10-minute observation.
child, up to six DOFs for the first control child, Descriptive Information
and up to six DOFs for the second control child.
The DOF Profile is normed separately for boys and At the top of the DOF Profile in Figure 3-1,
girls at ages 6-11. The DOF Profile is also normed you can see descriptive information about
separately for classroom observations and for re- Melinda and the eight DOFs used to create the
cess observations. Profiles for each of the two set- profile. The ID number assigned to Melinda
(200901), plus her name, gender, age, and birthdate
are printed on the left side of the profile. The middle
23
24
3. Computer-Scored DOF Profile
Figure 3-1. Computer-scored DOF syndrome scales for classroom observations of 8-year-old Melinda Brandt.
3. Computer-Scored DOF Profile 25
column shows the observation period (02/15/07- distributed across DOFs for the identified child and
02/17/07), indicating the length of time between control children. The DOF Module allows up to
the first 10-minute observation and the last 10- two observers in the same observation set, and it
minute observation, which in Melinda’s case prints a warning on the DOF Profile when there
spanned 3 days. are two observers.
“Observation Set” indicates the label (Winter The number of observations used for scoring
2007) which the observer’s supervisor assigned to the DOF Profile is printed on the far right side of
the set of eight DOFs used to create the DOF Pro- the profile. For Melinda, you can see that there were
file for Melinda. A label for the observation set is four observations of the identified child (Melinda),
required for computer-scoring the DOF. As ex- plus two observations of Control Child 1 and two
plained in Chapter 2, users can choose whatever observations of Control Child 2. Although the DOF
label fits their purpose. For example, you might Module requires a minimum of two DOFs for the
choose to obtain one set of observations in the be- identified child for computer-scoring, we recom-
ginning of the school year and another set of ob- mend obtaining three to six observations of the
servations later in the year, after implementing an identified child. Whenever possible, we recom-
intervention for the identified child. The label for mend at least two observations of one or two con-
the observation set might then be the time frame trol children in the same setting, alternating obser-
for each set of observations, as it was in Melinda’s vations of the identified child and control children.
case. Or the label could indicate a specific activity To sample behavior over different time frames, we
for different sets of observations (e.g., reading class recommend observing the identified child and con-
versus math class). trol children in the morning and afternoon of at
least two different days. Observers should com-
The observer’s name is printed to the right of
plete a separate DOF for each 10-minute observa-
the observation period. As best practice, we rec-
tion of each child.
ommend that a single observer conduct all obser-
vations of the identified child and control children Syndrome Scales
to be included in the same observation set. In
Beneath the descriptive information at the top
Melinda’s case, the same observer (Valerie Stone)
of Page 1 in Figure 3-1, you can see bar graphs for
did the four observations of Melinda and two ob-
T scores corresponding to the total raw scores for
servations of each of the two control children. If
the five syndrome scales for the identified child,
two different observers had done the observations,
Melinda Brandt, (dark bar to the left) and the two
then this could introduce rater “bias” into
control children (lighter bar to right). Ratings are
Melinda’s DOF scores. This would be particularly
averaged across the two control children to create
problematic if one observer (e.g., Valerie Stone)
the total raw score for controls. The range of T
had done all four observations of Melinda, but an-
scores is shown to the left of the bar graph display.
other observer had done all four observations of
As explained in Chapter 6, we performed statisti-
the control children. If this had been the case, you
cal analyses of DOF problem items to determine
would not know whether differences between
which items tend to occur together to form syn-
scores for Melinda versus the control children rep-
dromes. The label for each DOF syndrome scale
resented true differences in the children’s behav-
summarizes the types of problems that form that
ior or whether the scores were influenced by how
syndrome.
the two different observers rated the DOFs. If two
different observers are used for observations in the Total Scores, T Scores, and Percentiles for
same observation set, it is important to make sure Syndrome Scales. Beneath the bar graph for each
that each observer rates both the identified child syndrome scale are the total raw scores, T scores,
and control children so that any potential bias is and percentiles for the identified child and the two
26 3. Computer-Scored DOF Profile
control children. The averaged ratings of the prob- syndromes, you can see that Melinda obtained clini-
lem items comprising each scale are printed be- cal range T scores on the Intrusive (T = 71-C) and
low the scores and percentiles, with ratings of the Oppositional (T = 70-C) syndromes, both of which
identified child (ID) in the left column and ratings fell above the 97th percentile. Melinda obtained a
for the controls (CRL) in the right column. Abbre- much lower T score of 58 on the Immature/With-
viated versions of the problem items are listed in drawn syndrome, which fell at the 79th percentile
the middle columns below each scale. for the normative sample. Melinda’s T score on
the Immature/Withdrawn syndrome was the same
On the DOF Profile for Melinda, you can see
as the T score for the two control children. The
that she obtained a total score of 2.5 on the Slug-
two control children obtained much lower T scores
gish Cognitive Tempo syndrome, which corre-
than Melinda on the Intrusive (T = 54) and Oppo-
sponds to a T score of 67. Melinda’s T score of 67
sitional (T = 50) syndromes, which fell at the 65th
falls at the 96th percentile for the DOF normative
and 50th percentiles, respectively. (The DOF Pro-
sample of 6-11-year-old girls. This means that 96%
file shows a T score of 50 and percentile < 50
of the DOF normative sample received a score
for the control children’s total score of 0.0 on the
equal to or lower than Melinda’s score of 2.5 on
Oppositional syndrome, because T scores are trun-
the Sluggish Cognitive Tempo syndrome. The let-
cated at 50 on the syndrome scales.) Chapter 6 de-
ter B printed next to the T score of 67 indicates
scribes our procedures for assigning T scores to
that Melinda’s T score on Sluggish Cognitive
raw scores for the DOF syndrome scales.
Tempo fell within the borderline clinical range for
the normative sample. The two control children Borderline, Clinical, and Normal Ranges for
obtained an averaged total score of 1.0 on Slug- Syndrome Scales. The two broken lines on the bar
gish Cognitive Tempo, which corresponds to a T graph display in Figure 3-1 demarcate borderline
score of 56. The control children’s T score of 56 and clinical ranges for judging the deviance of
fell at the 73rd percentile, which was in the normal scores on the five DOF syndromes, as compared
range for the DOF normative sample. A later sec- to the normative sample. We identified a border-
tion describes borderline, clinical, and normal line clinical range for each scale because categori-
ranges for scores on the DOF syndrome scales. cal distinctions are usually less reliable for indi-
viduals who score close to the border of a category.
In a similar fashion, you can see that Melinda’s
The addition of a borderline range enables practi-
total score of 12.0 on the Attention Problems syn-
tioners to make more differentiated decisions about
drome had a T score of 74, which was above the
children’s functioning than would be possible if
97th percentile for the normative sample of 6-11-
all scores were categorized as normal versus clini-
year-old girls. This means that over 97% of the
cal.
DOF normative sample received a total score equal
to or lower than Melinda’s score of 12.0 on Atten- Table 3-1 summarizes the borderline clinical and
tion Problems. The letter C printed next to clinical ranges for all the DOF scales. The border-
Melinda’s T score of 74 indicates that her score on line clinical range for the DOF syndrome scales
Attention Problems fell within the clinical range spans the 93rd to the 97th percentiles, which corre-
for the normative sample. The two control chil- spond to T scores of 65 to 69. The clinical range is
dren obtained an averaged total score of 5.5 on >97th percentile, which corresponds to T scores >69.
Attention Problems, which corresponds to a T score Scores that fall in the borderline range between the
of 56, falling at the 73rd percentile and in the nor- two broken lines on the DOF Profile are high
mal range for the normative sample. enough to be of concern, but are not as clearly de-
viant as scores that fall in the clinical range above
When you examine the bar graphs, total scores,
the top broken line. Scores above the top broken
T scores, and percentiles for the remaining DOF
line indicate that the observer rated enough prob-
lems as present (ratings of 1, 2 or 3) to be of clini-
cal concern. As indicated in Chapter 2, ratings of 1
are for slight or ambiguous occurrences of a prob-
Table 3-1
Borderline and Clinical Ranges on DOF Scales
DOF Scale Borderline Clinical
Classroom Observations
Empirically Based Syndromes T = 65-69 T >69
Sluggish Cognitive Tempo 93rd-97th percentile >97th percentile
Immature/Withdrawn
Attention Problems
Intrusive
Oppositional
Total Problems-Classroom T = 60-63 T >63

84th-90th percentile >90th percentile
DSM-Oriented Scales
Attention Deficit/Hyperactivity Problems T = 65-69 T >69
Inattention Subscale 93rd-97th percentile >97th percentile
Hyperactivity-Impulsivity Subscale
On-task T = 31-35 T <31

3rd-7th percentile <3rd percentile
Recess Observations
Aggressive Behavior T = 65-69 T >69
93rd-97th percentile >97th percentile
Total Problems-Recess T = 60-63 T >63
84th-90th percentile >90th percentile
Note. On the problem scales, high scores warrant concern. Problem scale scores in the borderline range
are high enough to be of concern, but not so clearly deviant as scores in the clinical range. On the On-task
scale, low scores warrant concern.
As Figure 3-1 shows, Melinda’s scores on the on all five DOF syndrome scales. The T scores and
DOF profile fell in the clinical range above the 97th percentiles for Melinda provide a standard against
percentile on the Attention Problems, Intrusive, and which you can judge deviance in her observed be-
Oppositional syndrome scales, and in the border- havior relative to a large normative sample of 6-
line clinical range between the 93rd and 97th per- to-11-year-old girls. The additional T scores and
centiles on the Sluggish Cognitive Tempo syn- percentiles for the control children provide a stan-
drome scale. The borderline to clinical range scores dard for judging the deviance of Melinda’s ob-
on these four DOF syndromes indicated that served behavior relative to other children in the
Melinda manifested many more problems than same classroom setting.
were typically observed in classrooms for 6-11-
By examining the observer’s ratings on the spe-
year-old girls in the DOF normative sample.
cific problem items, you can see the types of prob-
Melinda’s score on the Immature/Withdrawn syn-
lems that Melinda showed within each syndrome.
drome fell within the normal range. The scores for
That is, you can see that Melinda received aver-
the two control children fell in the normal range
aged scores of 0.5 to 1.0 for three items on the By contrast, the two control children received an
Sluggish Cognitive Tempo syndrome, 0.5 for one averaged total score of 9.5, which corresponds to
item on the Immature/Withdrawn syndrome, 0.5 a T score of 55, falling at the 69th percentile for the
to 3.0 for six items on the Attention Problems syn- normative sample of 6-11-year-old girls.
drome, 0.5 to 3.0 for five items on the Intrusive
Borderline, Clinical, and Normal Ranges for
syndrome, and 0.5 to 1.5 for five items on the Op-
Total Problems. The two broken lines on the bar
positional syndrome. The observer’s ratings of the
graph in Figure 3-2 demarcate borderline and clini-
control children yielded average scores of 0.5 to
cal ranges for judging deviance on Total Problems,
1.5 for 11 items across the five syndrome scales,
as compared to the DOF normative sample. As
with all other items scored 0.0.
shown in Table 3-1, the borderline range for the
Total Problems Total Problems score spans approximately the 84th
to the 90th percentiles, which correspond to T scores
Figure 3-2 shows Page 2 of the computer-scored
of 60 to 63. The clinical range is >90th percentile,
DOF Profile for Melinda Brandt, which includes
which corresponds to T scores >63. The border-
bar graphs, total raw scores, T scores, and percen-
line clinical and clinical ranges for Total Problems
tiles for Total Problems and On-task. The profile
are lower than for the DOF syndrome scales be-
also includes a list of item ratings for Other Prob-
cause the Total Problems score is comprised of all
lems that are not scored on the DOF syndrome
89 problem items (for details, see Chapter 6).
scales. The scores shown on this page of the DOF
Scores that fall in the borderline clinical range
Profile were derived from the same four DOFs for
warrant concern, but are not as clearly deviant as
Melinda and four DOFs for control children that
scores that fall in the clinical range. Scores that
were used to score the syndrome scales.
fall below the borderline clinical range (<84th per-
Total Scores, T Scores, and Percentiles for centile) are considered to be in the normal range.
Total Problems. The bar graph on the left side of As done for the syndrome scales, the letter B is
the DOF Profile in Figure 3-2 shows the Total Prob- printed next to T scores that fall in the borderline
lems scores for Melinda and the two control chil- clinical range for Total Problems, while the letter
dren. The Total Problems score is the sum of the C is printed next to T scores that fall in the clinical
averaged 0-1-2-3 ratings for the 89 problem items range. Chapter 6 describes procedures for deter-
on each DOF. The DOF Module separately aver- mining T scores and borderline and clinical
ages the item ratings for the identified child cutpoints for the Total Problems score.
(Melinda) and for the two control children. Total
Other Problems
raw scores from averaged item ratings, T scores,
and percentiles are printed beneath the bar graphs The middle column on Page 2 of the DOF Pro-
for the identified child and controls. The scale of T file contains a list of items labeled “Other Prob-
scores for Total Problems is printed on the left side lems.” These are abbreviated versions of the 36
of the bar graph. specific problem items, plus open-ended item 89,
which are not included in the five DOF syndrome
As you can see in Figure 3-2, Melinda obtained
scales for classroom observations. Scores for the
a total score of 27.0 for Total Problems, which cor-
37 other problems are included in the Total Prob-
responds to a T score of 79, falling above the 98th
lems score. The averaged ratings for the identified
percentile for the normative sample of 6-11-year-
child (ID) are listed to the left of each item and
old girls. This means that 98% of the normative
averaged ratings for control children (CRL) are
sample received a score lower than Melinda’s score
listed to the right of each item. Although the other
of 27.0 for Total Problems. The letter C printed
problems are not included in the syndrome scale
next to the T score of 79 indicates that Melinda’s T
scores, each of them may be important in its own
score fell in the clinical range for Total Problems.
Figure 3-2. Computer-scored DOF Total Problems, Other Problems, and On-task for 8-year-old Melinda Brandt.
29
right. For example, you can see in Figure 3-2 that mative sample received a score equal to or lower
Melinda obtained a score of 1.0 for 28. Out of seat than the control children’s score of 9.5 for On-task.
and 0.5 for 85. Behaves irresponsibly. She obtained Because On-task raw scores range from 0 to 10,
scores of 0.0 for all other items in the Other Prob- the mean scores can be easily translated into per-
lems list. The two control children obtained a score centages for evaluation reports. Thus, you can see
of 0.5 for 10. Clings to adults or too dependent that across the four 10-minute observation sessions,
and 1.0 for 28. Out of seat. You should examine Melinda was on-task an average of only 55% of
ratings for these Other Problems items, along with the time. By contrast, the two control children were
ratings of items comprising the DOF syndrome on-task an average of 95% of the time.
scales, to formulate interpretations of DOF results.
On-task On-Task. The two broken lines on the bar graph
for On-task in Figure 3-2 demarcate borderline
As explained in Chapter 2, observers rate on-
clinical and clinical ranges for judging deviance
task behavior of the identified child and control
for On-task compared to the DOF normative
children by marking boxes for on-task or off-task
sample. Low scores for On-task warrant concern,
that represent the last 5 seconds of each 1-minute
in contrast to high scores on the problem scales.
interval over the 10-minute observation period.
As shown in Table 3-1, the borderline clinical range
Total On-task scores can thus range from 0 to 10
for On-task scores spans approximately the 3rd to
for each observation. The DOF Module averages
7th percentiles, which correspond to T scores of 31
On-task ratings separately for the identified child
to 35. The clinical range is <3rd percentile, which
and control children across observation sessions.
corresponds to T scores <31. Scores that fall above
Total Scores, T Scores, and Percentiles for On- the borderline clinical range (>7th percentile) are
task. The bar graph on the right side of Page 2 of considered to be in the normal range. Chapter 6
the DOF Profile in Figure 3-2 shows On-task scores describes procedures for assigning T scores and
for Melinda Brandt and the two control children. borderline and clinical cutpoints to On-task scores.
Separate bars correspond to On-task T scores for
DSM-Oriented Attention Deficit/Hyperac-
the identified child (dark bar to the left) and the
tivity Problems and Inattention and Hyper-
two control children (lighter bar to the right). Be-
activity-Impulsivity Subscales
low the bar graph are the mean scores, T scores,
and percentiles obtained by Melinda and the two Children’s problems can also be viewed from
control children. the perspectives of formal diagnostic systems. The
dominant system in the United States is embodied
As you can see in Figure 3-2, Melinda obtained
in the American Psychiatric Association’s DSM-
a mean score of 5.5 for On-task. A score of 5.5
IV and DSM-IV-TR. The DSM’s diagnostic cat-
corresponds to a T score of 33, which fell at the 5th
egories are intended to serve many purposes. Un-
percentile for the normative sample of 6-11-year-
like the syndromes derived statistically from the
old girls. This means that only 5% of the norma-
DOF, DSM diagnostic categories for behavioral
tive sample received a score equal to or lower than
and emotional problems are not derived directly
Melinda’s score of 5.5 for On-task. The letter B
from problem scores obtained from standardized
printed next to the T score of 33 indicates that
assessment. Nevertheless, assessment instruments
Melinda’s T score fell in the borderline clinical
like the DOF, and other ASEBA forms like the
range for On-task. The two control children ob-
CBCL/6-18, TRF, YSR, SCICA, and TOF, are of-
tained a mean score of 9.5 for On-task. A score of
ten used to obtain data on which to base diagnoses.
9.5 corresponds to a T score of 51, which fell at
the 54th percentile for the normative sample of 6- The CBCL/6-18, TRF, YSR, and SCICA pro-
11-year-old girls. This means that 54% of the nor- files include several DSM-oriented scales compris-
ing problem items judged by experienced psychia- scores, and percentiles for the subscales, with rat-
trists and psychologists to be very consistent with ings of the identified child (ID) in the left column
DSM-IV diagnostic categories (for details, see and ratings for the controls (CRL) in the right col-
Achenbach & Rescorla, 2001 and McConaughy & umn.
Achenbach, 2001). The DOF Profile has one DSM-
Figure 3-3 shows that Melinda obtained a total
oriented scale, the Attention Deficit/Hyperactivity
score of 17.5 on the Attention Deficit/Hyperactiv-
Problems scale, which can be scored from class-
ity Problems scale, which corresponds to a T score
room observations.
of 72, falling above the 97th percentile for the DOF
To create the DOF Attention Deficit/Hyperac- normative sample of 6-11-year-old girls. The let-
tivity Problems scale, we selected DOF items that ter C printed next to the T score of 72 indicates
were similar to other ASEBA items judged to be that Melinda’s T score on Attention Deficit/Hyper-
very consistent with DSM-IV symptoms of ADHD. activity Problems fell within the clinical range for
We also added new DOF items that are similar to the normative sample. The two control children
ADHD symptoms that did not have counterparts obtained a total score of 5.0 on Attention Deficit/
among other ASEBA items. Chapter 6 presents Hyperactivity Problems, which corresponds to a T
details of how we constructed the DOF Attention score of 53, falling at the 62nd percentile for the
Deficit/Hyperactivity Problems scale. normative sample of 6-11-year-old girls.
Figure 3-3 shows Page 3 of the computer-scored The two sections of bar graphs to the right of
DOF Profile, scored for 8-year-old Melinda Brandt graphs for Attention Deficit /Hyperactivity Prob-
and two control children in the same classroom. lems show how Melinda and the two control chil-
Total raw scores for the Attention Deficit/Hyper- dren scored on the Inattention and Hyperactivity-
activity Problems scale and Inattention and Hyper- Impulsivity subscales. You can see that Melinda
activity-Impulsivity subscales were derived from obtained a total score of 5.5 on the Inattention
the same four DOFs for Melinda and two DOFs subscale, which corresponds to a T score of 68,
for each control child, as done for the other DOF falling at the 97th percentile for the normative
scales. sample of 6-11-year-old girls. The letter B printed
next to Melinda’s T score of 68 indicates that her
Total Scores, T Scores, and Percentiles for
score on Inattention was within the borderline clini-
DSM-oriented Scales.As you can see in Figure 3-3,
cal range for the normative sample. The two con-
the DOF Module prints separate bar graphs for T
trol children obtained a total score of 2.0 on Inat-
scores on the DSM-oriented Attention Deficit/Hy-
tention, which corresponds to a T score of 55, fall-
peractivity Problems scale and subscales for the
ing at the 69th percentile for the normative sample.
identified child (dark bar to the left) and averaged
Melinda obtained a total score of 12.0 on the Hy-
controls (lighter bar to right). The range of T scores
peractivity-Impulsivity subscale, which corre-
is shown to the left of the first bar graph. Beneath
sponds to a T score of 73, falling above the 97th
the bar graphs, are the total raw scores, T scores,
percentile. The letter C printed next to Melinda’s
and percentiles for the identified child and control
T score of 73 indicates that her score on Hyperac-
children for each scale. The Attention Deficit/Hy-
tivity-Impulsivity was in the clinical range for the
peractivity Problems scale includes 23 items, of
normative sample. The two control children ob-
which 10 comprise the Inattention subscale and 13
tained a total score of 3.0, corres-ponding to a T
comprise the Hyperactivity-Impulsivity subscale.
score of 52, falling at the 58th percentile.
The Attention Deficit/Hyperactivity Problems to-
tal score equals the sum of the Inattention and Hy- Borderline, Clinical, and Normal Ranges for
peractivity-Impulsivity subscale total scores. The Attention Deficit/Hyperactivity Problems and
averaged ratings of items comprising each of the Subscales. The two broken lines on the bar graphs
two subscales are printed below the total scores, T
32
Figure 3-3. Computer-scored DOF Attention Deficit/Hyperactivity Problems and Inattention and Hyperactivity-Impulsivity subscales
for 8-year-old Melinda Brandt.
in Figure 3-3 demarcate borderline clinical and rizes scale scores on the DOF Profile for the iden-
clinical ranges for the DSM-oriented scale and tified child and control children. You can easily
subscales. As shown in Table 3-1, the borderline import the Narrative Report into a word process-
range for the DSM-Oriented Attention Deficit/ ing document when writing evaluation reports. This
Hyperactivity Problems scale and the Inattention not only makes report writing more efficient, but
and Hyperactivity-Impulsivity subscales spans the also guarantees the accuracy of the scores cited for
93rd to 97th percentiles, which correspond to T each DOF scale. Another option is to include the
scores of 65 to 69. The clinical range is >97th per- DOF Narrative Report as an addendum to evalua-
centile, which corresponds to T scores >69. Scores tion reports and/or as a note in a child’s case record.
falling below a T score of 65 and below the 93rd Figure 3-5 shows the DOF Narrative Report for
percentile are considered to be in the normal range. observations of Melinda Brandt.
As indicated in the previous section, Melinda’s
scores on the Attention Deficit/Hyperactivity Prob- DOF PROFILE FOR RECESS
lems scale and the Hyperactivity/Impulsivity OBSERVATIONS
subscale were in the clinical range, while her score
on the Inattention subscale was in the borderline The DOF Profile for recess observations con-
clinical range. Scores for the two control children sists of 2 pages. Page 1 displays bar graphs for the
were in the normal range. Aggressive Behavior syndrome scale and Total
Problems, plus a list of items and ratings for Other
Summary Report for Classroom Problems not scored on the Aggressive Behavior
Observations syndrome. Page 2 is a Summary Report with de-
scriptive information about the DOFs that were
Figure 3-4 shows Page 4 for the DOF Profile
used to score the DOF Profile.
derived from classroom observations of Melinda
Brandt and the two control children. Page 4 is a Figure 3-6 shows Page 1 of the DOF Profile for
Summary Report that provides descriptive infor- recess observations, scored for 9-year-old Ricky
mation about each of the four DOFs for Melinda Johnson (not his real name) and two control chil-
and four DOFs for the control children that were dren. As you can see in the column on the right
used to score the DOF Profile. You can see in the side at the top of the profile, the observer (Harry
Summary Report that the gender of the two con- Provo) obtained six observations of Ricky, four
trol children was the same as for the identified child observations of Control Child 1, and two observa-
(female) and that the same observer (Valerie Stone) tions of Control Child 2. The observations were
completed all eight DOFs. The observations were done over an 8-day observation period from 10/
done in the morning and afternoon of two differ- 09/07 to 10/16/07. The observer labeled the obser-
ent days (02/15/07 and 02/17/07) and during a va- vation set for the twelve DOFs as “Playground Fall
riety of classroom activities (reading, social stud- 2007.”
ies, class meeting, and math). Chapter 5 discusses
Aggressive Behavior Syndrome Scale
Melinda’s case in more detail, including reports
from her mother and teacher about her behavior at The bar graph on the left side of the DOF Pro-
home and at school. file in Figure 3-6 shows scores on the Aggressive
Behavior Syndrome scale for Ricky Johnson, the
Narrative Report for Classroom identified child, (dark bar on the left) and the two
Observations control children (lighter bar on the right). (Chap-
In addition to printing the DOF Profile and Sum- ter 6 describes our factor analyses to derive the
mary Report, the DOF Module gives users the op- Aggressive Behavior syndrome scale.)
tion of printing a Narrative Report that summa- Total Scores, T Scores, and Percentiles for
34
Figure 3-4. Summary Report of DOFs for classroom observations of 8-year-old Melinda Brandt.
Figure 3-5. Narrative Report summarizing DOF results for classroom observations of 8-year-old
Melinda Brandt.
36
Figure 3-6. Computer-scored DOF Profile for recess observations of 9-year-old Ricky Johnson.
Aggressive Behavior. Beneath the bar graph, you lines on the DOF Profile are high enough to be of
can see the total raw scores, T scores, and percen- concern, but are not as clearly deviant as scores
tiles for the identified child and control children. that fall in the clinical range above the top broken
The range of T scores is shown to the left of the line. The DOF Module prints the letter B next to T
bar graph. The averaged ratings of each of the items scores that fall in the borderline clinical range and
comprising the Aggressive Behavior syndrome are the letter C next to T scores that fall in the clinical
printed below the scores and percentiles, with rat- range. Scores that fall below the borderline clini-
ings of the identified child (ID) in the left column cal range (<93rd percentile) are considered to be in
and ratings for the controls (CRL) in the right col- the normal range. Chapter 6 describes procedures
umn. for assigning borderline clinical and clinical
cutpoints to the DOF Aggressive Behavior syn-
On the DOF Profile for Ricky, you can see that
drome scale.
he obtained a total score of 5.5 on the Aggressive
Behavior syndrome, which corresponds to a T score Total Problems
of 74, which was above the 97th percentile for the
The bar graph to the right of Aggressive Be-
DOF normative sample of 6-11-year-old boys. This
havior in Figure 3-6 shows the Total Problems
means that at least 97% of the DOF normative
scores for Ricky and the two control children.
sample received a score lower than Ricky’s score
of 5.5 on Aggressive Behavior. The letter C indi- Total Scores, T Scores, and Percentiles for
cates that Ricky’s T score of 74 fell within the clini- Total Problems. The Total Problems score is the
cal range for the normative sample. The two con- sum of the averaged 0-1-2-3 ratings for 88 prob-
trol children obtained an averaged total score of lem items on each DOF. (Item 28. Out of seat is
1.0 on Aggressive Behavior, which corresponds to not included in the Total Problems score for recess
a T score of 58, falling at the 79th percentile for the observations.) The DOF Module separately aver-
DOF normative sample of 6-11-year-old boys. ages the item ratings for the identified child (Ricky)
and the two control children. Total raw scores from
When you examine the averaged item scores
averaged item ratings, T scores, and percentiles are
listed below the bar graph, you can see that the
printed beneath the bar graph for the identified child
observer rated 6 of 9 problems as present for Ricky:
and controls.
14. Cruel, bullies, or mean to others; 30. Gets into
physical fights; 31. Gets teased; 47. Screams; 66. As you can see in Figure 3-6, Ricky obtained a
Teases; and 86. Bossy. The labels are abbreviated total score of 11.0 for Total Problems, which cor-
versions of the problem items. The control chil- responds to a T score of 77, which was above the
dren, by contrast, were rated 0.0 on all items ex- 98th percentile for the normative sample of 6-11-
cept 47. Screams. year-old boys. The letter C printed next to the T
score of 77 indicates that Ricky’s T score fell in
the clinical range for Total Problems. By contrast,
Aggressive Behavior. The two broken lines on the
the two control children received an averaged To-
bar graph for Aggressive Behavior in Figure 3-6
tal Problems score of 3.5, which corresponds to a
demarcate borderline and clinical ranges for judg-
T score of 62, falling at the 89th percentile for the
ing the deviance of scores, as compared to the nor-
normative sample of 6-11-year-old boys. The let-
mative sample. As shown in Table 3-1, the border-
ter B printed next to the T score of 62 indicates
line clinical range for the DOF Aggressive Behav-
that the control children’s T score fell in the bor-
ior syndrome scale spans the 93rd to the 97th per-
derline clinical range for Total Problems.
centiles, which corresponds to T scores of 65 to
69. The clinical range is >97th percentile, which Borderline, Clinical, and Normal Ranges for
corresponds to T scores >69. Scores that fall in the Total Problems. The two broken lines on the bar
borderline clinical range between the two broken
graph display in Figure 3-6 demarcate borderline motor control; and 87. Complains. You should ex-
and clinical ranges for judging deviance on Total amine ratings for these Other Problems items, along
Problems, as compared to the DOF normative with ratings of items comprising the DOF Aggres-
sample. As shown in Table 3-1, the borderline range sive Behavior syndrome, to formulate interpreta-
for the Total Problems score spans approximately tions of DOF results.
the 84th to the 90th percentiles, which correspond
to T scores of 60 to 63. The clinical range is >90th Summary Report for Recess
percentile, which corresponds to T scores >63. You Observations
can see on the profile that the borderline clinical Figure 3-7 shows Page 2 for the DOF Profile
and clinical ranges for Total Problems are lower derived from recess observations of Ricky Johnson
than for the DOF Aggressive Behavior syndrome and the two control children. Page 2 is a Summary
scale. This is because the Total Problems score for Report that provides descriptive information about
recess observations comprises 88 problem items, each of the six DOFs for Ricky and six DOFs for
in contrast to 9 items for Aggressive Behavior. the control children that were used to score the DOF
Scores that fall below the borderline clinical range Profile. You can see in the Summary Report that
(<84th percentile) are considered to be in the nor- the observations spanned an 8-day period from 10/
mal range. As done for Aggressive Behavior, the 09/07 to 10/16/07. The observer (Harry Provo) did
letter B is printed next to T scores that fall in the two observations of Ricky on each of the three days,
borderline clinical range for Total Problems, while four observations of Control Child 1 on each of
the letter C is printed next to T scores that fall in the first two days, and two observations of Con-
the clinical range. trol Child 2 on the third day. All of the observa-
Other Problems tions were conducted on the playground. Chapter
5 discusses Ricky’s case in more detail, including
A list of items labeled “Other Problems” is reports about his behavior from his mother and
printed on the right side of the DOF Profile for teacher.
recess observations. These are abbreviated versions
of the 78 specific problem items, plus open-ended Narrative Report for Recess
item 89, which are not included in the Aggressive Observations
Behavior syndrome scale. The averaged ratings for Figure 3-8 shows the DOF Narrative Report
the identified child (ID) are listed to the left of each summarizing scale scores on the DOF Profile for
item and averaged ratings for control children recess observations of Ricky Johnson. As indicated
(CRL) are listed to the right of each item. Scores earlier, you can easily import the Narrative Report
for the Other Problems items are included in the into a word processing document for evaluation
Total Problems score. As you can see in Figure 3- reports. Or you can include the Narrative Report
6, Ricky obtained scores of 0.5 to 1.0 for 3. Ar- as an addendum to evaluation reports and/or as a
gues; 8. Difficulty waiting turn in activities or tasks; note in a child’s case record.
20. Disobedient; 22. Doesn’t seem to feel guilty
after misbehaving; 67. Temper tantrums, hot tem- SUMMARY
per, or seems angry; and 83. Doesn’t get along with
peers. He obtained scores of 0.0 for all other items The computer-scored DOF Profile provides a
in the Other Problems list. The two control chil- visual, quantitative picture of children’s problems
dren obtained scores of 0.5 to 1.0 for 8. Difficulty rated by observers in classrooms or at recess. The
waiting turn in activities or tasks; 75. Withdrawn, DOF can be scored only by computer because av-
doesn’t get involved with others; 82. Clumsy, poor eraging scores across multiple observation sessions
would be too complex for hand-scoring. The DOF
Module requires a minimum of two DOFs for the
identified child for computer-scoring. DOFs for
control children are optional, but recommended in
order to provide a standard for evaluating the iden-
Figure 3-7. Summary Report of DOFs for recess observations of 9-year-old Ricky Johnson.
39
Figure 3-8. Narrative Report of summarizing DOF results for recess observations of 9-year-
old Ricky Johnson.
Chapter 4
Training DOF Observers and Conducting
School Observations
As indicated in earlier chapters, the DOF is de- servers to take with them to observation sites. After
signed for rating observations of children’s behav- initial training with the DOF Manual, trainees
ior in school classrooms, at recess, and other group should practice DOF recording and rating proce-
settings. Observers should have some knowledge dures through paired observations of children in
of child behavior and development, as well as school classrooms or comparable group settings.
theory and methodology of behavioral assessment. One observer can be a trainee and the second can
Observers may be paraprofessionals, such as teach- be an experienced DOF observer. Or two trainees
ers’ aides, undergraduate and graduate students, re- can practice together and then meet with an expe-
search assistants, and professionals in education, rienced DOF observer as trainer. Another option
school psychology, clinical psychology, and related is to have pairs of trainees view videotapes of chil-
disciplines. Page iii describes user qualifications dren in group settings and then discuss their ob-
for the DOF. For professionals with training in stan- servations and ratings with the trainer. We used
dardized assessment, a thorough understanding of both approaches in our research to develop the
the procedures described in this Manual is usually DOF. In either approach, responsible trainers must
sufficient for using the DOF and interpreting the adhere to requirements specified by an appropri-
DOF Profile. Paraprofessionals, students, and re- ate institutional review board or school adminis-
search assistants will require supervision and train- trative office to obtain proper permission for di-
ing by a qualified professional. In this chapter, we rect observations and/or videotapes of practice
provide guidelines for training DOF observers and cases for training purposes.
conducting observations in school settings. We also
The two observers should each observe the same
discuss procedures for assessing inter-observer
practice case for the same 10-minute period. Dur-
agreement and inter-rater reliability.
ing the 10 minutes, each observer writes a narra-
tive description of the child’s behavior and rates
TRAINING DOF OBSERVERS the child as being on-task or off-task at the end of
All users should read Chapters 1 through 3 of each 1-minute interval, as instructed on the DOF
this Manual to learn about the DOF and the com- and in Chapter 2. At the end of the 10-minute ob-
puter-scored DOF Profile. Chapter 2 provides in- servation period, without discussing their obser-
structions for rating the DOF problem items and vations, each observer then rates the child on the
the child’s on-task behavior. Users should pay spe- 89 DOF problem items. After completing their
cial attention to the Guidelines for Rating Specific DOFs, the two observers should compare their rat-
DOF Problem Items in Chapter 2. Brief instruc- ings for on-task and their ratings for the problem
tions are also provided on the DOF. items and should discuss any discrepancies be-
tween their observations and ratings. However,
As a first step in training DOF observers, su- observers should make no changes on any of their
pervisors should meet with them to discuss the DOF ratings based on these comparisons. The two
DOF rating procedures described in Chapter 2. It observers should then select a new child for an-
is also good to provide copies of the Guidelines other 10-minute observation, following the same
for Rating Specific DOF Problem Items for ob- procedure as for the first child. We recommend that
41
42 4. Training DOF Observers and Conducting School Observations
pairs of observers select at least five children as investigator must obtain approval from an institu-
practice cases for training on the DOF. Observers tional review board to conduct direct observations
should rate each child on a separate DOF. of children as part of a research protocol. School-
based assessments and research protocols usually
After completing their paired observations of
require letters and forms for obtaining informed
the practice cases, trainees should meet with the
consent from parents.
trainer to discuss discrepancies in their on-task and
item ratings, referring to the instructions and guide- Supervisors and researchers should create a
lines in Chapter 2. Thereafter, pairs of trainees, or standard form that indicates to the child’s teacher
a trainee and an experienced observer, should ob- that the parent of an identified child has given per-
serve and rate additional children until good agree- mission for the child to be observed. This form
ment is reached. We recommend making paired should adhere to policies of the school administra-
observations of at least 5 to 10 practice cases for tive office or an appropriate institutional review
training purposes. When the DOF is used in re- board, depending on the purpose of the observa-
search protocols, we recommend paired observations. The observer can then present this form to
tions on additional cases to assess inter-observer the teacher before the first observation.
agreement and inter-rater reliability, as discussed
Scheduling Observations
in later sections.
Supervisors and observers should develop a
GUIDELINES FOR OBSERVATIONS standard procedure for scheduling observations
IN SCHOOLS with teachers and other relevant school personnel.
Scheduling can be done by the supervisor, a desig-
Because the DOF is designed primarily for ob- nated assistant, or the observer. Supervisors and
serving children in school settings, we offer the observers should determine the best format for
following guidelines for using the DOF in schools. regular communication, (e.g., e-mail, phone voice
Trainers, supervisors, and researchers can adapt mail, cell phone). The supervisor should provide
these guidelines to fit their own procedures and the necessary information for each scheduled ob-
particular schools. servation: child’s name, child’s teacher, name of
Obtaining Permission for Observations the school, and directions to the school. Some su-
pervisors may want to complete all of the demo-
All users of the DOF should comply with school graphic information about the child on Page
policies regarding parental permission for direct 1 of the DOF. Others may depend on the observer
observations of children. Observations of practice to complete the information after the observation.
cases and control children do not require observ- If an observer is sick or otherwise unable to com-
ers to know the name of the child. For these obser- plete a scheduled observation, he/she should con-
vations, passive consent may be all that is required. tact the person scheduling observations as soon as
For example, a principal or other school adminis- possible to reschedule the observation. Observa-
trator may contact parents by letter to inform them tions of the identified child and control children
that children will be observed anonymously at should all be done within 1 to 2 weeks, whenever
school and explain the purpose of the observations. feasible. The next sections present guidelines for
Parents can then contact the school administrator observers in school settings. Supervisors and train-
if they do not want their child observed. If an iden- ers may adapt these guidelines to fit their needs
tified child is observed as part of a formal assess- and setting.
ment, such as a comprehensive special education
evaluation, then the evaluator and appropriate Guidelines for DOF Observers in Schools
school staff must follow procedures for obtaining Prior to Arrival at a School
parental permission. For research, the principal
4. Training DOF Observers and Conducting School Observations 43
 Whenever possible, call the school the morn- “I’m Jane Doe. The family of a child
ing of your scheduled observation to make sure in your classroom has granted permis-
the child is in school that day. If inclement sion for me to observe him/her. My su-
weather is likely, listen to the radio the morn- pervisor [or I] contacted you to arrange
ing of the observation to determine whether the time for this observation.” Show
school has been cancelled. the permission form with the child’s
name to the teacher. This is a good way
 Regardless of the weather, plan to arrive at
to identify the child without stating the
least 15 minutes early on the first observation
child’s name aloud.
day to allow time to visit the school office and
introduce yourself to the teacher. You may not
 Ask the teacher to quietly point out the identi-
need to arrive as early on your second observa-
fied child. Make sure that the child does not see
tion day.
that he/she is being singled out. Tell the teacher
Professional Dress that you are not supposed to know anything
about the child to ensure that the teacher does
 Dress professionally. Do not wear shorts, jeans,
not provide background or other information
t-shirts, shirts that expose the midriff, or other
that may influence your observations.
very casual clothing. Do not wear sleeveless
shirts or tank tops without jackets because these  When you begin your observation, fold the DOF
are sometimes against school dress codes. Wear- so that Page 1 with the identified child’s name
ing earrings is acceptable, but remove other is not visible. If the identified child’s name is
visible piercings for the observation session. not on the DOF, you can wait to write in the
child’s full name until after you have completed
 Avoid clothing or accessories with slogans be-
your observation and left the room so that no
cause some schools do not allow them.
one will see the name of the child being ob-
 Because some children and adults have aller- served.
gies, do not wear perfume or scented lotions
 If the teacher would like to introduce you in
(e.g., cologne, lotion, or after-shave).
the classroom, ask the teacher to avoid indicat-
Beginning Observations ing that you will observe a specific child. For
instance, the teacher could say:
 When you enter the school building, go to the
school’s office. Introduce yourself to the secre- “This is Jane. She is here to learn about
tary and inquire about the procedure for visit- what we do in second grade. So she is
ing the school. Many schools will require you just going to watch our class for a little
to sign in and/or wear a “visitor” nametag. In- while.”
troduce yourself to the principal if he/she is
 Some children may be curious and might ask
readily available.
what you are doing. If this happens, just give a
 Ask at the office for directions to the child’s very general comment about observing the
classroom. You may also want to inquire at the class. For example, you might say:
office if the child’s teacher is aware that you
“I haven’t been in second grade for a
will be observing the child that day.
long time, so I am here to see what
 When you arrive at the classroom, wait for an children do in second grade.”
appropriate moment to introduce yourself pri-
 If a child asks what you are writing, explain
vately to the teacher. For instance, you might
that you are making notes to remind yourself
say,
of what you saw. Do not show children the DOF,
but don’t try to hide it in ways that might raise vation during a very brief non-academic activ-
suspicions. Make sure the identified child’s ity, such as snack time or lining up to get mate-
name cannot be seen. Also make sure that the rials or changing activities (e.g., moving from
permission form with the identified child’s a reading group back to a student’s desk to do
name is not visible. math). You can write the activity in the spaces
on Page 2 where you make your notes of your
 Some children might want to show you things,
observations. If there is more than one activity
such as their schoolwork, drawings or stories.
during a 10-minute observation, write the pre-
You can briefly acknowledge these with a nod
dominant activity in the space next to “Setting”
or smile, but don’t be overly encouraging. You
on Page 1 of the DOF.
won’t be able to properly observe if you are
interacting with children. If necessary, move to  Observe the identified child for a full 10 min-
a place farther away from a child who wants to utes. If the child leaves the classroom for any
interact. If a child is insistent on showing some- reason (e.g., bathroom, drink of water) during
thing, you might say: this time, stop observing, and mark the break
on the DOFwhere you are writing your notes.
“Your drawing is great. Thank you
Begin observing again 30 seconds after the
for showing it to me. I can’t talk to
child returns to the classroom to allow the child
you about it now because my job is
time to settle in. You do not have to start an
to watch what everyone is doing, so
observation session over again if breaks like
that I can really learn about what goes
this occur after the first 2 minutes of observa-
on in the whole class.”
tion. For example, you might obtain 6 minutes
Observing the Identified Child of observations before a child leaves the class-
room, then wait 2 minutes for the child to re-
 Find a place in the classroom where you can
turn to class, allow the child to settle in for 30
observe the identified child unobtrusively, but
seconds, and then finish the additional 4 min-
can clearly see what the child is doing, includ-
utes of the 10-minute observation period.
ing seeing the child’s face. Do not make it ob-
vious which child you are watching. If the child  If you have observed the child for 2 minutes or
moves to another part of the room, you can less and then cannot complete the remaining 8
move to a new spot to see better. When you do minutes of observation, discontinue that obser-
move, try to do so without calling attention to vation session and begin a new 10-minute ob-
yourself. servation with a new DOF. For example, you
may begin an observation in the last minute of
 For classroom observations, observe the iden-
an academic period and then the child leaves
tified child during an academic activity. Aca-
the classroom for a one-hour break for lunch
demic activities include math, reading, social
and recess. In such a case, you should begin a
studies, and science, but may also include in-
new 10-minute observation when the child re-
dependent seatwork, circle discussions for
turns to the classroom after the break.
young children, and other learning activities.
If the class activity is not an academic activ-  After each 10-minute observation, stop observ-
ity—such as snack, free time, an assembly, or ing and complete your ratings of the 89 DOF
a “special” (e.g., gym, music), or a birthday items before beginning a new observation.
party—wait for normal classroom academic ac-
tivities. You may continue an on-going obser-
 As indicated above, try to get a full 10-minute  Write the date of the observation in the box for
observation for each DOF. If you obtain at least “Today’s Date.”
8 to 9 minutes of an observation, but then can-
 Check one box for “Setting: Class or Recess.”
not complete the remaining 1 to 2 minutes in
You can also write in the predominant activity
the same time period (i.e., the morning or af-
during the observation session.
ternoon of the same day), you may count that 8
to 9 minute session as a complete observation.  If not done already, record other information
However, an 8-to-9-minute observation, in place on Page 1 of each DOF for the identified child:
of 10 minutes, should be very rare. child’s gender, age, ethnic group or race, and
grade or level in school, and child’s birthdate
 If the identified child shows extremely unusual
(if known). If you do not know some of this
behavior, proceed with the observation anyway.
information, have your supervisor complete
After all your observations and DOF ratings
those sections. You or your supervisor can en-
are completed, you may ask the teacher pri-
ter an ID # for the identified child on each DOF,
vately if the child’s behavior was very unusual
if not done already. Each identified child should
and note the teacher’s answer on the DOF.
have a unique ID # that is the same for all DOFs
However, consultation with teachers is gener-
for that child and any matched controls. You
ally not encouraged because it may lead to in-
or your supervisor can also assign a label for
formation that will compromise your status as
“Observation Set.” See Page 4 of the DOF for
an independent observer. Consult your super-
instructions for completing information on
visor after the observation if you have concerns
Page 1.
about the child.
 Repeat the observation procedures using a new
Observing Control Children
DOF for the next 10-minute observation of the  For each identified child, select one or two con-
identified child. trol children of the same gender and age in the
 On Page 1 of each DOF, write the identified same classroom. You do not have to know the
child’s full name. As indicated earlier, you can names of the control children. Choose control
wait to write the identified child’s full name children who do not sit near the identified child
until after you leave the room. and do not interact with the identified child.
For example, you can choose a child who sits
 Write your first and last name in the box for at a diagonal across the room from the identi-
“Observer’s Name.” fied child, or who sits in a group of other chil-
 Write the number of each observation in the dren across the room. Try to use the same strat-
box for “Observation #” in the right hand cor- egy each time for choosing control children. If
ner of Page 1 of the DOF. Number observation a control child does interact with the identified
sessions for each identified child consecutively child, indicate this in your notes on Page 2 of
over the observation sessions (e.g., 1, 2, 3, 4, the DOF, but continue your observation of the
5, 6). control child for the full 10 minutes.
 In the section for “Observed Child,” check the  Do not select the control child on the basis of
box for “Identified child” to indicate each DOF particular behaviors that the child displays, be-
that was completed for the identified child. cause the control child should be an anonymous
“random” selection. The only exclusions for
 Write the time you begin each observation in control children are obvious physical disabili-
the box for “Time of Day.” ties (e.g., in a wheel chair or has a broken arm)
or mental disabilities or mental retardation as a double check for indicating controls ver-
(e.g., Down syndrome, seizure disorder). sus identified children.
 Write observations for each control child in the  In the section for “Observed Child,” check the
same way that you wrote observations of the box for “Control Child 1” to indicate each DOF
identified child. Use a separate DOF for each that was completed for the first control child.
10-minute observation of each control child. Check the box for “Control Child 2” to indi-
cate each DOF that was completed for the sec-
 We recommend observing two control children
ond control child.
for each identified child, if possible.
 Write the time you begin each observation in
 Try to alternate observations of the control child
the box for “Time of Day.”
with observations of the identified child. That
is, if you are observing two control children on  Write the date of the observation in the box for
the same day as the identified child: observe “Today’s Date.”
Control Child 1, then observe the Identified
 Check one box for “Setting: Class or Recess.”
Child, then observe Control Child 2; then ob-
You can also write in the predominant activity
serve Control Child 1 again, then observe the
during the observation session.
Identified Child again, then observe Control
Child 2 again. This is the ideal sequence for  Record other information on Page 1 of each
observations, but it may not always be possible DOF for the control child: child’s gender, age,
on the same day. For example, for easier sched- ethnic group or race, and grade or level in
uling, you and your supervisor may decide to school. If you do not know the age of control
obtain several DOFs for one control child on children, write the age of the identified child
the same day and then several DOFs for a sec- as an estimate of age or leave blank. You will
ond control child on a different day. If you do not know the control child’s birthdate, so leave
this, you should still alternate observations of that blank. You or your supervisor can enter an
the control child with observations of the iden- ID # on each DOF, if not done already, to indi-
tified child. cate which identified child is linked to each
control child. Each identified child should have
 In the section for “Identified Child’s Name”
a unique ID # that is the same for all DOFs for
on Page 1 of the DOF, write a brief description
that child and any controls matched to the iden-
of each control child (e.g., girl with short blond
tified child. You or your supervisor can also
hair, boy with dark curly hair). Or write an ab-
assign a label for “Observation Set.” See Page
breviation to show the link between the identi-
4 of the DOF for instructions for completing
fied child’s name and the control child (e.g., if
information on Page 1.
the identified child is John Eric Smith, Con-
trol Child 1 might be labeled “JES-C1”). This Completing Observations
will provide an additional check to be sure
 Always thank the teacher after you complete
which DOFs belong to which control child.
your observations. You do not need to inter-
 Write your first and last name in the box for rupt class to do this. For example, when you
“Observer’s Name.” are ready to leave, you can stand by the door
until you make eye contact with the teacher and
 Write the number of each observation of each
then mouth the words “thank you.”
control child (e.g., 1, 2, 3, 4) in the box for
“Observation #” in the right hand corner of  Check Page 1 to make sure all information has
Page 1 of the DOF. You can add the letter “C” been completed on each DOF.
to each observation number (1C, 2C, 3C, 4C)
 Return the completed DOF forms to your su- this example, the 62.5% IOA would indicate a need
per-visors or as soon as you have completed for more training, and/or perhaps more refinement
all observations for a particular identified child of the definition of out-of-seat behavior.
and any matched control children.
IOA for DOF On-Task
 Return completed DOFs for control children
to your supervisor at the same time as you re- On Page 2 of the DOF, observers rate the ob-
turn DOFs for the identified child. served child as being “on-task” or “off-task” dur-
ing the last 5 seconds of each 1-minute interval
over a 10-minute observation period. Figure 4-1
ASSESSING INTER-OBSERVER
provides a worksheet that you can copy and use to
AGREEMENT
compute inter-observer agreement (IOA) for DOF
As part of their training of DOF observers, su- On-task ratings across five practice cases. Each ob-
pervisors and researchers may want to assess in- server should complete one DOF for the same 10-
ter-observer agreement (IOA) on practice cases. minute observation period for each practice case.
IOA refers to the extent to which two observers When the two observers both rate the child as “on-
agree on the occurrence and nonoccurrence of the task” in the same 1-minute interval, consider this
same behavior over the same observation period. an “Occurrence Agreement.” When the two observ-
We recommend using the “Percent Agreement In- ers both rate the child as “off-task” in the same 1-
dex” to calculate IOA (Hintze, 2005). This method minute interval, consider this a “Nonoccurrence
involves counting the total number of agreements Agreement.” When one observer rates the child as
and dividing that by the total of agreements plus “on-task” and the other observer rates the child as
disagreements. When computing the Percent “off-task” in the same 1-minute interval, consider
Agreement Index, it is important to consider IOA this a “Disagreement.” The five columns for prac-
separately for occurrences of a behavior and for tice cases in the worksheet provide spaces for re-
nonoccurrences of the same behavior in order not cording the number of Occurrence Agreements (O),
to inflate the level of agreement. For example, sup- Nonoccurrence Agreements (N), and Disagree-
pose over ten 1-minute intervals, two observers ments (D) across each 10-minute observation pe-
recorded six occurrences of out-of-seat behavior riod for each case. To do this, follow the instruc-
for the same intervals and two non-occurrences for tions at the top of the worksheet:
the same intervals, but they disagreed on occur- Occurrence Agreements (O): Record the num-
rence versus nonoccurrence for two intervals. IOA ber of 1-minute intervals when both observers
for occurrence agreements would be 6/6+2 = 6/8 = rated the child as “on-task” for each case.
.75 x 100 = 75%. IOA for nonoccurrences would Nonoccurrence Agreements (N): Record the
be 2/2+2 = 2/4 = .50 x 100 = 50%. However, if number of 1-minute intervals when both observ-
both occurrences and nonoccurrences were in- ers rated the child as “off-task” for each case.
cluded in the same calculation, IOA would be 6+2/ Disagreements (D): Record the number of 1-
6+2+2 = 8/10 = .80 x 100 = 80%. minute intervals when one observer rated the
child as “on-task” and the other observer rated
To avoid inflating IOA, we compute IOA sepa- the child as “off-task” for each case.
rately for occurrences and nonoccurrences and then
compute the mean IOA to obtain a single index of To compute IOA for On-task, follow the steps
IOA. In the above example, mean IOA would be shown in Figure 4-2.
(.75 + .50)/2 = .625 x 100 = 62.5%. The generally Figure 4-3 shows an example of a completed
accepted level of IOA for good agreement on dis- worksheet of IOA for DOF On-task based on five
creet behaviors is 80 to 90% (Hintze, 2005). So in practice cases observed by a pair of two observers
Figure 4-1. Worksheet for computing IOA for On-task.

Figure 4-2. Steps for computing IOA for On-task.

Figure 4-3. Example of IOA for On-task.

(Nancy Jones and Valerie Stone). Each observer plete a separate DOF for the same 10-minute ob-
completed one DOF for the same 10-minute observation of each practice case. In the first column
servation of each practice case. The entries in the of the worksheet, list the DOF problem items rated
columns under each practice case show the num- 1, 2, or 3 by at least one observer for each set of 2
ber of Occurrence Agreements (O), Nonoccurrence DOFs per practice case. Then following the instruc-
Agreements (N), and Disagreements (D) for each tions at the top of the worksheet, use the letters O,
case. The last column shows the total O, N, and D N, or D to note occurrences, nonoccurrences, and
across all five cases: O = 28, N= 18 and D = 4. The disagreements between the two observers for each
bottom of the worksheet shows the computations item listed:
of IOA using these data. IOAO for Occurrence Record “O” when the two observers agreed on
Agreements was 87.5%; IOAN for Nonoccurrence the occurrence of an item for each case (i.e.,
Agreements was 81.8%; and Mean IOA was both observers rated the item 1, 2, or 3); the ob-
84.5%, which is within generally accepted criteria servers do not have to agree on their numerical
for agreement. This suggests that no further train- rating.
ing would be necessary for On-task ratings by these Record “N” when the two observers agreed on
two observers. the nonoccurrence of an item for each case (i.e.,
both observers rated the item 0).
IOA for DOF Problem Items
Record “D” when the two observers disagreed
With some modification, the same procedures on the occurrence or nonoccurrence of an item
for each case (i.e., one observer rated the item
can be used to determine IOA for the DOF prob-
0, and the other observer rated the item 1, 2, or
lem items. This is a bit more complicated than for 3).
On-task because the DOF contains multiple target
behaviors rated over a 10-minute observation pe- To compute IOA for the DOF problem items,
riod. However, over such a relatively short period, follow the steps shown in Figure 4-5. In the last
most observers are likely to rate only a few DOF three columns of the worksheet, enter the sum of
items as present. To compute IOA for the DOF the O, N, D across all cases for each of the items
problem items, you must first dichotomize the 0- listed in the first column. At the bottom of the last
1-2-3 DOF ratings into occurrences versus three columns, enter the totals for O, N, and D.
nonoccurrences. To do this, consider ratings of 1, Figure 4-6 shows an example of a completed
2 or 3 as an occurrence of the problem and a rating worksheet for IOA for the DOF problem items for
of 0 as a nonoccurrence. Then, determine the num- the same five practice cases observed by the same
ber of occurrences and nonoccurrences for each pair of observers in Figure 4-3 (Nancy Jones and
problem for the same child rated by the two ob- Valerie Stone). The entries in the columns under
servers. Because many items are likely to be rated each practice case show the Occurrence Agree-
0 by both observers, we recommend counting oc- ments (O), Nonoccurrence Agreements (N), and
currences and nonoccurrences only for those items Disagreements (D) on each item for that case. The
that are scored present (i.e., rated 1, 2, or 3) by at last three columns in the worksheet show the sums
least one observer for at least one case. This will of O, N, and D for each item across all five cases.
avoid inflating IOA by including many Totals for O, N, and D across items and cases are
nonoccurrence agreements. entered at the bottom of each of the three columns:
Figure 4-4 provides a worksheet that you can O = 29, N = 15, and D = 6. The bottom of the
copy and use to compute IOA for the DOF prob- worksheet shows the computations of IOA using
lem items rated present by two paired observers these data. IOAO for Occurrence Agreements was
for five practice cases. Each observer should com- 82.9%; IOAN for Nonoccurrence Agreements was
71.5%; and Mean IOA was 77.2%. The IOAO was
Figure 4-4. Worksheet for computing IOA for problem items.

Figure 4-5. Steps for computing IOA for problem items.

Figure 4-6. Example of IOA for problem items.

within generally accepted criteria for agreement, Each observer should complete one DOF for
but IOAN and Mean IOA were slightly below ac- each case. Or observers can each complete two
ceptable levels. This suggests that additional train- DOFs per case based on two separate 10-minute
ing is necessary to improve agreement between the observations. If two DOFs are completed for each
two observers, especially for Nonoccurrences. For case, then On-task and DOF problem item ratings
example, the trainer could meet with the two ob- should be averaged across the two observation ses-
servers to review the Guidelines for Rating DOF sions. The latter approach would require twice as
Problem Items in Chapter 2 and discuss disagree- much time, but would have the advantage of ob-
ments on specific items, such as items 7, 33, 100, taining more stable measures of On-task and prob-
and 101. lem behavior for each case.
We recommend computing IOA on paired ob- To assess the consistency of DOF ratings across
servations of practice cases as a way to assess the cases, you would first obtain total On-task scores
adequacy of training for DOF observers. Some and raw scores for each of the relevant DOF prob-
trainers and researchers may also want to periodi- lem scales (for classroom observations: five syn-
cally check IOA with additional paired observa- dromes, DSM-oriented Attention Deficit/Hyperac-
tions to determine whether observers continue to tivity Problems scale and Inattention and Hyper-
follow the rating guidelines and thereby guard activity-Impulsivity subscales, and Total Problems-
against “observer drift.” Classroom; for recess observations: Aggressive Be-
havior syndrome, and Total Problems-Recess).
ASSESSING INTER-RATER Pearson correlations (r) can then be computed for
RELIABILITY On-task scores and the DOF problem scale scores
across all non-practice cases. That is, instead of
Chapter 7 presents our research on the reliabil- examining IOA for occurrences or nonoccurrences
ity of the DOF. These data are useful for evaluat- for On-task and problem item ratings, Pearson r
ing the psychometric properties of the DOF. At the assesses the consistency of paired quantitative scale
same time, supervisors and researchers may also scores obtained by two observers for the same set
want to assess the inter-rater reliability of their DOF of multiple cases.
observers, which requires paired observations of
multiple cases after initial training on practice Pearson r ranges from -1.00 to +1.00. A corre-
cases. To do this, we recommend that paired ob- lation of .00 means that the two observers’ DOF
servations be obtained on at least 15 cases, and scores are unrelated. That is, the scores do not go
preferably 20 or more, if time allows. (These cases up or down together at all. A high positive correla-
can be randomly selected anonymous children in tion means that the observers’ scores are consis-
classrooms or other group settings or participants tent within each other. That is, the two observers
in research projects.) tend to score the same cases high and the same
cases low. A high negative correlation means that
the two observers tend to score cases in the oppo-
site direction. That is, if one observer scores cer-
tain cases high, the other observer scores the same
cases low. For assessing inter-rater reliability of
assessment in struments and behavioral observa-
tions, Pearson rs in the .80s and .90s are generally
considered high, rs from .60 to .79 are considered
moderate, and rs below .60 are considered low
(Hintze, 2005; Sattler, 2008).
For rating scales like the DOF, it is important
to consider the nature of the phenomenon being
measured. Certain types of behaviors may produce
higher reliability than other types of behavior. For
Chapter 5
Practical Applications and Case Examples
This chapter discusses practical applications of tegrated with other ASEBA data to evaluate a
the DOF, in conjunction with other ASEBA forms. child’s functioning and to plan interventions.
In dealing with particular cases, skilled practitio-
ners apply their knowledge and procedures derived SEQUENCE FOR USING THE DOF AND
from other cases to obtain a clear picture of a par- OTHER ASEBA FORMS
ticular case. The ASEBA forms are designed to help
practitioners obtain a well-differentiated picture of When adults seek services, they usually express
each case and to relate the findings to other cases. their reasons for wanting help. Children seldom
Responsible practice requires practitioners to con- refer themselves for help. Instead they are referred
tinually test their judgments against various kinds by concerned adults, such as parents, teachers,
of evidence. The ASEBA scoring profiles facili- guidance counselors, school psychologists, and
tate this process by enabling practitioners to com- pediatricians. It is therefore important to obtain
pare data for a particular child with data obtained information from other sources, as well as from
from normative samples of children of the same direct assessment of the child. Figure 5-1 illustrates
gender and age range. The similar structure of the a typical sequence for using the DOF along with
various ASEBA forms and scoring profiles also other ASEBA forms for referral and intake, gath-
makes it easy to compare data from multiple per- ering data, and interpreting data, as well as man-
spectives. aging cases and evaluating outcomes. The next
sections discuss each of these components of the
The DOF provides a systematic way for observ- sequence.
ers to record and rate observations of children in
school classrooms, at recess, and in other group Referral & Intake
settings. The DOF scoring profile provides a quan- Once a referral has been initiated, parents and
titative picture of observations, which practitioners teachers can be asked to complete ASEBA forms
can examine to determine whether a child mani- as part of initial data gathering at intake. For ex-
fested more problems in that setting than other ample, parents or guardians can complete the
children of the same gender and age range. Be- CBCL/6-18, and with parental consent, one or more
cause the DOF Profile is similar to other ASEBA teachers can complete the TRF (for descriptions
profiles, practitioners can easily compare DOF of the ASEBA school-age forms, see Achenbach
scores with scores obtained from parents and teach- & Rescorla, 2001). Whenever possible, relevant
ers, as well as scores based on observations of chil- medical, educational, and background information
dren in test sessions and clinical interviews. should also be obtained during initial data gather-
In this chapter, we discuss practical applications ing. Scoring the ASEBA forms prior to observing
of the DOF for use in school settings and mental and testing the child and before interviewing par-
health services. We also discuss use of the DOF ents can help to identify areas of possible devi-
for assessments of children with ADHD, emotional ance that can be explored further in subsequent data
and behavioral disorders, and learning problems. gathering.
Case examples illustrate how DOF data can be in- Direct Data Gathering
56
5. Practical Applications and Case Examples 57
58 5. Practical Applications and Case Examples
Children can be directly assessed via direct ob- before interviewing or testing the child. Or they
servations, tests, and/or interviews, as appropriate. can ask a trained independent observer to make
Many experts in child assessment emphasize the observations with the DOF.
importance of directly observing children’s behav-
Parents, teachers, and other relevant school staff
ior in natural settings as part of the assessment pro-
should be interviewed to obtain information that
cess (Barkley, 2006; Sattler & Hoge, 2006; Shapiro
is not accessible via rating scales and question-
& Kratochwill, 2000; Volpe & McConaughy,
naires. McConaughy (2005) discusses interview-
2005). The DOF provides a standardized format
ing procedures in detail and provides reproducible
for doing this. School-based practitioners, such as
protocols for parent and teacher interviews.
school psychologists, special educators, and guid-
ance counselors, can use the DOF to observe chil- In many mental health settings, it is customary
dren in their classrooms, at recess, or in other group to make psychiatric diagnoses. The term “diagno-
settings. Teacher aides and other school staff can sis” has a variety of meanings. In its narrow sense,
also be trained as independent DOF observers (see diagnosis is the “medical term for classification”
Chapter 4). In addition, school-based practitioners (Guze, 1978, p. 53). With respect to children’s be-
may use the DOF as part of a functional behav- havioral and emotional problems, diagnosis in this
ioral assessment for problem-solving consultations narrow sense refers to matching a child’s problems
with teachers, as discussed in a later section. to diagnostic categories. The scoring profiles for
the DOF and other ASEBA forms provide DSM-
Observing children in natural settings can be a
oriented scales that include problem items consis-
challenge for practitioners who are not based in
tent with diagnostic categories of the American
school settings. However, most child assessments
Psychiatric Association’s DSM-IV and DSM-IV-
require information from school personnel. If a
TR. High scores on the DSM-oriented scales can
mental health practitioner has an on-going relation-
alert practitioners to possible DSM diagnoses.
ship with particular schools, he/she might train cer-
Structured diagnostic interviews with parents can
tain school personnel, such as teacher aides, to use
also provide diagnostic information (see
the DOF. Or the practitioner can collaborate with
McConaughy, 2005).
the school psychologist or a special educator to
have DOFs completed. The practitioner can also Data Interpretation
ask school staff to provide DOF observations along
The DOF Profile is modeled on similar profiles
with other referral data.
for other ASEBA school-age forms, as described
Many mental health and school-based evalua- in their respective manuals (Achenbach &
tions include interviewing the child and testing the Rescorla, 2001; McConaughy & Achenbach, 2001,
child’s ability and/or academic achievement. The 2004). Chapter 3 provides instructions for com-
SCICA (McConaughy & Achenbach, 2001) pro- puter-scoring the DOF Profile. For classroom ob-
vides a standardized protocol for interviewing chil- servations, the DOF Profile displays raw scores, T
dren and has rating forms for scoring interview- scores, and percentiles for five DOF syndrome
ers’ observations and children’s self reports dur- scales, a DSM-oriented Attention Deficit/Hyper-
ing the interview. When children are administered activity Problems scale and Inattention and Hyper-
ability and/or achievement tests, the TOF activity/Impulsivity subscales, Total Problems, and
(McConaughy & Achenbach, 2004) can be used On-task. For recess observations, the DOF Profile
to rate observations of children’s behavior during displays raw scores, T scores, and percentiles for
testing. To obtain unbiased observations of a child’s an Aggressive Behavior syndrome scale and Total
behavior, practitioners may want to use the DOF Problems.
to observe the child in the classroom or at recess
To interpret the DOF and other ASEBA pro- versus inconsistent patterns of problems across
files, practitioners should examine scale scores to settings.
identify a child’s strengths and problems accord-
Because the DOF is part of the ASEBA, practi-
ing to each data source. The ASEBA profiles pro-
tioners can easily compare DOF item scores, scale
vide visual displays of scores to aid interpretation.
scores, and profile patterns with data from other
As discussed in Chapter 3, borderline and clinical
ASEBA forms relevant for 6-11-year-old children.
ranges indicate whether a child’s scores on the DOF
Comparing DOF scores with TOF scores may be
scales are deviant relative to normative samples of
especially informative, since some children behave
6- 11-year-old boys and girls (see Table 3-1 in
differently in a structured test session versus the
Chapter 3).
more natural setting of the classroom or play-
Choosing Cutpoints. Practitioners can decide ground. After interpreting and integrating data,
whether to use the borderline clinical or clinical practitioners can consult with parents, teachers, and
cutpoints on the ASEBA scales to classify children other relevant persons to decide whether interven-
as “deviant” versus “nondeviant.” Using cutpoints tions are warranted and, if so, what sorts of inter-
at the bottom of the borderline range increases sen- ventions would be appropriate and feasible.
sitivity (i.e., the number of children needing help
or “true positives” classified as deviant), while re-
Case Management & Outcome
ducing the number of “false negatives” (the num- Evaluation
ber of children needing help classified as Several different professionals may be involved
nondeviant). Using cutpoints at the bottom of the in designing and implementing interventions for
clinical range, by contrast, increases specificity (i.e., children. For example, in school settings, a school
the number of children not needing help or “true psychologist may consult with a classroom teacher
negatives” classified as nondeviant), while reduc- to develop interventions for specific problems in
ing the number of “false positives” (the number of the classroom. Or a school multidisciplinary team
children not needing help classified as deviant). (MDT) may consult with teachers and parents to
The borderline and clinical cutpoints provide more develop an Individualized Education Program
flexibility than does a single cutpoint. For example, (IEP) for a child who is eligible for special educa-
when administering rating scales for screening pur- tion services. A mental health practitioner may of-
poses, practitioners may want to maximize sensi- fer recommendations for interventions to parents,
tivity by using borderline cutpoints that will iden- teachers, or a treatment team. Or the same mental
tify more children at risk for problems. These chil- health practitioner may first evaluate the child and
dren can then be referred for further in-depth as- then provide treatment, such as individual therapy.
sessment. When assessing eligibility for special- During the course of treatment, the child should
ized treatment programs or special education ser- be reassessed to monitor progress and evaluate
vices, practitioners may want to maximize speci- changes in behavior.
ficity. They can then use the clinical cutpoints to
identify only the most clearly deviant children as Over short intervals (e.g., every week or 2
eligible for specialized treatment programs or spe- weeks), the DOF can be used to monitor progress
cial educational placements. toward behavioral goals. Over longer intervals
(e.g., 2, 6, or 12 months), ASEBA forms, such as
Integrating Multisource Data. After interpret- the CBCL/6-18 and TRF, as well as the DOF, can
ing data from each source, practitioners must inte- be used to monitor progress and evaluate outcomes.
grate data across sources to identify consistencies If outcome evaluations include standardized test-
and inconsistencies in problem patterns across dif- ing, practitioners can also complete the TOF. Time
ferent situations and relationships. They can then 1 and Time 2 scores on ASEBA scales can be com-
use such data to form hypotheses about consistent pared to help practitioners decide whether to
modify or terminate interventions. ment. For example, the school psychologist could
use the DOF to obtain baseline observations of the
SCHOOL-BASED ASSESSMENTS
child’s classroom behavior across several occa-
Schools are especially important settings for sions. By examining the DOF profile, the school
evaluating children’s cognitive and behavioral and psychologist can learn which types of behavioral
emotional functioning. Some children show more problems are most severe in comparison to norms
problems in the school setting than in other con- for the child’s peers. From DOF scales on which a
texts, such as home. For example, some children child obtains scores in the clinical or borderline
with problems, such as ADHD, show co-occurring ranges, the school psychologist can choose spe-
behavioral, emotional, and learning problems at cific problem behaviors (e.g. doesn’t sit still, rest-
school, but may show fewer problems at home. less; doesn’t concentrate; disturbs other children)
Other children may show more problems or dif- to target for additional direct observations for a
ferent patterns of problems at home or in other functional behavioral assessment of the target be-
contexts than school. The DOF can be especially haviors.
useful for documenting systematic observations of
The functional behavioral assessment would
children’s behavior in school and other group set-
require gathering more baseline data and identify-
tings.
ing antecedents and consequences of the specific
Three-tiered Model and Response-to-Interven- target behaviors. Parents and teachers can also be
tion. The DOF can be especially useful for behav- interviewed and asked to complete the CBCL/6-
ioral assessment in three-tiered and Response-to- 18 and TRF to provide baseline data on a child’s
Intervention (RTI) models for services in schools. problem behaviors. The school psychologist and
The three-tiered model moves from universal con- teacher can use the baseline data to develop hy-
ditions for all children (Tier 1), to targeted inter- potheses about the functions of the child’s prob-
ventions of varying degrees of intensity for indi- lem behaviors (e.g., to gain attention or avoid aver-
vidual children or groups of children (Tier 2), to sive tasks). They can then design interventions to
very intensive interventions for individual children reduce selected target behaviors. For example, the
(Tier 3), as discussed by McConaughy and Ritter teacher might implement classroom accommoda-
(2008) and Tilly (2008). As a first step after a child tions and a positive behavioral support system
is referred, a school practitioner, such as a school to reduce disruptive behavior and improve the
psychologist, would interview the referring teacher child’s academic productivity. After interventions
to learn the teacher’s specific concerns about the are in place, the school psychologist (or other ap-
child’s learning and behavior. During this interview, propriate school staff) would routinely monitor the
the school psychologist would also learn what uni- child’s progress toward specific goals over short
versal conditions (Tier 1) were already in place to intervals to determine whether the interventions are
address behavioral and academic problems in the producing desired results.
child’s classroom. (For example, are there clear
When Tier 1 and Tier 2 interventions are effec-
classroom rules and expectations for the behavior
tively planned, delivered, and assessed for out-
of all children?)
come, and still prove to be ineffective for a child,
If appropriate Tier 1 conditions are in place but then a move to Tier 3 assessment is warranted. Tier
have proven ineffective with the referred child, then 3 involves further assessment of the child and the
the school psychologist would gather more spe- context of the problem behaviors, as well as the
cific data on the child’s problems to develop ap- reasons that previous interventions were not effec-
propriate Tier 2 interventions. The DOF provides tive. Tier 3 also involves more intensive interven-
a standardized format for obtaining direct obser- tion, coupled with more frequent monitoring of a
vations of a child’s behavior for Tier 2 assess- child’s progress. For children with behavioral and
emotional problems, intensive interventions might in essential life areas. Children who are not eli-
take the form of special education services or other gible for special education services may still qualify
special programs in the local school, regional pro- for accommodations in the general education set-
grams at the district level, referral for mental health ting under Section 504 and the ADA. Children with
services, or placement in an intensive hospital- ADHD, in particular, often qualify for Section 504
based or residential program. plans when they do not meet criteria for special
education.
Eligibility for Special Education. Eligibility for
special education services must be determined ac-
cording to the rules and regulations of the Indi-
ASSESSMENT OF ADHD
viduals with Disabilities Education Improvement The DOF and other ASEBA forms are especially
Act of 2004 (IDEA 2004; Public Law 108-446, useful for assessing ADHD in school settings and
2004) and its subsequent reauthorizations. IDEA mental health services. As a routine part of ADHD
2004 requires comprehensive assessment of the assessments, parents of school-age children can
nature, duration, severity, and patterning of a child’s complete the CBCL/6-18 and teachers can com-
problems, as well as assessment of environmental plete the TRF, along with other appropriate rating
circumstances and other factors that may precipi- scales and questionnaires. Interviews should also
tate or maintain the problems. Regulations for com- be conducted with parents and teachers to assess
prehensive evaluations are stipulated by IDEA symptoms and functional impairment at home and
2004, although each state has its own standards at school.
and regulations for interpreting the federal law.
Special education evaluations typically include Direct observations of a child’s behavior are
collecting data from parents and teachers, along often recommended for evaluating ADHD, along
with cognitive and achievement testing of the child. with parent and teacher reports (Barkley, 2006;
The evaluation information is used to determine DuPaul & Stoner, 2003). Direct observations by
whether the child meets criteria for one or more independent observers can be especially important
disabilities defined by IDEA 2004, including spe- as external validators of symptoms reported by
cific learning disability; emotional disturbance; parents and/or teachers. Particularly when there is
disagreement between parents and teachers regard-
speech or language impairment; autism (which can
ing symptom criteria, direct observations can add
include the spectrum of pervasive developmental
essential information for or against a diagnosis of
disorders); other chronic health impairment (which
ADHD. School psychologists and other school staff
can include ADHD), as well as other disabilities
can use the DOF to record and score observations
involving sensory and orthopedic impairments and
of a child in the classroom on several occasions.
traumatic brain injury. Children who qualify for
Cognitive and achievement testing may also be
special education services under any of the above
done to determine whether the child has cognitive
categories may also exhibit behavioral and emo-
and/or academic deficits that interfere with school
tional problems that can be assessed with the
functioning. Test examiners can then complete the
ASEBA forms.
TOF to assess test session behavior.
Section 504 Accommodations. In addition to
Like other ASEBA forms, the DOF and TOF
IDEA 2004, children with disabilities are protected
Profiles include an Attention Problems syndrome
under Section 504 of the Rehabilitation Act of 1973
scale as well as a DSM-oriented Attention Deficit/
(Rehabilitation Act, 1973) and the Americans with
Hyperactivity Problems scale with Inattention and
Disabilities Act of 1990 (ADA), which are civil
Hyperactivity-Impulsivity subscales. If the various
rights statutes. Section 504 and the ADA cover all
ASEBA profiles consistently yield high scores on
the disabilities defined by the IDEA, as well as
other disabilities that affect children’s functioning
the Attention Problems syndrome and/or on the (ii) Emotional disturbance includes schizo-
DSM-oriented Attention Deficit/Hyperactivity phrenia. The term does not apply to children
Problems scale, this would provide quantitative who are socially maladjusted unless it is de-
evidence to support an ADHD diagnosis. If differ- termined that they have an emotional distur-
ent informants’ ratings yield very different scores bance under paragraph (c) (4) (i) of this sec-
on Attention Problems and/or the Attention Defi- tion. (20 U.S.C. 1401 (3); 34 C.F.R §300.8 (c)
cit/Hyperactivity Problems scale, then practitioners (4) (i)).
need to consider how environmental settings and
The above federal definition of emotional dis-
relationships may differ across informants. Infor-
turbance includes five general characteristics, A
mation from the ASEBA forms should also be in-
through E, that describe behavioral and emotional
tegrated with data gathered from parent and teacher
problems. State regulations vary in their interpre-
interviews and other assessment sources to formu-
tations of the five characteristics. Some states in-
late an ADHD diagnosis. Once an ADHD diagno-
clude externalizing problems (e.g., aggressive be-
sis is confirmed, the school MDT can use the as-
havior, conduct disorder), along with internalizing
sessment data to determine whether the child quali-
problems, while other states try to exclude exter-
fies for special education services or a Section 504
nalizing problems. However, research has shown
plan. Medication and behavioral therapy may also
that externalizing and internalizing problems of-
be warranted.
ten co-occur (McConaughy & Skiba, 1993).
ASSESSMENT OF EMOTIONAL All three qualifying conditions listed in para-
DISTURBANCE graph (c) (4) (i) must apply to at least one of the
identified five characteristics (A to E) of emo-
The IDEA 2004 definition of emotional distur- tional disturbance. That is, the characteristic(s)
bance is as follows: must exist over a long period of time, to a marked
(c) (4) (i) Emotional disturbance means a con- degree, and adversely affect educational perfor-
dition exhibiting one or more of the follow- mance. A child who exhibits at least 1 of the 5 char-
ing characteristics over a long period of time acteristics, or has a diagnosis of schizophrenia, and
and to a marked degree that adversely affects meets all three qualifying conditions, is judged to
a child’s educational performance: have emotional disturbance. A child who does not
meet criteria for emotional disturbance is deemed
(A) An inability to learn that cannot be ex- to be ineligible for special education on the basis
plained by intellectual, sensory, or other of that category.
health factors;
To facilitate special education evaluations, Table
(B) An inability to build or maintain satisfac- 5-1 outlines relations between the IDEA 2004 cri-
tory interpersonal relationships with peers and teria for emotional disturbance and the empirically
teachers; based syndromes of the DOF, along with the
(C) Inappropriate types of behavior or feel- CBCL/6-18, TRF, YSR, TOF, and SCICA. The
ings under normal circumstances; table lists the syndrome and DSM-oriented scales
of each instrument next to the characteristic(s) that
(D) A general pervasive mood of unhappi- they most clearly reflect. The table also shows how
ness or depression; scores and scales of the various instruments can
(E) A tendency to develop physical symptoms provide evidence that characteristics have existed
or fears associated with personal or school for a long period of time, to a marked degree, and
problems; adversely affect educational performance (for fur-
ther discussion of ASEBA applications to criteria
for emotional disturbance, see McConaughy &
Achenbach, 2001; McConaughy & Achenbach,
2004; McConaughy & Ritter, 2008). If a child
shows deviance on scales relevant to the criteria
64
a
Attention Problems, DSM-oriented Attention Deficit/Hyperactivity Problems, Immature/Withdrawn, Intrusive, Oppositional, and
Sluggish Cognitive Tempo are scored from classroom observations; Aggressive Behavior is scored from recess observations.
b
Attention Problems, Language/Motor Problems, Withdrawn/Depressed, Self-Control Problems, and Anxious are scored from inter-
viewers’ observations; Aggressive/Rule-Breaking, Anxious/Depressed, and Somatic Complaints are scored from children’s self-
reports during the SCICA. The DSM-oriented scales on the SCICA are scored from interviewer’s observations and children’s self-
reports.
5. Practical Applications and Case Examples
Practitioners can examine the DOF results, along Melinda’s evaluation at the clinic, a teacher aide
with cognitive and achievement test data, to plan (Valerie Stone) used the DOF to obtain four 10-
appropriate interventions and classroom accommo- minute observations of Melinda in her classroom.
dations to address co-occurring behavioral and Ms. Stone also made two 10-minute observations
emotional problems along with academic deficits of each of two of Melinda’s classmates on the same
of children with learning disabilities. days that she observed Melinda. After she com-
pleted all the observations, Ms. Stone mailed the 8
CASE EXAMPLE OF ASSESSMENT
DOFs to the clinic psychologist for computer-scor-
OF ADHD:
ing.
Melinda Brandt, Age 8
For Melinda’s evaluation at the clinic, the psy-
Melinda Brandt is the 8-year-old girl whose chologist administered the Wechsler Intelligence
computer-scored DOF Profile was shown in Chap- Scales for Children-Fourth Edition (WISC-IV;
ter 3. Melinda was the younger of two children in Wechsler, 2003) and the Wechsler Individual
a middle class family that included her mother, Achievement Test-Second Edition (WIAT-II;
father, and older brother. Her mother brought her Wechsler, 2002) to assess her cognitive and aca-
to a mental health outpatient clinic for a psycho- demic functioning. She also administered a com-
logical evaluation because she was concerned about puterized continuous performance test (CPT) to
Melinda’s problems paying attention and her assess Melinda’s impulsivity and ability to sustain
struggles with school work. Melinda’s teacher had attention. After each test, the psychologist com-
sent home several notes complaining about pleted the TOF to provide a standardized assess-
Melinda’s behavior in school and her failure to ment of Melinda’s test session behavior. The psy-
complete work on time. Melinda’s mother was es- chologist also interviewed Ms. Brandt about
pecially worried that Melinda might be retained in Melinda’s developmental and educational history
third grade, which she felt would be a great blow and her behavior at home, and interviewed
to her self-esteem. Melinda’s teacher on the phone.
Melinda was evaluated by a child psychologist Parent and Teacher Reports. The CBCL/6-18
in the mental health clinic who also provided con- completed by Ms. Brandt produced scores in the
tracted consultation services in Melinda’s school borderline clinical range for Externalizing (84th
district. Melinda’s evaluation followed the se- percentile) and Attention Problems (95th percen-
quence illustrated in Figure 5-1 and included the tile), but normal range scores for all other scales.
five assessment axes outlined earlier in Table 1-1 Melinda’s scores on the CBCL/6-18 competence
in Chapter 1. Prior to Melinda’s appointment at scales were also in the normal range, although her
the clinic, Ms. Brandt completed the CBCL/6-18 mother expressed worries about her school perfor-
and a questionnaire about Melinda’s developmen- mance. In a structured diagnostic interview, Ms.
tal and medical history. With Ms. Brandt’s permis- Brandt endorsed 6 of 9 DSM-IV-TR ADHD symp-
sion, Melinda’s third grade teacher completed the toms of inattention, with onset before age 7, but
TRF and provided copies of Melinda’s school no symptoms of hyperactivity-impulsivity. Al-
records. Ms. Brandt also gave permission for the though Ms. Brandt acknowledged that Melinda
clinic psychologist to interview Melinda’s teacher sometimes seemed restless (e.g., had trouble sit-
and to obtain observations of Melinda’s behavior ting still at dinner and in church) and she did not
in the classroom. always “think things through,” Ms. Brandt did
not think that Melinda was unusually “hyperac-
As part of her consultation services to the school
tive” compared to other children in the family.
district, the psychologist had trained teacher aides
in procedures for using the DOF. One week before Ms. Brandt’s main concern was that Melinda’s
attention problems were interfering with her abil- but nothing seemed to work. The teacher confirmed
ity to do schoolwork and that she was falling be- that school staff were considering retention in third
hind in class. Ms. Brandt said that Melinda needed grade due to Melinda’s poor academic and social
constant reminders to do her homework and some- functioning.
times failed to hand in work even when she did
Classroom Observations with the DOF. Fig-
complete it. Melinda’s struggles with schoolwork
ure 2-2 in Chapter 2 showed the observer’s notes
often led to arguments and temper tantrums at
and on-task ratings on the DOF for the first 10-
home. Ms. Brandt had become especially alarmed
minute observation of Melinda. Figures 3-1 to 3-4
when Melinda’s teacher suggested that Melinda
in Chapter 3 displayed Melinda’s computer-scored
might not be ready to move on to fourth grade at
DOF Profile. Melinda scored in the clinical range
the end of the year.
above the 97th percentile on the Attention Prob-
The TRF completed by Melinda’s third grade lems, Intrusive, and Oppositional syndrome scales,
teacher produced scores in the clinical range for and in the borderline clinical range on the Slug-
Externalizing and Total Problems (above 90th per- gish Cognitive Tempo syndrome scale (see Figure
centile), plus clinical range scores on the TRF At- 3-1). These high scores indicated that Melinda ex-
tention Problems syndrome scale, the DSM-ori- hibited many more attention problems and more
ented Attention Deficit/Hyperactivity Problems intrusive and oppositional behavior in the class-
scale, and the Inattention and Hyperactivity-Impul- room than was typical for the DOF normative
sivity subscales (all above 97 th percentile). sample of 6-11-year-old girls. Melinda also scored
Melinda’s scores were in the borderline clinical in the clinical range on the DSM-oriented Atten-
range on the TRF Social Problems, Thought Prob- tion Deficit/Hyperactivity Problems scale and the
lems, Rule-Breaking and Aggressive Behavior syn- Hyperactivity/Impulsivity subscale, and in the bor-
drome scales, as well as the DSM-oriented Oppo- derline range on the Inattention subscale (see Fig-
sitional Defiant and Conduct Problems scales. ure 3-3).
The teacher’s ratings of Melinda’s adaptive Test Scores and Observations with the TOF.
functioning yielded a score in the clinical range On the WISC-IV, Melinda obtained a full scale IQ
below the 10th percentile. The teacher rated Melinda of 107, which was in the average range. She scored
as behaving much less appropriately, learning much in the average range for Verbal Comprehension
less, and somewhat less happy than typical pupils. (VCI = 108), Perceptual Reasoning (PRI = 106),
The teacher also rated Melinda’s academic perfor- and Working Memory (WMI = 107), but low aver-
mance as far below grade level in mathematics, age for Processing Speed (PSI = 88). On the WIAT-
written language, and social studies, somewhat II, Melinda scored in the average range for reading
below grade level in reading, but at grade level in and mathematics, but low average for written ex-
art. The teacher noted that Melinda was a capable pression. On the math subtests, she scored much
and creative child, but she had great difficulty sit- lower for numerical operations than for math rea-
ting still, seemed to talk constantly, and frequently soning. Her scores on the CPT also suggested ten-
disturbed other children. Her school work was of- dencies toward impulsive responding and difficul-
ten messy and incomplete. She failed to listen to ties sustaining attention.
instructions and seemed unconcerned about the
The psychologist’s ratings of Melinda’s test ses-
quality of her work. The teacher felt it was very
sion behavior produced scores in the borderline
challenging to have Melinda in her class. The
range on the TOF Attention Problems syndrome
teacher had tried accommodations to address
and the DSM-oriented Attention Deficit/Hyperac-
Melinda’s attention problems and disruptive be-
tivity Problems scale, as well as borderline scores
havior (e.g., moving her to a quiet corner in the
on the Inattention and Hyperactivity/Impulsivity
class and providing stickers for completed work),
subscales. Melinda also scored in the borderline and intrusive behavior in the classroom, consis-
range on the TOF Oppositional syndrome during tent with the TOF and the teacher’s reports on the
achievement testing, but not during cognitive test- TRF. Although Melinda’s mother did not report
ing. oppositional behavior on the CBCL/6-18, she did
say that attempts to help Melinda with her home-
Data Interpretation and Integration. Class-
work often erupted into arguments and temper tan-
room observations with the DOF were especially
trums at home. Taken together, these findings sug-
useful in Melinda’s case in light of discrepancies
gested a strong association between Melinda’s aca-
between reports by her mother versus her teacher.
demic skill deficits and her oppositional behavior
The DOF Profile showed that Melinda exhibited
when confronted with academic tasks.
many more attention problems than was typical for
the normative sample of 6-11-year-old girls. She Case Management and Outcome Evaluation.
also exhibited many more attention problems than To address problems revealed in the evaluation,
two classmates selected as DOF controls. These the psychologist referred Melinda and her parents
DOF findings corroborated reports of inattention to a child psychiatrist in the clinic for possible
by Melinda’s mother and her teacher. At the same medication for ADHD. The psychologist also con-
time, the DOF also showed high levels of hyper- sulted with Melinda’s teacher to develop accom-
activity and impulsivity in the classroom, consis- modations and behavioral interventions in the
tent with reports by Melinda’s teacher but not her classroom. They moved Melinda’s seat near the
mother. The TOF Profile also indicated high lev- teacher’s desk for closer monitoring of her work.
els of inattention and hyperactivity/impulsivity dur- They also paired Melinda with a peer tutor to work
ing cognitive and achievement testing, but at less on math and writing assignments and created an
severe levels than observed in the classroom with incentive plan to encourage on-task behavior and
the DOF. academic productivity. The school staff incorpo-
rated the accommodations and behavioral interven-
Taken together, the DOF and TOF provided im-
tions into a Section 504 plan for Melinda as an
portant independent evidence of problems with in-
alternative to retention in third grade. As part of
attention and hyperactivity/impulsivity, as reported
the Section 504 plan, a teacher’s aide continued to
by Melinda’s teacher on the TRF. The results from
conduct biweekly classroom observations of
the DOF, TOF, and TRF, in conjunction with de-
Melinda with the DOF. Following an RTI model,
velopmental and educational history, supported a
the school team examined DOF scores for On-task,
DSM-IV-TR diagnosis of ADHD-Combined type.
Attention Problems, and the Attention Deficit/Hy-
However, if the evaluation had relied only on symp-
peractivity Problems scale to monitor Melinda’s
tom reports by Melinda’s mother, without class-
progress toward their behavioral goals. They also
room observations that corroborated reports by her
used curriculum-based measures to monitor
teacher, a diagnosis of ADHD-Combined type
Melinda’s academic progress in math and written
would not have been appropriate.
work.
Melinda’s average scores on the WISC-IV sug-
CASE EXAMPLE OF A SCHOOL-BASED
gested that her problems with schoolwork were not
ASSESSMENT OF BEHAVIOR PROBLEMS:
due to low ability. However, low average WIAT-II
scores for math operations and written expression Ricky Johnson, Age 9
indicated that Melinda was falling behind in these
Ricky Johnson was the youngest child living
basic academic skills. Interestingly, the TOF Pro-
with his mother and two sisters in a low-income
file showed severe oppositional behavior during
inner city neighborhood. Ricky’s fourth grade
achievement testing, but not during cognitive test-
teacher consulted the school MDT because he had
ing. The DOF also revealed severe oppositional
been involved in several fights on the playground.
The teacher was not sure what started the fights or Ricky.
who else was involved, but Ricky was usually the
The DOF Profile scored from Mr. Provo’s ob-
one sent to the principal’s office for in-school sus-
servations of Ricky in the classroom (not shown)
pensions. The teacher also reported that Ricky was
produced a score at the 84th percentile for Total
disruptive in class and seemed to have few friends.
Problems, which fell in the borderline range for 6-
The school psychologist (Harry Provo) contacted
11-year-old boys. Ricky scored in the borderline
Ricky’s mother to express the team’s concerns and
range between the 93rd and 97th percentiles on the
to obtain her permission for a behavioral assess-
DOF Intrusive and Oppositional syndrome scales,
ment. Ms. Johnson agreed to have the school psy-
but in the normal range on the Sluggish Cognitive
chologist observe Ricky’s behavior in school. She
Tempo, Immature/Withdrawn, and Attention Prob-
also agreed to complete the CBCL/6-18 and to have
lems syndrome scales and the DSM-oriented At-
Ricky’s teacher complete the TRF.
tention Deficit/Hyperactivity Problems scale. On
Recess and Classroom Observations with the the Intrusive syndrome, Mr. Provo rated five items
DOF. As an initial step in his evaluation, Mr. Provo as present for Ricky: 8. Difficulty waiting turn in
used the DOF to observe Ricky in his classroom activities or tasks; 21. Disturbs other children; 46.
and during recess. On each of three days, Mr. Provo Disrupts group activities; 55. Demands must be
conducted two observations of Ricky in the class- met immediately; and 65. Talks too much. On the
room and two observations on the playground dur- Oppositional syndrome, Mr. Provo rated four items
ing recess. Mr. Provo also observed two other boys as present: 16. Difficulty following directions; 23.
as control children in the same setting. Doesn’t seem to listen to what is being said; 52.
Shows off, clowns, or acts silly; and 83. Doesn’t
Figure 3-6 in Chapter 3 showed the DOF Pro-
get along with peers. Although the two control
file scored from Mr. Provo’s six observations of
children also showed difficulty waiting their turn
Ricky during recess. On the DOF Profile, Ricky
and talking too much, their scores on all DOF scales
scored in the clinical range above the 97th percen-
were within the normal range.
tile on the Aggressive Behavior syndrome and in
the clinical range above the 90th percentile for To- Parent and Teacher Reports. The CBCL/6-18
tal Problems. These results indicated that Ricky completed by Ms. Johnson yielded a score in the
exhibited many more problems at recess than was clinical range above the 90th percentile for Exter-
typical for the DOF normative sample of 6-11-year- nalizing, along with a borderline score on the Rule-
old boys. On the Aggressive Behavior syndrome, Breaking Behavior syndrome (95th percentile).
Mr. Provo rated six items as present for Ricky: 14. Ricky scored just below the borderline range on
Cruel, bullies, or mean to others; 30.Gets into the Social Problems syndrome (90th percentile) and
physical fights; 31. Gets teased; 47. Screams; 66. Aggressive Behavior syndrome (92nd percentile),
Teases; and 86. Bossy. Ricky also exhibited six but well within the normal range on all other syn-
other problems that contributed to his high DOF drome scales. Ricky’s total competence score on
Total Problems score: 3. Argues; 8. Difficulty wait- the CBCL/6-18 was in the clinical range below the
ing turn in activities or tasks; 20. Disobedient; 22. 10th percentile, with clinical range scores below the
Doesn’t seem to feel guilty after misbehaving; 67. 3rd percentile on the Social and School scales.
Temper tantrums, hot temper, or seems angry; and
In a phone interview with Mr. Provo, Ms.
83. Doesn’t get along with peers. The two control
Johnson said that she was worried that Ricky was
children, by contrast, showed little aggressive be-
hanging out with older boys who had gotten into
havior. However, their borderline clinical score for
trouble in the neighborhood. She said that the po-
Total Problems indicated that they too showed other
lice had come to her house a month ago because
problems on the playground, most notably scream-
some of the boys were caught shoplifting at a local
ing and difficulty waiting their turn, similar to
grocery store. Ms. Johnson believed that Ricky was Aggressive Behavior syndrome. The CBCL/6-18
innocent, but she worried that he might be led on and TRF also produced scores above the 90th per-
by the other boys. Because Ms. Johnson worked centile on the Rule-Breaking Behavior syndrome.
two jobs to support her family, she was not able to Low scores on the CBCL/6-18 Social competence
provide the supervision at home that she felt Ricky scale and high scores on the TRF Social Problems
needed. She also reported that Ricky often had syndrome also indicated problems in social rela-
trouble with his schoolwork. She asked his older tionships.
sisters to help him with homework, but they were
Following the three-tiered model discussed ear-
often too busy with their own work or socializing
lier, the school MDT decided to initiate Tier 2 be-
with friends.
havioral interventions to address Ricky’s problems
The TRF completed by Ricky’s fourth grade in school. As a first step, Mr. Provo examined the
teacher yielded clinical range scores above the 90th profiles from the ASEBA forms to identify prob-
percentile for Externalizing and Total Problems, lems that were consistent across informants and
along with a clinical range score above the 97th settings. He listed several problems from the DOF
percentile on the Social Problems syndrome. Ricky recess observations that were similar to problems
scored near the borderline range on the Rule-Break- reported on the CBCL/6-18 and TRF: 3. Argues;
ing Behavior and Aggressive Behavior syndromes. 20. Disobedient; 22. Doesn’t seem to feel guilty
The teacher also reported several problems on the after misbehaving; 30. Gets into physical fights;
Attention Problems syndrome (e.g., 4. Fails to fin- 31. Gets teased; 66. Teases; 67. Temper tantrums,
ish things he/she starts; 22. Difficulty following hot temper, or seems angry; and 83. Doesn’t get
directions; 92. Underachieving, not working up to along with peers.
potential), but Ricky’s total score was in the nor-
Mr. Provo conducted a functional behavior as-
mal range. Ricky’s scores on the TRF adaptive
sessment to identify antecedents and consequences
functioning scale was in the clinical range below
of the problem behaviors, particularly fighting on
the 10th percentile. The teacher rated Ricky as be-
the playground. He learned that fights usually
having much less appropriately and learning much
erupted after other children teased Ricky and called
less than typical pupils. She also rated Ricky’s aca-
him names or after Ricky argued with them, teased
demic performance as far below grade level in read-
them, or became bossy about the rules of a game.
ing, mathematics, and written language, but at
As the arguing and teasing escalated, Ricky would
grade level in social studies and science. Ricky’s
lose his temper and start hitting and punching. The
teacher was especially concerned about his prob-
other children also lost their tempers and began
lems getting along with other children on the play-
hitting and punching, so that it was not always clear
ground and his disruptive behavior in class. She
who started the fights. On one occasion, the play-
was also concerned that in-school suspensions had
ground supervisor broke up a fight and sent Ricky
caused Ricky to miss instructional time to the point
to the principal’s office. On another occasion, no
that he was falling behind in his schoolwork. When
one intervened and the fight ended when the bell
Ricky was in class, he often clowned around and
rang for children to return to class.
disrupted class activities. He also became easily
frustrated with his work, sometimes to the point of Mr. Provo conducted an additional functional
ripping up his own papers. behavior assessment to identify antecedents and
consequences of problems observed in the class-
Data Interpretation and Integration. Results
room, particularly 21. Disturbs other children; 46.
from the DOF, TRF, and CBCL/6-18 indicated that
Disrupts group activities; and 52. Shows off,
Ricky was showing more externalizing problems
clowns, or acts silly. Mr. Provo learned that Ricky
than most boys his age. All three ASEBA forms
became easily frustrated when academic tasks were
produced scores above the 90th percentile on the
too difficult. He would then start clowning and
acting silly or would disturb other children to avoid ational and sports program, which introduced him
doing his work. When this happened, the teacher to a new peer group and increased adult supervi-
scolded Ricky and made him sit alone in the back sion in after-school hours. To improve his academic
of the room for a time-out. skills, Ricky received daily small group instruc-
tion in reading and math and his teacher checked
Mr. Provo met with Ricky’s teacher and Ms.
his work regularly to ensure that he understood
Johnson to discuss his observations and learn more
directions and was staying on task. Ricky received
about their perspectives on Ricky’s problems. Ms.
additional points on his behavior contract for meet-
Johnson and the teacher agreed that Ricky had dif-
ing academic goals (e.g., handing in assignments
ficulty controlling his temper and that he lacked
on time).
social skills for getting along with other children
his age. They also worried that Ricky might be Case Management and Outcome Evaluation.
learning “delinquent” behaviors from the older Mr. Provo continued to use the DOF to monitor
boys in his neighborhood. Mr. Provo noted that Ricky’s progress in meeting behavioral goals. To
teasing and name-calling seemed to be what do this, he trained a teacher aide in the DOF rating
sparked Ricky’s fights on the playground. He also procedures. The teacher aide used the DOF to make
pointed out that a desire to avoid difficult school two 10-minute observations of Ricky in the class-
work might explain Ricky’s disruptive behavior in room and two 10-minute observations at recess
the classroom. each week over a period of 10 weeks. Mr. Provo
scored each set of weekly DOFs and created graphs
Mr. Provo consulted with Ms. Johnson and
of Ricky’s scores for DOF On-task and the Intru-
Ricky’s teacher to develop Tier 2 interventions to
sive, Oppositional, and Aggressive Behavior syn-
address Ricky’s problems. They delineated a clearer
drome scales. Mr. Provo and the teacher examined
set of playground rules for all children (e.g., no
the graphs each week to evaluate Ricky’s progress.
hitting and fighting, no name calling) and increased
the level of supervision on the playground. They The graphs of DOF scores showed a gradual
developed a behavior contract for Ricky to encour- decline in the DOF syndrome scale scores and an
age positive behaviors (e.g., working quietly, ask- increase in On-task scores over the 10-week inter-
ing for help when needed, and working coopera- vention period. At the end of the 10 weeks, Ricky’s
tively with other children) that would replace un- teacher completed a new TRF and Ms. Johnson
desirable behaviors in the classroom. The behav- completed a new CBCL/6-18 as additional mea-
ior contract also included bonus points for days sures of outcomes for Ricky. The CBCL/6-18 and
when Ricky did not get into fights on the play- TRF both showed lower scores on the Social Prob-
ground. The teacher sent weekly reports home to lems, Rule-Breaking and Aggressive Behavior syn-
Ms. Johnson showing Ricky’s progress toward the drome scales than at baseline. These results, com-
behavioral goals. When Ricky met his weekly bined with the DOF findings, suggested that the
goals, Ms. Johnson provided special rewards at school-based interventions were associated with
home (e.g., watching a DVD, getting a special din- reductions in targeted problem behaviors and an
ner, playing a board game). increase in on-task behavior. Although the MDT
could not attribute direct causal effects to the in-
To improve his social functioning, Ricky was
terventions, they felt that the changes in ASEBA
enrolled in a weekly social skills group conducted
scores justified continuing the Tier 2 interventions
by the school guidance counselor. The teacher and
until the end of the school year.
the guidance counselor also began teaching a so-
cial skills curriculum to the entire class. Ms. If the Tier 2 interventions had not been associ-
Johnson enrolled Ricky in an after-school recre- ated with reductions in Ricky’s problems, then the
MDT would have proceeded with a more compre-
hensive Tier 3 evaluation to determine whether
Ricky was eligible for special education services
under the category of emotional disturbance. The
MDT could use the existing CBCL/6-18, TRF, and
Chapter 6
Constructing the DOF and DOF Profile
In earlier chapters, we described the 2009 DOF counterparts on the CBCL and 86 had counterparts
and the DOF Profile. In this chapter, we summa- on the TRF. Ten of the 96 original DOF items had
rize research to develop previous versions of the no direct counterpart on the CBCL and/or TRF. Ex-
DOF and its scoring profile. We then describe our amples are 2009 DOF items: 81. Easily led by
research to develop and standardize the scales of peers; 85. Tattles; and 86. Bossy (for details
the 2009 DOF Profile. Readers who are not inter- of the 1981 DOF, see Achenbach & Edelbrock,
ested in the details of form and scale construction 1983).
should feel free to skim or skip this chapter.
A revised edition of the DOF (Achenbach, 1986)
included the same 96 problem items, plus an open-
EARLIER VERSIONS OF THE DOF ended item for “other problems.” Some items were
The original version of the DOF (Achenbach, reworded slightly for clarification. An example is
1981) included 96 problem items, plus an open- 1981 DOF item 34. Isolates self from others that
ended item for other problems, and on-task ratings was reworded to 34. Physically isolates self from
for ten 1-minute intervals. To assemble the DOF others.
item set, Achenbach examined early versions of The 1986 DOF Profile displayed scores for six
the CBCL and TRF to find items appropriate for syndrome scales derived from principal compo-
rating direct observations of children’s behavior nents/varimax analyses of 212 clinically referred
in group settings. (For brevity, we use CBCL here 5-to-14-year-old children: Withdrawn-Inattentive,
to refer all versions.) Whenever possible, Nervous-Obsessive, Depressed, Hyperactive, At-
Achenbach retained the original wording of the tention Demanding, and Aggressive. The profile
CBCL and TRF items for DOF items. Examples also displayed scores for Internalizing and Exter-
are 2009 DOF items: 5. Defiant or talks back to nalizing scales derived from second-order factor
staff; 19. Destroys property belonging to others; analyses of scores on the six syndrome scales, plus
41. Physically attacks people; and 71. Unhappy, a Total Problems score (the sum of ratings on all
sad, or depressed. Other CBCL/TRF items were 96 items plus “other problems”) and an On-task
worded slightly differently on the DOF to make score. The 1986 DOF problem scales were normed
them more appropriate for direct observations of on 287 children observed as controls for referred
children’s behavior. Examples are: rewording children in regular classrooms of 45 schools in
CBCL/TRF item 8. Can’t concentrate, can’t pay Vermont, Nebraska, and Oregon. The 1986 DOF
attention for long to DOF item 7. Doesn’t concen- computer-scoring program calculated raw scores
trate or doesn’t pay attention for long; rewording for the syndrome scales, plus raw scores and T
CBCL/TRF item 10. Can’t sit still, restless, or scores for Internalizing, Externalizing, and Total
hyperactive to DOF item 9. Doesn’t sit still, rest- Problems, averaged across observation sessions
less, or hyperactive; and rewording CBCL/TRF separately for the identified child and matched con-
item 19. Demands a lot of attention to DOF item trols. The 1986 DOF profile also provided an On-
17. Demands or tries to get attention of staff (later task score ranging from 1 to 10, averaged across
abbreviated to 17. Tries to get attention of staff). observation sessions for identified and control chil-
Seventy-two of the 96 original DOF items had dren.
71
72 6. Constructing the DOF and DOF Profile
Several studies have reported data on the reli- and 1986 DOF discussed above.
ability, stability, and validity of the 1981 and 1986
As evidence for the validity of the DOF, Reed
versions of the DOF. Achenbach and Edelbrock
and Edelbrock (1983) reported that DOF Total
(1983) reported inter-rater reliabilities (Pearson r)
Problems and On-task scores discriminated signifi-
of .96 for DOF Total Problems and .71 for On-task
cantly between referred and nonreferred boys ob-
for 16 children observed in a residential treatment
served in the same classroom settings by observ-
setting by two trained research assistants. In the
ers blind to referral status. The convergent validity
same residential setting, 6-month stability Pearson
of the DOF was supported by significant correla-
correlations for 36 children were .55 for classroom
tions of .37 to .51 between DOF Total Problems
observations, .59 for recess observations, and .51
and TRF Total Problems (Achenbach & Edelbrock,
for classroom On-task scores. Scores for each child
1986; Reed & Edelbrock, 1983) and significant
were averaged over six 10-minute observations at
associations between DOF scores and CBCL pro-
Time 1 and Time 2. These stability coefficients for
file patterns (McConaughy et al., 1988).
the DOF were similar to the 6-month stability co-
McConaughy et al. (1998, 1999) also found that
efficient of .57 for CBCL ratings of the same chil-
DOF On-task, Internalizing, Nervous-Obsessive,
dren by their mother or a child care worker.
and Depressed scale scores significantly discrimi-
For a sample of 25 public school boys referred nated between outcomes for at-risk children who
for special services for behavioral problems, Reed received different school-based programs to pre-
and Edelbrock (1983) reported inter-rater vent serious emotional disturbance.
reliabilities of .91 for DOF Total Problems and .83
In another study, Skansgaard and Burns (1986)
for On-task ratings summed across six 10-minute
added nine items to the 1986 DOF to create a priori
observations in a 60-minute observation period.
problem scales for Inattention, Hyperactivity/Im-
They also reported mean inter-rater reliabilities of
pulsivity, Oppositional Defiant Disorder/overt
.85 for Total Problems and .71 for On-task for the
Conduct Disorder (ODD/overt CD), and Slow
same sample when rs for each of the 6 sets of 10-
Cognitive Tempo (SCT). For a sample of 24 chil-
minute observations were averaged across sessions.
dren, inter-rater reliabilities were .97, .95, .99, and
For a sample of 62 randomly selected 6-11-year- .69 for each of these four scales, respectively, plus
old boys, McConaughy, Achenbach, and Gent 1.00 for the DOF On-task score. To test the dis-
(1988) reported inter-rater reliabilities of .75 for criminative validity of this 106-item DOF,
DOF Total Problems and .88 for On-task scores Skansgaard and Burns grouped the 24 children into
averaged across one 10-minute classroom obser- an ADHD-Combined subtype (ADHD/C; n = 6),
vation and one 10-minute recess observation for ADHD-Inattentive subtype (ADHD/I; n = 6), and
each child. As part of a school-based prevention matched controls (n = 12), based on percentile
study, McConaughy, Kay, and Fitzgerald (1998, cutpoints on teachers’ ratings of DSM-IV ADHD
1999) reported reliabilities for five pairs of trained symptoms. Despite the small sample sizes,
DOF raters. Each rater pair observed 20 randomly Skansgaard and Burns found that the ADHD-C and
selected elementary school-aged children in class- ADHD-IN groups both scored significantly higher
rooms for one 10-minute period. Inter-rater on the DOF Inattention scale and lower on DOF
reliabilities averaged across rater pairs were .86 On-task scores than matched controls. The ADHD-
for DOF Total Problems and .90 for On-task. C group scored significantly higher than the
ADHD-IN group and controls on the DOF Hyper-
Achenbach and Rescorla (2001) reported mean
activity/Impulsivity and ODD/overt CD scales. The
inter-rater reliabilities of .90 for DOF Total Prob-
ADHD-IN group scored significantly higher than
lems and .84 for On-task scores (after Fisher z trans-
controls on the DOF SCT scale. Similar group dif-
formations), averaged across the studies of the 1981
ferences were reported for teachers’ ratings on com-
6. Constructing the DOF and DOF Profile 73
parable subsets of DSM-IV symptoms, except that settings. As part of the ASEBA, the DOF yields
ADHD-IN scored significantly higher than both scores for observed problems that can be meshed
ADHD-C and controls on SCT. with data from other sources, such as parent re-
ports, teacher reports, children’s self-reports, test
In 2003, we created an expanded version of the
session observations, and observations from child
1986 DOF for use in a large study of children with
clinical interviews. This facilitates a multiaxial
behavioral and emotional problems and matched
approach to assessment, as discussed in Chapter
controls (McConaughy & Achenbach, 2003). The
1.
2003 DOF included 114 specific problem items,
plus an open-ended item for the observer to “write To develop the 2009 DOF and DOF Profile, we
in any observed problems or behaviors not listed used a psychometric approach similar to that used
above.” We retained 95 of the 1986 DOF problem for other ASEBA forms and profiles, including the
items, plus the open-ended item for other problems. CBCL/6-18, TRF, and YSR (Achenbach &
We added 19 new items to expand the 1986 DOF Rescorla, 2001), SCICA (McConaughy &
for research, as follows: Achenbach, 2001), and TOF (McConaughy &
Achenbach, 2004). We used the following proce-
We replaced the 1986 item, 4. Behaves like op-
dures:
posite sex, with a new item, 4. Cheats. We changed
the 1986 item, 75. Underactive, slow moving, lacks 1. We selected and tested a pool of items that de-
energy, or yawns, by removing the word, “yawns” scribe observable aspects of children’s behav-
and created a new item 114. Yawns. We changed ior, affect, and interactions in group settings.
the 1986 item, 83. Fails to express self clearly, in- 2. We obtained observers’ ratings of problem items
cluding speech defects, by removing the words, “infor 6-11-year-old clinically referred children and
cluding speech defects” and created a new item matched control children in the same settings.
97. Speech problem (describe). We created 16 ad-
3. We factor analyzed the observers’ ratings to ag-
ditional items to correspond to TOF items and prob-
gregate problem items into quantitative syn-
lems that observers had written in for the open-
drome scales.
ended item in earlier research, plus items to tap
DSM-IV (American Psychiatric Association, 1994) 4. We constructed a DSM-oriented Attention Defi-
symptoms for ADHD that were not covered by the cit/Hyperactivity Problems scale comprised of
original 1986 DOF items. We also slightly re- problem items that are consistent with DSM-IV-
worded eleven 1986 DOF items. TR criteria for ADHD.
5. We constructed a Total Problems score consist-
We used data from samples rated on the 1986
ing of the sum of ratings on the 89 problem
DOF and the 2003 DOF to develop the 2009 DOF
items.
with 89 problem items and to derive scales for the
2009 DOF Profile, as discussed in the next sec- 6. We assigned standard scores (T scores) and per-
tions. (Appendix D shows the final 89 items of the centiles for the DOF problem scales and the On-
2009 DOF in comparison to the 97 items of the task score that indicate how a child’s scores com-
1986 DOF.) pare with scores for normative samples of chil-
dren.
PSYCHOMETRIC APPROACH TO 7. We tested the problem scales and On-task score
THE 2009 DOF for reliability and validity.
Consistent with previous research, we designed The next sections describe our research for steps
the 2009 DOF to obtain direct observational data 1 through 6. Chapter 7 reports reliability and Chap-
on children’s problems and on-task behavior in ter 8 reports validity for the 2009 DOF scales.
school classrooms, recess, and comparable group
STATISTICAL DERIVATION OF DOF
SYNDROMES FOR CLASSROOM urban, and rural areas. The New York sample in-
OBSERVATIONS cluded children referred to the outpatient clinic of
the Department of Psychiatry at SUNY Upstate
Like other ASEBA forms, the DOF was devel- Medical University in Syracuse, New York. The
oped both to document specific problems and to Pennsylvania sample included children referred to
identify co-occurring problems. We used various the outpatient clinic of The Children’s Hospital of
statistical procedures to identify syndromes of co- Philadelphia. An additional 612 children were ran-
occurring problems. The original Greek meaning domly selected control children in the same class-
of the word syndrome is the act of running together. rooms as the referred children. The total sample of
Although “syndrome” is often equated with dis- 1,261 children included 873 boys and 388 girls,
ease, its most general meaning is “a set of concur- each having 2 to 4 DOFs. Each DOF covered a 10-
rent things” (Gove, 1971). Consistent with this minute observation.
meaning, we performed a series of factor analyses
to identify syndromes of children’s problems that From the total sample of 1,261 children, we se-
were observed in school classrooms. lected two samples for EFAs. One sample included
955 children who were rated on either the 1986 or
Factor analysis refers to a family of statistical 2003 DOF. This sample included 649 referred chil-
methods for identifying patterns of co-occurring dren plus 306 matched control children who scored
items. Because different factor-analytic approaches above the DOF median Total Problems score of
may produce different results, we used several ap- 5.9 that we found for controls. As explained later,
proaches that included both exploratory factor this sample was used for EFAs of 41 high frequency
analysis (EFA) methodology (SPSS, 2007) and problem items included in both the 1986 and 2003
confirmatory factor analysis (CFA) methodology DOF. The second EFA sample was on a subset of
(Mplus; Muthén & Muthén, 2004). EFA yields fac- the first sample of 955 children, which included
tors that summarize the associations among prob- 613 children who were rated only on the 2003 DOF
lem items without testing specific models for the (335 referred and 278 controls). This sample was
factor structure, whereas CFA tests the fit of data used for EFAs of 57 high frequency problem items
to particular measurement structures. included in the 2003 DOF. We used two different
samples for initial EFAs to determine whether the
Samples for Factor Analyses factor structure differed for the 41-item set versus
For the factor analyses, we assembled a sample the larger 57-item set from the 2003 DOF. The
of 1,261 children ages 6-12 who were observed in sample for subsequent CFAs included all 1,261
school classrooms. Of these, 486 were rated on the children (649 referred and 612 controls). We com-
1986 DOF and 775 were rated on the 2003 DOF. bined boys and girls in all analyses to maximize
The total sample included 649 children who were our sample sizes. Table 6-1 summarizes the
clinically referred for evaluations of behavioral and samples used for factor analyses.
emotional problems and/or learning difficulties or
were identified as “at risk” for such problems (for Items for Factor Analyses
brevity, we are labeling this group “referred”). The To select DOF items for factor analyses, we
referred samples were drawn from outpatient clin- omitted the open-ended item for “other problems”
ics and schools in Vermont, New York, and Penn- and combined two new items on the 2003 DOF
sylvania. The Vermont sample included children with two other items with similar wording, reduc-
referred to the outpatient clinic of the University ing the total item set to 112 items. Of these, 95
of Vermont Department of Psychiatry. Children in items were included on both the 1986 DOF and
the Vermont sample were drawn from urban, semi-
Table 6-1
Samples for Factor Analyses to Derive 2009 DOF Syndromes
Boys Girls Total
Total Samplea
Referred children 464 185 649
Controls 409 203 612
Total 873 388 1,261
Sample for EFAs of 41 high frequency items from

1986 & 2003 DOFb
Controls 219 87 306
Total 683 272 955
Sample for EFAs of 57 high frequency items from

2003 DOFc
Controls 197 81 278
Total 428 185 613
Ethnicityd
Non-Latino White 79.4%
African American 14.5%
Latino/Hispanic 2.0%
Mixed or Other 4.1%
a
This sample was used for CFAs.
b
This sample was a subset of N = 1,261.
c
This sample was a subset of N = 955.
d
Percents for N = 1,124 referred and control children for whom ethnicity was known.
2003 DOF. To obtain item frequencies, we aver- and control children combined. We then identified
aged the 0, 1, 2, and 3 ratings for each of 112 DOF 41 of the 57 items from the 2003 DOF that were
items across the 2 to 4 DOFs for each of the 1,261 also rated on the 1986 DOF. We used these two
children in the total sample shown in Table 6-1. item sets for initial EFAs, as described in the next
From the frequency distributions of averaged item section.
ratings, we identified 55 low frequency items that
were rated present (>0.00) for fewer than 5% of
Factor-Analytic Methods
referred children and fewer than 3% of referred EFAs for Deriving Factors. As a general strat-
and control children combined. Omitting these 55 egy, we performed exploratory Maximum Likeli-
low frequency items, we retained 57 items from hood (ML), Unweighted Least Squares (ULS), and
the 2003 DOF that were rated present (>0.00) for Principal Components Analyses (PCA) of Pearson
>5% of referred children and for >3% of referred
correlations among the retained DOF high fre- for each child. The single-factor WLS analyses
quency items. The initial EFAs that yielded 3 to 10 were applied to 3,533 DOFs for the entire sample
factors were subjected to Varimax (orthogonal) of 1,261 children. The single-factor WLS analy-
rotations to produce uncorrelated factors and ses identified five unidimensional factors, with
Oblimin (oblique) rotations to allow correlations 29 items loading on only one factor and 16 items
among factors. Using these general strategies, we loading on more than one factor.
performed six separate EFAs (3 methods x 2 rota-
CFA Tests of the 5-Factor Model. To assign
tions) on the 41 high frequency items included on
items to only one factor for a final 5-factor model,
the 1986 DOF and 2003 DOF, using the sample of
we performed a correlated 5-factor WLS analysis
children rated on either of the two forms (N =
of the candidate factors for the 3,533 DOFs for the
955). We then performed an additional six EFAs
total sample of 1,261 children. From these analy-
on the 57 high frequency items included on the
ses, we identified 43 items meeting criteria (a) and
2003 DOF, using the sample of children rated only
(b) above: 10 for Factor 1; 7 for Factor 2; 12 for
on the 2003 DOF (N = 613). We found similar fac-
Factor 3; 6 for Factor 4; and 8 for Factor 5. An
tor structures from the 1986/2003 DOF 41-item
additional item loaded .18 on Factor 2. Only one
set and the analyses of the 2003 DOF 57-item set.
of the 45 items from the single-factor solutions
We therefore used solutions from the EFAs of the
failed to load significantly on any factor in the test
2003 DOF 57-item set for our next analyses.
of the 5-factor model.
We identified five factors that were similar in
We then examined correlations between di-
the six EFAs of the 2003 DOF 57-item set. We
chotomous item scores (0 vs. 1-2-3) and latent vari-
retained DOF items that had loadings > .20 and p
ables for the five factors for all items dropped from
<.01 on at least one of the five factors. The differ-
the factors in previous analyses. We looked for
ent factor extraction and rotation methods thus
items that had correlations > .40 with at least one
collectively contributed results that were subse-
factor and a difference > .10 between that correla-
quently tested via CFA methods, as described in
tion and correlations with the remaining four fac-
the next two sections.
tors. Seven items met these criteria. To obtain a
CFAs for Evaluating the Unidimensionality of final 5-factor solution, we then tested models with
Factors. To test the unidimensionality of factors and without these seven items for the 3,533 DOFs
derived in the forgoing analyses, we applied single- for the total sample of 1,261 children.
factor Weighted Least Squares (WLS) analyses to
To obtain a final 5-factor solution, we used CFA
candidate items comprising each of the five candi-
methodology in an exploratory manner, rather than
date factors. Items that loaded on multiple versions
seeking “confirmation” of factor models. We ex-
of a particular factor were included if they met the
amined solutions for the following characteristics:
following criteria: (a) the item’s factor loading had
(a) proper convergence; (b) no out-of-range param-
to be significant at p <.01, i.e., the estimated factor
eter estimates; (c) reasonable model fit; and (d) re-
loading had to exceed its standard error by at least
tention of items with factor loadings > .20 and sig-
2.57, and (b) the loading had to be >.20. To avoid
nificant at p <.01. For the final 5-factor solution,
statistical risks associated with low frequency cells,
52 items met criteria (a) through (d): 12 for Factor
we applied the WLS analyses to tetrachoric corre-
1; 10 for Factor 2; 12 for Factor 3; 8 for Factor 4;
lations between item scores dichotomized 0 vs. 1,
and 10 for Factor 5. We evaluated goodness-of-fit
2, and 3. Because Mplus can take account of de-
between the data and the models with the Root
pendency in a data set (i.e., more than one DOF
Mean Square Error of Approximation (RMSEA;
per subject), we were able to analyze ratings from
Browne & Cudek, 1993), which has been recom-
each DOF separately, rather than entering the mean
mended as the best measure of fit (Loehlin, 1998).
of the ratings on each item for all DOFs completed
Table 6-2
Factor Loadings of Items on the DOF Syndrome Scales
for Classroom Observations
DOF Syndrome Scale Factor Loading
I. Sluggish Cognitive Tempo

11. Confused or seems to be in a fog .71
15. Daydreams or gets lost in thoughts .53
27. Forgetful in activities or tasks .47
44. Apathetic, unmotivated, or won’t try .78
51. Slow to respond verbally .55
53. Shy or timid .32
57. Stares blankly .58
60. Yawns .22
70. Underactive, slow moving, tired, or lacks energy .50
71. Unhappy, sad or depressed .61
II. Immature/Withdrawn
1. Acts too young for age .87
25. Difficulty organizing activities or tasks .49
26. Fails to give close attention to details .57
34. Physically isolates self from others .43
39. Loses things .58
49. Avoids or is reluctant to do tasks that require sustained mental effort .62
59. Wants to quit or does quit tasks .66
61. Strange behavior .27
75. Withdrawn, doesn’t get involved with others .31
77. Fails to express self clearly .36
III. Attention Problems

7. Doesn’t concentrate or doesn’t pay attention for long .74
9. Doesn’t sit still, restless, or hyperactive .61
13. Fidgets, including with objects .55
24. Eats, drinks, chews, or mouths things that are not food, excluding junk foods .31
42. Picks or scratches nose, skin, or other parts of body .23
56. Easily distracted by external stimuli .62
76. Sucks thumb, fingers, hand, or arm .28
82. Clumsy, poor motor control .52
IV. Intrusive
8. Difficulty waiting turn in activities or tasks .73
17. Tries to get attention of staff .45
21. Disturbs other children .63
32. Interrupts .68
33. Impulsive or acts without thinking, including calling out in class .66
Table 6-2 (cont.)
DOF Syndrome Scale Factor Loading
45. Responds before instructions are completed .41

46. Disrupts group activities .79
55. Demands must be met immediately, easily frustrated .73
65. Talks too much .50
72. Unusually loud .65
78. Impatient .76
81. Easily led by peers .51
V. Oppositional
2. Makes odd noises .51
3. Argues .68
5. Defiant or talks back to staff .75
16. Difficulty following directions .67
20. Disobedient .76
22. Doesn’t seem to feel guilty after misbehaving .87
23. Doesn’t seem to listen to what is being said .60
43. Runs about or climbs excessively .50
52. Shows off, clowns, or acts silly .40
74. Whining tone of voice .58
83. Doesn’t get along with peers .54
87. Complains .71
Note. N = 1,261 with 3,533 DOFs for the final Weighted Least Squares factor analyses of tetrachoric
correlations of dichotomous item scores; referred children, n = 649; matched controls, n = 612. Values in
bold show the three highest loadings for each factor.
The RMSEA for the final 5-factor solution was the order in which they appear on the DOF Pro-
.024, which was well within the range <.07 gener- file. The order of the syndrome scales was deter-
ally considered to indicate good fit. mined by subsequent second-order factor analyses
described in a later section.
Results of Factor Analyses
The DOF Sluggish Cognitive Tempo syndrome
Table 6-2 shows the five DOF syndrome scales includes 10 items describing confusion, lack of mo-
with the factor loadings for each item derived from tivation, and underactivity. Items with the highest
the final 5-factor solution. The names of the syn- factor loadings were: 11. Confused or seems to be
drome scales reflect the content of the items com- in a fog; 44. Apathetic, unmotivated, or won’t try;
prising each factor. We chose names that were con- and 71. Unhappy, sad, or depressed. Five items
sistent with current literature and with the names (11, 15, 44, 57, and 75) were consistent with the
of scales derived from similar factor analyses of 2007 Sluggish Cognitive Tempo scales created for
the 1986 DOF and other ASEBA forms. The syn- the CBCL/6-18 and TRF (Achenbach & Rescorla,
dromes are numbered in Table 6-2 according to 2007). Interestingly, symptoms of sluggish cogni-
tive tempo were tested for possible inclusion in gressive Behavior and SCICA Self-Control Prob-
the DSM-IV criteria for ADHD, but were not in- lems syndromes. Examples are: 3. Argues; 5. De-
cluded in the final criteria (Frick et al., 1994). As fiant or talks back to staff; and 20. Disobedient.
described in an earlier section, a prior research The Oppositional syndrome does not include prob-
study using an expanded version of the 1986 DOF lems reflecting physical aggression, such as fight-
showed a significant association between a DOF ing, which are unlikely to be observed in class-
scale labeled “Slow Cognitive Tempo” and the In- room settings.
attentive type of ADHD (Skansgaard & Burns,
1998). LOW FREQUENCY ITEMS RETAINED
The DOF Immature/Withdrawn syndrome in- ON THE DOF
cludes 10 items describing problems of immatu- To finalize the DOF, we examined the frequency
rity and disorganization, along with withdrawn be- distributions of averaged item ratings for the
havior. Items with the highest factor loadings were: sample of 649 referred children who were rated on
1.Acts too young for age; 49. Avoids or is reluc- the 1986 DOF or 2003 DOF in their classrooms,
tant to do tasks that require sustained mental ef- plus 232 of the referred children who were rated
fort; and 59. Wants to quit or does quit tasks. Five on the 1986 DOF during recess. We retained items
items (25, 26, 39, 49, and 59) were consistent with that were scored present (>0.00) for >2% of the
items comprising a DOF DSM-oriented Inatten- classroom and recess samples. We retained 23
tion scale described in a later section. Item 75. With- items from classroom observations and an addi-
drawn, doesn’t get involved with others was con- tional 7 items from recess observations. These 30
sistent with similar items on the CBCL/6-18, TRF, items, plus 6 additional items that were not on the
and TOF Withdrawn/Depressed syndrome, as well five DOF syndromes, and open-ended item 89.
as the 1986 DOF Withdrawn-Inattentive syndrome. Other problems not listed above were grouped to-
The DOF Attention Problems syndrome in- gether as “Other Problems,” as shown in Table 6-
cludes eight items describing difficulty with atten- 3. The 37 Other Problems, plus the 52 items on the
tion and restlessness. Items with the highest factor DOF syndromes, are included in the final 2009
loadings were: 7. Doesn’t concentrate or doesn’t version of the DOF, which thus has 88 specific
pay attention for long; 9. Doesn’t sit still, restless, problem items, plus one open-ended item. The 0-
or hyperactive; and 13. Fidgets, including with ob- 1-2-3 ratings on all 89 items are summed to com-
jects. Four items (7, 9, 13, and 56) were consistent pute the DOF Total Problems score, as explained
with items comprising the 1986 DOF Hyperactive in a later section.
scale and the TOF and TRF Attention Problems
scales. AGGRESSIVE BEHAVIOR SYNDROME
FOR RECESS OBSERVATIONS
The DOF Oppositional syndrome was the most
robust factor to emerge from our analyses. It in- As explained in previous sections, five DOF
cludes12 items that reflect oppositional or unco- syndrome scales were derived from observations
operative behavior. The highest loading items were: of children in classroom settings. Because activi-
5. Defiant or talks back to staff; 20. Disobedient; ties in classrooms are often teacher-directed and
and 22. Doesn’t seem to feel guilty after misbehav- structured around a curriculum, children may be
ing. Five items (3, 5, 20, 52, and 105) were consis- less likely to exhibit certain types of problem be-
tent with items on the TOF Oppositional syndrome. haviors in the classroom than in less structured
The Oppositional syndrome also contains items settings, such as recess. Examples are getting into
with counterparts on the CBCL/6-18 and TRF Ag- fights, teasing, and being teased. To determine
whether there were any syndromes for recess ob-
Table 6-3
DOF “Other Problems” Item Set
DOF Items
4. Cheats
6. Brags, boasts
10. Clings to adults or too dependent
12. Cries
14. Cruel, bullies, or mean to others
18. Destroys own things
19. Destroys property belonging to others
28. Out of seat
29. Gets hurt, accident prone
30. Gets in physical fights
31. Gets teased
35. Lies
36. Bites fingernails
37. Nervous, highstrung, or tense
38. Nervous movements, twitching, tics, or other unusual movements (describe):
40. Too fearful or anxious
41. Physically attacks people
47. Screams
48. Secretive, keeps things to self, including refusal to show things to teacher
50. Self-conscious or easily embarrassed
54. Explosive or unpredictable behavior
58. Speech problem (describe):
62. Stubborn, sullen, or irritable
63. Sulks
64. Swears or uses obscene language
66. Teases
67. Temper tantrums, hot temper, or seems angry
68. Threatens people
69. Too concerned with neatness or cleanliness
73. Overly anxious to please
79. Tattles
80. Repeats behavior over & over; compulsions (describe):
84. Runs out of class (or similar setting)
85. Behaves irresponsibly (describe):
86. Bossy
88. Afraid to make mistakes
89. Other problems not listed above
Table 6-4
Factor Loadings of Items on the DOF Aggressive Behavior
Syndrome Scale for Recess Observations
DOF Items Factor Loading
14. Cruel, bullies, or mean to others .40

30. Gets in physical fights .52
31. Gets teased .21
41. Physically attacks people .45
47. Screams .23
63. Sulks .34
66. Teases .24
79. Tattles .33
86. Bossy .23
Note. N = 480 for Unweighted Least Squares single-factor analyses of averaged item scores; referred
children, n = 232; matched controls, n = 248. Values in bold show the three highest factor loadings.
servations that were not identified in classroom syndromes for classroom observations, we retained
observations, we performed additional factor analy- items with (a) factor loadings significant at p <.01,
ses of 35 items from the “Other Problems” shown i.e., the estimated factor loading had to exceed its
in Table 6-3. We excluded item 28. Out of seat, standard error by at least 2.57, and (b) loadings
which was not on the 1986 DOF, and would not >.20. Table 6.4 shows the factor loadings for the
have been relevant for recess observations. We also nine items that met these criteria for a recess ob-
excluded open-ended item 89. The factor analyses servation scale, which we labeled Aggressive Be-
were performed on the sample of 232 clinically re- havior. As expected, the Aggressive Behavior syn-
ferred children ages 6-11 whose recess observa- drome included problems with physical aggression
tions were rated on the 1986 DOF, plus 248 matched as well as other social problems, such as teasing
control children. (Of the 232 referred children, 124 and being teased. The three highest loading items
had two matched control children in the same re- were: 30. Gets into physical fights; 41. Physically
cess setting.) Each child was rated on the DOF for attacks people; and 14. Cruel, bullies, or mean to
two 10-minute observations during recess, alter- others. Most of the items comprising the DOF
nating between control and referred children. Aggressive Behavior syndrome for recess obser-
vations have counterparts on the CBCL/6-18 and
From the total sample of 480 children observed
TRF Aggressive Behavior syndromes (Achenbach
at recess, we obtained frequency distributions of
& Rescorla, 2001).
averaged 0, 1, 2, 3 item ratings for each of the 35
items. From the frequency distributions of averaged
item ratings, we identified 12 DOF items that were
DSM-ORIENTED ATTENTION DEFICIT/
scored present (>0.00) for >5% of referred chil- HYPERACTIVITY PROBLEMS AND
dren and >3% of control children. We then applied INATTENTION AND HYPERACTIVITY-
single-factor ULS analyses to the 12 candidate IMPULSIVITY SUBSCALES
items to test the unidimensionality of a single-fac- To aid practitioners and researchers in diagnos-
tor solution. Consistent with criteria for the five tic assessments, Achenbach and Rescorla (2001)
constructed DSM-oriented scales comprising tained mental effort. Three other DOF items were
CBCL/6-18, TRF, and YSR items that mental also consistent with DSM-IV ADHD symptoms:
health experts judged to be very consistent with 28.Out of seat; 45. Responds before instructions
DSM-IV (American Psychiatric Association, 1994) are completed; and 55. Demands must be met im-
diagnostic categories. To do this, they asked the mediately, easily frustrated. To cover all possible
experts to rate items from all three ASEBA forms DSM-IV ADHD symptoms, we added the above
as very consistent, somewhat consistent, or not 11 items to the 12 items that the experts judged to
consistent with descriptive criteria for several be very consistent with DSM-IV ADHD symptoms.
DSM-IV diagnostic categories. The raters were 22
We then assigned the 23 items to Inattention
highly experienced child psychiatrists and psy-
and Hyperactivity-Impulsivity subscales, as shown
chologists from 16 cultures. All the raters had pub-
in Table 6-5. Of these 23 items, 21 were similar to
lished research on children’s behavioral and emo-
items on the TOF Attention Deficit/Hyperactivity
tional problems. Raters were given the DSM-IV
Problems scale and its Inattention and Hyperac-
criteria for guidance, but one-to-one matching of
tivity-Impulsivity subscales (McConaughy &
DSM-IV criteria to ASEBA items was not neces-
Achenbach, 2004). Items in italic are similar to
sary to justify ratings of very consistent. Some
items identified by experts for the CBCL/6-18 and
ASEBA items could thus be judged as very con-
TRF Attention Deficit/Hyperactivity Problems
sistent with the experts’ concepts of particular
scales, while non-italicized items are the additional
DSM-IV categories, even if the DSM-IV criteria
DOF items consistent with DSM-IV symptoms.
did not include precise counterparts of the ASEBA
The Attention Deficit/Hyperactivity Problems To-
items. ASEBA items that were rated as very con-
tal score is the sum of the 0, 1, 2, and 3 ratings for
sistent with the DSM-IV categories by at least 14
all 23 items.
of the 22 raters were grouped into six DSM-ori-
ented scales: Affective Problems, Anxiety Prob-
lems, Somatic Problems, Attention Deficit/Hyper-
NORMATIVE SAMPLE
activity Problems, Oppositional Defiant Problems, For classroom observations, the DOF norma-
and Conduct Problems (for details, see Achenbach tive sample included 661 children ages 6-11, as
& Rescorla, 2001). shown in Table 6-6. These were randomly selected
To create the DOF DSM-oriented Attention children in general education classrooms in four
Deficit/Hyperactivity Problems scale, we selected states: Arizona (n = 65), New York (n = 146), Penn-
DOF items that were comparable to the CBCL/6- sylvania (n = 172), and Vermont (n = 278). The
18 and TRF items that the experts rated as very DOF normative sample for recess observations
consistent with the DSM-IV diagnosis of ADHD. included 244 Vermont children ages 6-11, who
We identified 12 DOF items that were similar to were a subsample of the normative sample for
CBCL/6-18 and TRF items. classroom observations. Each child in the norma-
tive samples was observed and rated on the DOF
As indicated earlier, to develop the 2003 DOF, for two to four 10-minute periods for classroom
we also wrote new items to tap DSM-IV symp- observations and two 10-minute observations for
toms of ADHD that were not already covered by recess observations. The 0-1-2-3 ratings on each
other items: 8.Difficulty waiting turn in activities of the DOF items were averaged across the 2 to 4
or tasks; 23. Doesn’t seem to listen to what is be- DOFs for each child. The averaged item scores
ing said; 25. Difficulty organizing activities or were then summed to obtain total raw scores for
tasks; 26. Fails to give close attention to details; each of the relevant DOF scales for classroom ob-
27. Forgetful in activities or tasks; 39. Loses things; servations and recess observations.
43. Runs about or climbs excessively; and 49.
Avoids or is reluctant to do tasks that require sus- To test age and gender differences in the nor-
Table 6-5
Items Comprising the DOF DSM-Oriented Attention Deficit/Hyperactivity Problems Scale
and Inattention and Hyperactivity-Impulsivity Subscales
Inattention Subscale
7. Doesn’t concentrate or pay attention for long
16. Difficulty following directions
23. Doesn’t seem to listen to what is being said
25. Difficulty organizing activities or tasks
26. Fails to give close attention to details
27. Forgetful in activities or tasks
39. Loses things
49. Avoids or is reluctant to do tasks that require sustained mental effort
56. Easily distracted by external stimuli
59. Wants to quit or does quit tasks
Hyperactivity-Impulsivity Subscale
8. Difficulty waiting turn in activities or tasks
9. Doesn’t sit still, restless, or hyperactive
13. Fidgets, including with objects
21. Disturbs others
28. Out of seat
32. Interrupts
33. Impulsive or acts without thinking, including calling out in class
43. Runs about or climbs excessively
45. Responds before instructions are completed
46. Disrupts group activities
55. Demands must be met immediately, easily frustrated
65. Talks too much
72. Unusually loud
Note. Items in italics have counterparts on the CBCL/6-18 and TRF Attention Deficit/Hyperactivity
Problems scales. All but two DOF items (21 and 46) have counterparts on the TOF Attention Deficit/
Hyperactivity Problems scale. The Attention Deficit/Hyperactivity Problems scale score is the sum of
0-1-2-3 ratings on the Inattention and Hyperactivity-Impulsivity subscales.
mative sample, we performed a 2 (ages 6-8 vs. ages Total Problems-Classroom, Aggressive Behavior,
9-11) x 2 (boys vs. girls) MANOVA on raw scale Total Problems-Recess, and On-task scores. As
scores for the five DOF syndromes, followed by shown in Table 6-7, boys scored significantly
univariate 2 x 2 ANOVAs on scores for each syn- higher than girls on 6 of 10 DOF scales for class-
drome scale. We performed a similar 2 x 2 room observations. There were no significant gen-
MANOVA, followed by univariate ANOVAs, on der differences for recess observations. Significant
the DOF Inattention and Hyperactivity-Impulsiv- age effects were found only on the Immature/With-
ity scales, and 2 x 2 univariate ANOVAs on the drawn syndrome, on which children ages 6-8
Attention Deficit/Hyperactivity Problems scale, scored significantly higher (Mean = .23, SD = .58)
Table 6-6
Characteristics of Normative Samples for the DOF
Boys Girls Total
Ages
6 79 45 124
7 92 44 136
8 68 59 127
9 73 43 116
10 56 26 82
11 35 41 76
Total 403 258 661
Recess Observations
Ages 32 6 38
6 26 18 44
7 34 18 52
9 32 18 50
10 28 8 36
11 18 6 24
Total 170 74 244
Ethnicitya
Native American 8.9%
Asian 2.1%
Mixed or Other 0.3%
Unknown 1.2%
a
Percentages of total N = 661 for classroom observations. (Recess observations were obtained on a
subsample of children used for classroom observations.)
than children ages 9-11 (Mean = .12, SD = .40), p the items of the DOF problem scales provide con-
= .005, Eta2 = .012. We constructed norms sepa- tinuous distributions of scores that indicate the
rately for boys and girls in each setting, as described degree to which problems are reported for each
in the next section. child on each scale. The DOF On-task score also
provides continuous raw scores ranging from 0 to
ASSIGNING NORMALIZED T SCORES 10 in 0.5 increments. These raw scale scores are
TO RAW SCORES especially useful for statistical analyses, because
they reflect all the variation that is possible on
The sums of the averaged 1, 2, and 3 ratings on each scale. To help users see how an individual
Table 6-7
Means and Standard Deviations of DOF Raw Scale Scores for the Normative Samples
Boys Girls
DOF Scales Mean SD Mean SD Eta2
Empirically Based Scales
Sluggish Cognitive Tempo .91a 1.27 .68 .91 .010
Immature/Withdrawn .21 .56 .13 .44 ns
Attention Problems 4.45a 3.10 3.90 2.66 .007
Intrusive 1.07 1.54 .99 1.43 ns
Oppositional .97a 1.57 .67 1.31 .009
Total Problems-Classroom 8.79a 5.95 7.36 5.09 .014
DSM-Oriented Scales
Attention Deficit/Hyperactivity Problems 5.41a 3.95 4.70 3.52b .007
Inattention subscale 1.76 1.91 1.53 1.70 ns
Hyperactivity-Impulsivity subscale 3.66a 2.63 3.17 3.39 .008
On-task 8.64 1.57 8.86 1.57 ns
Recess Observations
Aggressive Behavior .48 .75 .49 .75 ns
Total Problems-Recess 1.56 2.07 1.56 2.15 ns
Note. N = 661 for classroom observations; N = 244 for recess observations.

a
Boys > girls, p <.05.
b
Not significant when corrected for the number of comparisons (Sakoda, Cohen & Beall, 1954).
child’s scores on each scale compare with scores Scales

from the normative sample, we assigned normal-
We assigned normalized T scores to the total
ized T scores to the total raw scores for each DOF
raw scores of each DOF problem scale according
scale. The T scores are standard scores that com-
pare the child’s standing on a scale with the distri- to the percentiles found for the raw score distribu-
tions in each normative sample. For each DOF
bution of scores obtained by the normative
scale, we computed “midpoint” percentiles for each
sample of children of the same gender for the same
total raw score according to procedures specified
setting (classroom or recess). This enables users
by Crocker and Algina (1986, p. 439). According
to see whether a child’s scale scores are high or
to this procedure, a raw score that occupies a par-
low compared to “normal” peers. The T scores also
ticular percentile of the cumulative frequency dis-
enable users to compare a child’s standing on each
tribution is assumed to also occupy all the next
scale with the child’s standing on the other scales.
lower percentiles down to the percentile occupied
Assigning T scores to the DOF Problem by the next lower raw score. To re-present the range
of percentiles occupied by a raw score, the raw If we based T scores directly on midpoint per-
score is assigned to the midpoint of the percentiles centiles, the lowest T score for the Attention Prob-
that it occupies. lems syndrome for boys 6-11 would be 32, reflect-
ing the 4th midpoint percentile for boys who ob-
For example, on the DOF Attention Problems
tained a score of 0. By contrast, the lowest T score
syndrome for classroom observations, we found
for the Oppositional syndrome would be 43, re-
that a raw score of 6 spanned the 71st through 75th
flecting the 22nd midpoint percentile for boys 6-11
percentiles for the normative sample of 6-11-year-
who obtained a score of 0 on this syndrome scale.
old boys. We therefore assigned the score of 6 to
If these T scores were displayed on a profile for a
the 73rd percentile, which is midway between the
boy whose score was 0 on both syndrome scales,
71st and 75th percentiles. According to the proce-
the T score of 43 might suggest that the boy had
dure for assigning normalized T scores to raw
more problems on the Oppositional syndrome
scores (Abramowitz & Stegun, 1968), the 73rd per-
scale, than on the Attention Problems syndrome
centile score should get a T score of 56. To pro-
scale where the boy’s T score would be 32. This
vide a common metric for the five DOF syndrome
difference in T scores would mask the fact that the
scales for classroom observations, we assigned nor-
boy really had no problems on either syndrome
malized T scores from 50 through 70 according to
scale.
the midpoint percentile procedures.
To avoid misleading impressions like those de-
We followed the same midpoint percentile pro-
scribed above, we truncated the assignment of T
cedure for assigning normalized T scores from 50
scores, as recommended by Petersen, Kolen, and
to 70 to all the DOF problem scales: the five syn-
Hoover (1993), and as done for other ASEBA forms
dromes, DSM-oriented Attention Deficit/Hyperac-
(Achenbach & Rescorla, 2000, 2001). To equalize
tivity Problems scale, Inattention and Hyperactiv-
the starting points for the five syndrome scales for
ity-Impulsivity subscales, and Total Problems for
classroom observations and the DSM-oriented At-
classroom observations, as well as the Aggressive
tention Deficit/Hyperactivity Problems scale and
Behavior syndrome and Total Problems for recess
Inattention and Hyperactivity-Impulsivity
observations. Procedures for truncating lower T
subscales, we assigned a T score of 50 to raw scores
scores at 50 and for assigning T scores >70 are
that fell at approximately the 50th percentile and
described below. Procedures for assigning lower
lower.
and higher T scores to DOF Total Problems scores
for classroom observations are described in a sepa- We also truncated T scores at 50 for the Ag-
rate section. gressive Behavior syndrome scale and Total Prob-
lems for recess observations. That is, we assigned
Truncation of Lower T Scores at 50. The raw
a T score of 50 to raw scores of 0 and then based
scores of the DOF problem scales were all posi-
normalized T scores on midpoint percentiles for
tively skewed in the normative sample, with large
Aggressive Behavior and Total Problems-Recess
proportions of children having scale scores of 0.
up to the 98th percentile (T = 70).
That is, more children in the normative sample re-
ceived very low than very high DOF problem Assignment of a T score of 50 to several raw
scores. Furthermore, because high scores are clini- scale scores prevents users from overinterpreting
cally significant on problem scales, it is more im- small differences among scores that are well within
portant for the scales to make finer discriminations the normal range. It also reduces differentiation
among high scores than among low scores that are among low scores. However, loss of such differ-
at the bottom of the normal range. entiation is of little practical importance, because
it involves differences that are all at the low end of
the normal range. If users nevertheless wish to pre-
serve differences at the low end of the normal We followed the same procedure for assigning
range, they can focus on the total raw scale scores. T scores above 70 to the DSM-oriented Attention
For statistical analyses that do not involve com- Deficit/Hyperactivity Problems scale, the Inatten-
bining data across genders, raw scale scores are tion and Hyperactivity-Impulsivity subscales, and
usually preferable, because they directly reflect all the Aggressive Behavior syndrome for recess ob-
differences among scores without the effects of servations. Our procedures for assigning T scores
truncation or other transformations. to Total Problems are described below.
Assigning T Scores Above 70 (>98th Percen- Assigning T scores to Total Problems
tile). Most children in the normative samples ob-
The DOF Total Problems score consists of the
tained scores that were well below the maximum
sum of the 1, 2, and 3 ratings on all the specific
possible. It was therefore impossible to base the
problem items of the DOF, plus the highest rating
highest T scores on percentiles, because the high-
(1, 2, or 3) for any problems written by the ob-
est possible scores were spread over a tiny per-
server in the spaces for the open-ended item 89.
centage of children in the normative sample. Be-
Item 89 provides two spaces for adding problems
cause there were hardly any children in the norma-
that are not listed elsewhere. However, only the
tive samples on whom to base T scores above the
highest rating for added items is included in order
98th percentile (T >70), we assigned T scores from
to limit the effects of idiosyncratic problems on
71 to 100 in as many increments as there were re-
the Total Problems score. Separate Total Problems
maining raw scores on each scale.
scores are computed for classroom observations
As an example, on the DOF Attention Problems and recess observations. There are gender-specific
syndrome scale, the raw score of 11 (occupying norms for classroom and recess. To provide norm-
the 98th percentile) was assigned a T score of 70 referenced scores for Total Problems, we computed
for boys 6-11. Because there are eight items on the the scores obtained by each gender within each
scale, the maximum possible score is 24 (i.e., if a setting. We then computed midpoint percentiles ac-
boy received a rating of 3 on all eight items, the cording to the procedure described earlier for the
boy’s raw scale score would be 24.) There are 30 other DOF problem scales. We assigned T scores
intervals from 71 to 100, but 26 possible raw scores to midpoint percentiles for Total Problems raw
from 11.5 through 24. (Because of averaging, DOF scores, as described below.
raw scores include scores rounded to .5). To as-
No Truncation of Lower T Scores for Total
sign T scores to the 26 possible raw scores, we
Problems-Classroom. There are more items on the
divided 30 by 26. Because 30/26 = 1.15, we as-
Total Problems scale than on any other scale, and
signed T scores to raw scores in intervals of 1.15.
at least some of the items are endorsed for most
Thus, a raw score of 11.5 was assigned a T score
children. For classroom observations, relatively
of 70 + 1.15 = 71.15, rounded off to 71. A raw
few children in the normative samples obtained ex-
score of 12 was assigned a T score of 71.15 + 1.15
tremely low scores for Total Problems. It was there-
= 72.30, rounded off to 72, and so on. The highest
fore unnecessary to truncate Total Problems T
possible raw score of 24 on Attention Problems
scores at 50 for classroom observations as we did
was assigned a T score of 100. By comparison, on
for other DOF problem scales. For Total Problems-
the Oppositional syndrome, a raw score of 5.5 (oc-
Classroom, the lowest raw score of 0 for boys and
cupying the 98th percentile) was assigned a T score
for girls was assigned a T score of 33 (2nd percen-
of 70 for boys 6-11. The number of items on the
tile). We then based normalized T scores directly
Oppositional syndrome is 12. Therefore, the high-
on midpoint percentiles for scores obtained by the
est possible score on the Oppositional syndrome
normative samples, up to the 98th percentile (T =
is 36, which was assigned a T score of 100.
70).
For consistency in displaying scores on the DOF mined by the predominant activity sampling
Profile, the DOF computer-scoring program does method (i.e., the child must be doing what is ex-
not print Total Problems-Classroom T scores be- pected for more than one half of the 5-second in-
low 50. However, users can obtain these lower T terval). The number of “on-task” intervals are then
scores from the computer-scored data sets. summed for each 10-minute observation period and
are averaged by the DOF computer-scoring pro-
Truncation of Lower T Scores for Total Prob-
gram across multiple observations. The averaged
lems-Recess. For recess observations, 32% of boys
On-task raw score can thus range from 0 to 10, in
and 36% of girls in the normative samples obtained
increments of 0.5.
raw scores of 0 for Total Problems. To take this
positive skew into account, we truncated T scores To provide norm-referenced scores for DOF On-
at 50 for Total Problems-Recess, as done for other task, we obtained averaged raw scores for boys and
DOF problem scales, as explained earlier. girls in the normative samples for classroom ob-
servations. We then computed midpoint percentiles
Assigning T Scores Above 70 (>98th Percen-
according to the procedures described earlier for
tile) for Total Problems. No children in the nor-
the DOF problem scales. The raw scores for DOF
mative or referred samples obtained DOF Total
On-task were all negatively skewed in the norma-
Problems scores close to the maximum scores pos-
tive samples. That is, fewer children in the norma-
sible. If we followed the same procedure as for the
tive sample received very low than received very
other problem scales, we would have compressed
high On-task scores. Furthermore, because low
the Total Problems scores actually obtained into a
scores are clinically significant for On-task, it is
narrow range of T scores. We would also have as-
more important to make finer discriminations
signed a relatively broad range of T scores to raw
among low scores than among high scores. To take
scores obtained by few or no children. To enable
account of the negatively skewed On-task scores
the upper Total Problems-Classroom and Total
and the need for finer discrimination among low
Problems-Recess T scores to reflect differences
than high scores, we assigned T scores to raw scores
among the raw scores that are most likely to occur,
in the following ways:
we did the following: (a) we identified the five
highest scores obtained by boys and girls in the 1. At the low end of the On-task scale, we
normative and referred samples combined, sepa- assigned a T score of 20 to On-task scores
rately for classroom and recess; (b) we computed of 0 for both boys and girls. We then
the mean of the five highest scores for each gender assigned T scores to raw scores of 0.5 to
in each setting; (c) we assigned a T score of 89 to 9.5 based on the midpoint per-centiles. The
the mean of the five highest raw scores for each T scores ranged from 21 to 51 (53rd
gender in each setting; (d) we then assigned T percentile) for girls and 21 to 53 (62nd
scores 90 through 100 in equal intervals to the raw percentile) for boys.
scores that were above those that had been assigned
2. We assigned a T score of 60 to the
T = 89. We followed these procedures for Total
highest possible On-task raw score of
Problems-Classroom T scores >70 and Total Prob-
10 for both boys and girls, which was
lems-Recess T scores >70.
above the 80 th percentile for both
Assigning T Scores to DOF On-Task genders.
DOF On-task is only scored for classroom ob-
servations. To score On-task, an observer records
MEAN T SCORES
whether the child is “on-task” or “off-task” in the Appendix A shows the mean DOF T scores and
last 5 seconds of each 1-minute interval for each raw scores for the normative samples of boys and
10-minute observation period. On-task is deter- girls for classroom observations and recess obser-
vations. For all DOF problem scales, except Total Hyperactivity-Impulsivity subscales for classroom
Problems-Classroom, raw scale score distributions observations, and the Aggressive Behavior scale
are positively skewed and low scores are truncated for recess observations. The borderline range indi-
at T = 50. Consequently, the mean T scores are cates scores that are high enough to be of concern,
above 50 and their standard deviations are below but not so high as to be clearly deviant. T scores
10 in the normative samples. Raw scores are less >69 (>97th percentile) are considered to be in the
skewed for DOF Total Problems-Classroom. Thus, clinical range. T scores below 65 (<93rd percen-
the mean T scores for DOF Total Problems-Class- tile) are considered to be in the normal range.
room are closer to 50, and their standard devia-
T scores from 60 to 63 (84th through 90th per-
tions are closer to 10 in the normative samples.
centiles) are considered to be in the borderline clini-
In contrast to the DOF problem scales, On-task cal range for DOF Total Problems-Classroom and
scores are negatively skewed and high scores are Total Problems-Recess. T scores >63 (>90th per-
truncated at T = 60. Thus, the mean T scores for centile) are considered to be in the clinical range.
on-task are below 50 and their standard deviations T scores below 60 (<84th percentile) are consid-
are below 10 in the normative samples. ered to be in the normal range.
Users should thus keep in mind that the T scores For DOF On-task, T scores from 31 to 35 (3rd to
for most DOF problem scales and T scores for On- 7 percentiles) are considered to be in the border-
th
task deviate from the mean of 50 and standard de- line clinical range. T scores <31 (<3rd percentile)
viation of 10 expected when normal bell-shaped are considered to be in the clinical range. T scores
distributions are transformed directly into T scores. above 35 (>7th percentile) are considered to be in
Users should also keep in mind that the means and the normal range. DOF On-task is scored only for
standard deviations of the DOF scales may vary classroom observations.
from one sample of children to another. In particu-
As reported in Chapter 8, children who were
lar, the means and standard deviations for prob-
referred for mental health or special education ser-
lem scale scores obtained by samples of children
vices scored significantly higher on the DOF prob-
referred for mental health services are typically
lem scales and On-task than matched samples of
higher than for nonreferred children. Examples of
nonreferred control children in the same settings.
this can be seen in Appendix B, which displays
Because scores on the DOF problem scales are
means and standard deviations for scale scores ob-
quantitative measures of the number and degree
tained by matched samples of referred children and
of problems observed for a child, the scores are
nonreferred control children observed in the same
not intended to mark categorical differences be-
settings. Scores for referred children are often less
tween children who are “sick” vs. “well.” Instead,
skewed than for nonreferred children, because
the borderline and clinical ranges help users iden-
fewer referred children obtain very low scores.
tify scores that are of enough concern to warrant
consideration for professional help. Users may
NORMAL, BORDERLINE, AND
choose to apply higher or lower cutpoints for their
CLINICAL RANGES own clinical or research purposes. For example,
On the computer-scored DOF Profile shown in for some cases, or for certain clinical or research
Chapter 3, broken lines are printed across the purposes, scores at the high end of the normal range
graphic displays to demarcate borderline and clini- (e.g., >90th percentile) on the syndrome scales or
cal ranges for DOF scale scores. T scores from 65 Attention Deficit/Hyperactivity Problems scale
to 69 (93rd through 97th percentiles) are considered may also warrant concern. If you wish to classify
to be in the borderline clinical range for the DOF children’s scores dichotomously as clearly in the
syndrome scales, DSM-oriented Attention Deficit/ normal vs. clinical range on the DOF syndrome
Hyperactivity Problems scale and Inattention and scales, Attention Deficit/Hyperactivity Problems
scale, and Aggressive Behavior, we suggest using children’s behavior in classroom and recess set-
T scores below 65 to designate the normal range tings. The DOF covers a 10-minute observation
vs. T scores >65 to designate the clinical range. window. We recommend obtaining 3 to 6 DOFs
For DOF Total Problems, we suggest using T scores for each identified child, plus additional DOFs for
below 60 to designate the normal range vs. T scores control children in the same setting. The 2009 ver-
>60 to designate the clinical range. For DOF On- sion of the DOF contains 88 specific problem
task, we suggest using T scores above 35 to desig- items, plus one open-ended item for “other prob-
nate the normal range vs. T scores <35 to desig- lems.” Each problem is rated on a 0-1-2-3 scale
nate the clinical range. ranging from 0 = no occurrence to 3 = definite oc-
currence with severe intensity, high frequency, or
SUMMARY
3 or more minutes total duration. The DOF also
We developed the DOF from observations of includes an On-task score ranging from 0 to 10,
which can easily be converted to a percentage.
We constructed the DOF syndromes by apply-
ing exploratory and confirmatory factor-analytic
methodology similar to procedures used for other
ASEBA forms, including the CBCL/6-18, TRF,
YSR, SCICA, and TOF. The factor analyses yielded
five syndromes: Sluggish Cognitive Tempo, Im-
mature/Withdrawn, Attention Problems, Intrusive,
and Oppositional. The Attention Problems syn-
drome was similar to syndromes derived from the
CBCL/6-18, TRF, YSR, SCICA, and TOF. The
Oppositional syndrome was similar to the Oppo-
sitional syndrome on the TOF and the Self-Con-
trol Problems syndrome of the SCICA. The Slug-
gish Cognitive Tempo syndrome was similar to the
2007 Sluggish Cognitive Tempo scales scored from
the CBCL/6-18 and TRF.
In addition to the syndrome scales for classroom
observations, we constructed an Attention Deficit/
Hyperactivity Problems scale comprised of items
consistent with DSM-IV symptoms of ADHD. The
Attention Deficit/Hyperactivity Problems scale has
subscales for Inattention and Hyperactivity-Impul-
sivity. We also constructed an Aggressive Behav-
ior syndrome scale that can be scored from recess
observations. DOF Total Problems can be scored
separately for classroom and recess observations
by summing the 0-1-2-3 ratings for the 89 prob-
lem items.
The DOF scales are normed separately for boys
and girls ages 6-11. We assigned normalized T
scores to raw scores on each scale. The T scores
enable users to compare children with peers across
all scales and to compare a child’s standing on each
syndrome with the same child’s standing on each
of the other syndromes. To prevent over-interpre-
Chapter 7
Reliability of the DOF
Reliability refers to agreement between repeated and >.50 large.

assessments when the phenomena being assessed
are expected to remain constant. The DOF is de- INTER-RATER RELIABILITY
signed to obtain observers’ ratings of children’s
functioning in group settings. To assess the reli- To assess inter-rater reliabilities for classroom
ability of such observations, it is important to know observations, pairs of trained observers used the
the degree to which two observers obtain similar DOF to rate one to four 10-minute observations of
results for the same child in the same observation 212 randomly selected children in elementary
period, i.e., the degree of inter-rater reliability. We school classrooms in Pennsylvania, New York, and
present inter-rater reliability between pairs of ob- Vermont. The sample of 212 children included 112
servers for classroom observations of 212 children boys and 100 girls, ages 6-11. Of these, 58 chil-
and for recess observations of 17 children. dren were rated by five pairs of observers in greater
Philadelphia, Pennsylvania; 91 children were rated
It is also important to know the degree to which by four pairs of observers in greater Syracuse, New
observers obtain similar results over periods when York; and 63 children were rated by three pairs of
children’s behavior is not expected to change much, observers in greater Burlington, Vermont, for a to-
i.e., test-retest reliability. In this chapter, we present tal of 12 observer pairs. For training, each pair of
test-test reliability for DOFs completed for two observers simultaneously rated five practice cases
separate sets of observations of 27 children over to learn the DOF procedures, as described in Chap-
intervals averaging 12 days. ter 4. Following training, the observer pairs inde-
Some users may be interested in the internal pendently used the DOF to simultaneously rate 14
consistency of the DOF scales. This refers to the to 24 anonymously selected children. Observers
correlation between half of a scale’s items and the were instructed not to discuss their ratings with
other half of the items. We report Cronbach’s each other until after all reliability data were col-
(1951) alpha as a measure of internal consistency lected. The number of observation periods per child
for each DOF scale for separate samples of referred varied across observer pairs. Nine observer pairs
children and control children in the same settings. completed one DOF per child per observer, while
three observer pairs completed 2 to 4 DOFs per
For direct observations of behavior, reliability child per observer.
coefficients >.70 are generally considered good for
low-stakes screening or program evaluation, while To assess inter-rater reliabilities for recess ob-
coefficients closer to .90 are desirable for high- servations, one pair of trained observers used the
stakes eligibility or diagnostic decisions DOF to rate two 10-minute observations during
(Chafouleas, Christ, Riley-Tillman, Briesch, & recess (and lunch) for 17 anonymously selected
Chanese, 2007; Hintze & Matthews, 2004). In children (14 boys and 3 girls) in a Vermont school
terms of effect sizes, Cohen (1988) considers for children with behavioral/emotional disorders.
Pearson rs of .10 to .29 small, .30 to .49 medium, When multiple observations were obtained per
91
92 7. Reliability of the DOF
child, we averaged the 0-1-2-3 ratings across DOFs in Chapter 6. For recess observations, inter-rater
to obtain an average rating for each of the 88 items reliabilities were .73 for the Aggressive Behavior
for each observer. We then summed the average syndrome and .97 for Total Problems.
ratings for relevant items to obtain raw scores for
The second column in Table 7-1 shows inter-
each DOF problem scale. We also averaged On-
rater reliabilities derived from raw scale scores
task scores across multiple DOFs per child per
obtained only on the first 10-minute observation
observer. When only one DOF was obtained per
with the DOF. The correlations were generally simi-
child per observer, we summed the 0-1-2-3 ratings
lar to those shown in the first column for scores
for relevant items to obtain raw scores for each
averaged across 1 to 4 DOFs: mean r = .78 versus
DOF problem scale and computed the On-task
.79 for all nine problem scales for classroom ob-
score per child per observer.
servations and mean r = .94 versus .97 for On-task.
To obtain reliabilities for classroom observa- 8. For the problem scales, seven rs for scores aver-
tions, we computed Pearson rs between raw scale aged across 1 to 4 DOFs (column 1) were within
scores separately for 10 DOF scales for each of .02 r values for scores obtained from 1 DOF (col-
the 12 observer pairs. Of the 120 Pearson rs for umn 2). To further examine inter-rater reliability
classroom observations, 106 were significant at p for one versus multiple observations per child, we
<.05. We converted each r to Fisher’s z and then computed average rs by Fisher’s z transformation
obtained a mean z for each DOF scale across the for the nine observer pairs who obtained only one
12 observer pairs. We also averaged Fisher’s z DOF per child versus the three observer pairs who
scores across the six DOF empirically based prob- obtained 2 to 4 DOFs per child. For observer pairs
lem scales, the three DSM-oriented scales, and all with only one DOF per child, the mean r was .82
nine problem scales. We converted the mean z across the nine problem scales for classroom ob-
scores back to r for each DOF scale. We also con- servations and .94 for On-task. For observer pairs
verted mean z scores back to r to obtain the mean who obtained 2 to 4 DOFs per child, the mean r
r of the six empirically based scales, mean r of the was .75 across the nine problem scales and .99 for
three DSM-oriented scales, and mean r of all nine On-task.
problem scales. Inter-rater reliabilities for the Ag-
The above findings indicate that inter-rater re-
gressive Behavior syndrome and Total Problems-
liability was generally similar for observer pairs
Recess were obtained directly for one pair of ob-
obtaining only one 10-minute observation per child
servers. Both rs were significant at p <.001.
versus multiple 10-minute observations per child.
As can be seen in the first column of Table 7-1, The small differences between rs are useful to con-
inter-rater reliabilities for the empirically based sider for training purposes, since obtaining mul-
scales ranged from .71 for the Oppositional syn- tiple DOFs per child is more time and labor inten-
drome to .87 for Sluggish Cognitive Tempo and sive than obtaining only one DOF per child. As
.88 for Total Problems-Classroom, with a mean r can be seen in Table 7-1, for most scales, good
of .80. For the DSM-oriented scales, inter-rater inter-rater reliability can be obtained with only one
reliabilities were .80 for the Attention Deficit/Hy- 10-minute observation per child. Chapter 4 dis-
peractivity Problems scale, .70 for the Inattention cusses procedures for training DOF observers.
subscale, and .81 for the Hyperactivity-Impulsiv-
As described in Chapter 6, revisions of the DOF
ity subscale, with a mean r of .77. The mean inter-
entailed adding and testing new items as well as
rater r was .79 across all nine problem scales and
writing rules for rating various items. We analyzed
.97 for On-task. For classroom observations, the
findings from various revisions to identify any sig-
inter-rater reliabilities for DOF Total Problems and
nificant effects on inter-rater reliability. Computed
On-task scores were consistent with previous find-
across the 12 observer pairs, we found similar mean
ings on earlier versions of the DOF, as discussed
7. Reliability of the DOF 93
Table 7-1
Inter-Rater Reliabilities for DOF Scales
Inter-Rater r Inter-Rater r
Scores averaged Scores for first
across 1 to 4 DOFs DOF per child
DOF Scale per child
Sluggish Cognitive Tempo .87 .86
Immature/Withdrawn .79 .73
Attention Problems .72 .74
Intrusive .78 .72
Oppositional .71 .71
Total Problems-Classroom .88 .86
Mean r for empirically based scales .80 .78
DSM-Oriented Scales
Attention Deficit/Hyperactivity Problems .80 .80
Inattention subscale .70 .72
Hyperactivity-Impulsivity subscale .81 .80
Mean r for DSM-oriented scales .77 .78
Mean r for all problem scales .79 .78
On-task .97 .94
Recess Observations
Aggressive Behavior .73 .83
Total Problems-Recess .97 .98
Note. N = 212 for classroom observations; N = 17 for recess observations. For classroom observations,
inter-rater rs were obtained for each of 12 pairs of observers. Mean rs were then computed for each scale
by Fisher z transformation. For recess observations, inter-rater rs were obtained from one pair of observers.
Mean rs across sets of scales for classroom and recess observations were computed by Fisher’s z
transformation.
inter-rater reliabilities for DOF Total Problems items that had scoring rules versus 45 retained
scores computed from the 88 items retained on items without scoring rules (mean r = .81 versus
the 2009 DOF versus Total Problems scores com- .76, respectively).
puted from the 96 items of the 1986 DOF (mean r
= .86 versus .83, respectively). We found similar TEST-RETEST RELIABILITY
mean inter-rater reliabilities for the 43 retained
To assess test-retest reliability, we computed
Pearson rs for DOFs completed for 27 children, reliability was also low for On-task scores. The
who were rated by the same observer over inter- lower test-retest reliabilities versus higher inter-
vals of 7 to 22 days (average interval = 12.4 days). rater reliabilities may also be due to the composi-
The test-retest sample included 19 boys and 8 girls tion of our samples. The test-retest sample was
attending Vermont schools. Ages ranged from 6 to comprised only of clinically referred children, some
12 years, with a mean age of 8.4 (S.D. = 1.9). (Only of whom were in treatment, in contrast to anony-
one child was age 12.) The observer obtained four mously selected control children for the inter-rater
10-minute classroom observations for each child reliabilities.
over two days at Time 1 and four 10-minute class-
Pearson r reflects similarities between the rank
room observations over two days at Time 2. The
orders of scores obtained at Time 1 and Time 2. It
0-1-2-3 item ratings were averaged across the four
is high when rankings of individuals’ scores retain
Time 1 observation sessions and across the four
approximately the same rank from Time 1 to Time
Time 2 observation sessions. The averaged item
2. Because it is not affected by the absolute mag-
ratings were then summed to obtain raw scores for
nitude of scores, r can be high even if all the Time
the five DOF syndrome scales, the DSM-oriented
1 scores differ in magnitude from the Time 2 scores.
Attention Deficit/Hyperactivity Problems scale, In-
To test differences in mean scores relative to their
attention and Hyperactivity-Impulsivity subscales,
variance, we performed dependent t tests of differ-
and Total Problems-Classroom. Averaged raw
ences between Time 1 versus Time 2 scores for
scores were also obtained for On-task.
each of the 10 DOF scales. As shown in Table 7-2,
We computed rs between raw scores obtained Time 1 scores differed significantly (p <.05) from
for Time 1 versus Time 2 for each DOF problem Time 2 scores only for the Immature/Withdrawn
scale, plus On-task. Correlations were significant syndrome, which could be a chance effect (Sakoda,
at p <.05 for 8 of 10 scales. As Table 7-2 shows, et al., 1954).
the significant test-retest rs for the empirically
INTERNAL CONSISTENCY
based syndromes ranged from .48 for the Imma-
ture/Withdrawn syndrome to .73 for the Intrusive To assess internal consistency of the DOF
syndrome. The test-retest r for Total Problems- scales, we computed Cronbach’s alpha (1951) for
Classroom was .72. Test-retest rs were .76 for the each DOF scale. Alpha represents the mean of the
DSM-oriented Attention Deficit/Hyperactivity correlations between all possible sets of half the
Problems scale, .43 for the Inattention subscale, items comprising a scale. The magnitude of alpha
and .77 for the Hyperactivity-Impulsivity subscale. tends to be directly related to the length of the scale,
The mean test-retest rs were .53 across the empiri- because half the items of a short scale provide a
cally based scales, .73 across the DSM-oriented less stable measure than half the items of a long
scales, and .58 across all problem scales. The test- scale.
retest r was .42 for On-task.
Although internal consistency is sometimes re-
Test-retest reliabilities were moderate (.72 to ferred to as “split-half reliability,” it is not “reli-
.77) for the Intrusive syndrome, Total Problems- ability” in the sense of measuring how well a scale
Classroom, the DSM-oriented Attention Deficit/ will produce the same results over different occa-
Hyperactivity Problems scale, and the Hyperactiv- sions when the target phenomena are expected to
ity-Impulsivity subscale. Test-retest reliability was remain constant. Furthermore, some scales with
low for the Sluggish Cognitive Tempo, Attention relatively low internal consistency may be more
Problems, and Oppositional syndromes, and the In- valid than other scales with very high internal con-
attention subscale, suggesting that the problems sistency. As an example, if a scale consists of 20
comprising these scales are more variable than the versions of the same item, it should produce very
problems comprising the other scales. Test-retest high internal consistency, because respondents
7. Reliability of the DOF 95
Table 7-2
Test-Retest Reliabilities, Means, and Standard Deviations for DOF Scales
Test- Time 1 DOF Time 2 DOF

DOF Scale Retest r Mean SD Mean SD
Sluggish Cognitive Tempo (.25) 1.09 .84 1.08 .86
Immature/Withdrawn .48 .69b, c .67 .42 .52
Attention Problems (.35) 8.30 1.68 8.00 1.66
Intrusive .73 2.02 1.56 2.37 1.35
Oppositional .49 2.23 1.48 2.37 1.72
Total Problems-Classroom .72 16.58 4.76 16.34 4.70
Mean r empirically based scalesa .53
DSM-Oriented Scales
Attention Deficit/Hyperactivity Problems .76 10.63 3.00 10.13 2.69
Inattention subscale .43 3.77 1.19 3.42 1.25
Hyperactivity-Impulsivity subscale .77 6.86 2.18 6.72 1.78
Mean r for DSM-oriented scalesa .73
Mean r for all problem scalesa .58
On-task .42c 8.53 1.12 8.65 .91
Note. N = 27. All test-retest observations were done in classrooms. Mean test-retest interval = 12.4 days.
All significant Pearson rs were p <.05. Values in parentheses were not significant.
a
Mean r was computed by Fisher’s z transformation.
b
Time 1 DOF > Time 2 DOF, p <.05.
c
Not significant when corrected for the number of comparisons (Sakoda, et al., 1954).
should give similar answers to the 20 versions of As detailed in Chapter 6, the DOF syndrome
the item. However, such a scale would usually be scales were derived from factor analyses of the
less valid than a scale that uses 20 different items correlations among items. The composition of the
to assess the same phenomenon. Because each of syndrome scales is therefore based on internal con-
the 20 different items is likely to tap different as- sistencies among certain subsets of items. Mea-
pects of the target phenomenon and to be subject sures of internal consistency of the syndrome scales
to different errors of measurement, the 20 differ- are thus somewhat redundant with the inter-item
ent items are likely to provide better measurement correlations on which the scales were based. By
despite lower internal consistency than a scale that contrast, the DOF DSM-oriented scales were de-
uses 20 versions of a single item. veloped a priori, based on experts’ ratings of how
consistent items are with a DSM-IV diagnosis of
Table 7-3
Cronbach’s Alpha Coefficients (Internal Consistency) for DOF Scales
DOF Scale Alpha
Sluggish Cognitive Tempo .49
Immature/Withdrawn .76
Attention Problems .67
Intrusive .80
Oppositional .69
Total Problems-Classroom .87
Mean alpha for empirically based scalesa .73
DSM-Oriented Scales
Attention Deficit/Hyperactivity Problems .81
Inattention subscale .68
Hyperactivity-Impulsivity subscale .72
Mean alpha for DSM-oriented scalesa .74
Recess Observations
Aggressive Behavior .56
Total Problems-Recess .70
Note. Cronbach’s alpha was computed for matched samples of referred children and control children in
the same settings. For classroom observations, N = 332; for recess observations, N =248.
a
Mean alpha computed by Fisher’s z transformation.
ADHD (for details, see Chapter 6). Consequently, cally based syndromes, .68 to .81 for the three
internal consistencies for the DSM-oriented scales DSM-oriented scales, and .87 for Total Problems.
are not redundant with inter-item correlations from For recess observations, alphas were .56 for Ag-
factor analyses. gressive Behavior and .70 for Total Problems.
As shown in Table 7-3, we computed alphas, SUMMARY
derived separately from classroom observations of
For classroom observations, the mean inter-rater
332 children and from recess observations of 248
r was .80 across the five empirically based syn-
children. The samples included equal numbers of
dromes and Total Problems and .77 across the three
referred children and randomly selected control
DSM-oriented scales, with an overall mean r of
children of the same gender in the same settings.
.79 across all DOF problem scales. The r of .97 for
The classroom sample included 224 boys and 108
the DOF On-task score and .88 for Total Problems
girls ages 6-11 (mean age = 8.3, SD = 1.7). The
showed high inter-rater reliability, consistent with
recess sample included 174 boys and 74 girls ages
prior research. For recess observations, the inter-
6-12 (mean age = 8.4, SD = 1.6; only two referred
rater r was .73 for Aggressive Behavior and .97
children were age 12). For classroom observations,
for Total Problems.
alphas ranged from .49 to .80 for the five empiri-
To assess test-retest reliability, a trained observer
completed four DOFs at Time 1 and four DOFs at
Time 2 for 27 children observed in classrooms at
Chapter 8
Validity of the DOF
Validity refers to the accuracy with which in- cators. Pilot editions were tested at multiple sites
struments assess what they are supposed to assess. and revised on the basis of feedback from parents,
The DOF is designed to measure independent ob- paraprofessionals, and clinicians. Details of the
servations of children’s behavioral, emotional, and rationale and procedures for selecting ASEBA
social problems in group settings. Data obtained items have been presented in previous manuals for
from the DOF are intended to mesh with data from the instruments (Achenbach, 1991a, b, c;
other sources, particularly ASEBA instruments for Achenbach & Edelbrock, 1983, 1986, 1987).
obtaining parent reports (CBCL/6-18), teacher re-
As explained in Chapter 6, the original 1981
ports (TRF), self-reports (YSR), clinical interviews
and 1986 DOF had 96 problem items, plus an open-
(SCICA), and test session observations (TOF). Like
ended item for describing and rating problems not
other ASEBA instruments for assessing behavioral
specified on the DOF. To assemble the original
and emotional problems, the validity of the DOF
DOF item sets, Achenbach selected items from
must be evaluated in relation to a variety of crite-
early versions of the CBCL and TRF describing
ria, none of which is definitive by itself. In this
problems that might be directly observed in group
chapter, we present evidence for the content valid-
settings. Whenever possible, he retained the origi-
ity and criterion-related validity of the DOF.
nal wording of the CBCL and TRF items, but re-
CONTENT VALIDITY OF DOF ITEMS worded some items slightly to make them more
appropriate for direct observations. For the 2003
The most basic kind of validity is content va-
research edition of the DOF, we retained 95 of the
lidity, which is the degree to which an instrument’s
1986 DOF items and added 19 new items to corre-
content includes what the instrument is intended
spond to TOF items and to tap DSM-IV and DSM-
to measure. The DOF items were modeled on
IV-TR symptoms for ADHD and other problems
CBCL/6-18 and TRF items that are appropriate for
that were not covered by the original DOF items.
direct observations. The TRF includes 97 items
paralleling those of the CBCL/6-18, plus additional Through analyses described in Chapter 6, we
items appropriate to school settings. Nearly all the reduced the item set for the 2009 version of the
CBCL/6-18 and TRF items discriminated signifi- DOF to 88 specific items, plus one open-ended item
cantly (p <.01) between referred and nonreferred for other problems. Of the 88 specific items on the
children (Achenbach & Rescorla, 2001). DOF, there are the following counterpart items on
other ASEBA forms: 51 on the CBCL/6-18, 63 on
Beginning in the 1960s, ASEBA problem items
the TRF, 49 on the YSR, 69 on the TOF, 60 on the
were developed and refined on the basis of research
SCICA-Observation Form, and 35 on the SCICA
and practical experience (Achenbach, 1966;
Self-Report Form. The content validity of the DOF
Achenbach & Lewis, 1978). The procedures for
items is thus strongly supported by nearly four de-
selecting ASEBA problem items included exami-
cades of research, consultation, feedback, and re-
nation of child/adolescent psychiatric case histo-
finement of comparable ASEBA items. In addition,
ries, extensive literature searches, and consultation
63% of the DOF items significantly discriminated
with mental health professionals and special edu-
between clinically referred and control children,
97
98 8. Validity of the DOF
as described in the next section. lected control children in the same settings. From
the samples shown earlier in Table 6-1, we selected
CRITERION-RELATED VALIDITY 6- to-12-year-olds who had been referred for evalu-
Criterion-related validity refers to the degree ation of behavioral and emotional problems and/
of association between a particular measure, such or learning difficulties and had participated in our
as a scale scored from the DOF, and an external research studies using the DOF (for brevity, we call
criterion for characteristics that the scale is intended this group “referred.”). For each referred child, in-
to measure. One of the main reasons for deriving dependent observers selected one or two control
syndrome scales from ASEBA forms was the lack children of the same gender and in the same class-
of an empirically based taxonomy of child psycho- room as the referred child. For classroom observa-
pathology (Achenbach & McConaughy, 1997; tions, the matched samples included 166 referred
Achenbach & Rescorla, 2001). The ASEBA syn- children ages 6-11 and 263 control children. (For
drome scales provide empirically based groupings classroom observations, 97 referred children had
of items that describe children’s behavioral and two matched controls). For recess observations, the
emotional problems, as reported by key informants. matched samples included 124 referred children
In a similar fashion, the DOF items describe be- ages 6-12 and 248 control children. (For recess ob-
havioral and emotional problems that can be observations, all referred children had two matched
served in group settings, such as classrooms and controls; only two referred children were age 12.)
school playgrounds. The DOF syndromes were Referred children with full scale IQ scores <75
derived to provide empirically based scales for were excluded from both samples.
scoring groups of these problem items that tend to
Table 8-1 shows the characteristics of the
co-occur. The DOF DSM-oriented Attention Defi-
matched samples of boys and girls for classroom
cit/Hyperactivity Problems scale was developed for
observations (N = 430) and recess observations (N
scoring problems consistent with a the DSM-IV
= 372). Ethnicity of the sample for classroom ob-
and DSM-IV-TR diagnosis of ADHD.
servations was 85.2% non-Latino White and 14.8%
An important way to test criterion-related va- other ethnicities. Ethnicity of the sample for re-
lidity of the DOF items and scales is to measure cess observations was 100% non-Latino White.
their ability to discriminate between children who,
independently of their DOF scores, have been MANCOVA of DOF Item Ratings
judged to be at risk for emotional or behavioral
problems and have been referred for mental health To test associations of referral status and de-
mographic variables with DOF item ratings for
or special education evaluations and/or services.
classroom observations, we used a multivariate
We recognize that clinical referral is not an infal-
analysis of covariance (MANCOVA) to analyze
lible criterion of need for help. Some children in
DOF item ratings obtained by the matched samples
our referred samples may not have needed profes-
shown in Table 8-1. For classroom observations,
sional help, while others in our “control” sample
the MANCOVA design was 2 (referred vs. con-
may have needed help. However, actual referral
trols) x 2 (boys vs. girls), with ethnicity (non-Latino
status is as ecologically valid as any other practi-
White vs. Other) as a covariate. For recess obser-
cal alternative for testing criterion-related valid-
vations, we used a 2 (referred vs. controls) x 2 (boys
ity.
vs. girls) MANOVA, with no covariate because
Matched Referred and Control Samples ethnicity for that sample was 100% non-Latino
White. For each of the 88 DOF items, we aver-
To test the criterion-related validity of DOF aged 0-1-2-3 ratings across 2 to 4 DOFs separately
items and scale scores, we used matched samples for each referred child and each control child. To
of clinically referred children and randomly se- create equal sample sizes for referred and control
8. Validity of the DOF 99
Table 8-1
Characteristics of Matched Samples of Referred and Control Children
Boys Girls Total
Control children 179 85 264
Total 291 139 430
Recess Observations
Control children 174 74 248
Total 261 111 372
Ethnicity for Classroom Samplea

Mixed or Other 4.0%
a
Percentages for N = 425 for classroom observations; ethnicity for recess observations was 100% non-
Latino White.
children, we computed the mean of the averaged tus, gender, and ethnicity (p < .01), but no signifi-
item ratings when there were two control children. cant referral status x gender interaction. The over-
For classroom observations, the dependent vari- all MANOVA for recess observations showed sig-
ables for the MANCOVA were mean item ratings nificant effects of referral status and gender (p <
for 166 referred children and 166 averaged ratings .01), but no significant interaction. The first three
for controls. For recess observations, dependent columns of Table 8-2 display significant effect sizes
variables for the MANOVA were mean item rat- (ES) of referral status, gender, and ethnicity for each
ings for 124 referred children and 124 averaged of the 88 specific problem items on the DOF, as
ratings for controls. (For recess observations, ob- obtained from subsequent ANCOVAs of classroom
servers used the 1986 version of the DOF, from observations. The last two columns of Table 8-2
which 72 items were retained on the 2009 DOF.) display significant ES of referral status and gender
Because we found no significant differences on obtained from subsequent ANOVAs of recess ob-
DOF Total Problems for younger (ages 6 to 8) ver- servations. The ES is represented by the percent of
sus older (ages 9 to 12) children, we did not in- variance (partial Eta2) uniquely accounted for by
clude age in the MANCOVA or MANOVA designs. each independent variable that was significant at p
Socioeconomic status (SES) was also not included <.05. According to Cohen’s (1988) criteria for ES
as a covariate because SES was not available for in ANCOVA/ANOVA, effects accounting for 1-
the control sample. 5.8% of variance are small; effects accounting for
5.9-13.7% of variance are medium; and effects ac-
The overall MANCOVA for classroom obser-
counting for >13.8% of variance are large. The ES
vations showed significant effects of referral sta-
Table 8-2
Percent of Variance Accounted for by Significant (p<.05) Effects of Referral Status and
Demographic Variables on DOF Item Scores in ANCOVAs
Classroom Observations Recess Observations

Ref Ref
DOF Item Status Gender Ethnicity
a b c
Statusa Genderb
1. Acts too young for age 5 — 1W, d 2d —

2. Makes odd noises 2 3B, d — — —
3. Argues 1d — 2O, d 2d 2B, d
4. Cheats — — 2O — —
5. Defiant or talks back to staff — — 4O — —
6 Brags, boasts — — — 3 —
7. Doesn’t concentrate or doesn’t pay attention
for long 6 — — — —
8. Difficulty waiting turn in activities or tasks 2 — — — —
9. Doesn’t sit still, restless, orhyperactive 4 2B, d 2W — —
10. Clings to adults or too dependent — — — 4 —
11. Confused or seems to be in a fog 1 — — — —
12. Cries — — 1O — —
13. Fidgets, including with objects 7 — — — 3G, d
14. Cruel, bullies, or mean to others — 2G, d — 3 —
15. Daydreams or gets lost in thoughts 1d — — — —
16. Difficulty following directions 3 — 2O, d — —
17. Tries to get attention of staff — — 5O 2d —
18. Destroys own things — — — — —
19. Destroys property belonging to others — — 2O, d — —
20. Disobedient — — 3O 3 —
21. Disturbs other children — — 5O 2 —
22. Doesn’t seem to feel guilty after misbehaving — — 3O 3 —
23. Doesn’t seem to listen to what is being said 6 — 3O — —
24. Eats, drinks, chews, or mouths things that
are not food, excluding junk foods (describe): 1d 2G, d — — —
25. Difficulty organizing activities or tasks 5 — — — —
26. Fails to give close attention to details 2 — — — —
27. Forgetful in activities or tasks 2 — — — —
28. Out of seat 2 — — — —
29. Gets hurt, accident prone — — — 5 2G, d
30. Gets in physical fights — — — 4 —
31. Gets teased — — — 3 —
32. Interrupts — — 2O — —
33. Impulsive or acts without thinking,
including calling out in class 5 — — — —
34. Physically isolates self from others 1d — — 2d 2B, d
35. Lies — — — — —
Table 8-2 (cont.)

Ref Ref
DOF Item Statusa Genderb Ethnicityc Statusa Genderb
36. Bites fingernails — — — — —

37. Nervous, highstrung, or tense 2 — — — —
38. Nervous movements, twitching, tics or other
unusual movements (describe): 2 — — — —
39. Loses things — — — — —
40. Too fearful or anxious 1d — — — —
41. Physically attacks people — — — 4 3B, d
42. Picks or scratches nose, skin, or other parts
of body (describe): — 4B, d 1O, d — —
43. Runs about or climbs excessively — — — — —
44. Apathetic, unmotivated, or won’t try 2 — 2O, d — —
45. Responds before instructions are completed — — — — —
46. Disrupts group activities 2 — — — —
47. Screams — — — — 4G, d
48. Secretive, keeps things to self, including
refusal to show things to teacher — — — — —
49. Avoids or is reluctant to do tasks that require
sustained mental effort 1d — — — —
50. Self-conscious or easily embarrassed — 2G, d — — —
51. Slow to respond verbally 2 — — — —
52. Shows off, clowns, or acts silly 2 — — 2d —
53. Shy or timid 1d 4G, d — — —
54. Explosive or unpredictable behavior — — — 2d —
55. Demands must be met immediately,
easily frustrated — — — — —
56. Easily distracted by external stimuli — — 5O — —
57. Stares blankly 3 — 3O 2 —
58. Speech problem (describe): — — — — —
59. Wants to quit or does quit tasks 1 — — — —
60. Yawns — — — — —
61. Strange behavior (describe): 2 — — — —
62. Stubborn, sullen, or irritable — — — — —
63. Sulks — — — — —
64. Swears or uses obscene language — — — — —
65. Talks too much 3 — — — —
66. Teases — — 2O — —
67. Temper tantrums, hot temper, or seems angry — — — — —
68. Threatens people — — 2O 2d —
69. Too concerned with neatness or cleanliness — — — — —
Table 8-2 (cont.)

Ref Ref
DOF Item Statusa Genderb Ethnicityc Statusa Genderb
70. Underactive, slow moving, tired, or lacks

energy 1 — — — —
71. Unhappy, sad, or depressed 2 — — — —
72. Unusually loud 4 — — — —
73. Overly anxious to please — — — — —
74. Whining tone of voice — — — — 3G, d
75. Withdrawn, doesn’t get involved with others 2 — — 8 —
76. Sucks thumb, fingers, hand, or arm — — — — —
77. Fails to express self clearly 2 — — — —
78. Impatient — — — — —
79. Tattles — — — 3 —
80. Repeats behavior over & over;
compulsions (describe): — — — — —
81. Easily led by peers — — 3O — —
82. Clumsy, poor motor control — — — 3 —
83. Doesn’t get along with peers — — — 3 —
84. Runs out of class (or similar setting) — — — — —
85. Behaves irresponsibly (describe): — — — — —
86. Bossy — 2G, d — 2d —
87. Complains 1d — 3O 3 3G, d
88. Afraid to make mistakes 2 — — — —
Note. For classroom observations: N = 166 referred children ages 6-11 and 264 matched “controls” in the
same classrooms. Analyses were referral status x gender MANCOVA and ANCOVAs with ethnicity
(Non-Latino White vs. Other) as a covariate. For recess observations: N = 124 referred children ages 6-
12 and 248 matched “controls” in the same setting. Analyses were referral status x gender MANOVA and
ANOVAs with no covariate. The percent of variance uniquely accounted for by each independent variable
is represented by partial Eta2. Scores were item raw scores averaged across 2 to 4 DOFs per child.
a
All significant effects of referral status reflected higher scores for referred than control children.
b
B = boys scored higher; G = girls scored higher.
c
W = Non-Latino White scored higher; O = “Other” scored higher.
d
Not significant when corrected for number of analyses.
in Table 8-2 are values for partial Eta2 rounded to Referral Status Effects. For classroom obser-
the nearest whole number. The superscript d in the vations, referred children scored significantly
table indicates effects that could be regarded as higher (p <.05) than control children on 38 of the
significant by chance when corrected for the num- 88 DOF items. Of these, eight effects could have
ber of analyses for each independent variable, us- occurred by chance, which are marked by the su-
ing a p <.05 protection level (Sakoda et al., 1954). perscript d in Table 8-2. Three DOF items showed
medium ES for referral status: 7. Doesn’t concen- Scale Scores

trate or doesn’t pay attention for long (6%); 13.
To test associations of referral status and de-
Fidgets (7%); and 23. Doesn’t seem to listen to what
mographic characteristics with DOF scale scores
is being said (6%). The remaining 35 significant
for classroom observations, we performed multiple
ES were small, accounting for 1-5% of variance.
regressions on raw scores for each scale (the de-
For recess observations, referred children scored
pendent variable) with the independent variables
significantly higher (p <.05) than control children
of referral status, gender, and ethnicity (non-Latino
on 24 of 67 DOF items. (Five items were scored 0
White versus Other). For multiple regressions of
for 100% of cases and 16 items had missing val-
recess observations, the independent variables were
ues because they were not included on the 1986
referral status and gender. To obtain raw scores for
DOF used for recess observations.) One item, 75.
each problem scale, we first averaged item ratings
Withdrawn, doesn’t get involved with others,
across 2 to 4 DOFs separately for each referred
showed a medium ES for referral status. All other
child and each control child. We then computed
ES for recess observations were small, accounting
the mean of the averaged item ratings when there
for 1-5% of variance. Seventeen items showed sig-
were two control children, as done for the
nificant ES for recess observations but not class-
MANCOVA and MANOVA of item ratings. Raw
room observations. Thus, 55 of 88 (63%) DOF
scores for the DOF scales were the sums of the
items showed significant effects of referral status
averaged ratings for items comprising each scale.
in classroom observations, recess observations, or
For classroom observations, the raw score for To-
both.
tal Problems was the sum of the averaged item rat-
Demographic Effects. The demographic vari- ings for the 88 specific problem items, plus the
ables of gender and ethnicity showed several small open-ended item for additional problems. For re-
ESs (p <.05) on DOF item ratings as follows: For cess observations, the raw score for Total Prob-
classroom observations, there were eight small ES lems was the sum of the averaged item ratings for
for gender, accounting for 2-4% of variance, all of the 72 specific problem items retained from the
which could be due to chance. Boys were rated 1986 DOF, plus the open-ended item for additional
higher on three DOF items and girls were rated problems. For On-task, we averaged the 0 to 10
higher on five items. There were 22 small ES for scores across 2 to 4 DOFs separately for each re-
ethnicity, accounting for 1-5% of variance. Of ferred child and each control child, and then com-
these, eight could be chance effects. Children with puted the mean of the averaged On-task scores
“Other” ethnicity (which included African Ameri- when there were two control children.
can, Latino/Hispanic, and Mixed or other ethnicity)
Table 8-3 displays ESs for associations of re-
were rated higher than non-Latino White children
ferral status and ethnicity with DOF scale scores.
on 20 DOF items, while non-Latino White chil-
The ES is the squared standardized regression co-
dren were rated higher on two items.
efficient, which reflects the percent of variance in
For recess observations, there were eight small scale scores that was uniquely accounted for by
ES for gender, accounting for 1-4% of variance, each independent variable. According to Cohen’s
all of which could be due to chance. Boys were (1988) criteria for ES in multiple regressions, ef-
rated higher on three DOF items and girls were fects accounting for 2-12% of variance are small;
rated higher on five items. (As indicated earlier, effects accounting for 13-25% of variance are me-
ethnicity was not included in analyses of recess dium; and effects accounting for >26% of variance
observations because the entire sample was non- are large. The superscript c in the table indicates
Latino White.) effects that could be regarded as significant by
chance when corrected for the number of analyses
Multiple Regression Analyses of DOF
for each independent variable, using a p <.05 pro-
Table 8-3
Percent of Variance Accounted for by Significant (p <.05) Effects of Referral Status and
Ethnicity on DOF Scale Scores in Multiple Regressions
DOF Scale Referral Statusa Ethnicityb
Sluggish Cognitive Tempo 6c 1O, c
Immature/Withdrawn 6 —
Attention Problems 8 —
Intrusive 4c 3O
Oppositional 7 3O
Total Problems-Classroom 13 2O
DSM-Oriented Scales
Attention Deficit/Hyperactivity Problems 10 1O, c
Inattention subscale 8 2O
Hyperactivity-Impulsivity subscale 9 —
On-task 8 6W
Recess Observations
Aggressive Behavior 10 N/A
Total Problems-Recess 26 N/A
Note. For classroom observations: N = 166 referred children ages 6-11 and 264 matched “controls” in the
same classrooms. For recess observations: N = 124 referred children ages 6-12 and 248 matched “controls”
in the same setting. For classroom observations, analyses were multiple regressions of raw scale scores
on referral status, gender, and ethnicity. For recess observations, analyses were multiple regressions of
raw scale scores on referral status and gender. Percent of variance is represented by the squared standardized
regression weight for each independent variable. There were no significant gender effects on any DOF
scale.
a
Referred children scored significantly (p <.05) higher than control children on all problem scales; control
children scored significantly (p <.05) higher than referred children on On-task.
b
O = “Other” scored higher than non-Latino White; W = non-Latino White scored higher than “Other”.
c
Not significant when corrected for the number of analyses.
tection level (Sakoda et al., 1954). higher than referred children on On-task (8% of
variance). Of the 12 significant ES for referral sta-
Referral Status Effects. Referral status effects
tus, two could have occurred by chance. DOF To-
outweighed demographic effects on all DOF prob-
tal Problems-Recess showed a large ES, account-
lem scales. Referred children scored significantly
ing for 26% variance. DOF Total Problems-Class-
(p <.05) higher than control children on all DOF
room showed a medium ES, accounting for 13%
problem scales, accounting for 4 to 26% of vari-
of variance. All other ES were small according to
ance. Control children scored significantly (p <.05)
Cohen’s (1988) criteria. After the two DOF Total
Problems scales, the next highest ES were for the
DSM-oriented Attention Deficit/Hyperactivity
Mean Scale Scores for Referred and within each set of discriminant analyses.
Control Children Discriminant analyses selectively weight pre-
Table 8-4 displays the mean raw scores and stan- dictors to maximize their collective associations
dard deviations obtained by referred and control with the criterion groups being analyzed. The
children on each DOF scale, derived from weighting process makes use of characteristics of
MANCOVA and ANOVA. MANCOVAs were the sample that may differ from other samples. To
modeled on the regression analyses, treating refer- avoid overestimating the accuracy of the classifi-
ral status and gender as between subject measures cation obtained by discriminant analyses, it is nec-
and ethnicity as a covariate. These included a 2 essary to correct for shrinkage in associations that
(referred vs. control) x 2 (boys vs. girls) would occur when discriminant weights derived
MANCOVA on raw scale scores for the five DOF in one sample are applied to a new sample. To cor-
syndromes, followed by univariate 2 x 2 ANOVAs rect for shrinkage, we employed a “jackknife”
on scores for each syndrome scale. We performed (cross-validation) procedure whereby discriminant
a similar 2 x 2 MANCOVA, followed by univariate functions are computed with a different child’s data
ANOVAs, on the Inattention and Hyperactivity-Im- excluded (“held out”) of the sample each time. Each
pulsivity subscales, and 2 x 2 univariate ANOVAs discriminant function is then cross-validated by
on the Attention Deficit/Hyperactivity Problems testing the accuracy of its predictions for the child
scale, Total Problems-Classroom, and On-task who was held out when the discriminant function
scores. ANOVAs for the Aggressive Behavior syn- was computed. Finally, the percentage of correct
drome and Total Problems-Recess were also mod- predictions is averaged across all the held-out chil-
eled on the multiple regressions, treating referral dren.
status and gender as between subject measures with In addition to discriminant analyses of sets of
no covariate. The results mirrored those of the mul- DOF scales, we obtained cross-validated percent-
tiple regressions. Referred children scored signifi- ages of cases correctly classified by the DOF Total
cantly (p <.05) higher than control children on all Problems-Classroom, Attention Deficit/Hyperac-
DOF problem scales, while control children scored tivity Problems scale, Aggressive Behavior, and To-
significantly (p <.05) higher than referred children tal Problems-Recess as single predictors.
on On-task. There were no significant gender ef-
fects. For each set of predictors, Table 8-5 shows the
cross-validated percentages of children correctly
Discriminant Analyses of DOF Scale classified as referred (sensitivity) versus controls
Scores (specificity). The weighted combination of the five
We used discriminant analyses to determine syndrome scales correctly classified 56% of re-
which weighted combinations of DOF scale scores ferred children and 74% of control children, with
best differentiated referred from control children an overall correct classification rate of 65% and
for the matched samples shown in Table 8-1. When overall misclassification rate of 35%. An additional
there were two matched control children for a re- forward stepwise discriminant analysis indicated
ferred child, we averaged item ratings across the that all but the Intrusive syndrome were signifi-
two controls, as done in previous analyses. For cant (p <.05) predictors in the discriminant func-
classroom observations, we performed one dis- tion. The Sluggish Cognitive Tempo syndrome was
criminant analysis using the five DOF syndromes the strongest predictor (standardized canonical co-
as candidate predictors and another discriminant efficient = .478), with the other three syndromes
analysis using the DSM-oriented Inattention and contributing about equally to the discriminant func-
Hyperactivity-Impulsivity subscales as candidate tion (standardized canonical coefficients = .340
predictors. Predictors were entered simultaneously to .389). The DOF Total Problems-Classroom score
alone showed similar classification rates: 54% of
Table 8-4
Means and Standard Deviations of DOF Raw Scale Scores for Referred and Control Children
Referreda Averaged Controlsb

DOF Scale Mean SD Mean SD
Sluggish Cognitive Tempo 1.2 1.3 0.7 0.8
Immature/Withdrawn 0.8 1.8 0.2 0.4
Attention Problems 5.6 3.0 4.1 2.6
Intrusive 1.9 3.0 1.0 1.4
Oppositional 1.9 1.9 0.9 1.8
Total Problems-Classroom 13.6 8.8 8.0 5.4
DSM-Oriented Scales
Attention Deficit/Hyperactivity Problems 8.2 5.7 5.0 3.6
Inattention subscale 2.8 2.6 1.5 1.8
Hyperactivity-Impulsivity subscale 5.4 3.7 3.5 2.4
On-task 8.0 1.7 8.9 1.6
Recess Observations
Aggressive Behavior 1.4 1.6 0.5 0.6
Total Problems-Recess 4.2 3.2 1.3 1.4
Note. For classroom observations: N = 166 referred children ages 6-11 and averaged ratings for 264
matched “controls” in the same classrooms. For recess observations: N = 124 referred children ages 6-12
and averaged ratings for 248 matched “controls” in the same setting. Problem scale scores were the sums
of averaged item ratings.
a
Referred children scored significantly (p <.05) higher than control children on all problem scales.
b
Control children scored significantly (p <.05) higher than referred children on On-task.
referred children and 75% of control children cor- ward stepwise discriminant analysis indicated that
rectly classified, with an overall correct classifi- both the Inattention and Hyperactivity-Impulsiv-
cation rate of 65% and overall misclassification ity subscales were significant (p <.05) predic-
rate of 35%. tors, with Hyperactivity-Impulsivity contributing
slightly more (standardized canonical coefficient
The weighted combination of the DSM-oriented
= .610) than Inattention (standardized canonical
Inattention and Hyperactivity-Impulsivity
coefficients = .508). The Attention Deficit/Hyper-
subscales correctly classified 53% of referred chil-
activity Problems scale alone showed the same
dren and 73% of control children, with an overall
overall correct classification rate of 63% and
correct classification rate of 63% and overall
misclassi-fication rate of 37%.
misclassification rate of 37%. An additional for-
Table 8-5
Cross-Validated Percents of Cases Correctly Classified as Referred vs. Control
Overall
Averaged Correct
Candidate Predictors Referred Controls Classification
Five syndrome scales 56% 74% 65%
Total Problems-Classroom 54% 75% 65%
DSM-oriented Inatttention & Hyperactivity-
Impulsivity subscales 53% 73% 63%
DSM-oriented Attention Deficit/Hyperactivity
Problems 54% 72% 63%
Recess Observations
Aggressive Behavior 51% 80% 65%
Total Problems-Recess 64% 90% 77%
Note. For classroom observations: N = 166 referred children ages 6-11 and averaged ratings for 264
matched “controls” in the same classrooms. For recess observations: N = 124 referred children ages 6-12
and averaged ratings for 248 matched “controls” in the same setting. Scale scores were the sums of
averaged item ratings for referred and control children.
The Aggressive Behavior syndrome, based on Criterion-related validity was supported by the
recess observations, correctly classified 51% of re- ability of the DOF items and scale scores to dis-
ferred children and 80% of control children, with criminate between matched samples of referred and
an overall correct classification rate of 65% and nonreferred control children. Referred children
overall misclassi-fication rate of 35%. Total Prob- scored significantly higher on 55 of the 88 DOF
lems-Recess alone produced the best classification items for observations in classrooms and/or recess,
rates, correctly classifying 64% of referred chil- with referral status accounting for 1 to 7% of vari-
dren and 90% of control children, with an overall ance. Demographic variables of gender and
correct classification rate of 77% and overall ethnicity showed small effects on item scores. Re-
misclassification rate of 23%. ferred children scored significantly higher than
nonreferred children on all DOF problem scales,
SUMMARY accounting for 4 to 26% of variance. DOF Total
Problems accounted for 13% of variance in class-
This chapter presented several kinds of evidence room observations and 26% of variance in recess
for the validity of the 2009 DOF items and scale observations. Control children scored significantly
scores. Content validity of the DOF items is based higher on DOF On-task, accounting for 8% of vari-
on their derivation from similar items of the CBCL/ ance.
6-18 and TRF, most of which significantly dis-
criminated referred from nonreferred children A weighted combination of the five DOF syn-
(Achenbach & Rescorla, 2001). dromes correctly classified 56% of referred chil-
dren (sensitivity) and 74% of nonreferred children
(specificity). A weighted combination of the DSM-
oriented Inattention and Hyperactivity-Impulsiv-
ity subscales showed only slightly lower sensitiv-
Chapter 9
Answers to Frequently Asked Questions
This chapter answers questions that may arise tions of control children are recommended but
about the DOF. The questions are grouped under optional.
headings pertaining to the DOF form and profile,
3. Why are control children included on
applications of the DOF, relations to other assess-
the DOF?
ment procedures, coordinating data from multiple
sources, and relations to DSM and special educa- Answer: Observations of control children pro-
tion classifications. If you have a question that is vide a standard for evaluating the behavior of the
not answered under one heading, look under the identified child in relation to peers in the same situ-
other headings. The Table of Contents and Index ation. Observers do not need to know the names of
may also help you find answers to questions not control children. Observers should select a control
listed in this chapter. child of the same gender who is situated far enough
away so as not to influence the behavior of the iden-
FEATURES OF THE DOF tified child, if possible. The DOF Module for com-
puter scoring allows up to two control children per
1. What is the DOF?
identified child to score one DOF Profile.
Answer: The Direct Observation Form (DOF) 4. What if it is not possible to match the
is a standardized form for rating observations of gender of a control child to the gender
6- to-11-year-old children in school classrooms, at of the identified child?
recess, and in other group settings. During a 10-
minute period, the observer uses the DOF to write Answer: Although rare, this may occur in some
a narrative description of the child’s behavior, af- settings or special programs where one gender
fect, and interactions. The observer also rates the vastly outnumbers the other. We recommend
child for being on-task or off-task for 5 seconds at matching the gender of control children to the gen-
the end of each 1-minute interval. At the end of der of the identified child because the DOF Profile
the 10-minute observation, the observer rates the has separate norms for boys and girls. However, if
child on 89 problem items using a 0-1-2-3 scale. the gender of a control child is different from the
Chapter 2 of this Manual provides detailed instruc- gender of the identified child, the DOF Module
tions for using the DOF and rating the DOF items. for computer scoring weights the item scores of
that control child in order to derive scale scores
2. How many 10-minute observations are that approximate scores for the correct gender. If
necessary to score the DOF? the identified child is a boy and the control child is
Answer: The DOF Module for computer scor- a girl, then item scores of the control child are ad-
ing requires at least two DOFs (i.e., two 10-minute justed upward. If the identified child is a girl and
observations) and allows up to six DOFs per child the control child is a boy, item scores of the con-
to score one DOF Profile. Because children’s be- trol child are adjusted downward. These adjust-
havior can vary from one occasion to another, we ments are done only for DOF Profiles for class-
recommend 3 to 6 observations of the identified room observations, because there were no signifi-
child on at least two different days. Observers may cant gender differences for recess observations, as
also include 1 to 6 DOFs for each of two control reported in Chapter 8.
children matched to the identified child. Observa-
108
9. Answers to Frequently Asked Questions 109
5. What is the DOF Profile? Answer: The “Other Problems” are problem
items that were not associated strongly to quality
Answer: The DOF Profile is a computer-scored
for any of the syndromes derived from factor analy-
display of item and scale scores from classroom
ses. Therefore, they are not included in the syn-
observations and/or recess observations. The user
drome scales. However, each of the “Other Prob-
must select one setting (class or recess) for com-
lems” items may be important in its own right.
puter-scoring. The DOF can only be scored by com-
There is a different set of “Other Problems” for
puter because of the complexity of averaging item
DOF Profiles based on classroom observations
scores across multiple observation sessions. As
versus recess observations. The relevant “Other
discussed in Chapter 3, the DOF Profile for class-
Problems” item set is included along with items
room observations displays averaged item scores
from the syndrome scales when computing scores
plus raw scores, T scores, and percentiles for five
for Total Problems-Classroom and Total Problems-
empirically based syndrome scales, a DSM-ori-
Recess.
ented Attention Deficit/Hyperactivity Problems
scale with Inattention and Hyperactivity-Impulsiv- 8. Why doesn’t the DOF Profile display
ity subscales, Total Problems, and an On-task score. percentiles or T scores for the “Other
The DOF Profile for recess observations displays Problems”?
averaged item scores plus raw scores, T scores and
Answer: The “Other Problems” do not consti-
percentiles for an empirically based Aggressive
tute a separate scale. They are merely the items
Behavior syndrome scale and Total Problems. Pro-
that did not qualify for the syndrome scales. There
files for both settings also display averaged item
are thus no specific associations among them to
scores for “Other Problems” not scored on the syn-
warrant treating them as a separate scale. How-
drome scales. The DOF Profile has separate norms
ever, each of these problems may be important, and
for boys and girls ages 6 to 11.
they are all included in computing Total Problems
6. What are the DOF syndrome scales? scores.
Answer: As detailed in Chapter 6, the DOF syn- 9. How is the open-ended item 89
drome scales were derived by factor analyzing av- figured into the scale scores?
eraged scores for the DOF items to identify pat-
Answer: If the observer enters any problems in
terns of co-occurring problems. Each of the five
item 89, the highest rating that the observer gave
syndrome scales consists of a set of problem items
to any of these problems (i.e., 1, 2, or 3) is added
that were found to co-occur. The 0-1-2-3 ratings
to the Total Problems score.
on each problem item are averaged across multiple
10-minute observations. A child’s total score on a 10. What are the DSM-oriented Attention
syndrome scale is the sum of the averaged ratings Deficit/Hyperactivity Problems scale
on the items comprising each scale. For total scores and its Inattention and Hyperactivity-
on each syndrome scale, the DOF Profile indi- Impulsivity subscales?
cates standard scores (T scores) and percentiles
Answer: The DSM-oriented Attention Deficit/
based on a normative sample of boys and girls ages
Hyperactivity Problems scale and its Inattention
6-11. Classroom observations are scored on the
and Hyperactivity-Impulsivity subscales consist of
following five syndrome scales: Sluggish Cogni-
items that are consistent with the DSM-IV and
tive Tempo, Immature/Withdrawn, Attention Prob-
DSM-IV-TR diagnostic categories of Attention
lems, Intrusive, and Oppositional. For recess ob-
Deficit/Hyperactivity Disorder (ADHD). The In-
servations, the one syndrome is designated as Ag-
attention subscale has 10 items and the Hyperac-
gressive Behavior.
tivity-Impulsivity subscale has 13 items. The At-
7. What are the “Other Problems”?
110 9. Answers to Frequently Asked Questions
tention Deficit/Hyperactivity Problems total score cal range, while T scores of 60 to 63 (84th to 90th
is the sum of ratings on all 23 items. Twelve of the percentiles) are in the borderline range. For cer-
23 items were similar to CBCL/6-18 and/or TRF tain purposes, such as screening to identify chil-
items that an international panel of experts identi- dren who are at risk for problems, users may choose
fied as being very consistent with the DSM-IV to use lower cutpoints on the problem scales than
symptoms of ADHD, as explained in Chapter 6. those that demarcate the borderline or clinical
Eleven additional items were added to the DOF to range.
tap ADHD symptoms that were not covered by the
13. Should extremely low scores on the
other items.
DOF problem scales be considered
11. Should raw scores, percentiles, or T deviant?
scores be used to report results for
Answer: No. Extremely low scores on the prob-
DOF scales?
lem scales merely reflect an absence of problems
Answer: Percentiles and T scores are usually observed for a particular time frame and setting.
preferable to raw scale scores for reporting find- Because children may manifest problems that are
ings for individual children, because they indicate concentrated in particular areas, it is not unusual
degrees of deviance on each scale in comparison for profiles to have high scores on some scales but
with the normative sample for the child’s gender. low scores on other scales. Low scores on the prob-
However, for statistical analyses of scale scores, lem scales do not necessarily mean that problems
raw scale scores should be used because the T are absent in other contexts, such as the home or
scores on all scales, expect Total Problems-Class- other settings in school.
room, are truncated at 50, as explained in Chapter
14. How should DOF On-task scores be
6. If boys and girls are combined in the same sta-
interpreted?
tistical analyses, it may be useful to assign stan-
dard scores separately for each gender so that scores Answer: As explained in Chapter 2, observers
for each gender have the same mean and the same rate on-task behavior by marking boxes for on-task
standard deviation. On the other hand, if the sta- or off-task that represent the last 5 seconds of each
tistical analyses are intended to test gender differ- 1-minute interval over each 10-minute observation
ences in raw scale scores, then the scores should period. Total On-task scores can thus range from 0
not be standardized by gender. to 10 for each observation. The DOF Module for
computer scoring averages On-task scores across
12. How should high scores on the DOF
multiple observation sessions. Low On-task scores
problem scales be interpreted?
warrant clinical concern, in contrast to high scores
Answer: The DOF Profile shows the letter B for the problem scales. The DOF Profile displays
next to T scores that fall in the borderline clinical mean raw scores, T scores and percentiles for On-
range and the letter C next to T scores that fall in task. T scores <31 (<3rd percentile) are considered
the clinical range for each of the problem scales. to be in the clinical range, while T scores of 31 to
Scores in the borderline range warrant concern but 35 (3rd to 7th percentiles) are in the borderline range.
are not as clearly deviant at those in the clinical The On-task mean raw score can also be translated
range. For the syndrome scales and DSM-oriented into a percentage of on-task behavior, as done in
scales, T scores >69 (>97th percentile) are consid- the DOF Narrative Report.
ered to be in the clinical range, while T scores of
15. How are clinical interpretations of
65 to 69 (93rd to the 97th percentiles) are consid-
the DOF Profile made?
ered to be in the borderline range. For Total Prob-
lems, T scores >63 (>90th percentile) are in the clini- Answer: The DOF is designed to provide a stan-
dardized description of a child’s behavioral and than age 11. However, the farther the departure
emotional problems observed in school classrooms, from the 6-11-year-old norms, the less appropriate
at recess, or in other comparable group settings. the percentiles and T scores may be for interpret-
The T scores and percentiles for the problem scales ing a child’s DOF Profile. Researchers who plan
and On-task provide a basis for comparing an in- to analyze only DOF raw scores (not T scores) may
dividual child to normative samples of peers of the choose to use the DOF for observations of chil-
same gender. The scale scores on the DOF Profile dren outside of the 6 - 11 age range.
can also be compared with analogous scale scores
4. Can the DOF be used to assess child-
on CBCL/6-18, TRF, YSR, SCICA, and TOF pro-
ren who have physical or mental dis-
files to identify similarities and differences between
abilities?
problems reported by different informants in dif-
ferent situations. Information from all available Answer: The DOF provides a standardized de-
sources should then be integrated to form a com- scription of observed behavior. If a child has a
prehensive picture of the child’s functioning, as il- physical or mental disability, then observed behav-
lustrated in case examples in Chapter 5. ior must be interpreted with this in mind. How-
ever, children with physical and mental disabili-
APPLICATIONS OF THE DOF ties were excluded from the DOF normative
sample. T scores and percentiles on the DOF Pro-
1. Who should complete the DOF? file therefore provide comparisons only to peers
Answer: The DOF can be completed by any- without disabilities.
one who has sufficient understanding of the re-
quired observation and rating procedures, as de- RELATIONS TO OTHER ASSESSMENT
scribed in Chapter 2. Observers can be teachers’ PROCEDURES
aides or other school paraprofessionals, under-
1. Can other procedures for assessing
graduate or graduate students, and research assis-
behavioral and emotional problems be
tants, as well as professionals in education, school
used with the DOF?
psychology, clinical psychology, and related disci-
plines. Paraprofessionals and students should use Answer: The DOF obtains samples of behav-
the DOF under the supervision of a qualified pro- ioral and emotional problems observed during
fessional who has knowledge of the theory and multiple 10-minute observations of children in
methodology of standardized assessment. Chapter group settings. As explained in Chapter 2, users
4 provides guidelines for training DOF observers can include up to six DOFs in one “observation
and conducting school observations. set” for each identified child and up to six DOFs
for each of two control children matched to the
2. When should the DOF be completed?
identified child. Scores from one “observation set”
Answer: The DOF should be completed imme- can then be compared with scores from another
diately after the 10-minute observation for which “observation set” for the same identified child. For
it was used. Observers should complete a separate example, you might choose to compare DOF scores
DOF for each 10-minute observation. for observations completed at the beginning of the
school year (e.g., observation set = Fall 2009) and
3. Can the DOF be used below age 6 or
a second set of observations completed at the end
above age 11?
of the school year (e.g., observation set = Spring
Answer: The 2009 version of the DOF was 2010). Or you might compare sets of observations
normed for ages 6 to 11. The DOF may be appro- done before and after an intervention. By compar-
priate for younger children in group settings, such ing DOF scores obtained from observation sets for
as Kindergarten or preschool, and children older different time periods, users can distinguish be-
112 9. Answers to Frequently Asked Questions
tween problems that are quite consistent across Answer: Discrepancies between findings from
time versus those that are more variable. Users may different assessment procedures can be as infor-
also choose to compare observation sets for differ- mative as similarities. For example, if the DOF was
ent situations, such as math class versus reading used to obtain observations of a child in a particu-
class. In addition, the DOF scores can be compared lar school classroom, then DOF scale scores could
with the scores on analogous scales of other be compared to analogous scale scores on the TRF
ASEBA forms, which include counterparts of many completed by the child’s teacher in the same class-
DOF items and scales. Assessment data obtained room and the CBCL/6-18 completed by one or both
from interviews of children, parents, and teachers, parents. DOF scale scores could then be compared
medical exams, cognitive and achievement tests, with scores on analogous scales of the CBCL/6-
and behavioral and family assessment can also be 18 and TRF to see if the child’s observed behavior
compared with DOF data to provide a comprehen- was different from behavior reported by the teacher
sive basis for assessment, as discussed in Chap- or the parents. If a child has more than one teacher,
ters 1 and 5. observations with the DOF could be done in both
teachers’ classrooms and then scored on separate
2. How do DOF scales compare with
DOF Profiles. If the DOF scores from the two class-
scales scored from other ASEBA
room settings differed on certain scales, then fur-
forms?
ther observations and interviews with the teachers
Answer: Although the scales of other ASEBA would be appropriate to determine why the child
forms were derived independently from item pools, may behave differently in the two classrooms.
samples of participants, and raters that differ from Comparisons of DOF scores with test session ob-
those of the DOF, the following DOF scales have servations scored on the TOF and interviewer rat-
counterparts on profiles scored from most ASEBA ings scored on the SCICA may also help to docu-
forms for children and youth: Sluggish Cognitive ment discrepancies and consistencies between
Tempo (similar to 2007 CBCL/6-18 and TRF Slug- problems observed in group settings, such as school
gish Cognitive Tempo), Immature/Withdrawn classrooms, versus elsewhere.
(similar to CBCL/6-18 and TRF Withdrawn/De-
pressed), Attention Problems (similar to CBCL/6- RELATIONS TO DSM AND SPECIAL
18 and TRF Attention Problems), Oppositional EDUCATION CLASSIFICATIONS
(similar to TOF Oppositional and CBCL/6-18 and
TRF Aggressive Behavior), the DSM-oriented At- 1. How can the DOF contribute to an
tention Deficit/Hyperactivity Problems scale and ADHD diagnosis and other DSM
Inattention and Hyperactivity-Impulsivity diagnoses?
subscales (similar to CBCL/6-18, TRF, and TOF Answer: Because the DSM criteria for behav-
DSM-oriented Attention Deficit/Hyperactivity ioral and emotional disorders are not defined in
Problems scale and Inattention and Hyperactivity- terms of specific assessment procedures, scores on
Impulsivity subscales). DOF Total Problem scores the DOF items and scales may be combined with
can also be compared to Total Problems on other other kinds of data in judging whether the criteria
ASEBA forms. for DSM diagnoses are met. The 23 items of the
3. What if there are differences between DOF DSM-oriented Attention Deficit/Hyperactiv-
a child’s pattern of problems on the ity Problems scale have fairly clear counterparts
DOF Profile versus the child’s among the symptom criteria for ADHD as defined
patterns of problems on other ASEBA by DSM-IV (American Psychiatric Association,
profiles? 1994) and DSM-IV-TR (American Psychiatric
Association, 2000). High scores on the DOF At-
tention Problems syndrome may also suggest that time, to formulate DSM diagnoses, DOF results
DSM criteria for ADHD should be considered. must be combined with other assessment data, in-
High scores on the DOF Oppositional syndrome cluding parent reports, teacher reports, and test
may suggest that DSM criteria for Oppositional results as appropriate. The CBCL/6-18, TRF, and
Defiant Disorder should be considered. At the same TOF also have an Attention Problems syndrome
and a DSM-oriented Attention Deficit/Hyperactiv-
ity Problems scale, as well as Inattention and Hy-
peractivity-Impulsivity subscales, that can contrib-
ute useful information for making DSM diagnoses
of ADHD.
2. How can the DOF be used in
determining eligibility for special
education according to disability
categories, such as those defined by
the 2004 Individuals with Disabilities
Education Improvement Act (IDEA
2004)?
Answer: Categories of educational disabilities
are not defined in terms of specific tests and other
assessment procedures. However, IDEA 2004 does
require direct observations of children in school
as part of a comprehensive assessment for deter-
mining eligibility for special education services.
The DOF provides a structured format for conduct-
ing observations and produces a standardized pro-
file that documents the results of the observations.
The DOF, along with other ASEBA forms, can thus
provide important quantitative data for judging
whether children have the kinds of problems for
which particular special education services are in-
tended, as discussed in Chapter 5.
References
Abramowitz, M., & Stegun, I.A. (1968). Handbook Achenbach, T. M., & Edelbrock, C. (1987). Manual
of mathematical functions. Washington, DC: National for the Youth Self-Report and Profile. Burlington, VT:
Bureau of Standards. University of Vermont, Department of Psychiatry.
Achenbach, T. M. (1966). The classification of Achenbach, T. M., & Lewis, M. (1971). A proposed
children’s psychiatric symptoms: A factor-analytic model for clinical research and its application to en-
study. Psychological Monographs, 80, (No. 615). copresis and enuresis. Journal of the American Acad-
emy of Child Psychiatry, 10, 535-554.
Achenbach, T. M. (1981). The Direct Observation
Form of the Child Behavior Checklist (rev. ed.) Achenbach, T. M., & McConaughy, S. H. (1997).
Burlington, VT: University of Vermont, Department Empirically based assessment of child and adoles-
of Psychiatry. cent psychopathology: Practical applications. Thou-
sand Oaks, CA: Sage.
Achenbach, T. M. (1986). The Direct Observation
Form of the Child Behavior Checklist (rev. ed.) Achenbach, T. M., & Rescorla, L. A. (2000). Manual
Burlington, VT: University of Vermont, Department for the ASEBA Preschool Forms & Profiles.
of Psychiatry. Burlington, VT: University of Vermont, Department
of Psychiatry.
Achenbach, T.M. (1991a). Manual for the Child Be-
havior Checklist/4-18 and 1991Profile. Burlington, Achenbach, T. M., & Rescorla, L. A. (2001). Manual
VT: University of Vermont, Department of Psychia- for the ASEBA School Age Forms & Profiles.
try. Burlington, VT: University of Vermont, Research
Achenbach, T.M.(1991b). Manual for the Teacher’s
Report Form and 1991Profile. Burlington, VT: Uni- American Education Research Association, Ameri-
versity of Vermont, Department of Psychiatry. can Psychological Association, & National Council
on Measurement in Education. (1999). Standards
Achenbach, T.M. (1991c). Manual for the Youth Self- for educational and psychological testing. Washing-
Report and 1991 Profile. Burlington, VT: University ton, D.C.: American Education Research Association.
of Vermont, Department of Psychiatry.
American Psychiatric Association. (1994). Diagnos-
Achenbach, T. M., & Edelbrock, C. (1983). Manual tic and statistical manual of mental disorders-fourth
for the Child Behavior Checklist/4-18 and Revised edition. Washington, DC: Author.
Child Behavior Profile. Burlington, VT: University
of Vermont, Department of Psychiatry. American Psychiatric Association. (2000). Diagnos-
tic and statistical manual of mental disorders-fourth
Achenbach, T. M., & Edelbrock, C. (1986). Manual edition-text revision. Washington, DC: Author.
for the Teacher’s Report Form and Teacher Version
of the Child Behavior Profile. Burlington, VT: Uni- Barkley, R. A. (2006). Attention deficit hyperactivity
versity of Vermont, Department of Psychiatry. disorder: A handbook for diagnosis and treatment
(3rd ed.). New York: Guilford Press.
114
References 115
Browne, N. W., & Cudeck, R. (1993). Alternative Academy of Child and Adolescent Psychiatry, 33, 529-
ways of assessing model fit. In K. A. Bollen, & J. S. 539.
Long (Eds.), Testing structural equation models (pp.
136-162). Newbury Park, CA: Sage. Individuals with Disabilities Education Improvement
Act of 2004. Public law. No. 108-446, 118 Stat. 2647
Cohen, J. (1988). Statistical power analysis for the (2004). [Amending 20 U.S.C. §§ 1400 et seq.]
behavioral sciences (2nd ed.). New York: Academic Joint Committee on Testing Practices. (2004). Code
Press. of fair testing practices in education. Washington,
D.C.: American Psychological Association.
Chafouleas, S. M., Christ, T. J., Riley-Tillman, T. C.,
Briesch, A. M., & Chanese, J. A. M. (2007). Kaufman, A.S., & Kaufman, N.L. (1983). Kaufman
Generalizability and dependability of direct behav- Assessment Battery for Children. Circle Pines, MN:
ior ratings to assess social behavior of preschoolers. American Guidance Service.
School Psychology Review, 36, 63-79.
Leff, S. S., & Lakin, R. (2005). Playground-based
Crocker, L., & Algina, J. (1986). Introduction to clas- observational systems: A review and implications for
sical and modern test theory. New York: Holt, practitioners and researchers, School Psychology
Rinehart, Winston. Review, 34, 475-489.
Cronbach, L.J. (1951). Coefficient alpha and the in- Loehlin, J. C. (1998). Latent variable models: An
ternal structure of tests. Psychometrika, 16, 297-334. introduction to factor, path, and structural analysis
(3rd ed.). Mahwah, NJ: Lawrence Erlbaum Associ-
DuPaul, G.J., & Stoner, G. (2003). ADHD in the ates.
schools. (2nd ed.). New York: Guilford Press.
McConaughy, S.H. (2005). Clinical interviews for
Gove, P. (Ed.). (1971). Webster’s third new interna- children and adolescents: Assessment to intervention.
tional dictionary of the English language. Springfield, New York: Guilford Press.
MA: Merriam.
McConaughy, S.H., & Achenbach, T.M. (2003). Di-
Guze, S. (1978). Validating criteria for psychiatric direct Observation Form (Research Ed.). Burlington,
agnosis: The Washington University approach. In VT: University of Vermont, Research Center for Chil-
M.S. Akiskal & W.L. Webb (Eds.), Psychiatric diag- dren, Youth, & Families.
nosis: Exploration of biological predictors (pp. 49-
59). New York: Spectrum. McConaughy, S. H., & Achenbach, T. M. (2001)
Manual for the Semistructured Clinical Interview for
Hintze, J. M. (2005). Psychometrics of direct obser- Children and Adolescents-Second Edition.
vation. School Psychology Review, 34, 507-519. Burlington, VT: University of Vermont, Research
Hintze, J. M., & Matthews, W. J. (2004). The
generalizability of systematic direct observations McConaughy, S.H., & Achenbach, T.M. (2004).
across time and settings: A preliminary investigation Manual for the Test Observation Form for Ages 2-
of the psycholometrics of behavioral observation. 18. Burlington, VT: University of Vermont, Research
School Psychology Review, 33, 258-270. Center for Children, Youth, & Families.
Frick, P. J., Lahey, B. B., Applegate, B., Kerdyck, L., McConaughy, S. H., Achenbach, T. M. & Gent, C. L.
Ollendick, T., Hynd, G. W., et al. (1994). DSM-IV (1988). Multiaxial empirically-based assessment:
field trials for the disruptive behavior disorders: Parent, teacher, observational, cognitive, and person-
Symptom utility estimates. Journal of the American ality correlates of Child Behavior Profiles for 6-11
116 References
year-old boys. Journal of Abnormal Child Psychol- chology, 11, 521-530.

ogy, 16, 485-509.
Rehabilitation Act of 1973, Section 504. (1973). 29
McConaughy, S. H., Kay, P. J., & Fitzgerald, M. U.S.C. § 706, 1996; § 504 [ 30 C.F.R Part 104].
(1999). The Achieving, Behaving, Caring Project for
preventing ED: Two-year outcomes. Journal of Emo- Roid, G. H. (2003). Stanford-Binet Intelligence Scales,
tional and Behavioral Disorders, 7, 224-239. Fifth Edition. Itasca, IL: Riverside Publishing.
McConaughy, S. H., Kay, P. J., & Fitzgerald, M. Sakoda, J. M., Cohen, B. H., & Beall, G. (1954). Test
(1998). Preventing SED though parent-teacher action of significance for a series of statistical tests. Psy-
research and social skills instruction: First-year out- chological Bulletin, 51, 172-175.
comes. Journal of Emotional and Behavioral Disor-
Sattler, J. M. (2008). Assessment of children: Cogni-
ders, 6, 81-93.
tive applications (5th ed.). La Mesa, CA: Author.
McConaughy, S.H., Mattison, R.E., & Peterson, R.L.
Sattler, J. M., & Hoge, R. D. (2006). Assessment of
(1994). Behavioral/emotional problems of children
children: Behavioral, social, and clinical foundations
with serious emotional disturbance and learning dis-
(5th ed.). La Mesa, CA: Jerome Sattler, Publisher, Inc.
abilities. School Psychology Review, 23, 81-98.
Shapiro, E. S., & Heick, P. (2004). School psycholo-
McConaughy, S.H., & Ritter, D. (2008). Best prac-
gist assessment practices in the evaluation of students
tices in multimethod assessment of emotional and
referred for social/behavioral/emotional problems.
behavioral disorders. In A. Thomas & J. Grimes
Psychology in the Schools, 41, 551-561.
(Eds.), Best practices in school psychology-V, Vol-
ume 3, (pp. 697-716), Bethesda, MD: National As- Shapiro, E. S., & Kratochwill, T. R. (Eds.) (2000).
sociation of School Psychologists. Introduction: Conducting a multidimensional behav-
ioral assessment. In E.S. Shapiro & T.R. Kratochwill
McConaughy, S.H., & Skiba, R. (1993). Comorbidity
(Eds.), Conducting school-based assessments of child
of externalizing and internalizing problems. School
and adolescent behavior (pp. 1-20). New York:
Psychology Review, 22, 419-434.
Guilford.
Muthén, L.K., & Muthén, B.O. (2001). Mplus User’s
Skansgaard, E. P., & Burns, G. L. (1998). Compari-
Guide (Version 2). Los Angeles, CA: Muthén &
son of DSM-IV ADHD combined and predominantly
Muthén.
inattention types: Correspondence between teacher
Naglieri, J. A., & Das, J. P. (1997). Cognitive Assess- ratings and direct observations of inattentive, hyper-
ment System. Itasca, IL: Riverside Publishing. activity/impulsivity, slow cognitive tempo, opposi-
tional defiant, and overt conduct disorder symptoms.
Petersen, N. S., Kolen, M. J., & Hoover, H. D. (1993). Child & Family Behavior Therapy, 20, 1-14.
Scaling, norming, and equating. In R.L. Linn (Ed.),
Educational measurement (3rd ed.), (pp.221-262). SPSS. (2007). SPSS Base 15.1 User’s Guide. Chi-
Washington, D.C.: American Council on Education. cago, IL: SPSS.
Reed, M. L., & Edelbrock, C. (1983). Reliability and Tilly, W.D. (2008). The evolution of school psychol-
validity of the Direct Observation Form of the Child ogy to science-based practice: Problem solving and
Behavior Checklist. Journal of Abnormal Child Psy- the three-tiered model. In A. Thomas & J. Grimes
References 117
(Eds.), Best practices in school psychology-V, Vol- tings: A review of seven coding systems. School Psy-
ume 1, (pp. 17-36). Bethesda, MD: National Asso- chology Review, 34, 454-474.
ciation of School Psychologists.
Wechsler, D. (2002). Wechsler Individual Achieve-
Volpe, R.J., & McConaughy, S.H. (2005). (Guest Edi- ment Tests-Second Edition. San Antonio, TX: Psy-
tors). Systematic direct observational assessment of chological Corporation.
student behavior: Its use and interpretation in mul-
tiple settings: An introduction to the Mini-series, Wechsler, D.C. (2003). Wechsler Intelligence Scale
School Psychology Review, 34, 451-453. for Children-Fourth Edition. San Antonio, TX: Psy-
chological Corporation.
Volpe, R. J., DiPerna, J. C., Hintze, J. M., & Shapiro,
E. S. (2005). Observing students in classroom set- Wilson, M.S., & Reschly, D.J. (1996). Assessment in
school psychology training and practice. School Psy-
chology Review, 25, 9-23.
Woodcock, R.W., McGrew, K., & Mather, N. (2001).

Woodcock-Johnson III. Itasca, IL: Riverside Publish-
ing.
118
119
120 Appendix B
121
APPENDIX D
ITEMS COMPRISING THE 2009 DOF AND THE 1986 DOF
2009 DOF Items 1986 DOF Items
1. Acts too young for age 1. Acts too young for age
2. Makes odd noises 2. Makes odd noises
3. Argues 3. Argues
4. Cheats 4. Cheats
5. Defiant or talks back to staff 5. Defiant or talks back to staff
6. Brags, boasts 6. Bragging, boasting
7. Doesn’t concentrate or doesn’t pay attention for long 7. Doesn’t concentrate or doesn’t pay attention for long
8. Difficulty waiting turn in activities or tasks 8. Can’t get mind off certain thoughts; obsessions
(describe):
9. Doesn’t sit still, restless, or hyperactive 9. Doesn’t sit still, restless, or hyperactive
10. Clings to adults or too dependent 10. Clings to adults or too dependent
11. Confused or seems to be in a fog 11. Confused or seems to be in a fog
12. Cries 12. Cries
13. Fidgets, including with objects 13. Fidgets, including with objects
14. Cruel, bullies, or mean to others 14. Cruelty, bullying, or meanness
15. Daydreams or gets lost in thoughts 15. Daydreams or gets lost in thoughts
16. Difficulty following directions 16. Deliberately harms self
17. Tries to get attention of staff 17. Tries to get attention of staff
18. Destroys own things 18. Destroys own things
19. Destroys property belonging to others 19. Destroys property belonging to others
20. Disobedient 20. Disobedient
21. Disturbs other children 21. Disturbs other children
22. Doesn’t seem to feel guilty after misbehaving 22. Doesn’t seem to feel guilty after misbehaving
23. Doesn’t seem to listen to what is being said 23. Shows jealousy
24. Eats, drinks, chews, or mouths things that are not 24. Eats, drinks, chews, or mounths things that are not
food, excluding junk foods (describe): food, excluding tobacco and junk foods (describe):
25. Difficulty organizing activities or tasks 25. Shows fear of specific situations or stimuli (describe):
26. Fails to give close attention to details 26. Says no one likes him/her
27. Forgetful in activities or tasks 27. Says others are out to get him/her
28. Out of seat 28. Expresses feelings of worthlessness or inferiority
29. Gets hurt, accident prone 29. Gets hurt, accident prone
30. Gets in physical fights 30. Gets in physical fights
31. Gets teased 31. Gets teased
32. Interrupts 32. Hears things that aren’t there (describe):
33. Impulsive or acts without thinking, including calling 33. Impulsive or acts without thinking, including calling
out in class out in class
34. Physically isolates self from others 34. Physically isolates self from others
35. Lies 35. Lying
36. Bites fingernails 36. Bites fingernails
37. Nervous, highstrung, or tense 37. Nervous, highstrung, or tense
38. Nervous movements, twitching, tics, or other unusual 38. Nervous movements, twitching, tics or other unusual
movements (describe): movements (describe):
39. Loses things 39. Overconforms to rules
40. Too fearful or anxious 40. Too fearful or anxious
41. Physically attacks people 41. Physically attacks people
Note. Bold font in the first column shows new items added to the 2009 DOF. Bold italic font in the second column shows
1986 DOF items that were not included on the 2009 DOF.
a
Items 62 through 89 on the 2009 DOF have counterparts on the 1986 DOF, but the item numbers were changed as can be
seen by comparing item numbers across columns.
122
Appendix D 123
APPENDIX D (CONT.)
42. Picks or scratches nose, skin, or other parts of body 42. Picks or scratches nose, skin, or other parts of body
(describe): (describe):
43. Runs about or climbs excessively 43. Falls asleep
44. Apathetic, unmotivated, or won’t try 44. Apathetic, unmotivated, or won’t try
45. Responds before instructions are completed 45. Refuses to talk
46. Disrupts group activities 46. Disrupts group activities
47. Screams 47. Screams
48. Secretive, keeps things to self, including refusal to 48. Secretive, keeps things to self, including refusal to
show things to teacher show things to teacher
49. Avoids or is reluctant to do tasks that require 49. Sees things that aren’t there (describe):
sustained mental effort
50. Self-conscious or easily embarrassed 50. Self-conscious or easily embarrassed
51. Slow to respond verbally 51. Sexual activity (describe):
52. Shows off, clowns, or acts silly 52. Shows off, clowns, or acts silly
53. Shy or timid 53. Shy or timid
54. Explosive or unpredictable behavior 54. Explosive or unpredictable behavior
55. Demands must be met immediately, easily frustrated 55. Demands must be met immediately, easily frustrated
56. Easily distracted by external stimuli 56. Easily distracted by external stimuli
57. Stares blankly 57. Stares blankly
58. Speech problem (describe) 58. Acts like feelings are hurt when criticized
59. Wants to quit or does quit tasks 59. Steals
60. Yawns 60. Stores up things he/she doesn’t need, except hobby
items such as marbles (describe):
61. Strange behavior (describe): 61. Strange behavior (describe):
62. Strange ideas (describe):
62. Stubborn, sullen, or irritablea 63. Stubborn, sullen, or irritable
64. Sudden changes in mood or feelings
63. Sulksa 65. Sulks
66. Suspicious
64. Swears or uses obscene languagea 67. Swears or uses obscene language
68. Talks about killing self
65. Talks too mucha 69. Talks too much
66. Teasesa 70. Teases
67. Temper tantrums, hot temper, or seems angrya 71. Temper tantrums, hot temper, or seems angry
72. Verbal expressions of preoccupation with sex
68. Threatens peoplea 73. Threatens people
69. Too concerned with neatness or cleanlinessa 74. Too concerned with neatness or cleanliness
70. Underactive, slow moving, tired, or lacks energya 75. Underactive, slow moving, or lacks energy
71. Unhappy, sad, or depresseda 76. Unhappy, sad, or depressed
72. Unusually louda 77. Unusually loud
73. Overly anxious to pleasea 78. Overly anxious to please
74. Whining tone of voicea 79. Whining tone of voice
75. Withdrawn, doesn’t get involved with othersa 80. Withdrawn, doesn’t get involved with others
81. Worries
76. Sucks thumb, fingers, hand, or arma 82. Sucks thumb, fingers, hand, or arm
77. Fails to express self clearlya 83. Fails to express self clearly
a
124 Appendix D
APPENDIX D (CONT.)
78. Impatienta 84. Impatient

79. Tattlesa 85. Tattles
80. Repeats behavior over & over; compulsions (describe):a 86. Repeats behavior over & over; compulsions
(describe):
81. Easily led by peersa 87. Easily led by peers
82. Clumsy, poor motor controla 88. Clumsy, poor motor control
83. Doesn’t get along with peersa 89. Doesn’t get along with peers
84. Runs out of class (or similar setting)a 90. Runs out of class (or similar setting)
85. Behaves irresponsibly (describe): a 91. Behaves irresponsibly (describe):
86. Bossya 92. Bossy
93. Plays with younger children
87. Complainsa 94. Complains
88. Afraid to make mistakesa 95. Afraid to make mistakes
89. Other problems not listed above: a 96. Acts like poor loser
97. Other problems (specify):
a
Index
A D
Abramowitz, M., 84, 114 Das, J. P., 2, 116
Achenbach, T.M., 2-3, 30, 56, 58, 62, 71-73, 78, Diagnosis, 112
81, 86, 97-98, 114-116 DiPerna, J. C., 1, 117
ADHD, 61, 65, 73, 82, 109 Disabilities, 111
Aggressive Behavior, 1, 33, 37, 79, 81 DOF profile, 108
Algina, J., 84, 115 DSM , 1, 30, 58, 78, 81-82, 109, 112
Alpha, 94-96 DSM-oriented scales 31, 81, 85
American Education Research Association, 114 DuPaul, G. J., 61, 115
American Psychiatric Association, 1, 30, 58, 73, 81, 114 E
Applegate, B., 115
Edelbrock, C., 71-72, 97, 114, 116
Attention Deficit/Hyperactivity Problems , 1, 23, 30-32, 81-83,
Emotional disturbance, 61, 63
109
Ethnicity, 84, 99, 100, 103-104
Attention Problems, 23, 77, 79
B F
Factor analyses, 74, 75, 76, 79
Barkley, R.A., 58, 61, 114
Fitzgerald, M., 72, 115, 116
Beall, G., 85, 116
Frick, P. J., 78, 115
Borderline range, 26-28, 30-31, 37, 88, 110
Functional behavioral assessment, 60, 69
Briesch, A. M., 91, 115
Browne, N. W., 76, 115 G
Burns, G. L., 72, 78, 116 Gender , 100, 103, 108
C Gent, C. L., 72, 115
Gove, P., 73, 115
Case Management, 59, 67, 70
Guidelines for rating problem items, 15
CBCL/6-18, 60-63, 65, 68, 70-71, 78-79, 81, 97
Guze, S., 58, 115
Chafouleas, S. M., 91, 115
Chanese, J. A. M., 91, 115 H
Christ, T. J., 91, 115 Heick, P., 1, 116
Classroom observations, 23-24, 33-35, 66-67, 77, Hintze, J. M., 1, 46, 47, 55, 91, 115, 117
82, 84-85, 87, 93, 95-96, 99-100, 104, 106, 107 Hoge, R. D., 1, 58, 116
Clinical interpretations, 110 Hoover, H. D., 86, 116
Clinical range, 26-28, 30-31, 37, 88, 110 Hynd, G. W., 115
Cohen, B. H., 85, 116 Hyperactivity-Impulsivity, 1, 30-32, 81-83, 109
Cohen, J., 91, 115
I
Computer-scoring program, 23
Content validity, 97 ID number, 10
Continuous recording methods, 2 Identified child, 1, 5, 10, 11, 23, 25, 44
Control children, 1, 5, 10, 11, 23, 25, 45 Immature/Withdrawn, 23, 77-78, 82
Criterion-related validity, 98 Inattention, 1, 30-32, 81-83, 109
Crocker, L., 84, 115 Individualized Education Program (IEP), 59
Cronbach, L.J., 91, 94, 96, 115 Individuals with Disabilities Education Improvement, 115
Inter-observer agreement, 46, 48-50, 52-53
Cudeck, R., 76, 115
Inter-rater reliability, 51, 91-93
Internal consistency, 91, 94
Intrusive, 23, 77
125
126 Index
K R
Kaufman, A.S., 3, 115 Recess observations, 36, 38-40, 67, 79, 81, 84-85,
Kaufman, N.L., 3, 115 87, 93, 96, 99-100, 102, 104, 106, 107
Kay, P. J., 72, 116 Reed, M. L., 72, 116
Kerdyck, L., 115 Referral status, 99-100, 103-107
Kolen, M. J., 86, 116 Rehabilitation Act of 1973, 116
Kratochwill, T. R., 58, 116 Reliability, 91
Reschly, D.J., 1, 117
L
Rescorla, L. A., 2-3, 30, 56, 58, 72-73, 78, 81, 86,
Lahey, B. B., 115 97-98, 114
Lakin, R., 1, 115 Response-to-Intervention (RTI), 60, 67
Learning disabilities, 62 Riley-Tillman, T. C., 91, 115
Leff, S. S., 1, 115 Ritter, D., 60, 62, 116
Lewis, M., 97, 114 Roid, G. H., 116
Loehlin, J. C., 76, 115
Low scores, 110 S
Sakoda, J. M., 85, 95, 99, 116
M
Sattler, J. M., 1, 55, 58, 116
Mather, N., 3, 117 School psychologist, 59-60
Matthews, W. J., 91, 115 SCICA, 58, 63, 79, 97
Mattison, R.E., 62, 116 Section 504 Accommodations 61, 67
McConaughy, S. H., 2-3, 30, 58, 60, 62, 72-73, 98, Setting, 12
114-116 Shapiro, E. S., 1, 58, 116-117
McGrew, K., 3, 117 Skansgaard, E. P., 72, 78, 116
Mean T scores, 88 Skiba, R., 116
Multiaxial assessment, 2-3 Sluggish Cognitive Tempo, 23, 76-78, 105
Multidisciplinary team (MDT), 59 Special education, 60, 112
Multisource data, 59 SPSS, 74, 116
Muthén, B.O., 74, 116 Stegun, I.A., 84, 114
Muthén, L.K., 74, 116 Stoner, G., 61, 115
N Syndromes, 73, 77, 109
Naglieri, J. A., 2, 116 T
Narrative report, 33, 35, 38, 40 T Scores, 25, 28, 30-31, 33, 37, 83, 85-87, 110
Normal range, 26, 28, 30, 31, 37, 88 Test-retest reliability, 91, 93, 95
Normalized T scores 82-84 Three-tiered model, 60, 69
Normative samples, 82, 84-85 Tilly, W.D., 60, 116
O Time sampling, 2
TOF, 58-59, 61, 63, 66, 78-79, 97
Observation set, 11, 23, 25 Total problems, 23, 28, 33, 37, 87
Observer’s notes, 12, 14 Training observers, 41
Ollendick, T., 115 TRF, 60-63, 66, 68, 70-71, 78-79, 81, 97
On-task, 12-14, 23, 30, 47-50, 87-88, 110
Oppositional, 23, 78-79 V
Other problems, 23, 28, 33, 37, 79-80, 109 Validity, 96
Outcome evaluation, 59, 67, 70 Volpe, R.J., 1, 2, 58, 116-117
P W
Percent agreement index, 46 Wechsler, D., 2, 65, 116-117
Percentiles, 25, 28, 30-31, 33, 37, 110 Wilson, M.S., 1, 117
Petersen, N. S., 62, 86, 116 Woodcock, R.W., 2, 117
Peterson, R. L., 62, 116
Problem items, 13, 15, 47, 52-54 Y
Profile, 23 YSR, 63, 81, 97
al for the ASEBA Direct Observation Form McConaughy & Achenbach
the ASEBA Direct Observation Form McConaughy & Achenbach
ibrary of Congress Control Number: xxxxxxx

ISBN 978-1-932975-12-3

2009 DOF Manual 1 PDF

Încărcat de

Informații document

Titlu original

Drepturi de autor

Formate disponibile

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Drepturi de autor:

Formate disponibile

2009 DOF Manual 1 PDF

Încărcat de

Drepturi de autor:

Formate disponibile

Manual for the ASEBA

Direct Observation Form

ibrary of Congress Control Number: xxxxxxx

Stephanie H. McConaughy, University of Vermont

II. Constructing the DOF and DOF Profile .......................................................... Chapter 6

III. Statistical Data on Reliability and Validity

IV. Answers to Frequently Asked Questions ......................................................... Chapter 9

V. Mean DOF Scale Scores for Normative Samples of

VI. Mean DOF Scale Scores for Match Referred Children

2. Using the DOF and Rating the DOF Items ................................................................................................... 5

3. Computer-Scored DOF Profile ..................................................................................................................... 23

4. Training DOF Observers and Conducting School Observations .............................................................. 41

5. Practical Applications and Case Examples ................................................................................................. 56

6. Constructing the DOF and DOF Profile...................................................................................................... 71

ASSIGNING NORMALIZED T SCORES TO RAW SCORES ................................................................... 82

7. Reliability of the DOF ................................................................................................................................... 91

8. Validity of the DOF ....................................................................................................................................... 97

9. Answers to Frequently Asked Questions ................................................................................................... 108

References ......................................................................................................................................................... 114

Index ................................................................................................................................................................. 125

Axis I Axis II Axis III Axis IV Axis V

CBCL/6-18a TRFb TOFc Height, weight DOFd

History School records Ability tests Medical exam SCICAe

Parent Caregiver Achievement tests Neurological YSRf (for age 11)

Language tests Personality tests

Figure 2-1. Page 1 of the Direct Observation Form.

Figure 2-1 (cont.) Page 2 of the Direct Observation Form.

Figure 2-1 (cont.) Page 3 of the Direct Observation Form.

Figure 2-1 (cont.) Page 4 of the Direct Observation Form.

terms for data entry: African American, Asian, Observation Set

through the box for “OFF TASK.” tions.

DOF Scale Borderline Clinical

Total Problems-Classroom T = 60-63 T >63

On-task T = 31-35 T <31

Figure 4-1. Worksheet for computing IOA for On-task.

Figure 4-2. Steps for computing IOA for On-task.

Figure 4-3. Example of IOA for On-task.

Figure 4-4. Worksheet for computing IOA for problem items.

Figure 4-5. Steps for computing IOA for problem items.

Figure 4-6. Example of IOA for problem items.

Boys Girls Total

Sample for EFAs of 41 high frequency items from

Sample for EFAs of 57 high frequency items from

DOF Syndrome Scale Factor Loading

I. Sluggish Cognitive Tempo

III. Attention Problems

Table 6-2 (cont.)

DOF Syndrome Scale Factor Loading

45. Responds before instructions are completed .41

DOF Items Factor Loading

14. Cruel, bullies, or mean to others .40

Boys Girls Total

On-task 8.64 1.57 8.86 1.57 ns

Note. N = 661 for classroom observations; N = 244 for recess observations.

child’s scores on each scale compare with scores Scales

Reliability refers to agreement between repeated and >.50 large.

Mean r for all problem scales .79 .78

On-task .97 .94

Test- Time 1 DOF Time 2 DOF

Mean r for all problem scalesa .58

On-task .42c 8.53 1.12 8.65 .91

DOF Scale Alpha

Boys Girls Total