Upload
others
View
3
Download
0
Embed Size (px)
Citation preview
RESEARCH ARTICLE
The Influence of Parental Concern on the Utility of AutismDiagnostic Instruments
Karoline Alexandra Havdahl , Somer L. Bishop, Pal Sur�en, Anne-Siri Øyen, Catherine Lord,Andrew Pickles, Stephen von Tetzchner, Synnve Schjølberg, Nina Gunnes, Mady Hornig, W. Ian Lipkin,Ezra Susser, Michaeline Bresnahan, Per Magnus, Nina Stenberg, Ted Reichborn-Kjennerud, andCamilla Stoltenberg
The parental report-based Autism Diagnostic Interview-Revised (ADI-R) and the clinician observation-based AutismDiagnostic Observation Schedule (ADOS) have been validated primarily in U.S. clinics specialized in autism spectrumdisorder (ASD), in which most children are referred by their parents because of ASD concern. This study assessed diag-nostic agreement of the ADOS-2 and ADI-R toddler algorithms in a more broadly based sample of 679 toddlers (age35–47 months) from the Norwegian Mother and Child Cohort. We also examined whether parental concern aboutASD influenced instrument performance, comparing toddlers identified based on parental ASD concern (n 5 48) andparent-reported signs of developmental problems (screening) without a specific concern about ASD (n 5 400). TheADOS cutoffs showed consistently well-balanced sensitivity and specificity. The ADI-R cutoffs demonstrated good spe-cificity, but reduced sensitivity, missing 43% of toddlers whose parents were not specifically concerned about ASD.The ADI-R and ADOS dimensional scores agreed well with clinical diagnoses (area under the curve�0.85), contribut-ing additively to their prediction. On the ADI-R, different cutoffs were needed according to presence or absence ofparental ASD concern, in order to achieve comparable balance of sensitivity and specificity. These results highlightthe importance of taking parental concern about ASD into account when interpreting scores from parental report-based instruments such as the ADI-R. While the ADOS cutoffs performed consistently well, the additive contributionsof ADI-R and ADOS scores to the prediction of ASD diagnosis underscore the value of combining instruments basedon parent accounts and clinician observation in evaluation of ASD. Autism Res 2017, 0: 000–000. VC 2017 Interna-tional Society for Autism Research, Wiley Periodicals, Inc.
Keywords: Autism Diagnostic Interview-Revised; Autism Diagnostic Observation Schedule; early diagnosis; screening
Background
Early diagnosis of autism spectrum disorder (ASD) is
important given that interventions in young children
are associated with considerable improvements in
symptoms and functioning [Zwaigenbaum et al., 2015].
However, the time lag from first evaluation to ASD
diagnosis can be long, often more than a year [Crane,
Chester, Goddard, Henry, & Hill, 2016; Wiggins, Baio,
& Rice, 2006; Zuckerman, Lindly, & Sinche, 2015].
Therefore, reliable and valid instruments are crucial to
aid clinicians in making timely and appropriate diagno-
ses of ASD in toddlers. Among the most widely used
assessment instruments for ASD are the Autism Diag-
nostic Interview–Revised (ADI-R) [Rutter, Le Couteur, &
Lord, 2003] and the Autism Diagnostic Observation
Schedule (ADOS) [Lord et al., 2000]. The ADI-R is a
semistructured caregiver interview, in which a trained
interviewer asks questions to elicit detailed descriptions
of the child’s social-communication and repetitive
behaviors. The ADOS is a standardized, semistructured
observational assessment of social communication and
repetitive behaviors and interest, which is administered
and scored by a trained examiner. The ADI-R and ADOS
have demonstrated good agreement with clinical diag-
nosis of ASD, especially when used in combination [see
From the Nic Waals Institute, Lovisenberg Diaconal Hospital, Oslo, Norway (K.A.H., A.-S.Ø.); Norwegian Institute of Public Health, Oslo, Norway
(K.A.H., P.S., A.-S.Ø., S.S., N.G., P.M., T.R.-K., C.S.); MRC Integrative Epidemiology Unit, School of Social and Community Medicine, University of
Bristol, Bristol, UK (K.A.H.); Department of Psychiatry, University of California San Francisco, San Francisco (S.L.B.); Center for Autism and the
Developing Brain, Weill Cornell Medical College, White Plains, New York, New York (C.L.); Department of Biostatistics, King’s College London, Lon-
don, UK (A.P.); Department of Psychology, University of Oslo, Oslo, Norway (S.v.T.); Mailman School of Public Health, Columbia University, New
York, New York (M.H., W.I.L., E.S., M.B.); Neuropsychiatric Unit, Oslo University Hospital, Oslo, Norway (N.S.); Institute of Clinical Medicine, Uni-
versity of Oslo, Oslo, Norway (T.-R.K.); Department of Global Public Health and Primary Care, University of Bergen, Bergen, Norway (C.S.); New
York State Psychiatric Institute, New York, New York (M.H., E.S.)
Received July 17, 2016; accepted for publication May 04, 2017
Address for correspondence and reprints: Alexandra Havdahl, Department of Children’s Health, Norwegian Institute of Public Health, Post Box
4404 Nydalen, N-0403 Oslo, Norway. E-mail: [email protected]
Published online 00 Month 2017 in Wiley Online Library (wileyonlinelibrary.com)
DOI: 10.1002/aur.1817VC 2017 International Society for Autism Research, Wiley Periodicals, Inc.
INSAR Autism Research 00: 00–00, 2017 1
systematic review: Charman & Gotham, 2013; Falkmer,
Anderson, Falkmer, & Horlin, 2013]. Due to findings of
reduced diagnostic validity in certain groups (e.g., tod-
dlers, individuals with very low IQ), the ADI-R and
ADOS algorithms have recently been revised to better
account for influences of age and language level on
ASD symptom ratings [Kim & Lord, 2012b; Lord et al.,
2012]. The new ADI-R Toddler and ADOS-2 algorithms
have shown improved diagnostic agreement compared
to the previous algorithms [Kim & Lord, 2012a].
The ADI-R and ADOS are heavily relied upon for diag-
nostic evaluations of ASD across a range of clinical and
research settings worldwide, yet only a few studies have
examined their validity outside of ASD specialty clinics
in the United States (U.S.). In a Swedish sample of tod-
dlers, Zander, Sturm, and B€olte [2015] found that the
ADOS performed similarly as in the U.S. validation
studies, whereas the ADI-R performed differently (i.e.,
generally lower scores, resulting in increased specificity
and reduced sensitivity). The ADI-R algorithms also
showed reduced diagnostic agreement in another study
of toddlers in Europe and Israel [de Bildt et al., 2015].
ADI-R sensitivity was especially low among toddlers
with ASD using phrase-speech; nearly half of these tod-
dlers scored in the little-to-no concern range. These find-
ings warrant further study given that both the ADI-R
and ADOS are widely used in Europe [e.g., in more
than 80% of ASD diagnostic evaluations of toddlers and
preschoolers in Norway; Larsen, 2015]. Multiple cultural
and contextual factors could contribute to the inconsis-
tent results between U.S. and non-U.S. studies. For
example, parent awareness of ASD symptoms and/or
inclination to report problematic behavior in their tod-
dlers might be generally lower in European countries
compared to in the United States [Zander et al., 2015].
Additionally, parental report could be influenced by
health system differences (e.g., whether an ASD diagno-
sis is required to be eligible for services).
A notable difference between the U.S. and European
studies was sampling from ASD specialized clinics, in
which most children are referred by their parents
because of concern about ASD, compared to from non-
specialized neuropsychiatric clinics or via screening. It
is possible that, among toddlers with ASD, those whose
parents are concerned about ASD may be more severely
impaired than those whose parents have nonspecific
concerns. However, this would be expected to also
result in differences in the performance of the ADOS
(not only the ADI-R); yet the ADOS demonstrated good
diagnostic validity in Swedish toddlers referred to a
nonspecialized neuropsychiatric clinic [Zander et al.,
2015] as well as in a U.S. sample of toddlers identified
based on community screening for signs of develop-
mental delay [Guthrie, Swineford, Nottke, & Wetherby,
2013]. It is also possible that parents who are concerned
about ASD may be more aware of and/or more inclined
to report autism-related behaviors than parents who do
not suspect ASD, thereby affecting the performance of
parent report-based instruments such as the ADI-R.
Information about the role of parental concern in
influencing the performance of the ADI-R and/or ADOS
is needed given widespread use of these instruments
outside of ASD clinics [Molloy, Murray, Akers, Mitchell,
& Manning-Courtney, 2011]. Current best practice
guidelines recommend routine screening for ASD in
general child psychiatry settings followed by diagnostic
assessment if signs of ASD are detected [Volkmar et al.,
2014]. Therefore, use of the ADI-R and ADOS with chil-
dren initially brought for assessment due to nonspecific
developmental and behavioral concerns is increasingly
common.
To date, no study has compared the psychometric
properties of ASD diagnostic instruments among chil-
dren identified for evaluation of ASD in different ways.
This study examined agreement between scores and
cutoffs from the ADI-R and ADOS and clinical diagno-
ses among toddlers recruited from a population-based
study that employed multiple methods for identifica-
tion. First, we aimed to examine diagnostic agreement
in this broadly based Norwegian sample and compare
these estimates with diagnostic agreement reported in
U.S. validation studies carried out in ASD clinics. Sec-
ond, we examined whether parental concern about ASD
influenced the performance of the instruments. This
was possible given that a subgroup of toddlers was
recruited because their parents were concerned about
ASD, whereas other toddlers were recruited because
their parents reported behavioral signs associated with
ASD (e.g., language delay) without a specific concern
about ASD. In particular, we were interested in whether
parental ASD concern influenced the discriminative
utility of the standardized cutoffs and/or the dimen-
sional scores from the ASD assessment instruments.
Methods
The Norwegian Mother and Child Cohort Study
A sample from the Norwegian Mother and Child
Cohort Study (MoBa) received diagnostic evaluations
for ASD as part of a substudy, the Autism Birth Cohort
(ABC) [Stoltenberg et al., 2010; Sur�en et al., 2014].
MoBa is a prospective pregnancy cohort study estab-
lished by the Norwegian Institute of Public Health in
1999, with nationwide recruitment of mothers in asso-
ciation with routine ultrasound examinations (41%
consented) [see Magnus et al., 2006, 2016; Schreuder &
Alsaker, 2014]. The current data were derived from
quality-assured MoBa data files released in 2014 (v7).
The children (n 5 114,500) were born August 1999
2 Havdahl et al./Influence of parental concern on ASD instruments INSAR
through July 2009. The participants are largely of Nor-
wegian or Scandinavian ethnicity (95%) [Myhre, Thore-
sen, Grogaard, & Dyb, 2012].
Multiple strategies were used to identify children
with possible ASD. One strategy was based on parental
concern about ASD, defined as the parent responding
yes to the question of whether the child has autistic
traits/autism in the MoBa questionnaires (ages 3, 5, and
7 years) and/or by self-referral or professional referral
(parental agreement was a prerequisite) to the ABC
research clinic for suspected ASD. Another strategy,
which did not require parental concern about ASD, was
screening for behavioral signs associated with ASD in
the 3-year MoBa questionnaire (response rate: 58.6%).
The screening consisted of questions about language
and social development, as well as the Social Commu-
nication Questionnaire (SCQ) [Rutter, Bailey, & Lord,
2003] (see criteria in Appendix 1). The participation
rate for assessment based on the 3-year-questionnaire
was approximately 50%.
Identification routes also included registered ASD
diagnoses in the Norwegian Patient Registry, siblings of
referred/screen-positive children, and random selection
of controls (flow chart in Appendix 2). The clinical
assessments were undertaken in 2005–2012 at Lovisen-
berg Diaconal Hospital in Oslo, in collaboration with
the Norwegian Institute of Public Health and Columbia
University.
Participant Flow
Of the 114,500 children in MoBa, 1,033 children were
clinically assessed for ASD (not invited n 5 112,255;
declined assessment n 5 1,212). Since the present study
focused on the ADI-R and ADOS in toddlers (age <48
months), children assessed at older ages (n 5 201) or
who did not complete the ADI-R/ADOS (n 5 153) were
excluded (see flow chart in Appendix 2). Hence, the ini-
tial sample consisted of 679 toddlers (aged 35–47
months). Children with severe sensory (sight/hearing)
and/or motor impairments and/or nonverbal mental
age below 10 months (n 5 14) were excluded, as the
instruments have not been validated for children with
such impairments [Kim & Lord, 2012a; Lord et al.,
2012].
ASD was defined as clinical diagnoses of Autistic Disor-
der (n 5 41), Pervasive Developmental Disorder Not Oth-
erwise Specified (n 5 24), and Asperger’s Disorder (n 5 1)
(Diagnostic and Statistical Manual of Mental Disorders,
Fourth Edition, Text Revision, DSM-IV-TR; American Psy-
chiatric Association, 2000). These three DSM-IV-TR sub-
categories of ASD have been found to be well accounted
for by a single ASD category [Frazier et al., 2012]. For
comparability with previous studies, Rett’s Disorder and
Childhood Disintegrative Disorder were not included in
the ASD case definition (excluded n 5 2). Non-ASD diag-
noses were assigned to 303 toddlers, primarily language
disorders (n 5 204), intellectual disability (n 5 38), and
attention-deficit/hyperactivity disorder (n 5 20). The
remaining 294 children did not meet criteria for any
DSM-IV-TR diagnosis (as shown in the flow chart in
Appendix 2, the majority of these were recruited ran-
domly as controls). The sample characteristics are pre-
sented in Table I.
Measures
The ADI-R (Norwegian translation) was administered by
trained research assistants who had demonstrated
research reliability in using the instrument [Rutter, Le
Couteur, et al., 2003]. The original ADI-R algorithm
used in the assessments provides a classification for
Autistic Disorder. The ADI-R Toddler algorithms, con-
sisting of ADI-R items that have demonstrated high sen-
sitivity and specificity in toddlers of three separate
language levels (nonverbal, single words, phrase-speech)
[Kim & Lord, 2012b], were retroactively calculated.
Each algorithm provides a cutoff prioritizing sensitivity
(clinical cutoff), a cutoff prioritizing specificity (research
cutoff), as well as ranges of concern about ASD (the
clinical cutoff differentiates “mild-to-moderate” from
“little-to-no” concern). The Toddler algorithms were
used when analyzing dimensional ADI-R scores. When
small subgroup sample sizes necessitated collapsing
across the language levels, we used the 10 items appli-
cable to children of all language levels.
The ADOS (Norwegian translation) was administered
by licensed clinical psychologists who had demon-
strated research reliability in using the instrument [Lord
et al., 2000]. Revised algorithms from the ADOS-2 [Lord
et al., 2012] and calibrated severity scores [CSS, range
1–10; Gotham, Pickles, & Lord, 2009] were calculated
retrospectively. The CSS was used when analyzing
dimensional ADOS scores.
Age-equivalent scores derived from the Vineland Adap-
tive Behavior Scales were used to measure expressive
language ability [Sparrow, Balla, & Cicchetti, 1984].
Language level was defined by ADI-R item 30 (i.e., non-
verbal, single words, phrase-speech or better). IQ was
measured with standard scores from the Stanford–Binet
Intelligence Scales-5th Edition for most participants
(SB5; full version:n 5 401, abbreviated version:n 5 200)
[Roid, 2003]. Toddlers with lower developmental levels
than required for the SB5 received the Mullen Scales of
Early Learning (MSEL;n 5 56) [Mullen, 1995]. To avoid
floor effects on the MSEL, the ratio full scale IQ was
derived from age-equivalent scores (mental age/chrono-
logical age*100) [Bishop, Guthrie, Coffing, & Lord, 2011].
Behavior problems not specific to ASD (attention
problems/hyperactivity, aggressive behavior, and
INSAR Havdahl et al./Influence of parental concern on ASD instruments 3
Table
I.To
tal
Sam
ple
Char
acte
rist
ics
by
Expre
ssiv
eLa
ngu
age
Leve
lan
dD
iagn
osi
s
Phra
se-s
pee
chSi
ngl
ew
ord
sNonve
rbal
ASD
dia
gnose
sOth
erdia
gnos
esNo
dia
gnose
sASD
dia
gnose
sOth
erdia
gnos
esASD
dia
gnos
esOth
erdia
gnose
s
NM
(SD)
NM
(SD)
NM
(SD)
NM
(SD)
NM
(SD)
NM
(SD)
NM
(SD)
Age
inm
onth
s35
41.2
(2.6
)231
41.5
(2.1
)294
42.1
(2.2
)20
41.1
(3.3
)58
41.0
(2.4
)11
41.4
(3.2
)14
41.8
(2.6
)
Mal
ese
x[%
]32
[91.
4]
167
[72.
3]
165
[56.1
]17
[85.0
]48
[82.8
]6
[54.5
]10
[71.4
]
IQ34
81.3
(20.
5)
230
92.5
(15.0
)294
103
.6(1
2.2)
18
64.0
(19.
9)
57
72.0
(15.
9)
11
37.7
(11.
2)
13
52.
7(1
9.9)
Langua
ge
age
33
29.5
(7.6
)222
33.2
(9.1
)283
43.2
(8.9
)20
16.9
(3.4
)58
18.8
(4.4
)11
7.5
(4.3
)14
11.9
(4.6
)
Beh
avio
rpro
ble
ms
(ite
mm
ean)
Par
ent-
report
ed35
0.6
(0.2
)223
0.6
(0.3
)286
0.5
(0.2
)20
0.8
(0.3
)56
0.6
(0.3
)11
0.6
(0.2
)13
0.5
(0.2
)
Clin
icia
n-o
bse
rved
35
0.5
(0.4
)231
0.2
(0.3
)294
0.2
(0.2
)20
0.4
(0.4
)58
0.3
(0.4
)11
0.5
(0.5
)14
0.2
(0.4
)
ADOS-
CSS
35
5.9
(2.0
)231
1.8
(1.6
)294
1.3
(1.0
)20
6.1
(2.4
)58
2.3
(1.7
)11
7.8
(1.5
)14
2.6
(2.3
)
ADI-
R(f
ull
dia
gnost
ical
gori
thm
)35
13.6
(7.0
)231
4.0
(4.1
)294
1.7
(2.7
)20
12.2
(5.3
)58
4.8
(4.8
)11
15.1
(7.0
)14
5.0
(4.3
)
ADI-
R(c
om
mon
item
s)35
7.1
(4.1
)231
1.9
(2.2
)294
1.0
(1.6
)20
8.1
(3.5
)58
3.3
(3.1
)11
11.6
(5.2
)14
4.4
(3.7
)
Iden
tifica
tion
route
,n
[%]
Par
enta
lco
nce
rnfo
rASD
13
[37%
]14
[6%
]2
[1%
]9
[45%
]4
[7%
]4
[36%
]2
[14%
]
Par
ent-
report
edASD
signs
wit
hout
conce
rnab
out
ASD
21
[60%
]187
[81%
]112
[38%
]11
[55%
]52
[90%
]5
[46%
]12
[86%
]
Contr
ols
/oth
er1
[3%
]30
[13%
]180
[61%
]0
2[3
%]
2[1
8%]
0
Mat
ernal
educa
tion
�12
year
s,n
[%]a
9[2
6%]
73
[33%
]79
[28%
]10
[50%
]23
[40%
]3
[38%
]5
[39%
]
Not
e.La
nguag
ele
vel
stra
taar
ebas
edon
ADI-
Rit
em30
(ove
rall
leve
lof
languag
e).
aM
issi
ng
on
mat
ernal
educa
tion
for
n5
54,
per
centa
ge
of
non
-mis
sing
isre
port
ed.
ASD
,au
tism
spec
trum
dis
order
;ADOS-
CSS,
stan
dar
diz
edco
mpar
ison
scor
esfr
om
the
auti
smdia
gnos
tic
obse
rvat
ion
sched
ule
;ADI-
RTo
ddle
r(c
om
mon
item
s),
sum
of
the
10
ADI-
Rdia
gnos
tic
item
s
com
mon
acro
ssla
ngua
ge
leve
ls(a
tten
tion
tovo
ice,
dir
ect
gaz
e,se
ekin
gto
shar
een
joym
ent,
range
of
faci
alex
pres
sions,
appro
pri
aten
ess
of
soci
alre
sponse
,in
tere
stin
childre
n,
resp
onse
to
appro
aches
of
childre
n,
han
dan
dfinger
man
ner
ism
s,oth
erco
mple
xm
anner
ism
s,unusu
alse
nso
ryin
tere
sts)
.
4 Havdahl et al./Influence of parental concern on ASD instruments INSAR
anxiety; hereafter “behavior problems”) were measured
by parental report in the MoBa 3-year-questionnaire
[scale derived from the Child Behavior Checklist,
Achenbach & Rescorla, 2000] [see validity data in Biele,
Zeiner, & Aase, 2014; Zachrisson, Dearing, Lekhal, &
Toppelberg, 2013], and by clinician observation in the
ADOS section for “Other Abnormal Behaviors,” which
is not included in ASD algorithms [Havdahl, Hus Bal,
et al., 2016a].
Parental concern about ASD was defined as the
parent answering yes to the question of whether the
child has autistic traits in the MoBa 3-year-
questionnaire and/or by self-referral or professional
referral (parental agreement was a prerequisite) to the
ABC research clinic for suspected ASD (n 5 48). Most
toddlers in the ASD concern group also met the screen-
ing criteria based on parent-reported behavioral signs
(n 5 39 of 47, missing information for one child).
Behavioral signs without concern about ASD was opera-
tionalized as meeting the ABC Study screening criteria
for parent-reported behaviors associated with ASD while
answering no to the question of whether the child has
autistic traits (n 5 400). The screening criteria (see Appen-
dix 1) required parent concern about some aspect of the
child’s behavior or development (although not necessar-
ily about ASD).
Procedure
Informed consent was obtained from caregivers, using
forms approved by the Regional Committee for Medical
and Health Research Ethics in South-Eastern Norway
and the Columbia University Medical Center Institu-
tional Review Board. Participants underwent a compre-
hensive multidisciplinary clinical evaluation during the
course of one or two days. The assessment team did not
have access to information from MoBa questionnaires
or previous evaluations. Separate examiners adminis-
tered the ADOS and ADI-R, without knowledge of
results from the other instrument. Inter-rater reliability
on both instruments was continuously monitored.
There were multiple other sources of information. A
child psychiatrist or other physician administered a
parental interview of developmental history and cur-
rent concerns, a physical exam, and a standardized
observation of mother-child play interaction. Parents
and daycare staff completed questionnaires covering
signs of ASD and other psychiatric disorders, language
abilities, and executive functioning [see Sur�en et al.,
2014]. Psychiatric symptom assessment included the
Preschool Age Psychiatric Assessment [PAPA; Egger
et al., 2006] or the Early Childhood Inventory-4th Edi-
tion [ECI-4; Gadow & Sprafkin, 2000]. Following the
assessment, the team met to review all available infor-
mation and discuss clinical impressions. In accordance
with the current gold standard for diagnosing ASD
[Falkmer et al., 2013], licensed and experienced clini-
cians used clinical judgment informed by the full multi-
disciplinary evaluation to assign a consensus best-
estimate diagnosis (DSM-IV-TR criteria).
Data Analysis
All analyses were performed in SPSS 22.0 (IBM Corp.,
USA) or STATA 13 (StataCorp LP, USA). Significance
level was set at 0.05. Group differences were examined
with two-samples t-tests and chi-square tests or Fisher’s
exact tests. Effect sizes are reported as Cohen’s d (d;
small:0.20–0.49, medium: 0.50–0.79, large: �0.80), or
Cramer’s V (V; small: 0.10–0.29, moderate: 0.30–0.49,
large: �0.50).
Agreement between instrument scores and clinical
diagnoses was estimated using area under the curve
(AUC) from receiver operating characteristic (ROC) anal-
yses, a well-established method for assessing the overall
discriminative performance of a dimensionally scored
instrument compared against a dichotomously defined
reference standard (e.g., clinical diagnosis) [Janes, Long-
ton, & Pepe, 2009]. The ROC curve is a plot of the true
positive rate (the proportion of children with ASD diag-
nosis correctly classified as ASD) against the false positive
rate (the proportion of children without ASD diagnosis
misclassified by the instrument as ASD), across the com-
plete range of possible cutoffs. The Stata procedure roc-
comp was used to compare AUCs. Logistic regression was
used to examine whether scores from the ADOS and ADI-
R contributed independently to prediction of ASD diag-
noses (odds ratios [ORs] are reported).
Agreement between ADI-R/ADOS classifications and
clinical diagnoses was examined by calculating sensitiv-
ity (the proportion of children with ASD diagnoses clas-
sified as ASD) and specificity (the proportion of
children without ASD diagnoses classified as non-ASD).
Likelihood ratios (LR) are also reported. LR1 is the ratio
of the probability of scoring above the cutoff in chil-
dren with ASD to the probability in children without
ASD (sensitivity/1-specificity), and estimates >1 indicate
increased probability of ASD (small:2–4, moderate:5–10,
large:>10). LR– is the ratio of the probability of scoring
below the cutoff in children with ASD to the probabil-
ity in children without ASD (1-sensitivity/specificity),
and estimates <1 indicate reduced probability of ASD
(small:0.5–0.3, moderate:0.2–0.1, large:<0.1). For com-
parability with the U.S. validation study by Kim and
Lord [2012a], sensitivity and specificity is reported for
the ADOS-2 ASD cutoff and the ADI-R Toddler clinical
cutoffs, and by language level as defined by ADI-R item
30 (i.e., no words, single words, phrase-speech or
higher).
INSAR Havdahl et al./Influence of parental concern on ASD instruments 5
Prior to assessing the influence of parental ASD con-
cern on instrument performance, we examined how
parental ASD concern was associated with other rele-
vant child and parent characteristics (i.e., age, language
and cognitive abilities, behavior problems, ASD diagno-
sis, maternal education). To examine the influence on
ADI-R and ADOS performance, we employed ROC
regression methods [Janes & Pepe, 2008; Janes et al.,
2009] (Stata procedure rocreg with linear covariate
adjustment and 1,000 bootstrap resamples). This
approach allowed assessment of the influence of paren-
tal ASD concern on (1) threshold (cutoff) performance,
and (2) overall scale performance (the ROC curve). In
the first step, parental ASD concern was entered as a
predictor of ADI-R/ADOS scores among children with-
out ASD (linear regression coefficients are reported). As
measures of the effect size of the influence of parental
ASD concern on threshold performance, we report the
sensitivity and specificity of the cutoffs in toddlers
with parental ASD concern compared to those without.
In the second step, parental concern about ASD was
assessed as a predictor of the capacity of ADI-R/ADOS
dimensional scores to differentiate between toddlers
with and without ASD diagnosis. Probit regression coef-
ficients and AUC’s are reported. The ROC regression
analyses were also carried out with adjustment for age,
language abilities, nonverbal IQ and behavior
problems.
Results
Instrument Performance in the Total Sample
Discriminative performance of the cutoffs. Among
children without any clinical diagnosis (all of whom
were using phrase-speech), specificity of the ASD cutoff
was 96% for the ADOS and 99% for the ADI-R (98% and
99% when restricting to children recruited randomly as
controls). To facilitate comparison with previous studies
and because the ADI-R and ADOS are intended for differ-
entiation between ASD and other clinical disorders, diag-
nostic agreement is detailed below and in Table II for the
comparison of children with ASD versus non-ASD diag-
noses. In this subsample, the ADI-R (clinical) cutoff
showed good specificity, but sensitivity was modest.
Children meeting the ADI-R cutoff were 4 to 10 times
more likely to be diagnosed with ASD than with a non-
ASD disorder (see estimates of LR in Table II, which are
calculated from the ratio of sensitivity and specificity
estimates). Scoring below the ADI-R cutoff reduced the
probability of ASD diagnosis somewhat among children
with single words or less (LR–<0.3), but only modestly
among children with phrase-speech (LR– 5 0.5).
The ADOS (ASD cutoff) showed well-balanced sensi-
tivity and specificity (89% and 83%, respectively). Meet-
ing the ADOS cutoff increased the probability of ASD
diagnosis by a factor of 3–4 in children with single
words or less, and by 7 in children with phrase-speech.
Table II. Agreement of the Instrument Classifications With Clinical Diagnoses of ASD Versus Other Disorders
N ASD
N Other
disordersSensitivity
95% CI
Specificity
95% CI LR1 LR–TP FN TN FP
Total
ADOS 59 7 252 51 89%, 79–96 83%, 79–87 5 0.13
ADI-R 44 22 275 28 67%, 54–78 91%, 87–94 7 0.37
ADI-R & ADOS 39 27 294 9 59%, 46–71 97%, 94–99 20 0.42
ADI-R or ADOS 64 2 233 70 97%, 90–100 77%, 72–82 4 0.04
Phrase-speech (PS)
ADOS 31 4 200 31 89%, 73–97 87%, 82–91 7 0.13
ADI-R 20 15 216 15 57%, 39–74 94%, 90–96 9 0.46
ADI-R & ADOS 17 18 225 6 49%, 31–66 97%, 94–99 19 0.53
ADI-R or ADOS 34 1 191 40 97%, 85–100 83%, 77–87 6 0.03
Single words (SW)
ADOS 17 3 42 16 85%, 62–97 72%, 59–83 3 0.21
ADI-R 16 4 46 12 80%, 56–94 79%, 67–89 4 0.25
ADI-R & ADOS 14 6 55 3 70%, 46–88 95%, 86–99 14 0.32
ADI-R or ADOS 19 1 33 25 95%, 75–100 57%, 43–70 2 0.09
Nonverbal (NV)
ADOS 11 0 10 4 100%, 72–100 71%, 42–92 4 <0.01
ADI-R 8 3 13 1 73%, 39–94 93%, 66–100 10 0.29
ADI-R & ADOS 8 3 14 0 73%, 39–94 100%, 77–100 na 0.27
ADI-R or ADOS 11 0 9 5 100%, 72–100 64%, 35–87 3 <0.01
Note. ADI-R (clinical) cutoff: NV 5 11, SW 5 8, PS 5 13. ADOS-2 (ASD) cutoff: Module 1-no words 5 11, Module 1-some words 5 8, Module 2-
phrase-speech 5 7. Language level is defined by ADI-R item 30 (overall level of language).
ADI-R, autism diagnostic interview-revised; ADOS, autism diagnostic observation schedule; ASD, autism spectrum disorder; LR1, positive likeli-
hood ratio; LR–, negative likelihood ratio; TP, true positives; TN, true negatives; FP, false positives; FN, false negatives; CI, confidence interval.
6 Havdahl et al./Influence of parental concern on ASD instruments INSAR
Conversely, not meeting the ADOS cutoff was associ-
ated with greatly reduced probability of ASD diagnosis
across language levels (LR–�0.2).
Requiring toddlers to meet both the ADOS (ASD) and
ADI-R (clinical) cutoffs resulted in excellent specificity
(95–100%), and greatly increased the probability of ASD
diagnosis across language levels (LR1�14). For children
with single words or less, not meeting the combined cri-
terion was associated with moderately reduced probabil-
ity of ASD diagnosis (LR–�0.3). Given that sensitivity
was low among children with phrase-speech (49%), not
meeting the combined criterion was associated with lit-
tle reduction in probability of ASD diagnosis in this
subgroup (LR– 5 0.5). The least restrictive criterion,
requiring children to meet cutoff on either ADOS or
ADI-R, resulted in very high sensitivities (95–100%).
Not meeting this criterion was associated with greatly
reduced probability of ASD diagnosis across language
levels (LR–<0.1). Among children with single words or
less, the tradeoff was low specificity (57–64%), whereas
among children with phrase-speech, specificity was
good (83%).
Characteristics of the misclassified children are shown
in Appendix 4. Among children ultimately diagnosed
with ASD, those missed by the ADI-R (clinical) cutoff
(false negatives) had higher intellectual (d 5 0.63,
P 5 0.02) and language abilities (d 5 0.76, P<0.01).
Among children diagnosed with disorders other than
ASD, those meeting the ADI-R cutoff (false positives)
had lower intellectual (d 5 0.75, P<0.001) and language
abilities (d 5 0.68, P<0.01) and more parent-reported
behavior problems (d 5 0.77, P < 0.001). Although no
statistically significant differences were found between
children with ASD who received ADOS classifications of
non-ASD compared to ASD, the subgroup of false nega-
tives was small (n 5 7). Among children with diagnoses
other than ASD, those meeting the ADOS cutoff (false
positives) had lower intellectual (d 5 0.62, P<0.001)
and language abilities (d 5 0.37, P 5 0.02) and more
clinician-observed behavior problems during the ADOS
administration (d 5 0.79, P<0.001).
Discriminative performance of the dimensional
instrument scores. The Pearson’s correlation
between scores on the ADI-R and ADOS was 0.63 (using
the full ADI-R Toddler algorithm: phrase-speech r 5 0.56,
single words r 5 0.52, nonverbal r 5 0.77). Dimensional
scores from both instruments differentiated well between
children with and without ASD. AUC was 0.93 for the
ADI-R (95% confidence interval [CI] 5 0.91–0.96) (ASD
versus other disorder: AUC 5 0.90) and 0.95 for the
ADOS (95% CI 5 0.92–0.97) (ASD versus other disorder:
AUC 5 0.92). There was no statistically significant differ-
ence between ADOS and ADI-R AUC scores (v2 5 0.47,
P 5 0.49). Scores from the ADI-R and ADOS contributed
independently to the prediction of ASD diagnoses in
logistic regression (ADOS: OR 5 1.96, P<0.001, ADI-R:
OR 5 1.42, P<0.001, v2(2, N 5 663) 5 254.44, P<0.001),
with independent contributions from the social commu-
nication (P<0.001) and repetitive behavior domains
(P<0.04) of both instruments.
The Role of Parental ASD Concern
As described in the methods, the study groups for these
analyses were: (1) parental ASD concern (referral to the
ABC research clinic for suspected ASD and/or parental
endorsement of autistic traits during screening; n 5 48),
and (2) parent-reported behavioral signs of ASD without
a specific concern about ASD (n 5 400). Parental ASD
concern was associated with diagnosis of ASD (54% ver-
sus 9% of toddlers were diagnosed with ASD in the
group with and without parental ASD concern, respec-
tively). Within diagnostic group (ASD and non-ASD),
toddlers with and without parental ASD concern were
largely comparable with regard to age, IQ, and language
abilities (see Table III). Among toddlers ultimately diag-
nosed with ASD, those whose parents had concern
about ASD had more parent-reported behavior problems
(d 5 0.56).
Influence on Cutoff Performance
The first step of the ROC regression showed that among
toddlers without ASD, parental ASD concern was signifi-
cantly associated with higher ADI-R scores (B 5 2.82
[95% CI 5 1.79–3.85], P<0.001), but not with ADOS
scores (B 5 0.25 [95% CI 5 20.45–0.94], P 5 0.49). The
association with elevated ADI-R scores remained
(B 5 2.37, P<0.001) after adjusting for IQ, language
abilities, and parent-reported behavior problems (all
P-values<0.01).
The substantial effect size of the influence of parental
concern about ASD on ADI-R cutoff performance is
shown by the differences in sensitivity and specificity
of the ADI-R cutoffs (Fig. 1). Among toddlers with
parental ASD concern, the ADI-R clinical cutoff had
85% sensitivity (95% CI 5 65–96%) and 68% specificity
(95% CI 5 45–86%). These estimates are largely consis-
tent with those reported in the algorithm development
study [Kim & Lord, 2012b: sensitivity 80–94%, specific-
ity 70–81%]. In contrast, the ADI-R classifications
showed considerably lower sensitivity and higher spe-
cificity among toddlers without parental concern about
ASD (Fig. 1). A sizable proportion of the toddlers ulti-
mately diagnosed with ASD scored below the ADI-R
clinical cutoff (43%). Sensitivity and specificity of the
ADOS cutoffs were similar in toddlers with and without
parental concern about ASD (Fig. 1).
INSAR Havdahl et al./Influence of parental concern on ASD instruments 7
Influence on the Discriminative Performance of theDimensional Instrument Scores
Parental ASD concern was associated not only with ASD
diagnosis, but also with increased ADI-R scores indepen-
dent of ASD diagnosis (i.e., among the toddlers without
ASD). Hence, the presence or absence of parental ASD
concern could affect estimates of the agreement
between ADI-R scores and ASD diagnosis (parental con-
cern about ASD may in itself drive a sizable portion of the
agreement between ADI-R scores and ASD diagnosis).
After adjusting for the influence of parental ASD concern,
agreement between ADI-R scores and clinical diagnoses
was still good, resulting in AUC 5 0.85 (95% CI 5 0.80–
0.91), and parental ASD concern had no statistically
significant effect on the ROC curve (i.e., the ability of
ADI-R scores to distinguish between toddlers with and
without ASD) (coefficient 5 20.28 [95% CI 5 21.14–
0.57], P 5 0.51). Furthermore, ADI-R scores contributed
independently to the prediction of ASD diagnoses also in
the subgroup of toddlers without parental concern about
ASD (ADI-R: OR 5 1.37, P<0.001, ADOS: OR 5 1.99,
P<0.001, v2(2, N 5 400) 5 130.08, P<0.001). Still, as
shown by plotting the balance between sensitivity and
specificity across all possible ADI-R cutoffs (Fig. 2), differ-
ent cutoffs were required for optimal balance in toddlers
Figure 1. Sensitivity and specificity of ADI-R (clinical) and ADOS (ASD) cutoffs in children with and without parental concernabout ASD. Note. ASD, autism spectrum disorder; ADI-R, Autism Diagnostic Interview-Revised (toddler algorithms); ADOS, AutismDiagnostic Observation Schedule (ADOS-2 algorithms). V, Cramer�s V. ***P< 0.001; **P< 0.01; *P< 0.05.
Table III. Characteristics of Toddlers Ultimately Diagnosed With and Without ASD by Level of Concern About ASD
Diagnosed with ASD Not diagnosed with ASD
Parental concern about ASD Parental concern about ASD
Yes No Yes No
N M (SD) N M (SD) P N M (SD) N M (SD) P
Demographics
Age, months 26 41.5 (3.2) 37 41.2 (2.7) 0.64 22 41.4 (3.0) 363 41.6 (2.1) 0.71
Male sex, [%] 23 [88.5] 29 [78.4] 0.30 14 [63.6] 275 [75.8] 0.20
Maternal education �12yb 10 [41.7] 12 [33.3] 0.51 7 [41.2] 129 [36.3] 0.69
Developmental level
IQa 25 70.5 (24.7) 35 68.8 (25.6) 0.80 21 84.4 (17.1) 361 90.3 (18.7) 0.17
Language agea, months 25 21.6 (10.3) 36 22.3 (10.2) 0.82 19 27.9 (14.1) 351 31.6 (10.5) 0.15
Phrase-speech, [%] 13 [50.0] 21 [56.8] 0.60 16 [72.7] 299 [82.4] 0.26
Behavior problems
Parent-reporteda (range:0–2) 26 0.7 (0.3) 37 0.6 (0.2) 0.03 20 0.6 (0.4) 356 0.6 (0.3) 0.35
Clinician-observed (range:0–2) 26 0.4 (0.4) 37 0.5 (0.5) 0.32 22 0.2 (0.3) 363 0.2 (0.3) 0.61
ASD diagnostic instruments
ADOS CSS (range:1–10) 26 6.2 (2.3) 37 6.3 (2.0) 0.87 22 2.1 (1.7) 363 1.8 (1.6) 0.49
ADI-R (common items) (range:0–20) 26 9.9 (4.6) 37 6.9 (3.6) <0.01 22 4.8 (3.3) 363 2.0 (2.3) <0.01
a A few cases had missing information (see N).b Missing by subgroup from left to right: n 5 2, n 5 1, n 5 5, and n 5 8. Proportions are calculated based on non-missing n.
ASD, autism spectrum disorder; ADOS-CSS, standardized comparison scores from the Autism Diagnostic Observation Schedule; ADI-Diag10, sum of
the 10 items common across the Autism Diagnostic Interview-Revised Toddler algorithms [Kim & Lord, 2012b].
8 Havdahl et al./Influence of parental concern on ASD instruments INSAR
with and without parental concern about ASD (shown in
blue and red, respectively).
Since ADOS scores were not significantly influenced
by parental ASD concern, there was no need to adjust
for this when estimating the ROC curve (AUC 5 0.93
[95% CI 5 0.90–0.96]). There was also no statistically
significant influence of parental ASD concern on the
ability of ADOS scores to distinguish between toddlers
with and without ASD (coefficient 5 0.18 [95%
CI 5 21.30–0.94], P 5 0.76.
Sensitivity Analyses
In contrast to most previous studies, a few of the chil-
dren who received clinical diagnoses had been recruited
based on random selection (n 5 1 ASD, n 5 29 other
non-ASD disorder). The results were very similar when
excluding randomly selected children (Appendix 3).
Given that the rate of participant exclusion was higher
in the group with parental ASD concern compared to
the group without, we also carried out the analyses
with no exclusions and found essentially unchanged
results.
Results were similar when using the unrevised instru-
ment algorithms. The ADI-R Autism criteria performed
similarly as the ADI-R Toddler Research cutoff, and the
previously proposed relaxed ADI-R ASD criteria [Risi
et al., 2006] performed similarly as the ADI-R Toddler
Clinical cutoff. Due to the small sample sizes in sub-
groups stratified by language level, diagnosis, and
parental ASD concern, the analyses of parental ASD
concern were performed with collapsed language levels.
The ADOS CSS already takes language level into
account, and for the ADI-R we used only the algorithm
items applicable to all children across language levels.
The results were similar when using a mean item score
from the full language-specific algorithms.
Discussion
ASD diagnostic instruments have been validated pri-
marily in children referred for suspected ASD to special-
ized clinics in the United States, resulting in
standardized thresholds that may not be appropriate for
children identified in other ways (e.g., through general
population screening). Accordingly, the first aim of this
study was to examine diagnostic agreement in a
broadly based Norwegian sample. Compared with diag-
nostic agreement reported in validation studies carried
out in ASD clinics in the United States, we found simi-
lar estimates for the ADOS cutoffs (85–100% sensitivity
and 71–87% specificity). The ADI-R cutoffs showed
reduced sensitivity (57–80%) and increased specificity
(79–94%). Toddlers with ASD diagnoses who were
missed by the ADOS/ADI-R cutoffs had relatively strong
intellectual and language abilities. The reverse was
found for toddlers with other diagnoses who were mis-
classified as false positives by the instruments. False
positives also tended to have more behavior problems
not specific to ASD, such as hyperactivity, irritability,
and anxiety. These associations appeared to be rela-
tively informant-specific, with parent-reported behavior
problems significantly associated with ADI-R misclassifi-
cations, and clinician-observed behavior problems sig-
nificantly associated with ADOS misclassifications. This
pattern of informant-specific associations between rat-
ings of ASD symptoms and behavior problems was also
found in a recent study of primarily school-aged chil-
dren from the United States [Havdahl, et al., 2016a].
Our second aim was to examine whether parental
ASD concern influences the performance of the ASD
diagnostic instruments. The low sensitivity of the ADI-
R cutoffs, especially in toddlers using phrase-speech, is
consistent with studies of toddlers recruited primarily
through screening and/or referral for general concerns
(rather than self-referred due to suspected ASD) [de
Bildt et al., 2015; Zander et al., 2015]. Comparing tod-
dlers identified by parental ASD concern versus parent-
reported ASD signs (screening) without concern about
ASD, the ADOS cutoffs had consistently high sensitivity
and specificity. In contrast, the ADI-R cutoffs showed
reduced sensitivity and increased specificity among tod-
dlers whose parents did not have a specific concern
about ASD. The influence of parental ASD concern on
ADI-R scores appeared to be independent of other fac-
tors that typically affect ADI-R scores, such as IQ and
language level, and of clinician-observed ASD features
(ADOS scores). These findings indicate that in addition
to child features of ASD, scores on the parental report-
based ADI-R are also affected by parents’ concern about
ASD per se. Even though trained interviewers rate the
ADI-R items after eliciting detailed parent descriptions
Figure 2. Balance of sensitivity and specificity across ADI-Rscores in children with and without parental concern aboutASD. Note. ASD, autism spectrum disorder; ADI-R scores, sum ofthe 10 items common across the Autism Diagnostic Interview-Revised Toddler algorithms.
INSAR Havdahl et al./Influence of parental concern on ASD instruments 9
of the child’s actual behaviors, parents who are con-
cerned that their child has ASD are likely to be more
aware of behavioral features associated with ASD, give
more examples and/or provide clearer descriptions of
these features. Accordingly, cutoffs derived from spe-
cialized ASD clinic settings may miss many children
whose parents do not have a specific concern about
ASD.
Although the categorical ADI-R cutoffs performed
below expectations in parents who were not concerned
about ASD, the dimensional ADI-R scores were useful
for detecting toddlers with ASD even when parents
were not concerned about ASD (AUC�0.85). When tak-
ing parental concern about ASD into account (i.e., only
comparing children with parental ASD concern and
children without parental ASD concern) ADI-R scores
had comparable accuracy in detecting toddlers with
ASD across the two groups (see Fig. 2). This suggests
that parents who do not have a specific concern about
ASD are able to report as diagnostically useful informa-
tion about their child’s social communication and
repetitive behaviors as those who are concerned about
ASD, but the scores need to be interpreted in light of
the lack of ASD concern. Moreover, although the ADOS
cutoffs performed well alone, ADI-R scores contributed
independently to the prediction of ASD diagnosis over
and above the ADOS scores. Thus, parental report
remains an important source of information about past
and current ASD features beyond what can be observed
during clinic visits. The instruments’ additive contribu-
tions underscore the value of combining direct observa-
tion and parent accounts in diagnostic evaluations of
ASD [Kim & Lord, 2012a; Zander et al., 2015].
The findings illustrate that sensitivity and specificity
of instrument cutoffs can vary widely depending on the
characteristics of the sample. The ADI-R cutoffs, which
have been derived from samples of children seen in spe-
cialized ASD clinics, missed nearly half of toddlers with
ASD whose parents did not have a specific concern
about ASD. This demonstrates the importance of con-
sidering factors that may influence the performance of
instruments used in ASD assessment. While the present
study focused on the influence of parent concern about
ASD, which may affect the performance of parental
report based instruments in particular, direct observa-
tion based instruments may be influenced by other fac-
tors (e.g., child behavior problems in that context).
When cutoffs are relied upon without critical clinical
judgment, instrument misclassifications can lead to
false reassurance, loss of valuable time for appropriate
interventions, and/or inappropriate interventions [Hav-
dahl, von Tetzchner, Huerta, Lord, & Bishop, 2016b].
Interventions for ASD in young children are associated
with clinically significant improvements in symptoms,
skills and functioning [Zwaigenbaum et al., 2015]. Loss
of time for ASD interventions could potentially increase
the likelihood of more significant impairment and sup-
port needs later, as brain maturation and behavioral
patterns may become more fixed and less plastic at
older ages.
The results have significant implications for instru-
ment development and validation efforts. Studies of
samples in ASD specialty clinics have been valuable in
forming a primary base for the development, valida-
tion, and refinement of diagnostic instruments. While
this has positively influenced assessment practices in
such clinics, caution must be exercised when applying
identical practices to other settings. Parents who are
not specifically concerned about ASD cannot necessarily
be expected to respond to questions about their child’s
behavior in the same way as parents who have sought
an ASD evaluation for their child. Therefore, to achieve
an optimal balance of sensitivity and specificity, modi-
fied criteria may be required (e.g., alternative cutoffs
and/or item combinations). Due to the complexity and
ramifications of adjusting algorithms and cutoffs, such
changes are not warranted on the basis of a single sam-
ple. Replication studies are needed to determine
whether and how parental concern about ASD should
be taken into account at the level of instrument design.
In the meantime, it behooves clinicians and researchers
to consider parental concern about ASD when interpret-
ing scores based on parental report of ASD features. For
example, it would be useful to keep in mind that the
standard cutoffs on the ADI-R may be too high for tod-
dlers whose parents are not specifically concerned about
ASD, and that it is important to integrate parent
accounts with information from other sources (e.g.,
direct observation, daycare staff).
Strengths and Limitations
Strengths of the present study included the indepen-
dent administration of the ADOS and ADI-R by separate
clinicians blinded to information from the other instru-
ment, developmental history, and previous evaluations.
Nevertheless, as in the majority of studies of ADOS/
ADI-R validity, the assessment team was not blinded to
information from these instruments in determining
clinical diagnoses [Kim & Lord, 2012a; Zander et al.,
2015]. Currently, the most widely accepted gold stan-
dard for ASD diagnosis is expert clinical judgment
informed by all available information from a multidisci-
plinary evaluation that includes multiple sources of
information and standardized instruments [Falkmer
et al., 2013; Kim, Macari, Koller, & Chawarska, 2015].
Beyond the ADOS and ADI-R, the assessment comprised
several other sources of information about ASD symp-
toms, including observation of play interaction, parent
10 Havdahl et al./Influence of parental concern on ASD instruments INSAR
interview, and questionnaires completed by parents
and daycare staff. Additionally, the assessment team
included a specialist clinician who had conducted par-
ent interview and direct child observation, but who was
not involved in the administration or scoring of the
ADOS or ADI-R. Finally, even though the availability of
information from these instruments could have contrib-
uted to overestimation of diagnostic agreement, this
would be expected both for children with and without
parental ASD concern.
DSM-IV-TR diagnostic criteria were used in this study
and while findings suggest that the vast majority of
children diagnosed under DSM-IV can be expected to
receive the same classification (ASD/non-ASD) under
DSM-5 [Huerta, Bishop, Duncan, Hus, & Lord, 2012] it
is possible that use of DSM-5 criteria could have
affected the results.
Whereas this was a relatively large sample of toddlers,
sample sizes were small when stratifying by diagnosis
and identification method. Therefore, the results should
be interpreted with caution and with consideration of
their confidence intervals. In addition, the study was
conducted in a single culture and generalization to
other countries cannot necessarily be assumed. Selec-
tion bias could also potentially limit the generalizability
of the results. MoBa is population-based, but compari-
son with the general Norwegian population indicates
underrepresentation of young, single, and less educated
mothers, as well as lower prevalence of certain expo-
sures (e.g., smoking in pregnancy) [Nilsen et al., 2013;
Statistics Norway, 2016]. Self-selection bias may also
have been associated with participation in the clinical
assessment. Nonetheless, the participation rate for invi-
tations based on the 3-year-questionnaire was relatively
high (parental ASD concern: 52.7%, parent-reported
behavioral signs but no concern about ASD: 50.4%).
Conclusion
In this broadly based sample of toddlers, the ADOS cut-
offs showed consistent and well-balanced sensitivity and
specificity. However, the ADI-R cutoffs demonstrated low
sensitivity, particularly in identifying toddlers with ASD
in the absence of specific parental concern about ASD.
Different ADI-R cutoffs were needed according to pres-
ence or absence of parental ASD concern in order to
achieve comparable balance of sensitivity and specificity.
When using these different cutoffs, ADI-R scores differ-
entiated toddlers with and without ASD equally well in
the presence and absence of parental ASD concern. These
results highlight the importance of taking parental con-
cern about ASD into account when interpreting scores
from parental report-based instruments such as the ADI-
R. Although the ADOS cutoffs performed consistently
well, the additive contributions of ADI-R and ADOS
scores to the prediction of ASD diagnosis underscore the
value of combining instruments based on parent
accounts and clinician observation in evaluation of ASD.
Future studies should examine the influence of parental
ASD concern on the ADI-R and other parental report-
based instruments in other samples, and the utility of
adjusting cutoffs and/or algorithms based on whether
parents are concerned specifically about ASD.
Acknowledgments
The authors gratefully acknowledge the contribution of
the families that participated in the study. We thank
Kari Kveim Lie, Britt Kveim Svendsen, Elin Tandberg
and the Autism Birth Cohort Study clinic and research
staff for their invaluable contributions to the data col-
lection. This work was supported by the South-Eastern
Norway Regional Health Authority (Grant number:
2012101) and National Institutes of Health/National
Institute of Neurological Disorders and Stroke (Grant
number: U01 NS047537). Somer L. Bishop, Catherine
Lord, and Andrew Pickles receive royalties for instru-
ments they have co-authored (ADOS, ADI-R, SCQ). The
other authors declare no conflicts of interest.
Appendix
Appendix 1: 36-Month Screening Criteria in theAutism Birth Cohort (ABC) Study
Criteria:
1. Parent reports that the child has autistic traits or has
been referred to a specialist for autistic traits
2. SCQ-33 score of �12
3. Repetitive behavior subdomain on the SCQ-33 5 9
(full score)
4. Parent reports that the child has been referred to a
specialist for language delay
5. Parent reports that the child shows very little interest
in playing with other children
6. Parent reports that others (well-baby nurse, teacher,
family member) have expressed concern about the
child�s development
36-month screening algorithm:
� Meeting criterion 1
� Meeting any of criteria 2–5 AND criterion 6
Note. SCQ, Social Communication Questionnaire
(Rutter et al., 2003); SCQ-33, the 33 of the 40 items
which are applicable to both verbal and nonverbal chil-
dren. Criteria 2 and 3 were based on SCQ scores, while
the remaining criteria required the parent to tick off for
“yes” when asked about the particular concern.
INSAR Havdahl et al./Influence of parental concern on ASD instruments 11
Appendix 2: Participant Flow
Note. Colored boxes mark stratification used in analyses of the influence of parental concern for ASD.
ADI-R, autism diagnostic interview-revised; ADOS, autism diagnostic observation schedule; NVMA, nonverbal mental age; Severesens/motor imp., severe sensory/motor impairment with profound intellectual disability and autistic traits; CDD, childhood disinte-grative disorder.; ASD, autism spectrum disorder diagnoses; dx, diagnoses.
12 Havdahl et al./Influence of parental concern on ASD instruments INSAR
Appendix 3: Agreement of the Instrument Classifications with Clinical Diagnoses of ASD Versus OtherDisorders (Excluding Children Recruited Randomly as Controls)
N ASD
N Other
disordersSensitivity
95% CI
Specificity
95% CI LR1 LR–TP FN TN FP
Total
ADOS 59 6 226 48 91%, 81–97 83%, 78–87 5 0.11
ADI-research (res) 30 35 262 12 46%, 34–59 96%, 93–98 11 0.56
ADI-clinical (clin) 44 21 247 27 68%, 55–79 90%, 86–93 7 0.36
ADI-clin & ADOS 39 26 265 9 60%, 47–72 97%, 94–99 18 0.41
ADI-clin or ADOS 64 1 208 66 99%, 92–100 76%, 70–81 4 0.02
Phrase-speech (PS)
ADOS 31 3 176 28 91%, 76–98 86%, 81–91 7 0.10
ADI-res 11 23 198 6 32%, 17–51 97%, 94–99 11 0.70
ADI-clin 20 14 190 14 59%, 41–75 93%, 89–96 9 0.44
ADI-clin & ADOS 17 17 198 6 50%, 32–68 97%, 94–99 17 0.52
ADI-clin or ADOS 34 0 168 36 100%, 90–100 82%, 76–87 6 <0.01
Single words (SW)
ADOS 17 3 40 16 85%, 62–97 71%, 58–83 3 0.21
ADI-res 11 9 51 5 55%, 32–77 91%, 80–97 6 0.49
ADI-clin 16 4 44 12 80%, 56–94 79%, 66–88 4 0.25
ADI-clin & ADOS 14 6 53 3 70%, 46–88 95%, 85–99 13 0.32
ADI-clin or ADOS 19 1 31 25 95%, 75–100 55%, 42–69 2 0.09
Nonverbal (NV)
ADOS 11 0 10 4 100%, 72–100 71%, 42–92 4 <0.01
ADI-res 8 3 13 1 73%, 39–94 93%, 66–100 10 0.29
ADI-clin 8 3 13 1 73%, 39–94 93%, 66–100 10 0.29
ADI-clin & ADOS 8 3 14 0 73%, 39–94 100%, 77–100 na 0.27
ADI-clin or ADOS 11 0 9 5 100%, 72–100 64%, 35–87 3 <0.01
ADI, autism diagnostic interview-revised; ADOS, autism diagnostic observation schedule; ASD, autism spectrum disorder; LR1, positive likelihood
ratio; LR–, negative likelihood ratio. TP5true positives, TN5true negatives, FP, false positives; FN, false negatives; CI, confidence interval.
ADI-research cutoffs: NV 5 13, SW 5 13, PS 5 16. ADI-clinical cutoffs: NV 5 11, SW 5 8, PS 5 13. ADOS-2 ASD cutoff: Module 1-no words 5 11,
Module 1-some words58, Module 2-phrase-speech 5 7.
Age IQ Language age
Behavior problems not specific to ASD
Parent reported Clinician observed
N M (SD) N M (SD) N M (SD) N M (SD) N M (SD)
ADI-RASD diagnoses – True positive 44 41.1 (3.0) 43 64.0 (25.0) 42 19.2 (9.9) 44 0.7 (0.2) 44 0.5 (0.4)
ASD diagnoses – False negative 22 41.4 (2.6) 20 79.1 (21.5)* 22 26.7 (10.0)** 22 0.6 (0.2) 22 0.5 (0.5)
Other diagnoses – False positive 28 40.9 (2.4) 28 74.3 (19.0)*** 27 22.8 (10.3)** 27 0.8 (0.3)*** 28 0.4 (0.3)a
Other diagnoses – True negative 275 41.5 (2.1) 272 88.2 (18.3) 267 30.0 (10.6) 265 0.5 (0.3) 275 0.2 (0.3)
ADOSASD diagnoses – True positive 59 41.3 (2.9) 56 67.9 (25.3) 57 21.5 (10.7) 59 0.6 (0.2) 59 0.5 (0.4)
ASD diagnoses – False negative 7 40.5 (2.7) 7 75.6 (20.9) 7 24.1 (8.7) 7 0.6 (0.2) 7 0.3 (0.3)
Other diagnoses – False positive 51 41.3 (2.2) 49 77.5 (21.6)*** 49 26.0 (11.2)* 48 0.6 (0.2) 51 0.4 (0.4)***
Other diagnoses – True negative 252 41.5 (2.2) 251 88.8 (17.7) 245 30.0 (10.5) 244 0.6 (0.3) 252 0.2 (0.3)
a P 5 0.050
***P< 0.001;**P< 0.01; *P< 0.05.
ADI-R, autism diagnostic interview-revised; ADOS, autism diagnostic observation schedule; ASD, autism spectrum disorder. Cutoffs: ADI-R Clinical
[Kim & Lord, 2012], ADOS ASD [Lord et al., 2012].
Appendix 4: Characteristics of Misclassified Children
INSAR Havdahl et al./Influence of parental concern on ASD instruments 13
References
Achenbach, T.M., & Rescorla, L.A. (2000). Manual for the
ASEBA preschool forms & profiles. Burlington, VT: Univer-
sity of Vermont.
American Psychiatric Association. (2000). Diagnostic and statis-
tical manual of mental disorders (4th ed., text rev.). Wash-
ington, DC: American Psychiatric Association.
Biele, G., Zeiner, P., & Aase, H. (2014). Convergent and dis-
criminant validity of psychiatric symptoms reported in The
Norwegian Mother and Child Cohort Study at age 3 years
with independent clinical assessment in the Longitudinal
ADHD Cohort Study. Norsk Epidemiologi 5 Norwegian
Journal of Epidemiology, 24, 169–176.
Bishop, S.L., Guthrie, W., Coffing, M., & Lord, C. (2011). Con-
vergent validity of the Mullen Scales of Early Learning and
the Differential Ability Scales in children with autism spec-
trum disorders. American Journal on Intellectual and Devel-
opmental Disabilities, 116, 331–343.
Charman, T., & Gotham, K. (2013). Measurement issues:
Screening and diagnostic instruments for autism spectrum
disorders - lessons from research and practice. Child and
Adolescent Mental Health, 18, 52–63.
Crane, L., Chester, J.W., Goddard, L., Henry, L.A., & Hill, E.
(2016). Experiences of autism diagnosis: A survey of over
1000 parents in the United Kingdom. Autism, 20, 153–162.
de Bildt, A., Sytema, S., Zander, E., Bolte, S., Sturm, H.,
Yirmiya, N., . . . Oosterling, I.J. (2015). Autism Diagnostic
Interview-Revised (ADI-R) algorithms for toddlers and
young preschoolers: Application in a non-US sample of
1,104 children. Journal of Autism and Developmental Dis-
orders, 45, 2076–2091.
Egger, H.L., Erkanli, A., Keeler, G., Potts, E., Walter, B.K., &
Angold, A. (2006). Test-retest reliability of the Preschool
Age Psychiatric Assessment (PAPA). Journal of the American
Academy of Child and Adolescent Psychiatry, 45, 538–549.
Falkmer, T., Anderson, K., Falkmer, M., & Horlin, C. (2013).
Diagnostic procedures in autism spectrum disorders: A sys-
tematic literature review. European Child and Adolescent
Psychiatry, 22, 329–340.
Frazier, T.W., Youngstrom, E.A., Speer, L., Embacher, R., Law,
P., Constantino, J., . . . Eng, C. (2012). Validation of pro-
posed DSM-5 criteria for autism spectrum disorder. Journal
of the American Academy of Child and Adolescent Psychia-
try, 51, 28–40. e23.
Gadow, K., & Sprafkin, J. (2000). Early Childhood Inventory-4
screening manual. Stony Brook, NY: Checkmate Plus.
Gotham, K., Pickles, A., & Lord, C. (2009). Standardizing
ADOS scores for a measure of severity in autism spectrum
disorders. Journal of Autism and Developmental Disorders,
39, 693–705.
Guthrie, W., Swineford, L.B., Nottke, C., & Wetherby, A.M.
(2013). Early diagnosis of autism spectrum disorder: Stabil-
ity and change in clinical diagnosis and symptom presenta-
tion. Journal of Child Psychology and Psychiatry, 54, 582–
590.
Havdahl, K.A., Hus Bal, V., Huerta, M., Pickles, A., Øyen, A.S.,
Stoltenberg, C., . . . Bishop, S.L. (2016a). Multidimensional
influences on autism symptom measures: Implications for
use in etiological research. Journal of the American
Academy of Child and Adolescent Psychiatry, 55, 1054–
1063. e1053.
Havdahl, K.A., von Tetzchner, S., Huerta, M., Lord, C., &
Bishop, S.L. (2016b). Utility of the Child Behavior Checklist
as a screener for autism spectrum disorder. Autism
Research, 9, 33–42.
Huerta, M., Bishop, S.L., Duncan, A., Hus, V., & Lord, C.
(2012). Application of DSM-5 criteria for autism spectrum
disorder to three samples of children with DSM-IV diagno-
ses of pervasive developmental disorders. The American
Journal of Psychiatry, 169, 1056–1064.
Janes, H., Longton, G., & Pepe, M. (2009). Accommodating
covariates in ROC analysis. The Stata Journal, 9, 17–39.
Janes, H., & Pepe, M.S. (2008). Adjusting for covariates in stud-
ies of diagnostic, screening, or prognostic markers: An old
concept in a new setting. American Journal of Epidemiol-
ogy, 168, 89–97.
Kim, S.H., & Lord, C. (2012a). Combining information from
multiple sources for the diagnosis of autism spectrum disor-
ders for toddlers and young preschoolers from 12 to 47
months of age. Journal of Child Psychology and Psychiatry,
53, 143–151.
Kim, S.H., & Lord, C. (2012b). New autism diagnostic
interview-revised algorithms for toddlers and young pre-
schoolers from 12 to 47 months of age. Journal of Autism
and Developmental Disorders, 42, 82–93.
Kim, S.H., Macari, S., Koller, J., & Chawarska, K. (2015). Exam-
ining the phenotypic heterogeneity of early autism spec-
trum disorder: Subtypes and short-term outcomes. Journal
of Child Psychology and Psychiatry, 57, 93–102.
Larsen, K. (2015). The early diagnosis of preschool children with
autism spectrum disorder in Norway: A study of diagnostic
age and its associated factors. Scandinavian Journal of Child
and Adolescent Psychiatry and Psychology, 3, 136–145.
Lord, C., Risi, S., Lambrecht, L., Cook, E.H., Jr., Leventhal,
B.L., DiLavore, P.C., . . . Rutter, M. (2000). The Autism Diag-
nostic Observation Schedule-Generic: A standard measure
of social and communication deficits associated with the
spectrum of autism. Journal of Autism and Developmental
Disorders, 30, 205–223.
Lord, C., Rutter, M., DiLavore, P.C., Risi, S., Gotham, K., &
Bishop, S.L. (2012). Autism Diagnostic Observation Sched-
ule, second edition (ADOS-2) modules 1–4. Los Angeles,
CA: Western Psychological Services.
Magnus, P., Birke, C., Vejrup, K., Haugan, A., Alsaker, E.,
Daltveit, A.K., . . . Stoltenberg, C. (2016). Cohort profile
update: The Norwegian Mother and Child Cohort Study
(MoBa). International Journal of Epidemiology, 45, 382–
388.
Magnus, P., Irgens, L.M., Haug, K., Nystad, W., Skjaerven, R.,
Stoltenberg, C., MoBa Study Group. (2006). Cohort profile:
The Norwegian Mother and Child Cohort Study (MoBa).
International Journal of Epidemiology, 35, 1146–1150.
Molloy, C.A., Murray, D.S., Akers, R., Mitchell, T., & Manning-
Courtney, P. (2011). Use of the Autism Diagnostic Observation
Schedule (ADOS) in a clinical setting. Autism, 15, 143–162.
Mullen, E. (1995). Mullen scales of early learning. Minneapolis,
MN: Pearson Assessments.
Myhre, M.C., Thoresen, S., Grogaard, J.B., & Dyb, G. (2012).
Familial factors and child characteristics as predictors of
14 Havdahl et al./Influence of parental concern on ASD instruments INSAR
injuries in toddlers: A prospective cohort study. BMJ Open,
2, e000740.
Nilsen, R.M., Suren, P., Gunnes, N., Alsaker, E.R., Bresnahan, M.,
Hirtz, D., . . . Stoltenberg, C. (2013). Analysis of self-selection
bias in a population-based cohort study of autism spectrum
disorders. Paediatric and Perinatal Epidemiology, 27, 553–563.
Risi, S., Lord, C., Gotham, K., Corsello, C., Chrysler, C.,
Szatmari, P., . . . Pickles, A. (2006). Combining information
from multiple sources in the diagnosis of autism spectrum
disorders. Journal of the American Academy of Child and
Adolescent Psychiatry, 45, 1094–1103.
Roid, G.H. (2003). Stanford-Binet intelligence scales (5th ed.).
Itasca, IL: Riverside.
Rutter, M., Bailey, A., & Lord, C. (2003). Social Communica-
tion Questionnaire. Los Angeles, CA: Western Psychological
Services.
Rutter, M., Le Couteur, A., & Lord, C. (2003). Autism Diagnos-
tic Interview-Revised (ADI-R). Los Angeles, CA: Western
Psychological Services.
Schreuder, P., & Alsaker, E. (2014). The Norwegian Mother and
Child Cohort Study (MoBa)–MoBa recruitment and logistics.
Norsk Epidemiologi 5 Norwegian Journal of Epidemiology, 24,
23–27.
Sparrow, S., Balla, D., & Cicchetti, D. (1984). Vineland Adaptive
Behavior Scales. Circle Pines, MN: American Guidance Services.
Statistics Norway. (2016). Population’s level of education.
Retrieved 20th November 2016 from https://http://www.
ssb.no/statistikkbanken/selecttable/hovedtabellHjem.asp?
KortNavnWeb5utniv&CMSSubjectArea5utdanning&
PLanguage51&checked5true
Stoltenberg, C., Schjolberg, S., Bresnahan, M., Hornig, M.,
Hirtz, D., Dahl, C., ABC Study Group. (2010). The Autism
Birth Cohort: A paradigm for gene-environment-timing
research. Molecular Psychiatry, 15, 676–680.
Sur�en, P., Schjølberg, S., Øyen, A.-S., Lie, K.K., Hornig, M.,
Bresnahan, M., . . . Stoltenberg, C. (2014). The Autism Birth
Cohort (ABC): A study of autism spectrum disorders in
MoBa. Norsk Epidemiologi 5 Norwegian Journal of Epidemi-
ology, 24, 39–50.
Volkmar, F., Siegel, M., Woodbury-Smith, M., King, B.,
McCracken, J., State, M., . . . American Academy of Child
and Adolescent Psychiatry (AACAP) Committee on Quality
Issues (CQI). (2014). Practice parameter for the assessment
and treatment of children and adolescents with autism
spectrum disorder. Journal of the American Academy of
Child and Adolescent Psychiatry, 53, 237–257.
Wiggins, L.D., Baio, J., & Rice, C. (2006). Examination of the
time between first evaluation and first autism spectrum
diagnosis in a population-based sample. Journal of Develop-
mental and Behavioral Pediatrics, 27, S79–S87.
Zachrisson, H.D., Dearing, E., Lekhal, R., & Toppelberg, C.O.
(2013). Little evidence that time in child care causes exter-
nalizing problems during early childhood in Norway. Child
Development, 84, 1152–1170.
Zander, E., Sturm, H., & B€olte, S. (2015). The added value of
the combined use of the Autism Diagnostic Interview-
Revised and the Autism Diagnostic Observation Schedule:
Diagnostic validity in a clinical Swedish sample of toddlers
and young preschoolers. Autism, 19, 187–199.
Zuckerman, K.E., Lindly, O.J., & Sinche, B.K. (2015). Parental
concerns, provider response, and timeliness of autism spec-
trum disorder diagnosis. The Journal of Pediatrics, 166,
1431–1439. e1431.
Zwaigenbaum, L., Bauman, M.L., Choueiri, R., Kasari, C.,
Carter, A., Granpeesheh, D., . . . Natowicz, M.R. (2015).
Early intervention for children with autism spectrum disor-
der under 3 years of age: Recommendations for practice
and research. Pediatrics, 136(Suppl 1), S60–S81.
INSAR Havdahl et al./Influence of parental concern on ASD instruments 15