Development and psychometric evaluation of a new patient-reported outcome instrument measuring the functional impact of insomnia

Development and psychometric evaluation of a newpatient-reported outcome instrument measuringthe functional impact of insomnia

Christopher Bell • Lori D. McLeod •

Lauren M. Nelson • Sheri E. Fehnel •

Laurie J. Zografos • Brian Bowers

Accepted: 5 March 2011 / Published online: 20 April 2011

� Springer Science+Business Media B.V. 2011

Abstract

Purpose The objective of this study was to develop and

validate a patient-reported outcome instrument to com-

prehensively assess the consequences of inadequate sleep

for use in insomnia-related studies.

Methods To inform item development, relevant constructs

were identified through patient focus groups, literature review,

and expert input. Following a translatability assessment for

United States (US) English, US Spanish, and French, the draft

items were refined through iterative sets of patient interviews

in the United States and France. Psychometric properties were

evaluated using patient responses from a validation study

including 432 participants with either a diagnosis of primary

insomnia or no history of insomnia.

Results Psychometric analyses supported item reduction

from 38 to 26 items, yielding a unidimensional scale and

preserving the original content (mood, tiredness/energy,

memory/concentration, motivation, daily performance,

social interaction, sexual functioning). Evidence of internal

consistency (coefficient a = 0.97), convergent validity,

and known-groups validity also was documented.

Conclusions The Sleep Functional Impact Scale (SFIS) is

a psychometrically sound measure targeting the impact of

insomnia on patient functioning. When administered with a

sleep diary, this instrument has the ability to provide a

more comprehensive assessment of treatment response in

clinical studies.

Keywords Insomnia � Questionnaire � Impact �Psychometrics � Patient-reported outcome � PRO

Abbreviations

ANOVA Analysis of variance

DSM-IV-TR Diagnostic and Statistical Manual of

Mental Disorders, Fourth Edition,

Text Revision

EFA Exploratory factor analysis

ESS Epworth Sleepiness Scale

FOSQ Functional Outcomes of Sleep

Questionnaire

IRT Item response theory

ISI Insomnia Severity Index

KN Knowledge Networks

MOS-Sleep Medical Outcomes Study Sleep Scale

PRO Patient-reported outcome

PSQI Pittsburgh Sleep Quality Index

SFIS Sleep Functional Impact Scale

US United States

WPAI–GH Work Productivity and Activity

Impairment–General Health Questionnaire

Introduction

Insomnia, the most common sleep disturbance reported in

the United States (US), is characterized by difficulty falling

Disclaimer: Funding for this study was provided by

GlaxoSmithKline, Research Triangle Park, NC. All listed authors

meet the criteria for authorship set forth by the International

Committee for Medical Journal Editors.

Electronic supplementary material The online version of thisarticle (doi:10.1007/s11136-011-9885-8) contains supplementarymaterial, which is available to authorized users.

C. Bell (&) � B. Bowers

GlaxoSmithKline, Research Triangle Park, NC, USA

e-mail: [email protected]

L. D. McLeod � L. M. Nelson � S. E. Fehnel � L. J. Zografos

RTI Health Solutions, Research Triangle Park, NC, USA

123

Qual Life Res (2011) 20:1457–1468

DOI 10.1007/s11136-011-9885-8

http://dx.doi.org/10.1007/s11136-011-9885-8

asleep, difficulty maintaining sleep, waking too early in

the morning, and/or an inability to experience restorative

sleep [1–3]. Prevalence estimates in the United States vary

from 2 to 42.5%, primarily because researchers have

operationally defined the insomnia population in different

ways across studies [2, 4, 5]. When insomnia cannot be

attributed to a medical or psychiatric condition or to dis-

rupting environmental factors, it is referred to as primary

insomnia [6], with the prevalence of this more narrow

definition ranging between 6 and 10% of the US adult

population [3].

In addition to the nighttime factors related to sleep

disturbance, insomnia leads to a severe decline in func-

tioning during the day. These daytime aspects related to

functional and emotional impairment coupled with the

nighttime characteristics are reflected in the Diagnostic

and Statistical Manual of Mental Disorders, Fourth Edi-

tion, Text Revision (DSM-IV-TR) criteria for diagnosing

insomnia. Given the inclusion of daytime functioning in

the DSM-IV-TR criteria, it is important to measure this

aspect for diagnostic purposes. Furthermore, significant

changes in daytime functioning can be a convincing out-

come when comparing medical interventions for the

management of insomnia. However, there are no patient-

reported outcome (PRO) measures designed to evaluate

daytime functioning that meet current regulatory require-

ments as described in the Food and Drug Administration’s

guidance, Patient-Reported Outcome Measures: Use in

Medical Product Development to Support Labeling Claims

[7]. In addition, the European Medicines Agency’s draft

guidance on insomnia recommends evaluating daytime

functioning in clinical trials of new drugs to treat

insomnia [8]. Given these demands, there is a need for an

instrument that meets regulatory requirements and can

evaluate the aspects of daytime functioning associated

with insomnia.

This article focuses on a promising new PRO instru-

ment, the Sleep Functional Impact Scale (SFIS), which was

developed for use in clinical trials of new treatments for

insomnia. The SFIS specifically targets the impact of pri-

mary insomnia on patients’ subjective experiences with

their psychological and cognitive functioning, social

activities, and work productivity utilizing a weekly refer-

ence period.

This article presents the development and psychometric

evaluation of the SFIS, including the qualitative research

completed to identify concepts relevant to the functional

impact of insomnia, cognitive debriefing interviews to

refine the item wording, and the quantitative evaluation of

the psychometric properties based on data from a group of

adults who self-reported symptoms of primary insomnia

and another group of adults who reported no such

problems.

Methods

The SFIS was developed and evaluated in three sequential

stages: (1) qualitative research using patient focus groups, a

literature review, and expert panel input to generate the

initial set of items for a draft questionnaire; (2) cognitive

debriefing to optimize question wording and response

scales; and (3) psychometric evaluation. All stages were

conducted in the United States (US) with the exception of

cognitive debriefing, where interviews conducted in the

United States were augmented with interviews conducted

in Lyon, France. Relevant ethics and institutional review

board approvals were obtained, and all participants pro-

vided written informed consent.

Item generation and draft questionnaire

Key SFIS constructs were elicited during five focus groups

conducted in the United States. Patients were recruited

directly from physicians’ offices. Focus group eligibility

was determined on the basis of the following inclusion

criteria: physician-confirmed, primary insomnia diagnosis

of 3–6 months’ duration before screening; age 18–64 years;

fluent in English; and capable of participating in a 2-hour

(2-h) focus group. Patients were not eligible for participa-

tion if their sleep disturbances were due to the comorbidities

known to interfere significantly with sleep (e.g., depression,

generalized anxiety disorder, circadian rhythm disorder,

narcolepsy, restless leg syndrome, and chronic pain syn-

dromes); the direct psychological effects of alcoholism or

drug addiction; or the direct effects of a medication. The use

of sleep medications was not part of the inclusion or

exclusion criteria. However, the use of such medications was

discussed in the focus groups, with respondents reporting use

of common over-the-counter (e.g., acetaminophen and

diphenhydramine HCl [Tylenol PM]) and prescription

(e.g., zolpidem and eszopiclone) sleep medications. Table 1

provides demographic information for the 49 individuals

participating in the focus groups.

Two experienced moderators conducted each focus

group using a semi-structured discussion guide, with the

goal of having patients describe, in their own words, their

experiences with insomnia (e.g., general experiences,

symptoms, and direct and functional impacts). Each session

was audio-recorded to facilitate transcription and sub-

sequent analysis. From the results of the focus groups, a

targeted search of the relevant literature, input from a six-

member advisory panel of sleep specialists (see Acknowl-

edgements), and an initial translatability assessment (US

English, US Spanish, and French), a 50-item draft version

of the SFIS was created. The draft SFIS reflected a set of

questions that were meaningful and relevant to participants

when evaluating the impact of insomnia.

1458 Qual Life Res (2011) 20:1457–1468

123

Cognitive debriefing interviews

The draft SFIS was tested via face-to-face cognitive

debriefing interviews of patients with primary insomnia in

the United States and France. The purpose of the interviews

was to examine the practicality, as well as to support the face

and content validity of the SFIS. A total of 46 patients par-

ticipated in two rounds of face-to-face interviews (Table 1),

representing the three target languages: US English (13

interviews), US Spanish (16 interviews), and French (17

interviews). Patients were recruited using the same meth-

odology implemented for the focus groups and were required

to meet the same inclusion criteria, with the exception that

patients had to be fluent in the language for which they were

recruited. Patients were asked to complete the draft SFIS in

the presence of an interviewer and to clarify the compre-

hension and response process. Item reduction based on the

results of this qualitative research yielded a 38-item ques-

tionnaire reflecting the impact of insomnia on the following

aspects: mood (11 items), tiredness/energy (4 items),

memory/concentration (4 items), motivation (6 items), daily

performance (8 items), social interaction (4 items), and

sexual functioning (1 item). A 5-point Likert-type scale was

used for all questions relating to quantity (‘‘Much worse than

usual’’ to ‘‘Much better than usual’’), impact (‘‘Not at all’’ to

‘‘Extremely’’), and frequency (‘‘None of the time’’ to ‘‘All of

the time’’). The recall period throughout the questionnaire

was the past 7 days.

Quantitative psychometric evaluation

The psychometric properties and item performance of the

SFIS were evaluated in a large, Web-based observational

study of US patients. From a Knowledge Networks (KN)

online panel, 32,000 men and women aged 18 years and

older residing in the United States were randomly selected

for potential participation in this cross-sectional, popula-

tion-based survey. The KN panel, a proprietary Web-

enabled panel of research subjects who have agreed to

participate in ongoing survey research, is based on a ran-

dom-digit-dialing sample of the full US population (i.e., a

probability-based panel), ensuring that the panel is repre-

sentative of the US population. KN provides panelists

‘‘points’’ that can be redeemed for cash or Web-enabled

technology at regular intervals for panel participation.

KN administers an annual survey to gather and update

health and demographic information. Individuals who were

pre-identified from the panelists’ health profiles as having

insomnia were selected as candidates for the insomnia

group and those who were not pre-identified were candi-

dates for the control group. Screening questions were tar-

geted for each group to ensure participants met inclusion

Table 1 Qualitative development: study sample characteristics

Focus group Cognitive debriefing interviews–round 1 Cognitive debriefing interviews–round 2

US English US Spanish French US English US Spanish French

Sample size (n) 49 8 11 12 5 5 5

Female (%) 73.5% 87.5% 72.7% 58. 3% 100.0% 60.0% 80.0%

Age, mean (min, max) 46 (18–68) 49 (18–61) 46 (29–60) 43 (32–55) 47 (43–52) 30 (23–36) 44 (32–55)

Race (%)

White 63.30% 87.50% 0.00% N/A 40% 0.00% N/A

Other 37.70% 12.50% 100.0% N/A 60% 100.0% N/A

General health

Excellent 18.40% 12.50% 27.30% 41.70% 0.00% 0.00% –

Very good 30.60% 0% 36.40% 0.00% 60.00% 20.00% –

Good 36.70% 75.00% 27.30% 50.00% 40.00% 40.00% –

Fair 14.30% 12.50% 0.00% 8.30% 0.00% 20.00% –

Poor 0.00% 0.00% 0.00% 0.00% 0.00% 20.00% –

Insomnia severity

Very mild 10.20% 0% 9.10% 0.00% 0.00% 20.00% –

Mild 10.20% 12.50% 0.00% 8.30% 0.00% 0.00% –

Moderate 53.10% 37.50% 27.30% 66.70% 100% 40.00% –

Severe 18.40% 25% 54.50% 25.00% 0.00% 40.00% –

Very severe 2.00% 25% 0.00% 0.00% 0.00% 0.00% –

Unknown 6.10% 0% 9.10% 0.00% 0.00% 0.00% –

– Data are unavailable

n Sample size, US United States

Qual Life Res (2011) 20:1457–1468 1459

123

criteria for the study. Panelists eligible for the insomnia

group met the criteria described in Table 2.

In addition to the criteria described in Table 2, eligible

participants from both groups indicated that they had not

been diagnosed with a specific sleep disorder (e.g., circa-

dian rhythm sleep disorder, narcolepsy, parasomnia, peri-

odic limb movement disorder, restless legs syndrome, or

sleep apnea), or specific medical conditions with associated

sleep impairment, such as bipolar disorder, cancer, chronic

fatigue syndrome, congestive heart failure, dementia or

Alzheimer’s disease, HIV/AIDS, lupus, Parkinson’s dis-

ease, rheumatoid arthritis, or schizophrenia. Further study

inclusion criteria required eligible participants to consume

fewer than six caffeinated drinks per day and, if employed,

to work the day shift. Additionally, recruitment rules

specified that 75% of the enrolled participants were

employed at least 20 h per week.

Measures

The Web-based survey included the 38-item draft SFIS

(Electronic supplementary material) along with several

additional sleep questionnaires and a questionnaire designed

to measure health-related impacts on work productivity and

activities. Table 3 provides summary information for these

measures and each is described briefly below.

Medical Outcomes Study Sleep Scale (MOS-Sleep)

The MOS-Sleep is a 12-item, self-report instrument that

assesses both sleep duration and sleep quality. The MOS-

Sleep yields an index characterizing the severity of sleep

problems, as well as six sleep attribute scale scores

addressing Sleep Initiation, Maintenance, Perceived Ade-

quacy, Regularity, Daytime Somnolence, and Respiratory

Impairment [9]. Respondents are asked to recall their sleep

patterns over the past 4 weeks. Hays and Stewart [9] pro-

vide a detailed scoring algorithm that entails rescaling the

original numeric values to a 0 to 100 scale. This step

transforms the ranges for the summary index and six

subscales to 0–100, such that each score represents a per-

centage of the total possible points. Higher scores reflect

more of the attribute. Hays and colleagues [10] extensively

validated the MOS-Sleep in disease-specific clinical trials

and in several nonclinical samples and found acceptable

psychometric properties including reliability, validity, and

responsiveness.

Epworth Sleepiness Scale (ESS)

The ESS is an eight-item, self-report instrument designed

to measure sleep propensity in two domains: level of

daytime sleepiness and somnolence [11]. Respondents are

instructed to refer to their ‘‘usual way of life in recent

times’’ while answering the items. A composite is com-

puted as the sum of nonmissing items and total scores

range from 0 to 24. Higher scores indicate greater levels of

daytime sleepiness. Although the ESS is widely used in

sleep apnea and sleep disorder populations, Milten and

Hanly [12] report reservation regarding the reliability and

responsiveness of the ESS.

Functional Outcomes of Sleep Questionnaire (FOSQ)

The FOSQ is a 30-item, self-report instrument designed to

evaluate the impact of disorders of excessive sleepiness on

five factors related to activities of daily living: Activity

Level, Vigilance, Intimacy and Sexual Relationships, Gen-

eral Productivity, and Social Outcome [13]. Respondents are

asked to rate the difficulty they have generally performing a

given activity. Five subscales (ranging from 0 to 20) and a

global measure (ranging from 0 to 100) are reported, with

higher scores associated with greater function.

Weaver and colleagues [13] reported psychometric

properties of the FOSQ via a study with individuals with no

sleep problems, patients seeking medical attention for a

sleep problem, and patients with diagnosed obstructive

sleep apnea. A factor analysis confirmed the five subscales;

internal reliability was found to be excellent for both

subscales and the total measure, and test–retest reliability

Table 2 Psychometric evaluation: enrollment criteria

Insomnia Control

Reported an insomnia diagnosis from a physician Reported no diagnosis (current or former) of insomnia

Reported trouble falling asleep at the time of screening that

started over 3 months prior to interview for at least

3–5 nights per week

Reported not currently experiencing trouble falling asleep

more than 1 day per week and not currently using a sleep

aid more than 1 time per week

Reported feeling tired during the day at least 3–5 days per week

Reported sleep problems that were at least moderately bothersome

Reported insomnia that was not explained by a physical condition

(e.g., reflux, pregnancy, pain, breathing problems), or an emotional condition

1460 Qual Life Res (2011) 20:1457–1468

123

yielded coefficients ranging from r = 0.81 to 0.90. The

study also found that the FOSQ successfully discriminated

between normal sleep subjects and those seeking medical

attention for a sleep problem.

Pittsburgh Sleep Quality Index (PSQI)

The PSQI is a 19-item, self-rated questionnaire that assesses

sleep quality and disturbances over a 1-month time interval

[14]. Seven composite scores measure Subjective Sleep

Quality, Sleep Latency, Sleep Duration, Habitual Sleep

Efficiency, Sleep Disturbances, Use of Sleeping Medica-

tion, and Daytime Dysfunction. The composite scores are

summed yielding a global measure (range 0–21). Buysse

and colleagues [14] assessed the psychometric properties of

the PSQI over an 18-month period, with a control group of

individuals with normal sleep and a group of individuals

with either depression or sleep disorders, and reported

acceptable measures of internal consistency reliability, test–

retest reliability, and validity.

Insomnia Severity Index (ISI)

The ISI is a brief, seven-item, self-report instrument

designed to measure a patient’s perception of his or her

insomnia based on the severity of sleep-onset and sleep-

maintenance difficulties [15]. Respondents are asked to

reflect over the last 2 weeks. A total score, ranging from 0

to 28, is formed by summing over the nonmissing items.

Higher scores suggest more severe insomnia. Bastien and

colleagues [15] evaluated the psychometric properties of

the ISI and reported adequate internal consistency, indi-

cating the ISI is a reliable self-report measure to evaluate

perceived sleep difficulties, and concluded that the ISI is

sensitive to changes in perceived sleep difficulties with

treatment. Additionally, they reported high agreement

between patient- and clinician-rated scores.

Work Productivity and Activity Impairment–General

Health Questionnaire (WPAI-GH)

The WPAI–GH is a six-item instrument that measures the

percentage of work time missed due to health (absentee-

ism), productivity impairment while working (presentee-

ism), activity impairment, and loss of work productivity due

to health problems. The WPAI–GH uses a recall period of

the past 7 days. Scores are reported as percentages (0–100),

with higher percentages indicating greater impairment and

less productivity [16, 17].

Analysis

Descriptive statistics

Response distributions for the SFIS were examined for the

survey overall and separately for the two classifications

(i.e., insomnia and controls) to identify any potential floor

or ceiling effects and other response anomalies. Item-level

descriptive statistics (e.g., mean, standard deviation) were

computed for the SFIS items overall and separately for the

insomnia and control groups. Item-level effect size esti-

mates (e.g., Cohen’s d) were calculated between the

insomnia and control group.

Correlational analyses

Inter-item and item-total correlational analyses were con-

ducted to examine relationships among items and identify

Table 3 PRO assessments administered

Instrument

(# of Items)

Reference

period

Domains

MOS-Sleep (12) Past 4 weeks Sleep problems index

Sleep disturbance

Sleep adequacy

Daytime somnolence

Snoring

Respiratory impairment

Sleep quantity

ESS (8) Recent times Daytime sleepiness

Somnolence

FOSQ (30) Generally Impact on:

Activity level

Vigilance

Intimacy and sexual relationships

General productivity

Social outcome

PSQI (19) 1 month Subjective sleep quality

Sleep latency

Sleep duration

Habitual sleep efficiency

Sleep disturbances

Use of sleeping medication

Daytime dysfunction

ISI (7) Last 2 weeks Total severity

WPAI–GH (6) Past 7 days Absenteeism

Presenteeism

Work productivity loss

Activity impairment

ESS Epworth Sleepiness Scale, FOSQ Functional Outcomes of Sleep

Questionnaire, ISI Insomnia Severity Index, MOS-Sleep Medical

Outcomes Study Sleep Scale 12, PRO Patient-reported outcome,

PSQI Pittsburgh Sleep Quality Index, WPAI–GH Work Productivity

and Activity Impairment–General Health Questionnaire

Qual Life Res (2011) 20:1457–1468 1461

123

potential candidates for revision or removal from the SFIS.

First, inter-item polychoric correlations among the 38 SFIS

items were examined to identify important relationships.

Item pairs that produced an extremely high correlation

(e.g., greater than r = 0.80) were reviewed for redundancy.

Second, item-total correlations were computed for the 38

SFIS items for the entire sample and for the insomnia and the

control group.

Exploratory factor analysis

An exploratory factor analysis (EFA) was employed to

investigate the structure of the SFIS. Specifically, EFA was

performed using the inter-item polychoric correlation matrix

with data from the entire sample, and, as a sensitivity anal-

ysis, the matrix was computed by group. The size of the

eigenvalues [18] and the scree test results [19] from the

principal components analysis guided the choice of the

dimensionality of the tested models. Exploratory factor

analyses retaining varying numbers of factors were con-

ducted. Squared multiple correlations were used as commu-

nality estimates, with unweighted least squares estimation.

The initial orthogonal rotation (varimax) was followed with

an oblique rotation (quartimin), allowing correlation of the

factors. The number of factors selected was based on the size

and pattern of the factor loadings and the interpretability of

the factors. As a sensitivity analysis, the EFA was also

performed using data from the insomnia group alone.

Item Response Theory (IRT) Analysis

The structure of the SFIS was further examined through IRT

analysis to compute item parameter estimates describing the

relation between each item and the latent construct under-

lying the SFIS scale (MULTILOG v. 7.0 [20]). Samejima’s

[21] graded response model was fitted to assess the sensi-

tivity and discriminating ability of each item of the SFIS to

inform item reduction, subscale refinement, and possible

weighting techniques for scoring the final questionnaire.

Reliability and Validity

The internal consistency for the proposed subscales of the

SFIS was evaluated by computing Cronbach’s [22] coeffi-

cient alpha using item-level data. Correlations between the

SFIS and the other sleep-related instruments were computed.

It was hypothesized that the SFIS would moderately corre-

late (r [ |0.40|) with other measures of insomnia—helping

to demonstrate convergent validity. With respect to diver-

gent validity, it was expected that the SFIS would correlate

more highly with scales having similar domain content, such

as the FOSQ and ISI, than with instruments addressing the

mechanics of sleep (e.g., MOS-Sleep and PSQI).

Analysis of variance (ANOVA) was used to examine

mean differences in SFIS scores between individuals with

insomnia and those in the control group, providing evi-

dence of known-groups validity. Planned analyses also

included comparisons based on the number of hours of

sleep per night reported by participants on the MOS-Sleep

(i.e., a report of 7 to 8 h of sleep per night were considered

in the normal or adequate range and those outside would be

compared alongside) and groups categorized using the

global score on the PSQI (i.e., a score greater than 5 was

used to distinguish subjects with poor sleep quality).

Results

Sample demographics

Table 4 summarizes the demographic characteristics of the

432 participants by insomnia and control group. Enrolled

participants in the insomnia group ranged in age from 27 to

81 years, with a mean age of 53.1 (Standard deviation

(SD) = 11.4), while participants in the control group ran-

ged from 19 to 87 years of age, with a mean age of

46.7 years (SD = 17.5). The majority of participants in the

insomnia group were women (n = 140 [81.9%]), and

Table 4 Psychometric evaluation: study sample demographics

Demographic information Insomnia

(n = 171)

Control

(n = 261)

Mean age in years (SD) 53.1 (11.4) 46.7(17.5)

Gender

Male (%) 31 (18.1) 143 (54.8)

Female (%) 140 (81.9) 118 (45.2)

Race

White, Non-Hispanic (%) 164 (95.9) 186 (71.3)

Black, Non-Hispanic (%) 0 29 (11.1)

Other, Non-Hispanic (%) 3 (1.75) 10 (3.8)

Hispanic (%) 1 (0.6) 31 (11.9)

Multiracial, Non-Hispanic (%) 3 (1.75) 5 (1.9)

Marital status

Married (%) 115 (67.2) 159 (60.9)

Single (%) 20 (11.7) 76 (29.1)

Divorced (%) 25 (14.6) 18 (6.9)

Widowed (%) 9 (5.3) 3 (1.2)

Separated (%) 2 (1.2) 5 (1.9)

Education

Less than high school (%) 3 (1.8) 23 (8.8)

High school (%) 30 (17.5) 85 (32.6)

Some college (%) 62 (36.3) 78 (29.9)

College degree (%) 76 (44.4) 75 (28.7)

n Sample size, SD Standard deviation

1462 Qual Life Res (2011) 20:1457–1468

123

44.4% had obtained a college degree. The control group

had slightly more men (n = 143 [54.8%]), and approxi-

mately thirty percent (28.7%) had obtained a college

degree. Both the insomnia and control groups were pre-

dominantly white (95.9 and 71%, respectively) and married

(67.2 and 60.9%, respectively).

Item-level results

The response frequency distributions for 20 of the 38 SFIS

items were negatively skewed (i.e., problematic floor

effects), indicating that sleep had only a slight to moderate

impact on daily functioning. Additionally, the majority of

Table 5 Item-level differences overall and between control and Insomnia

Item Overall Insomnia Control Effect

sizen Mean

(SD)

n Mean

(SD)

n Mean

(SD)

1 (1a): Tired during the day? 431 2.8 (1.0) 170 3.6 (0.7) 261 2.3 (0.9) 1.7

2 (1b): Exhausted during the day? 430 2.0 (1.0) 169 2.6 (1.0) 261 1.6 (0.8) 1.2

3 (1c): Sleepy during the day? 431 2.4 (1.0) 170 2.9 (0.9) 261 2.0 (0.8) 1.0

4 (1d): Like you had less energy during the day? 432 2.6 (1.1) 171 3.5 (0.8) 261 2.0 (0.9) 1.7

5 (2a): Motivation to do usual daily activities? 431 2.1 (1.1) 171 2.9 (0.9) 260 1.7 (0.9) 1.3

6 (2b): Motivation to care for yourself? 432 1.5 (0.9) 171 1.8 (1.0) 261 1.3 (0.7) 0.6

7 (3a): Difficult to focus on what you were doing? 431 1.9 (1.0) 171 2.6 (0.9) 260 1.5 (0.8) 1.3

8 (3b): Difficult to stay alert? 432 1.9 (1.0) 171 2.6 (0.9) 261 1.5 (0.7) 1.3

9 (3c): Difficult to remember things? 431 2.1 (1.1) 170 2.8 (1.0) 261 1.6 (0.8) 1.4

10 (3d): Difficult to make decisions? 432 1.7 (1.0) 171 2.4 (1.0) 261 1.3 (0.7) 1.3

11 (4): How good was the work you did? 431 2.9 (0.6) 171 3.3 (0.6) 260 3.0 (0.5) 0.6

12 (5a): How often do you put off doing things? 431 2.4 (1.0) 171 3.0 (0.9) 260 2.0 (0.9) 1.1

13 (5b): How often do you take longer to finish things? 432 2.2 (1.1) 171 3.1 (1.0) 261 1.7 (0.8) 1.6

14 (5c): How often do you have to force yourself to start things? 432 2.4 (1.2) 171 3.3 (0.9) 261 1.8 (0.9) 1.6

15 (5d): How often do you have to force yourself to finish things? 431 2.3 (1.2) 171 3.1 (1.0) 260 1.8 (0.9) 1.4

16 (5e): How often do you do just enough work to get by? 431 2.1 (1.0) 171 2.7 (1.0) 260 1.8 (0.9) 1.0

17 (5f): How often do you take time off from school/work? 431 1.7 (0.9) 170 2.1 (1.0) 261 1.5 (0.8) 0.7

18 (5g): How often do miss appointments? 432 1.2 (0.5) 171 1.3 (0.6) 261 1.1 (0.3) 0.4

19 (5h): How often do you get to school/work late? 430 1.6 (0.9) 171 1.9 (1.1) 259 1.3 (0.7) 0.7

20 (5i): How often do you doze off while working? 429 1.4 (0.7) 170 1.6 (0.9) 259 1.2 (0.6) 0.5

21 (5j): How often do you forget what you were doing while working? 432 1.6 (0.8) 171 2.1 (1.0) 261 1.2 (0.5) 1.3

22 (5k): How often do you make mistakes while doing something you had to do? 430 1.7 (0.8) 169 2.2 (0.8) 261 1.4 (0.6) 1.2

23 (6a): Negatively affected your enjoyment of social activities? 432 1.9 (1.0) 171 2.6 (1.0) 261 1.5 (0.8) 1.3

24 (6b): Negatively affected your relationships? 432 1.8 (1.0) 171 2.4 (1.0) 261 1.4 (0.8) 1.1

25 (6c): Negatively affected your interest in intimacy? 426 2.0 (1.2) 168 2.8 (1.3) 258 1.5 (0.8) 1.3

26 (7a): How often have you turned down or canceled plans? 432 1.6 (0.9) 171 2.1 (1.0) 261 1.2 (0.6) 1.1

27 (7b): How often have you wanted to be left alone? 432 2.2 (1.1) 171 2.9 (1.1) 261 1.8 (0.9) 1.1

28 (8a): How often have you felt irritable? 432 2.1 (0.9) 171 2.7 (0.9) 261 1.8 (0.8) 1.1

29 (8b): How often have you felt frustrated? 432 2.1 (1.0) 171 2.7 (0.9) 261 1.7 (0.8) 1.2

30 (8c): How often have you felt down? 431 1.8 (1.0) 170 2.4 (1.0) 261 1.5 (0.7) 1.1

31 (8d): How often have you felt anxious? 430 1.8 (1.0) 170 2.3 (1.0) 260 1.4 (0.8) 1.0

32 (8e): How often have you felt stressed? 432 2.3 (1.1) 171 2.9 (1.0) 261 1.8 (1.0) 1.2

33 (8f): How often have you felt overwhelmed? 431 2.0 (1.1) 170 2.7 (1.2) 261 1.6 (0.9) 1.1

34 (9a): How frustrated by lack of energy? 432 2.4 (1.2) 171 3.3 (1.0) 261 1.7 (0.9) 1.6

35 (9b): How frustrated by many mistakes you made? 431 1.8 (0.9) 170 2.4 (1.1) 261 1.4 (0.6) 1.3

36 (9c): How frustrated by things you would usually ignore? 431 1.9 (1.0) 171 2.6 (1.1) 260 1.5 (0.7) 1.2

37 (10a): How bothered by getting less done than planned? 431 2.4 (1.2) 170 3.3 (1.1) 261 1.8 (0.9) 1.5

38 (10b): How bothered by trouble focusing on what you were doing? 431 2.1 (1.1) 170 2.9 (1.1) 261 1.5 (0.8) 1.6

n Sample size, SD Standard deviation

Qual Life Res (2011) 20:1457–1468 1463

123

items exhibited these floor effects in the control group, in

agreement with a lack of sleep problems in that group.

However, only four items (items 6, 18, 19, and 20) dem-

onstrated floor effects in both the insomnia and control

groups, indicating that most items did allow for an ade-

quate range of severity for the insomnia group. In addition

to the evaluation for floor and ceiling effects, items with

response categories that were never endorsed, indicating a

restricted response scale range, were flagged. This anomaly

occurred for three items: items 11 (Quality of work), 18

(Missed appointments), and 22 (Mistakes). For item 11,

over 90% of respondents endorsed only two categories

(i.e., categories 3 and 4); while for items 18 and 22, the

highest categories were never endorsed.

Displayed in Table 5 are the item-level descriptive sta-

tistics for the 38 SFIS items across the entire sample and by

insomnia and control groups, as well as the item-level

effect size estimates between the two groups (i.e., Cohen’s

d). In agreement with the item-level response frequency

distributions, the SFIS item-level means are lower than 3.0

and have a standard deviation of approximately 1, reflect-

ing that sleep, in general, had a mild impact on daily

functioning for the participants. The means for the whole

sample ranged from 2.9 for item 11 (SD = 0.6; Quality of

work) to a low of 1.2 for item 18 (SD = 0.5; Missed

appointments).

Effect sizes that are positive and greater than 1 indicate

that the insomnia group reported greater impact of sleep on

functioning than the control group. Overall, item-level

SFIS effect sizes can be characterized as large, indicating

that the majority of items are capable of distinguishing

between the two groups. Moderate effects were observed

for items 6 (d = 0.59; Care for self), 11 (d = 0.62; Quality

of work), and 20 (d = 0.50; Doze off); while small

effects were observed for item 18 (d = 0.38; Missed

appointments).

Correlational analyses

Of the 703 possible item pairs that are correlated, 25 pairs

of items have inter-correlations that exceed 0.80. Stem and

content overlap between these 25 item pairs are likely

causing redundancies, with multiple items assessing func-

tioning such as physical, cognitive, performing daily

activity, social, and sexual.

The majority of item-total correlations exceeded 0.70.

Item-total correlations less than or equal to |0.40| in either

the entire sample, or by group, were flagged as potential

candidates for removal. The low item-total correlations are

0.30 for item 3 (Feeling sleepy) in the insomnia group,

-0.39 for item 11 (Quality of work) across the whole

sample (-0.19 for control and -0.36 for insomnia), 0.40

for item 18 (Missed appointments) in the control group,

and 0.39 for item 20 (Doze off) across the whole sample

(0.35 for control and 0.31 for insomnia).

Item reduction

Based on the item-level descriptive statistics (e.g., floor

effects, restricted range, effect sizes, and the correlational

analyses), item reduction was performed and 12 out of 38

draft SFIS items were eliminated, yielding a final 26-item

version (SFIS). The deleted items included the following:

item 3 (Feeling sleepy) has low item-total correlation in the

insomnia group; item 6 (Grooming) and item 19 (Lateness)

exhibit floor effects in the whole sample and in the

insomnia and control groups; item 11 (Work quality), the

only reverse-scored item, has low item-total correlation and

a restricted response range across the whole sample and in

the two groups; item 20 (Doze off) has low item-total

correlation and exhibits floor effects across the whole

sample and in the two groups; while item 18 (Missed

appointments) and item 22 (Mistakes) have low item-total

correlation, exhibit floor effects, and have a restricted

response range across the whole sample and in the two

groups. In addition, items 34 through 38 were removed

because they may be measuring the impact of lack of

functioning rather than a lack of functioning due to sleep.

Further item reduction, through selection of one item from

each highly correlated item pair, was not considered at this

time.

Exploratory factor analysis

Eigenvalues from the principal components analysis were

examined to inform the number of dimensions underlying

the proposed 26-item SFIS. For the 26-item SFIS, the

first eigenvalue was 14.9, accounting for nearly 57% of

the variance. The second eigenvalue, 1.4, accounted for

an additional 5.3% of the variance, and the remaining 24

eigenvalues ranged from 1.2 to 0.09 (variance explained

from 4.7 to 0.4%). To assess the strength of the rela-

tionship between each of the 26 items and an underlying

factor, a one-factor EFA model was considered. Table 6

displays the factor loadings for the 26 items (all strong

and greater than 0.70). Considering the strong factor

loadings and that the addition of more factors would

explain a negligible amount of variance, the parsimoni-

ous one-factor model is recommended for the 26-item

SFIS.

The results were very similar for the EFA based on data

from the insomnia group alone. The first eigenvalue was

10.9, accounting for approximately 42% of variance; the

second was 2.2 (8.3%); the third was 1.6 (6.2%); and the

remaining eigenvalues ranged from 1.35 (5.2%) to 0.13

1464 Qual Life Res (2011) 20:1457–1468

123

(0.05%). The one-factor solution yielded factor loadings all

greater than 0.40.

Item response theory

According to the results of the EFA, the SFIS was ade-

quately represented by a unidimensional construct; there-

fore, all 26 items were considered in a graded response IRT

model. Results of the IRT analysis suggested that the SFIS

is more precise in the upper portion of the scale (i.e., the

majority of the item thresholds were in the positive region

of the construct), where comparisons among patients with

decreased functioning would most likely occur. The item

slope parameter estimates range from 1.38 to 3.42, with all

26 items being strongly and similarly related to the

underlying construct, a desirable property when sum-

scoring is applied.

Scale-level analyses

The descriptive statistics for the 26-item SFIS summary

measure and the other measures (i.e., ESS, FOSQ, ISI, MOS-

Sleep, WPAI–GH, and PSQI) used to explore content

validity are presented in Electronic supplementary material

across the entire sample and by group. Additionally, effect

size estimates between the nsomnia and control groups (i.e.,

Cohen’s d) are included. Given the EFA and IRT results,

SFIS scores were computed as sums across component items

with a possible range from 26 to 130. Higher SFIS scores

reflect greater impact of sleep on daily functioning. As

expected, the insomnia group scored higher on the SFIS than

the control group, 70.6 (SD = 16.4) versus 42.3 (SD =

13.8), with an effect size of 2.05. Effect sizes were moderate

to large and in the anticipated direction for the specific

instrument (i.e., insomnia group has greater dysfunction).

Reliability and construct validity

The internal consistency reliability of the 26-item SFIS is

very high at 0.97 for the entire sample and 0.95 for the

insomnia group. The construct validity correlations are

shown in Table 7. All correlations were statistically sig-

nificant. Strong correlations were found between the

26-item SFIS and the ISI Total Composite (r = 0.82), the

PSQI Overall Composite (r = 0.78), the FOSQ Activity

subscale (r = -0.76), the MOS-Sleep Problems Index I

and II (r = 0.74 and 0.76, respectively), and the PSQI

subscales Sleep Dysfunction and Subjective Sleep Quality

(r = 0.75 and 0.70, respectively). Moderately strong cor-

relations were observed between the SFIS and the FOSQ

Composite (r = -0.69), the WPAI–GH Activities

Impairment (r = 0.68), the two MOS subscales Sleep

Disturbance (r = 0.68) and Somnolence (r = 0.61), and

the PSQI Sleep Disturbance subscale (r = 0.63). The

remaining measures correlated moderately highly, ranging

from -0.56 (FOSQ Productive) to -0.42 (FOSQ Social).

Modest correlations were observed between the SFIS and

WPAI–GH Work Missed due to health (r = 0.31), and the

two MOS subscales pertaining to snoring and shortness of

breath (r = 0.26 and 0.27, respectively).

Consistent with expectations, the FOSQ Overall Com-

posite and ISI were strongly correlated with the SFIS, as

were the PSQI Composite and the MOS-Sleep Problems

indices. The weakest correlations were observed between

the SFIS and the MOS subscales that referred to snoring

and breathing.

Known-groups validity

SFIS scores were significantly higher (i.e., poorer func-

tion) in the insomnia group than the control group,

Table 6 26-Item SFIS EFA factor loadings

Item Factor 1

1 (1a): Tired during the day? 0.79

2 (1b): Exhausted during the day? 0.77

4 (1d): Like you had less energy during the day? 0.82

5 (2a): Motivation to do usual daily activities? 0.83

7 (3a): Difficult to focus on what you were doing? 0.84

8 (3b): Difficult to stay alert? 0.82

9 (3c): Difficult to remember things? 0.80

10 (3d): Difficult to make decisions? 0.83

12 (5a): How often do you put off doing things? 0.80

13 (5b): How often do you take longer to finish things? 0.85

14 (5c): How often do you have to force yourself

to start things?

0.86

15 (5d): How often do you have to force yourself

to finish things?

0.88

16 (5e): How often do you do just enough work to get by? 0.72

17 (5f): How often do you take time off from school/work? 0.64

21 (5j): How often do you forget what you were doing

while working?

0.80

22 (5k): How often do you make mistakes while

doing something you had to do?

0.79

23 (6a): Negatively affected your enjoyment

of social activities?

0.81

24 (6b): Negatively affected your relationships? 0.72

25 (6c): Negatively affected your interest in intimacy? 0.81

26 (7a): How often have you turned down or canceled

plans?

0.78

27 (7b): How often have you wanted to be left alone? 0.81

28 (8a): How often have you felt irritable? 0.86

29 (8b): How often have you felt frustrated? 0.81

30 (8c): How often have you felt down? 0.77

31 (8d): How often have you felt anxious? 0.82

32 (8e): How often have you felt stressed? 0.82

Qual Life Res (2011) 20:1457–1468 1465

123

t(430) = 19.36, P \ 0.0001, yielding an effect size of 2.05

(insomnia: mean = 70.59 and control: mean = 42.27,

SD = 13.79). Additionally, as anticipated, the SFIS scores

were significantly lower for those who reported an adequate

amount of sleep per night verses those who did not,

t(429) = -12.18, P \ 0.001 (effect size = -1.04, at least

7 h: mean = 42.61 and less than 7 h: mean = 63.24,

SD = 19.9) and for those with higher, hence, poorer PSQI

Sleep Quality scores, t(430) = 17.06, P \ 0.0001 (effect

size = 2.33, higher than 5 PSQI: mean = 65.01 and lower:

mean = 39.06, SD = 11.15).

Discussion

Patient focus groups, the published literature, and an

advisory panel generated the item content for the SFIS, and

information from cognitive debriefing interviews was used

to refine the draft questionnaire. Results from the psycho-

metric analyses of 432 patients were used to finalize the

item content and to evaluate the psychometric properties of

the SFIS. Based on the item-level evaluations (i.e., fre-

quencies, descriptives, effect sizes, and item correlations),

12 items were eliminated from the SFIS, and psychometric

analyses were performed on the 26-item SFIS scale.

Results of the principal components and factor analyses

provided overwhelming evidence that the SFIS is

unidimensional.

The factor analyses results are a bit surprising because

review of the instrument’s content, including concepts or

aspects addressed by groups of items, suggested that these

items might form separate, interpretable factors, or SFIS

subscales. The SFIS was developed to address seven

aspects related to the functional impact of insomnia: (1)

mood; (2) tiredness/energy; (3) memory/concentration; (4)

motivation; (5) daily performance; (6) social interaction;

and (7) sexual functioning. While the computation of

subscales could be considered, the single-factor scoring

structure of the SFIS-revised was corroborated by the

moderate to strong inter-item correlations and additional

IRT analyses, suggesting that computation of a total score

is most appropriate.

The results of the factor analyses, as well as the corre-

lational and IRT analytic results, support the use of con-

ventional unit-weighting of items for scoring the SFIS.

When construct validity correlations were evaluated, the

results were generally positive, providing satisfactory

preliminary support for the convergent construct validity of

the SFIS. The SFIS composite was most highly correlated

with the ISI, the PSQI Overall Composite, FOSQ Com-

posite and Activity subscale, the MOS-Sleep Problems

Index I and II, and the two PSQI subscales (Sleep Dys-

function and Subjective Sleep Quality). Moderately high

correlations were observed with the ESS total composite,

the remaining FOSQ subscales (Vigilance, Intimacy, Pro-

ductive, and Social) and PSQI subscales (Sleep Latency,

Duration, Efficiency, Disturbance, and Medication Usage).

The magnitude of the correlations indicates that the SFIS

composite correlates reasonably well with the other mea-

sures, but is not redundant with them.

The pattern of divergent correlations regarding reference

period was not identical to what was hypothesized,

reflecting that the impact of sleep disruption on functioning

was relatively stable for the participants over the assess-

ment period. As anticipated, the weakest correlations were

observed between the SFIS composite and the MOS

Table 7 Pearson validity correlations for the 26-item SFIS

(N = 272–432)

Scale 26-item

SFIS

ESS Total composite 0.46****

FOSQ: Activity -0.76****

FOSQ: Vigilance -0.54****

FOSQ: Intimacy -0.45****

FOSQ: Productive -0.56****

FOSQ: Social -0.42****

FOSQ: Composite -0.69****

ISI Total 0.82****

MOS: Sleep disturbance scale 0.68****

MOS: Snoring scale 0.26****

MOS: Short of breath scale 0.27****

MOS: Sleep adequacy -0.46****

MOS: Somnolence scale 0.61****

MOS: Sleep problems index I 0.74****

MOS: Sleep problems index II 0.76****

MOS: Sleep quantity (Raw Score) -0.48****

MOS: Optimal sleep scale -0.50****

WPAI–GH: Pct work time missed due to health 0.31****

WPAI–GH: Pct overall work impairment due to health 0.56****

WPAI–GH: Pct impaired while working due to health 0.55****

WPAI–GH: Pct other activities impaired due to health 0.68****

PSQI: Subjective sleep quality 0.70****

PSQI: Sleep latency 0.60****

PSQI: Sleep duration 0.47****

PSQI: Sleep efficiency 0.54****

PSQI: Sleep disturbance 0.63****

PSQI: Sleep medication usage 0.57****

PSQI: Sleep dysfunction 0.75****

PSQI: Overall composite 0.78****

**** P \ 0.0001

ESS Epworth Sleepiness Scale, FOSQ Functional Outcomes of Sleep

Questionnaire, ISI Insomnia Severity Index, MOS Medical Outcomes

Study, PSQI Pittsburgh Sleep Quality Index, SFIS Sleep Functional

Impact Scale, WPAI–GH Work Productivity and Activity Impair-

ment–General Health Questionnaire

1466 Qual Life Res (2011) 20:1457–1468

123

subscales that referred to disruption of sleep from snoring

and breathing.

The known-groups validity of the SFIS composite was

very strong. Participants in the insomnia group had higher

SFIS scores than those in the control group, reflecting

greater disturbance on functioning due to sleep deficiency.

Additionally, the group reporting adequate sleep had lower

SFIS scores and the group reporting poorer sleep quality on

the PSQI had higher SFIS scores. All three comparisons

were statistically significant at P \ 0.001.

Finally, the effect size of the difference in SFIS between

the insomnia and control groups was quite large (Cohen’s

d = 2.05), indicating that the SFIS was very capable of

discriminating between these two groups. Comparable

effect sizes of the validating instruments were also notably

large. The maximum was observed for the ISI total and

PSQI composite, both at approximately 4.0. Interestingly,

these two instruments had the strongest correlation with the

SFIS (r & 0.80) despite the longer reference periods and

different content domains. The MOS-Sleep Problems Index

I and II had somewhat higher effect sizes than the SFIS (at

2.4 and 2.7, respectively). The FOSQ subscales and com-

posite, ESS total, and WPAI–GH subscales were lower

than the SFIS. Selecting instruments based solely on the

magnitude of effect sizes alone is not recommended. The

content domain and reference periods are notably different;

while the SFIS, FOSQ, WPAI–GH, and ESS focus on the

impact of sleep on functioning, and the quality of sleep is a

major content domain for the ISI, PSQI, and MOS.

Recommendations

The 26-item SFIS was evaluated and exhibited satisfactory

psychometric properties. The 26 items preserve the aspects

covered by the draft 38-item SFIS: mood, tiredness/energy,

memory/concentration, motivation, daily performance,

social interaction, and sexual functioning. The results of

the evaluation also indicate that the 26-item SFIS scale is

extremely reliable and contains additional pairs of items

that are highly related. At this point, highly correlated item

pairs were retained because the present analyses provide

evidence that including them does not detrimentally impact

the scale. With respect to the scoring of the SFIS com-

posite, the 26 items should be summed, yielding a score

that can range from 26 to 130. Higher SFIS scores reflect

worse outcomes such that experiencing sleep difficulties

(e.g., difficulty initiating or maintaining sleep, nonrestor-

ative sleep) severely affects functioning. In cases where

item-level SFIS data are missing, we tentatively propose

that a respondent must answer at least 13 items or the SFIS

score should be set to missing.

Future studies might focus on further item reduction to

reduce administration time or explore potential differences

related to mode of administration. Gathering input from

patients (in a qualitative study) could be helpful in this

regard, both to identify the optimal items within the

redundant pairs and to ensure that the content validity of

the SFIS is preserved following item reduction or change in

mode. However, the current work does provide evidence to

support the SFIS as a psychometrically sound measure

targeting the impact of insomnia on patient functioning.

Administered along with a sleep diary, this instrument has

the ability to provide a more comprehensive assessment of

treatment response in clinical studies.

Acknowledgments We would like to acknowledge Mapi Values for

their work in conceptualizing and developing the SFIS, as well as

Knowledge Networks for the use of the KnowledgePanel� and for

administering the questionnaire. We thank the advisory panel par-

ticipants: Dr. Ruth Benca, University of Wisconsin; Dr. W. Vaughn

McCall, Wake Forest University; Dr. Martin Scharf, Tristate Sleep

Disorders Center; Dr. Jame K. Walsh, St. Luke’s Hospital; Dr. Gary

Zammit, Clinilabs, Inc.; and Dr. Phyllis Zee, Northwestern Univer-

sity. Funding for this study was provided by GlaxoSmithKline,

Research Triangle Park, NC.

Source(s) of Support GlaxoSmithKline

References

1. Merrit, S. L. (2000). Putting sleep disorders to rest. RN, 63(7),

26–30.

2. Lichstein, K. L., Durrence, H. H., Taylor, D. J., Bush, A. J., &

Riedel, B. W. (2003). Quantitative criteria for insomnia. BehaviorResearch and Therapy, 41(4), 427–445.

3. Roth, T. (2007). Insomnia: Definition, prevalence, etiology and

consequences. Journal of Clinical Sleep Medicine, 3(5), 7–10.

4. Ohayon, M. M. (1997). Prevalence of DSM-IV diagnostic criteria

of insomnia: Distinguishing insomnia related to mental disorders

from sleep disorders. Journal of Psychiatric Research, 31(3),

333–346.

5. Lineberger, M. D., Carney, C. E., Edinger, J. D., & Means, M. K.

(2006). Defining insomnia: Quantitative criteria for insomnia

severity and frequency. Sleep, 29(4), 479–485.

6. American Psychiatric Association. (1994). Primary insomnia. In:

Diagnostic and statistical manual of mental disorders, 4th edn,

text revision (DSM-IV-TR) (pp. 551–557). Washington, DC:

APA.

7. US Department of Health and Human Services. (2009). Guidance

for industry. Patient-reported outcome measures: Use in medical prod-

uct development to support labeling claims. http://www.fda.gov/

downloads/Drugs/GuidanceCompliance.RegulatoryInformation/

Guidances/UCM193282.pdf. Accessed 11 November 2010.

8. European Medicines Agency. (2009). Guideline on medicinal

products for the treatment of insomnia. http://www.ema.europa.

eu/docs/en_GB/document_library/Scientific_guideline/2009/11/

WC500011987.pdf. Accessed 4 January 2011.

9. Hays, R. D., & Stewart, A. L. (1992). Sleep measures. In A.

L. Stewart & J. E. Ware (Eds.), Measuring functional and well-being: The Medical Outcomes Study approach (pp. 235–259).

Durham, NC: Duke University Press.

10. Hays, R. D., Martin, S. A., Sesti, A. M., & Spritzer, K. L. (2005).

Psychometric properties of the medical outcomes study sleep

measure. Sleep Medicine, 6(1), 41–44.

Qual Life Res (2011) 20:1457–1468 1467

123

http://www.fda.gov/downloads/Drugs/GuidanceCompliance.RegulatoryInformation/Guidances/UCM193282.pdf



http://www.ema.europa.eu/docs/en_GB/document_library/Scientific_guideline/2009/11/WC500011987.pdf



11. Johns, M. W. (1991). A new method for measuring daytime

sleepiness: The Epworth Sleepiness Scale. Sleep, 14(6), 540–545.

12. Milten, M. S., & Hanly, P. J. (2003). Measurement properties of

the Epworth sleepiness scale. Sleep, 4(3), 195–199.

13. Weaver, T., Laizner, A., Evans, L., Maislin, G., Chugh, D., Lyon, K.,

et al. (1997). An instrument to measure functional status outcomes

for disorders of excessive sleepiness. Sleep, 20(10), 835–843.

14. Buysse, D., Reynolds, C., Monk, T., Berman, S., & Kupfer, D.

(1989). The Pittsburgh sleep quality index: A new instrument for

psychiatric practice and research. Psychiatry Research, 28(2),

193–213.

15. Bastien, C., Vallieres, A., & Morin, C. (2001). Validation of the

Insomnia Severity Index as an outcome measure for insomnia

research. Sleep Medicine, 2(4), 297–307.

16. Reilly, M. C., Zbrozek, A. S., & Duke, E. M. (1993). The validity

and reproducibility of a work productivity and activity impair-

ment instrument. Pharmacoeconomics, 4(5), 353–365.

17. Reilly, M. C. (2007). Work Productivity and Activity Impairment

Questionnaire (WPAI). http://www.reillyassociates.net/. Acces-

sed 28 September 2007.

18. Kaiser, H. (1960). The application of electronic computers to

factor analysis. Educational Psychological Measurement, 20(1),

141–151.

19. Cattell, R. B. (1966). The scree test for the number of factors.

Multivariate Behavioral Research, 1(2), 245–267.

20. Thissen, D., Chen, W., & Bock, R. D. (2003). MultilogTM version7.0. Lincolnwood, IL: Scientific Software International, Inc.

21. Samejima, F. (1969). Estimation of latent ability using a response

pattern of graded scores. Psychometric Monograph, No. 17.

22. Cronbach, L. (1951). Coefficient alpha and the internal structure

of tests. Psychometrika, 16(3), 294–334.

1468 Qual Life Res (2011) 20:1457–1468

123

http://www.reillyassociates.net/

Documents

Development and psychometric evaluation of a new patient-reported outcome instrument measuring the functional impact of insomnia