pure.ulster.ac.uk€¦ · Web viewThe aim of this review was to evaluate the psychometric...
35
Title Page Psychometric properties and cultural adaptation of Sleep Disturbance Measures in Arabic Speaking Populations: A Systematic Review Authors No Author Credentials Titles Author affiliations 1 Mohammed Al Maqbali RN., Dip.Admin., B.Sc. (Hons)., M.Sc., PhD Student Institute of Nursing and Health Research, Ulster University, Newtownabbey, UK 2 Lynn Dunwoody PhD, AFBPsS, C.Psychol, FHEA Lecturer in Health Psychology Psychology Research Institute, Ulster University, Coleraine, UK 3 Jane Rankin B.Sc. (Hons), M.Sc. Physiotherapy Clinical Lead Physiotherapy Department, Cancer Centre, Belfast Health and Social Care Trust, Belfast, UK 4 Eileen Hacker PhD, APN, AOCN, FAAN Professor and Department Chair, Science of Nursing Care, Indiana University School of Nursing, and Visiting Professor, Ulster University. School of Nursing, Indiana University, USA 5 Ciara Hughes PhD Senior Lecturer, School of Health Sciences Institute of Nursing and Health Research, Ulster University, Newtownabbey, UK 6 Jackie Gracey (Corresponded Author) PhD, BSc (Hons) Physiotherapy Lecturer, School of Health Sciences Institute of Nursing and Health Research, Ulster University, Newtownabbey, UK Running Title: Psychometric properties for Arabic Sleep Disturbance Scales Disclosures and Acknowledgement: I certify that there is no actual or potential conflict of interest in relation to this article. This review was conducted as part of PhD thesis, which was funded by the Ulster University Vice-Chancellor's Research Scholarship. Authors contributorship 1
pure.ulster.ac.uk€¦ · Web viewThe aim of this review was to evaluate the psychometric properties and cross-cultural adaptation of sleep disturbance scales that have been translated
Psychometric properties and cultural adaptation of Sleep
Disturbance Measures in Arabic Speaking Populations: A Systematic
Review
Authors
No
Author
Credentials
Titles
PhD Student
Institute of Nursing and Health Research, Ulster
University, Newtownabbey, UK
2
3
Physiotherapy Department, Cancer Centre, Belfast Health and
Social Care Trust, Belfast, UK
4
Professor and Department Chair, Science of Nursing Care, Indiana
University School of Nursing, and Visiting Professor, Ulster
University.
School of Nursing, Indiana University, USA
5
Institute of Nursing and Health Research, Ulster
University, Newtownabbey, UK
6
Institute of Nursing and Health Research, Ulster
University, Newtownabbey, UK
Running Title: Psychometric properties for Arabic Sleep Disturbance
Scales
Disclosures and Acknowledgement:
I certify that there is no actual or potential conflict of interest
in relation to this article. This review was conducted as part of
PhD thesis, which was funded by the Ulster University
Vice-Chancellor's Research Scholarship.
Authors contributorship
Al Maqbali completed the data collection, statistical support and
provided analysis. All authors contributed to the conceptualization
and design and the manuscript preparation.
Text word count
Summary:
The aim of this review was to evaluate the psychometric properties
and cross-cultural adaptation of sleep disturbance scales that have
been translated into Arabic or originally developed in Arabic and
to identify appropriate scales that can be used in research and
clinical practice intended for Arabic speaking participants. The
following databases were searched: CINAHL (2003 - 2019), MEDLINE
(1946 - 2019), EMBASE (1980 - 2019), PsycINFO (1806 - 2019), and
Cochrane Library (1806 - 2019). This review was conducted following
PRISMA guidelines. Terwee et al. (2007) quality assessment was used
to evaluate the psychometric properties of the studies and
cross-cultural adaptation was assessed using criteria from
Guillemin et al. (1993). Seven studies met the inclusion criteria,
which included 4 scales: the Epworth Sleepiness Scales (ESS),
Insomnia Severity Index (ISI), Pittsburgh Sleep Quality Index
(PSQI) and Arabic Scale of Insomnia (ASI). Cross-cultural
adaptations scored between good and poor; psychometric properties
information was missing for most scales. The review suggested that
PSQI may be a useful scale to measure sleep disturbance, as the
scale show good cultural adaptation and acceptable psychometric
properties in an Arabic population. Further, the scales measures
seven different aspects of sleep quality. This review provides
options to help researchers and clinicians select the most
appropriate instrument for their practice. Further psychometric
testing and cultural adaptation is required for sleep scales used
in Arabic clinical populations to ensure validity and reliability
in outcome measurement for research studies.
Key words: Sleep disturbance; Arabic; Review; Cross-cultural
adaptation; Measurement properties.
Introduction:
Sleep disturbance is a significant stressor that can affect the
psychological and physical health along with quality of life in
healthy and clinical populations. The National Institutes of Health
defined sleep disturbance as ‘dysregulation of sleep homeostasis,
sleep deficiency, sleep fragmentation, insufficient sleep or
impairment of sleep quality or quantity caused by a sleep disorder’
(National Institutes of Health, 2011). In the general population,
the prevalence rate of sleep disturbance ranges between 36% to 50%
(Heinzer et al., 2015; Nowicki et al., 2016). However, research
studies have reported the prevalence of sleep disturbance among
specific clinical populations such as cancer to be between 50% to
75% (Ancoli-Israel, 2015; Berger et al., 2005), psychiatric
disorders between 40% to 50% (Walia and Mehra, 2016) and
cardiovascular disease between 47% to 83% (Kasai et al., 2012;
Matsuda et al., 2017)
Sleep disturbances are associated with decreased quality of life,
increased use of the health care resources and poor mood. This is
evidenced by surveys from the United States of America and 5
European countries which found that people with insomnia had
significantly worse health related quality of life compared to
people with normal sleep (DiBonaventura et al., 2015). Sleep
disturbance can cause changes in metabolism, (Zimberg et al.,
2012), immune pathways (Besedovsky et al., 2012) and endocrine
functioning (Morgan and Tsai, 2015). In addition, sleep disturbance
can significantly increase the risk of developing type 2 diabetes
(Holliday et al., 2013), hypertension (Anothaisintawee et al.,
2016) and increase the risk of cardiovascular disease (Khan and
Aouad, 2017). Moreover, it is important to note that consequences
of sleep disturbance is not confined to the individual; they may
also have implications at a societal level, in terms of public
health and economic burdens (Colten et al., 2006). Although
objective tests, such as polysomnography, are the gold standard for
diagnosing sleep disorders, such as sleep apnea, self-reporting
scales are a brief and efficient method to assess sleep disturbance
which provide information on the individuals’ assessment of their
sleep health.
Despite the 300 million people that have Arabic as their native
language across 27 countries in the Middle East and North Africa,
evidence regarding the prevalence of sleep disorders in this
population is limited (Horesh and Cotter, 2016). The main reason
for this may relate to the fact that there is a lack of sleep
disturbance scales that have been specifically developed for Arabic
speaking populations. Some studies have translated or back
translated English versions of scales into Arabic (Al-Abri et al.,
2013; Suleiman et al., 2012) and compared the psychometric
properties of the translated scale to the original version. There
can be linguistic problems in relation to idiomatic expressions
that may not be present in the target language and also a need to
adjust verb tenses. Hence, validating a scale for use in another
language and culture is not as simple as back translation (Acquadro
et al., 2014). In addition, the original scale may not take into
consideration cultural differences, such as sleeping habits
associated with decreases in afternoon activity and temporal
aspects of religious obligations (Ahmed, 2014; Epstein et al.,
2015); therefore researchers need to consider cross cultural
adaptation of scales (Beaton et al., 2000; Bullinger et al., 1998).
With this in mind, the aim of this review is to explore the
validity, reliability and cultural adaptation of sleep disturbance
scales that have been translated into Arabic or originally
developed in Arabic in order to identify scales that can be used in
research and clinical practice in Arabic countries.
Methods:
Sleep disturbance scales were identified by searches utilising the
following electronic databases CINAHL (1937 until
10/February/2019), MEDLINE (1946 to 10/February/2019), EMBASE (1980
until 10/February/2019), PsycINFO (1806 until 10/February/2019) and
Cochrane Library (1980 until 10/February/2019). The search method
used both text words and MeSH terms of the following terms :
"Medicine, Arabic" OR "Arabs" OR "Arabic", "Scales", "Instrument
Validation", "Psychometrics" OR "Measurement Issues and
Assessments", "Outcome Assessment" OR "Measurement Issues and
Assessments", "daytime sleepiness", "Narcolepsy", "Insomnia", sleep
disturbances, "Sleep" OR "Sleep Apnea, Central" OR "Sleep
Disorders, Circadian Rhythm" OR "Sleep Apnea, Obstructive" OR
"Sleep-Wake Transition Disorders" OR "Sleep Disorders, Intrinsic"
OR "Sleep Disorders". Footnote chasing was used to identify
articles by backwards chaining references that were cited in
previous studies on the topic (Booth et al., 2013). The time
setting was not restricted, and all papers published in English
were included.
Assessment of measurement:
Quality assessment of psychometric studies evaluating sleep
disturbance scales that had been translated into Arabic, or
originally developed in Arabic, were assessed using Terwee et
al's., (2007) quality assessment criteria for evaluating the
psychometric properties of scales and Guillemin et al’s (1993)
guidelines for cultural adaptation of scales. Psychometric
properties included 9 aspects: content validity, internal
consistency, criterion validity, construct validity,
reproducibility, responsiveness, floor and ceiling effects, and
interpretation. For each of these attributes, a specific criterion
was defined by which the instrument should be reviewed (See
table1). The 9 aspects were scored as: ‘+’ positive rating; ‘?’
indeterminate rating; ‘-’ negative rating; ‘0’ no information
available. Terwee et al., (2007) recommended presentation of the
results in tabular form, rather than utilising an overall quality
score. This is because an overall score would assume an equal
importance of each of the psychometric properties, but in practice
this is not the case.
For cross-cultural adaptation, studies were evaluated on the extent
to which they employed the five steps suggested by Guillemin et al.
(1993) that are used to preserve the sensibility of a scale and
target culture: (1) translations, (2) back-translations, (3)
committee review, (4) pre-testing and (5) re-examination of the
weighting of scores (See table 2).
Translation is recommended using at least two translators with
experience in the use of the scale to translate from the original
language to the target language (Brislin, 1970). Back-translation
comprises two independent translators translating the new target
language version back to the original language. This step is to
ensure that the translated new version reflects the content of the
original instrument. The third step constitutes a committee review
to develop the pre-final version for pre-testing. The fourth step
is pre-testing which seeks to test the pre-final version on 30-40
participants of the target population. The last step is weighting
scores, which is the consideration of the weights of scores to the
cultural context. Each step of Guillemin et al., (1993)was assessed
by applying one of the following scores: 1=poor, 2=moderate and
3=good. The overall score was calculated by taking the mean of the
items measured. However, if the instrument was developed in Arabic,
the steps of translation and back translation were not required,
therefore not be assessed. This review has been reported according
to the Preferred Reporting Items Systematic Reviews and
Meta-Analysis (PRISMA) guidelines (Moher et al., 2010).
Selected Criteria:
The studies were eligible for inclusion in the review according to
the following criteria. All types of study designs that validated
scales in terms of sleep disturbance in Arabic speaking population,
in participants over the age of 18 years. Protocol papers and
conference abstracts were excluded.
Results:
Studies selection:
The search strategy identified 55 studies, these were reviewed by
abstract and title (Figure 1 shows the PRISMA flow chart). Seven
studies met the inclusion criteria; these studies evaluated four
sleep disturbance scales. (See table 3 for description of the
studies included in the review).
Identification of Sleep Disturbance Scales
Three sleep disturbance scales had been translated into Arabic: The
Epworth Sleepiness Scales (ESS, (Ahmed, 2014; Ahmed et al., 2014;
Al-Abri et al., 2013), the Insomnia Severity Index (ISI, Ahmed,
2015; Suleiman and Yates, 2011) and the Pittsburgh Sleep Quality
Index (PSQI, (Suleiman et al., 2012; Suleiman et al., 2010). One
scale was originally developed in the Arabic language and validated
in an Arabic population – the Arabic Scale of Insomnia (ASI,
Abdel-Khalek, 2008).(See table 4 for description of scales)
Quality Assessment:
A critical evaluation of the psychometric reporting employed in the
studies was guided by the Terwee et al., (2007) checklist (see
table 5 for a summary of the evaluation). The review evaluated 7
studies, one of which reported on two scales (Ahmed, 2014).
Content validity was assessed in 5 studies and only 3 received a
positive rating (Ahmed et al., 2014; Al-Abri et al., 2013; Suleiman
et al., 2010), two of these were assessed as being intermediate in
reporting the content validity (Ahmed, 2014; Suleiman et al.,
2012). Internal consistency was tested in all the studies. Only two
studies achieved a positive score for criterion validity
(Abdel-Khalek, 2008; Suleiman and Yates, 2011). Six studies gave
information about construct validation, two with intermediate
ratings (Ahmed, 2014; Suleiman et al., 2012) and 4 with positive
ratings (Abdel-Khalek, 2008; Ahmed et al., 2014; Suleiman et al.,
2010; Suleiman and Yates, 2011). Agreement was assessed by one
study (Al-Abri et al., 2013), which received a positive score. Two
studies tested reliability (test-retest) and had positive score
ratings (Abdel-Khalek, 2008; Ahmed et al., 2014), whereas the
remaining 5 studies did not provide any information.
Responsiveness, floor and ceiling effects and interpretation were
not reported in any of the studies.
A description of how studies adequately addressed cross-cultural
adaptation is provided in table 6. Three of the sleep scales had
been translated from English to Arabic (PSQI, ISI and ESS) and only
one, the ASI, was originally developed in Arabic and validated in
an Arabic population. The method of cultural adaptation, reported
in 4 studies, was heterogeneous. Only one study (Suleiman et al.,
2010) solely relied on translation from source to target language,
whereas the PSQI employed all five steps of the cultural adaptation
and scored 3 overall, indicating that cross-cultural adaptation was
adequately addressed. One study involving the Insomnia Severity
Index (ISI) included a four-step process but failed to complete
committee review. The ISI translation achieved a moderate weighting
score of 1.5 overall. Studies investigating the Epworth Sleepiness
Scale (ESS) reported the translations and back-translations steps
only and received a score of between 1.3 to 3 of the three studies
(Ahmed et al., 2014; Ahmed, 2014; Al-Abri et al., 2013).
Pittsburgh Sleep Quality Index (PSQI):
The Pittsburgh Sleep Quality Index (PSQI) is a 19-item instrument
designed to measure different aspects of sleep quality and sleep
disturbance (Buysse et al., 1989). The scale reports sleep quality
over the previous month. The PSQI is divided into 7 subscale
scores: subjective sleep quality, sleep latency (time of full
sleep), duration, habitual sleep efficiency (proportion between
total sleep time and time in bed), sleep disturbances (waking up
during the night), use of sleeping medication, and daytime
dysfunction (difficulty staying awake during daytime). The overall
score ranges between 0-21, and a high score indicates poor sleep
quality.
The original PSQI was tested with psychiatric patients and also
with a normal population; it has been validated in different
clinical populations, such as cancer (Beck et al., 2004; Otte et
al., 2013), renal transplant (Burkhalter et al., 2011), diabetes
(Cunha et al., 2008), multiple sclerosis (Moreira et al., 2008),
rheumatoid arthritis (Nicassio et al., 2014), heterogeneous
patients (Carpenter and Andrykowski, 1998) and those with sleep
disorders (Backhaus et al., 2002).
The PSQI has been translated into 51 languages including: Spanish
(Hita-Contreras et al., 2014), Sinhala (Anandakumar et al., 2016),
Hungarian (Takács et al., 2016), Chinese (Lin et al., 2016),
Persian (Chehri et al., 2016) and German (Lang et al., 2017).
Suleiman et al., (2010) translated the PSQI into Arabic and tested
it in 35 healthy Arabic bilingual participants. The original PSQI
(Buysse et al., 1989) showed very good internal consistency
(Cronbach's alpha = .89); however, the translated Arabic PSQI
demonstrated borderline acceptability (Cronbach's alpha = .65)
(Buysse et al., 1989). Another psychometric study that investigated
the PSQI in Arabic cardiac patients found the internal consistency
reliability was 0.74 (Suleiman et al., 2012). Based on the review
findings, the PSQI meets the quality assessment criteria for
content, construct validity, internal consistency; however,
criterion validity agreement, reliability, responsiveness, floor
and ceiling effects and interpretation have not been reported. The
translation process of PSQI Arabic was deemed to be good, as it
reported the five steps, recommended by Guillemin et al. (1993),
resulting in a score of 3.
Epworth Sleepiness Scales (ESS)
Epworth Sleepiness Scales (ESS) was constructed to assess the
daytime sleepiness and diagnose sleep disorders using 8 items on a
4-point Likert scale (Johns, 1991). The total score ranged between
0 (normal sleep) to 24 (very sleepy). Johns (1991) reported the
internal consistency as reliable (Cronbach's alpha = 0.72). The
scale was translated and validated into Greek (Tsara et al., 2004),
Korean (Cho et al., 2011), Norwegian (Beiske et al., 2009),
Portuguese (Bertolazi et al., 2009) and Dutch (Sauter et al.,
2007).
Three studies tested the psychometric properties of ESS in an
Arabic population. Ahmed et al., (2014) involved 90 healthy
participants from Saudi Arabia and found the internal consistency
to be very good (Cronbach's alpha= 0.86) and test-retest
reliability was r=0.86. The results seem to indicate that the
Arabic version of the ESS is a reliable and valid measure of
daytime sleepiness, however the author only addressed two aspects
of cross cultural adaptation (Translation and Back
Translation).
Al-Abri et al., (2013) use the Bland Altman plot method in
validating the ESS in an Arabic population. The researchers
recruited 97 healthy, bilingual (English/Arabic) participants to
compare the Arabic translated version with the original English
version. Pearson's correlation coefficients among the two versions
was excellent at r=0.914. Furthermore, Bland and Altman's method
indicated an agreement between the Arabic and English versions of
ESS. The cross-cultural adaptation was not adapted for all 5 steps,
with only two steps reported, translated and back translated, with
an overall score of 2.
Ahmed, (2015) conducted a further validation study with 83 Sudanese
participants and reported acceptable internal consistency
reliability of the Arabic version (Cronbach's alpha = 0.84). Three
steps were used to address cultural adaption, but the author did
not provide a detailed description of the process. The overall
score was 1.3.
Based on the review findings, the ESS meets the quality assessment
criteria for content, construct validity, internal consistency,
agreement, reliability; however, criterion validity responsiveness,
floor and ceiling effects interpretation has not been assessed.
Cross cultural adaptation of the Arabic version of the ESS only
employed the translation process by back-translation, and there was
very little description about committee approach step.
Insomnia Severity Index (ISI)
The Insomnia Severity Index (ISI) is a 7 item, 5-point Likert scale
with three subscales assessing the nature, severity and impact of
insomnia, over the last two weeks. Overall scores range between
0-28, with higher scores indicating more severe insomnia. The total
score can be categorized as follows: 0-7 (absence of insomnia);
8-14 (sub-threshold insomnia); 15-24 (moderate insomnia) and 22-28
(severe insomnia). Bastien et al., (2001) recommended that a
cut-off score of 10 should be used for categorising insomnia in
community sample.
Suleiman and Yates, (2011) validated the ISI with 35 healthy Arabic
participants; the internal consistency reliability was acceptable
(Cronbach's alpha = 0.84), however the sample size was very small
(n=35). Additionally, Ahmed, (2015) validated the ISI with 83
Sudanese participants, resulting in acceptable internal consistency
(Cronbach's alpha = 0.87). The ISI was originally developed to
assist with the clinical evaluation of patients. However, the
samples of both studies were healthy participants which may have
implications for generalisation of findings. It would appear that
in a healthy population, the ISI meets the quality assessment
criteria for content, criterion, construct validity, internal
consistency, but further work is needed in agreement, reliability,
responsiveness, floor and ceiling effects and interpretation in
clinical populations. The cultural adaptation of ISI translation
process fulfilled 3 steps of the Guillermin et al (1993)
guidelines; translation process by back-translation and committee
approach steps was reported. However, the steps were not clearly
described and overall score was 2.3 for Suleiman and Yates (2011)
and 1.3 for Ahmed (2014).
Arabic Scale of Insomnia (ASI)
The Arabic Scale of Insomnia (ASI) was developed by Abdel-Khalek,
(2008) for measuring sleep disorders amongst the Arabic population,
as an Arabic version. The ASI comprises 12-items. Participants were
asked to respond to each item on a 5-point Likert scale. The scales
of ASI were translated into English. The study involved three
categories from the general population: adolescents (n= 5044),
college students (2210) and government employees (1247). The
internal consistency reliability of ASI was acceptable (Cronbach's
alpha between = 0.84 to 0.87). The scale was test-retested after
one week and the Cronbach's alpha was between 0.70 and 0.90. Based
on the review findings, the ASI meets the quality assessment
criteria for content, criterion, construct validity, internal
consistency reliability. Further work is needed in agreement,
responsiveness, floor and ceiling effects, and interpretation has
not been reported. The ASI was translated into English but the
cross-cultural adaption cannot be assessed, as the author did not
give details about the process. The scale received an overall score
of 1.5 in translation to English.
Discussion
Measuring sleep disturbance is important as it is a problem that
can involve complex aetiologies with many associated factors (Roth,
2007), such as comorbid depression or pain. Understanding the links
between such comorbidities and sleep disturbance is critically
important; thus, reliable and valid scales to assess sleep
disturbances are needed. There are a limited number of scales
developed and validated in the Arabic population that measure sleep
quality. Devine et al., (2005) published a literature review to
identify scales that were used to measure sleep dysfunction and
found twenty-two that were validated in English, however the
present review found only 4 scales that have been validated in an
Arabic population. Most of these scales have been tested
psychometrically in English clinical populations, such as dementia,
cancer, back pain (Marin et al., 2006; Savard M et al., 2005; Spira
et al., 2012). It is apparent from the results of the current
review that there is need to validate these scales further, using
Arabic clinical populations, in order to assess their utility,
psychometric properties and cross-cultural sensitivity.
The cross-cultural adaptation of studies included in the review was
evaluated by Guillemin criteria of forward translation,
back-translations, testing on lay panels, committee review, and
re-examination of the weighting of scores (Guillemin et al., 1993).
However, after assessing each study using the Guillemin criteria,
it was apparent that most of the studies in this review were
lacking in some of the steps involved in cross-cultural adaptation.
Only one study, using the PSQI (Suleiman et al., 2010), employed
all five steps proposed by Guillemin. It is suggested that
researchers consider employing Guillemin’s guidelines to translate
sleep disturbance scales into Arabic to maximize the linguistic
structure and cultural equivalence of the instrument. This may help
to ensure that the sensitivity of such scales is not contaminated
or confounded by cultural or language differences within the target
population, and that the instrument measures what it was originally
intended to measure.
This review used guidelines by Terwee et al., (2007) to evaluate
the psychometric properties of the scales. The importance of this
was to identify the strengths and weaknesses of the scales.
Analysis indicated that comprehensive psychometric testing of sleep
disturbance scales to be used in the Arabic population has not been
fully realized; indeed, many psychometric properties remain
untested. This is worrisome as it can impact on the
generalizability and interpretation of the outcome measures. Hence,
clinicians and researchers need to exercise caution in using these
sleep scales, as there is uncertainty in whether they can be useful
in their Arabic cultural setting and with clinical
populations.
In terms of selecting the best instrument to measure sleep
disturbance, there are several factors that need to be taken into
consideration for clinical practice or research purposes (Cole et
al., 2007). It is necessary to determine what exactly the
questionnaire is deemed to measure, for example, to diagnose,
screen, or monitor sleep disorder in specific population clinical
or nonclinical. Secondly, the instrument scoring system such as
only PSQI have seven subscales and each one has own scores whereas
the other three scales (ISI, ESS and ASI) did not have subscales
which calculated total. Thirdly, how sleep has been defined in
terms of being insomnia or poor sleep. In addition, the length of
the scale, duration over which insomnia or poor sleep is measured
(presently, last month or last two weeks), assessment of dimensions
(subscales needed to measure), validity, reliability and cultural
adaption of the scale are important considerations.
This review found that most of the studies had incomplete reporting
of the cross-cultural adaptation and translation steps and this may
have resulted in imprecise use of Arabic terminology. Only the PSQI
had reported all steps of cultural adaptation recommended by
Guillemin et al., (1993). Moreover, no study met all of the Tewee’s
quality assessment criteria that evaluated the psychometric
properties of sleep related measurements (Terwee et al., 2007).
Further assessment is needed regarding criterion validity,
reliability, agreement, floor and ceiling effects, responsiveness
and interpretation (Table 4). The psychometrics properties of the
instrument may vary due to the different populations and settings
(Streiner et al., 2015). Investigating different populations and
using different methods may help to improve the psychometrics
properties of the instrument. The judgment of the instrument use
depends on the purpose and applicability in terms of population,
disease and treatment (Hyland, 2002).
The results of the review indicated that the PSQI, which measures
different aspects of sleep disturbance, with seven components has
been culturally adapted for use in Arabic population. The PSQI was
found to have acceptable psychometric properties and, may be of use
in Arabic clinical or non-clinical populations, but further testing
is required in criterion validity, agreement, reliability,
responsiveness, floor and ceiling effects and interpretation.
Conclusion:
The review has provided information for researchers and clinicians
to facilitate the choice between the existing Arabic versions of
sleep disturbance scales for certain applications in specific
populations and countries. This review showed inadequate evidence
of psychometric and cultural adaptation of the scales; it is
therefore recommended that further investigation of the performance
of Arabic versions of sleep disturbance measurement properties is
carried out.
References:
Abdel-Khalek, A.M. The development and validation of the Arabic
Scale of Insomnia (ASI). Sleep Hypn., 2008, 10: 3–10.
Acquadro, C., Bayles, A., Juniper, E. Translating patient-reported
outcome measures: a multi-step process is essential. J. Bras.
Pneumol., 2014, 40: 211–212.
Ahmed, A.E. Validation of Arabic versions of three sleep surveys.
Qatar Med. J., 2014, 20: 130–137.
Ahmed, A.E., Fatani, A., Al-Harbi, A., et al. Validation of the
Arabic version of the Epworth Sleepiness Scale. J. Epidemiol. Glob.
Health, 2014, 4: 297–302.
Al-Abri, M., Al-Hamhami, A., Al-Nabhani, H., Al-Zakwani, I.
Validation of the arabic version of the epworth sleepiness scale in
oman. Oman Med. J., 2013, 28: 454.
Anandakumar, D., Dayabandara, M., Ratnatunga, S.S., Hanwella, R.,
de Silva, V.A. Validation of the Sinhala version of the Pittsburgh
Sleep Quality Index. Ceylon Med. J., 2016, 61:.
Ancoli-Israel, S. Sleep Disturbances in Cancer: A Review. Sleep
Med. Res., 2015, 6: 45–49.
Anothaisintawee, T., Reutrakul, S., Van Cauter, E., Thakkinstian,
A. Sleep disturbances compared to traditional risk factors for
diabetes development: systematic review and meta-analysis. Sleep
Med. Rev., 2016, 30: 11–24.
Backhaus, J., Junghanns, K., Broocks, A., Riemann, D., Hohagen, F.
Test–retest reliability and validity of the Pittsburgh Sleep
Quality Index in primary insomnia. J. Psychosom. Res., 2002, 53:
737–740.
Bastien, C.H., VallieÁres, A., Morin, C.M. Validation of the
Insomnia Severity Index as an outcome measure for insomnia
research. Sleep Med., 2001, 2: 297–307.
Beaton, D.E., Bombardier, C., Guillemin, F., Ferraz, M.B.
Guidelines for the process of cross-cultural adaptation of
self-report measures. Spine, 2000, 25: 3186–3191.
Beck, S.L., Schwartz, A.L., Towsley, G., Dudley, W., Barsevick, A.
Psychometric evaluation of the Pittsburgh sleep quality index in
cancer patients. J. Pain Symptom Manage., 2004, 27: 140–148.
Beiske, K.K., Kjelsberg, F.N., Ruud, E.A., Stavem, K. Reliability
and validity of a Norwegian version of the Epworth sleepiness
scale. Sleep Breath., 2009, 13: 65–72.
Berger, A.M., Parker, K.P., Young-McCaughan, S., et al. Sleep/Wake
Disturbances in People With Cancer and Their Caregivers: State of
the Science. Oncology nursing forum (2005).
Bertolazi, A.N., Fagondes, S.C., Hoff, L.S., et al.
Portuguese-language version of the Epworth sleepiness scale:
validation for use in Brazil. J. Bras. Pneumol., 2009, 35:
877–883.
Besedovsky, L., Lange, T., Born, J. Sleep and immune function.
Pflüg. Arch.-Eur. J. Physiol., 2012, 463: 121–137.
Booth, A., Harris, J., Croot, E., Springett, J., Campbell, F.,
Wilkins, E. Towards a methodology for cluster searching to provide
conceptual and contextual “richness” for systematic reviews of
complex interventions: case study (CLUSTER). BMC Med. Res.
Methodol., 2013, 13: 118.
Brislin, R.W. Back-Translation for Cross-Cultural Research. J.
Cross-Cult. Psychol., 1970, 1: 185–216.
Bullinger, M., Alonso, J., Apolone, G., et al. Translating health
status questionnaires and evaluating their quality: the IQOLA
project approach. J. Clin. Epidemiol., 1998, 51: 913–923.
Burkhalter, H., Sereika, S.M., Engberg, S., Wirz-Justice, A.,
Steiger, J., De Geest, S. Validity of 2 sleep quality items to be
used in a large cohort study of kidney transplant recipients. Prog.
Transplant., 2011, 21: 27–35.
Buysse, D.J., Reynolds, C.F., Monk, T.H., Berman, S.R., Kupfer,
D.J. The Pittsburgh Sleep Quality Index: a new instrument for
psychiatric practice and research. Psychiatry Res., 1989, 28:
193–213.
Carpenter, J.S., Andrykowski, M.A. Psychometric evaluation of the
Pittsburgh sleep quality index. J. Psychosom. Res., 1998, 45:
5–13.
Chehri, A., Kiamanesh, A., Ahadi, H., Khazaie, H. Psychometric
properties of the Persian version of Sleep Hygiene Index in the
general population. Iran. J. Psychiatry Behav. Sci., 2016,
10:.
Cho, Y.W., Lee, J.H., Son, H.K., Lee, S.H., Shin, C., Johns, M.W.
The reliability and validity of the Korean version of the Epworth
sleepiness scale. Sleep Breath., 2011, 15: 377–384.
Cole, J.C., Dubois, D., Kosinski, M. Use of patient-reported sleep
measures in clinical trials of pain treatment: a literature review
and synthesis of current sleep measures and a conceptual model of
sleep disturbance in pain. Clin. Ther., 2007, 29: 2580–2588.
Colten, H.R., Altevogt, B.M., Institute of Medicine (U.S.),
Committee on Sleep Medicine and Research Sleep disorders and sleep
deprivation: an unmet public health problem. Institute of Medicine:
National Academies Press, Washington, D.C. , 2006.
Cunha, M.C.B. da, Zanetti, M.L., Hass, V.J. Sleep quality in type 2
diabetics. Rev. Lat. Am. Enfermagem, 2008, 16: 850–855.
Devine, E.B., Hakim, Z., Green, J. A Systematic Review of
Patient-Reported Outcome Instruments Measuring Sleep Dysfunction in
Adults: PharmacoEconomics, 2005, 23: 889–912.
DiBonaventura, M., Richard, L., Kumar, M., Forsythe, A., Flores,
N.M., Moline, M. The association between insomnia and insomnia
treatment side effects on health status, work productivity, and
healthcare resource use. PloS One, 2015, 10: e0137117.
Epstein, J., Santo, R.M., Guillemin, F. A review of guidelines for
cross-cultural adaptation of questionnaires could not bring out a
consensus. J. Clin. Epidemiol., 2015, 68: 435–441.
Guillemin, F., Bombardier, C., Beaton, D. Cross-cultural adaptation
of health-related quality of life measures: Literature review and
proposed guidelines. J. Clin. Epidemiol., 1993, 46:
1417–1432.
Heinzer, R., Vat, S., Marques-Vidal, P., et al. Prevalence of
sleep-disordered breathing in the general population: the HypnoLaus
study. Lancet Respir. Med., 2015, 3: 310–318.
Hita-Contreras, F., Martínez-López, E., Latorre-Román, P.A.,
Garrido, F., Santos, M.A., Martínez-Amat, A. Reliability and
validity of the Spanish version of the Pittsburgh Sleep Quality
Index (PSQI) in patients with fibromyalgia. Rheumatol. Int., 2014,
34: 929–936.
Holliday, E.G., Magee, C.A., Kritharides, L., Banks, E., Attia, J.
Short sleep duration is associated with risk of future diabetes but
not cardiovascular disease: a prospective study and meta-analysis.
PloS One, 2013, 8: e82305.
Horesh, U., Cotter, W.M. Current Research on Linguistic Variation
in the Arabic-Speaking World. Lang. Linguist. Compass, 2016, 10:
370–381.
Hyland, M.E. Recommendations from quality of life scales are not
simple, 2002.
Johns, M.W. A New Method for Measuring Daytime Sleepiness: The
Epworth Sleepiness Scale. Sleep, 1991, 14: 540–545.
Kasai, T., Floras, J.S., Bradley, T.D. Sleep apnea and
cardiovascular disease: a bidirectional relationship. Circulation,
2012, 126: 1495–1510.
Khan, M.S., Aouad, R. The effects of insomnia and sleep loss on
cardiovascular disease. Sleep Med. Clin., 2017, 12: 167–177.
Lang, C., Brand, S., Holsboer-Trachsler, E., Pühse, U., Colledge,
F., Gerber, M. Validation of the German version of the short form
of the dysfunctional beliefs and attitudes about sleep scale
(DBAS-16). Neurol. Sci., 2017, 38: 1047–1058.
Lin, K.Y., Frawley, H.C., Denehy, L., Feil, D., Granger, C.L.
Exercise interventions for patients with gynaecological cancer: a
systematic review and meta-analysis. Physiotherapy, 2016, 102:
309–319.
Marin, R., Cyhan, T., Miklos, W. Sleep disturbance in patients with
chronic low back pain. Am. J. Phys. Med. Rehabil., 2006, 85:
430–435.
Matsuda, R., Kohno, T., Kohsaka, S., et al. The prevalence of poor
sleep quality and its association with depression and anxiety
scores in patients admitted for cardiovascular disease: A
cross-sectional designed study. Int. J. Cardiol., 2017, 228:
977–982.
Moher, D., Liberati, A., Tetzlaff, J., Altman, D.G. Preferred
reporting items for systematic reviews and meta-analyses: the
PRISMA statement. Int. J. Surg., 2010, 8: 336–341.
Moreira, N.C.V., Damasceno, R.S., Medeiros, C.A.M., et al. Restless
leg syndrome, sleep quality and fatigue in multiple sclerosis
patients. Braz. J. Med. Biol. Res., 2008, 41: 932–937.
Morgan, D., Tsai, S.C. Sleep and the endocrine system. Crit. Care
Clin., 2015, 31: 403–418.
Morin, C.M. Insomnia: Psychological assessment and management.
Guilford Press, New York, NY, US , 1993.
National Institutes of Health National institutes of health sleep
disorders research plan. National Institutes of Health, Maryland,
USA , 2011.
Nicassio, P.M., Ormseth, S.R., Custodio, M.K., Olmstead, R.,
Weisman, M.H., Irwin, M.R. Confirmatory factor analysis of the
Pittsburgh Sleep Quality Index in rheumatoid arthritis patients.
Behav. Sleep. Med., 2014, 12: 1–12.
Nowicki, Z., Grabowski, K., Cubaa, W.J., et al. Prevalence of
self-reported insomnia in general population of Poland. Psychiatr.
Pol., 2016, 50: 165–173.
Otte, J.L., Rand, K.L., Carpenter, J.S., Russell, K.M., Champion,
V.L. Factor Analysis of the Pittsburgh Sleep Quality Index in
Breast Cancer Survivors. J. Pain Symptom Manage., 2013, 45:
620–627.
Roth, T. Insomnia: definition, prevalence, etiology, and
consequences. J. Clin. Sleep Med. JCSM Off. Publ. Am. Acad. Sleep
Med., 2007, 3: S7.
Sauter, C., Popp, R., Danker-Hopfe, H., et al. Normative values of
the German Epworth sleepiness scale. Somnologie-Schlafforschung
Schlafmed., 2007, 11: 272–278.
Savard M, Savard J, Simard S, Ivers H Empirical validation of the
Insomnia Severity Index in cancer patients. Psychooncology., 2005,
14: 429–441.
Spira, A.P., Beaudreau, S.A., Stone, K.L., et al. Reliability and
Validity of the Pittsburgh Sleep Quality Index and the Epworth
Sleepiness Scale in Older Men. J. Gerontol. Ser. A, 2012, 67A:
433–439.
Streiner, D.L., Norman, G.R., Cairney, J. Health Measurement
Scales: A Practical Guide to Their Development and Use. Oxford
University Press, Oxford , 2015.
Suleiman, K., Hadid, L.A.-, Duhni, A. Psychometric Testing of the
Arabic version of the Pittsburgh Sleep Quality Index (A-PSQI) among
Coronary Artery Disease Patients in Jordan. 2012, 6.
Suleiman, K.H., Yates, B.C. Translating the insomnia severity index
into Arabic. J. Nurs. Scholarsh., 2011, 43: 49–53.
Suleiman, K.H., Yates, B.C., Berger, A.M., Pozehl, B., Meza, J.
Translating the Pittsburgh sleep quality index into Arabic. West.
J. Nurs. Res., 2010, 32: 250–268.
Takács, J., Bódizs, R., Ujma, P.P., Horváth, K., Rajna, P., Harmat,
L. Reliability and validity of the Hungarian version of the
Pittsburgh Sleep Quality Index (PSQI-HUN): comparing psychiatric
patients with control subjects. Sleep Breath., 2016, 20:
1045–1051.
Terwee, C.B., Bot, S.D., de Boer, M.R., et al. Quality criteria
were proposed for measurement properties of health status
questionnaires. J. Clin. Epidemiol., 2007, 60: 34–42.
Tsara, V., Serasli, E., Amfilochiou, A., Constantinidis, T.,
Christaki, P. Greek version of the Epworth sleepiness scale. Sleep
Breath., 2004, 8: 91–95.
Walia, H.K., Mehra, R. Overview of common sleep disorders and
intersection with dermatologic conditions. Int. J. Mol. Sci., 2016,
17: 654.
Zimberg, I.Z., Dâmaso, A., Del Re, M., et al. Short sleep duration
and obesity: mechanisms and future perspectives. Cell Biochem.
Funct., 2012, 30: 524–529.
17
Property
Definition
Content validity
+
?
-
2
Internal
Consistency
+
?
-
Cronbach's alpha (s)! 0.70 or O0.95, despite adequate design and
method;
0
3
Criterion validity
+
?
-
Correlation with gold standard! 0.70, continuous adequate design
and method;
0
4
Construct validity
+
?
-
Less than 75% of hypotheses were confirmed, despite adequate design
and Methods;
0
5
Reproducibility
5.1. Agreement
+
?
-
MIC> SDC OR MIC equals or inside LOA, despite adequate design
and method;
0
5.2. Reliability
+
?
-
ICC or weighed Kappa! 0.70, despite adequate design and
method;
0
6
Responsiveness
Clinically important changes over time
+
?
-
SDC or SDC> MIC OR MIC equals or inside LOA OR RR <1.96 OR
AUC! 0.70, despite adequate design and methods
0
7
Floor and ceiling Effects
+
?
-
<15% of the respondents achieved the highest or lowest possible
scores, strict adequate design and methods;
0
8
Interp4retatability
+
?
Doubtful design or method OR less than four subgroups OR no MIC
defined;
0
No information found on interpretation.
MIC= minimal important change; SDC=smallest detectable change;
LOA=limits of agreement; ICC=Intraclass correlation; SD, standard
deviation.
a + = positive rating; ?=indeterminate rating; - =negative rating;
0=no information available.
b Doubtful design or method= lacking of a clear description of the
design or methods of the study, sample size smaller than 50
subjects (should be at
least 50 in every (subgroup) analysis), or any important
methodological weakness in the design or execution of the
study.
* This table adapted from Terwee et al. 2007 page 39
Table 2: Guidelines to preserve equivalence in cross-cultural
adaptation of measures
Translation Technique
Back-Translation Technique
Committee Approach
Use appropriate back-translators
Membership of the committee should be multidisciplinary
Use structured techniques to resolve discrepancies Modify
instructions or format, modify/reject inappropriate items, generate
new items
Ensure that the translation is fully comprehensible
Verify cross-cultural equivalence of source and final
versions
Check for equivalence in source and final versions using a pre-test
technique Either use a probe technique
Or submit the source and final versions to bilingual lay
people
Immigrants: Choose the language of administration or use a
dual-format measure
Consider adapting the
* This table adapted from Guillemin et al. 1993 page 1422
Table 3: Characteristics of the Studies Including the Review
Scale with references
(Suleiman et al., 2010)
Cronbach’s α ranges from .36 to .84
PSQI (Suleiman et al., 2012)
Jordan
Cross-sectional
Insomnia Severity Index(ISI) (Suleiman and Yates, 2011)
USA
Cross-sectional
Cronbach’s α ranges from .49 to 0.92
ISI (Ahmed, 2014)
Epworth Sleepiness Scale (ESS) (Ahmed et al., 2014)
Saudi Arabia
Re-test 0.98
Oman
Cross-sectional
NR
Arabic Scale of Insomnia (ASI) (Abdel-Khalek, 2008)
Kuwait
Cross-sectional
Very large sample
Author tested in on other study in clinical population
Table 4: Description of the identified Sleep Disturbance Scales
(Arabic Version)
Instrument
Item
19
0 to 3
1-5 good sleeper
5-21 poor sleeper
No subscale
8
0
No subscale
Less than 5 min
12
0
5 min
Past month
Table 5: The assessment of measurement properties of Sleep
Disturbance Scales
Instrument
+
+
?
+
+
+
(Abdel-Khalek, 2008)
Study and instrument
3
3
3
3
3
3
2
2
NR
NR
NR
2
(Abdel-Khalek, 2008)
2
1
NR
NR
NR
1.5
Rating: 3 = good, 2 = moderate, 1 = poor, NR = information not
reported, NA: not applicable
17