34
June 18, 2022 Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. [email protected] RCMAR/EXPORT September 15, 2008, 3-4pm http://www.chime.ucla.edu/measurement/measureme nt.htm

Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. [email protected] RCMAR/EXPORT September 15, 2008, 3-4pm

Embed Size (px)

Citation preview

Page 1: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

April 21, 2023

Construction and Evaluation of

Multi-item Scales

Ron D. Hays, Ph.D. [email protected]

RCMAR/EXPORT

September 15, 2008, 3-4pm

http://www.chime.ucla.edu/measurement/measurement.htm

Page 2: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

2

http://www.ispor.org/TaskForces/PROInstrumentsUse.asp

Developing a new PRO instrument is a labor and time intensive task.

Use of existing instruments is generally preferable to developing a new instrument or modifying an instrument.

If a PRO instrument is modified, additional validation studies may be needed to confirm the adequacy of the modified instrument’s measurement properties. The extent of additional validation recommended depends on the type of modification made.

The FDA intends to consider a modified instrument as a different instrument from the original and will consider measurement properties to be version-specific.

http://www.fda.gov/cder/guidance/6910dft.htm

Page 3: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

3

End goal is measure that is “Psychometrically Sound”

• Same people get same scores

• Different people get different scores and differ in the way you expect

• Measure works the same way for different groups (age, gender, race/ethnicity)

• Measure is practical

Page 4: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

4

Item-scale correlation matrixItem-scale correlation matrix

Depress Anxiety Anger Item #1 0.50* 0.50 0.50 Item #2 0.50* 0.50 0.50 Item #3 0.50* 0.50 0.50 Item #4 0.50 0.50* 0.50 Item #5 0.50 0.50* 0.50 Item #6 0.50 0.50* 0.50 Item #7 0.50 0.50 0.50* Item #8 0.50 0.50 0.50* Item #9 0.50 0.50 0.50* *Item-scale correlation, corrected for overlap.

Page 5: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

5

Item-scale correlation matrixItem-scale correlation matrix

Depress Anxiety Anger Item #1 0.80* 0.20 0.20 Item #2 0.80* 0.20 0.20 Item #3 0.80* 0.20 0.20 Item #4 0.20 0.80* 0.20 Item #5 0.20 0.80* 0.20 Item #6 0.20 0.80* 0.20 Item #7 0.20 0.20 0.80* Item #8 0.20 0.20 0.80* Item #9 0.20 0.20 0.80* *Item-scale correlation, corrected for overlap.

Page 6: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

6

Correlation Negative Positive

Small −0.3 to −0.1 0.1 to 0.3

Medium −0.5 to −0.3 0.3 to 0.5

Large −1.0 to −0.5 0.5 to 1.0

Construct validity• Requires prior hypotheses about direction

and size of associations of measure with other variables, e.g.,– Physical functioning will be negatively

associated with age and the association will be of medium size.

Page 7: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

7

Documenting content validity

+ Chronology of all item development activities+ Protocols for qualitative interviews, focus groups, cognitive interviewsand other research used to identify concepts, generate items, or revisean existing instrument, including training of interviewers+ Development of response options, modes of administration and scoring+ Size, characteristics, location, and (if requested) transcripts of eachqualitative interview and focus group+ Documentation on how saturation was achieved (i.e. no new informationwas obtained from additional qualitative interviews or focus groups)+ Description of any pilot test, including cognitive interviewing, cognitiveinterview transcripts (if requested)+ Versions of the instrument at various milestones of development+ Item tracking table that list the source of each item in the final instrument, and how it changed during development+ A summary statement of qualitative research in support of contentvalidity of the PRO instrument

D. Patrick et al, Value in Health 2007, 10, S125-37

Page 8: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

8

30 Second Break

Page 9: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

9

First law of survey development:Do not attempt it at home

Page 10: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

10

Second law: Know thy respondent

Page 11: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

11

Third law: Practice before you play

“Cut and try, see how it looks and sounds, see how people react to it, and then cut again, and try again” Converse & Presser (1986, p. 78)

Identify problems with

– Comprehension of items (stem/response options)– Retrieval of information– Skip patterns– Response burden

Page 12: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

12

Fourth law: Keep it simple and short

Page 13: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

13

Use only as many words, items and response options as needed

Page 14: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

14

Survey Instructions

Thank you for taking the time to fill out this survey. The purpose of this survey is to learn about your experiences as a cancer patient. The information you provide is very important. It will help to improve cancer services for other patients.

Many of the questions ask about your experiences at your “Cancer Center.” A Cancer Center refers to the hospital, center, or institute where you receive most of your cancer care. A Cancer Center also refers to the doctors, nurses, and other health care professionals who work with the hospital, center, or institute. In some places, the Cancer Center is all in one building. In other places the doctors, nurses, and other health care professionals who work with the Cancer Center are in different locations.

Many of the questions ask about your “Cancer Care Team.” A cancer care team refers to the doctors, nurses, and other health care professionals who provide your cancer care. Your cancer care team might also include social workers, counselors, patient navigators and others who help with your cancer care.

You may feel that some questions are easier to answer and some are harder to answer. Please remember that there are no right or wrong answers! We want to know about your experiences, both good and bad. If you truly do not know the answer to a question, then it is okay to check the box that says “Don’t know.”

Page 15: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

15

Modified

The purpose of this survey is to learn about your experiences as a cancer patient, both good and bad.

Some of the questions ask about your “Cancer Center” and others ask about your “Cancer Care Team.”

A Cancer Center is the hospital, center, or institute where you receive most of your cancer care and includes all the health care professionals who work there.

A Cancer Care Team refers to the doctors, nurses, and

other health care professionals (social workers, counselors, patient navigators, etc.) who help provide your cancer care.

Page 16: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

16

Item stem

During the PAST MONTH, how often have you had trouble sleeping because you…

Cannot get to sleep within 30 minutes?

Wake up in the middle of the night or early morning?

Have to get up to use the bathroom?

Page 17: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

17

Use a recall period that is as short as possible, but long enough

• Now

• Last 24 hours

• Last 4 weeks versus last month

• Last 6 months

• Last 12 months

Page 18: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

18

Response options

• None of the time

• A little of the time

• Some of the time

• Most of the time

• All of the time

A good bit of the time

Page 19: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

19

3-5 response options is enough

Strongly correlated with items when administered using more responses options

Miller, D. G. (1956). The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychology Review, 2, 81-96.

Page 20: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

20

Fifth law: Believe the survey respondent, but only so much

Page 21: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

21

Page 22: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

22

Cognitive interview probe

• Questions 20 and 21 were similar (read both questions again).

• Which answer choices were easiest for you? Never to always or Poor to Excellent?

• Why?

• Do you think these two questions measure the same thing?

Page 23: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

23

The “age of item response theory”

• Unlimited number of items in the bank

• Different items administered to different people (tailored)

• Evaluation of how well each person fits the underlying model

Page 24: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

24

Item characteristic Curves (ICCs)

Models, in probabilistic terms, the relationship between a person’s level on the underlying construct and his/her response to a question.

{Thanks to Bryce Reeve for IRT analyses summarized in the next set of slides.}

Page 25: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

25

ICCs tell us for each item

• The appropriate number of response categories

• Where the person needs to be located on the underlying construct to endorse each response category

Page 26: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

26

0.00

0.25

0.50

0.75

1.00

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Pro

bability

ExcellentPoor

Never

B16. …your follow-up care doctor listen carefully to you?

SometimesUsually

Always

CAHPS Item

Page 27: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

27

0.00

0.25

0.50

0.75

1.00

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Pro

bability

ExcellentPoor

Never

B25. …you leave your follow-up care doctor’s office or clinic with unanswered questions related to your cancer?

Sometimes

Usually

Always

New Item

Page 28: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

28

0

50

100

150

200

250

300

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Frequency

0.00

0.25

0.50

0.75

1.00

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Probability

ExcellentPoor

B16*. …your follow-up care doctor listen carefully to you?

Page 29: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

29

Information curves

Information curves indicate the range over the measured construct where an item (or scale) is best at differentiating among individuals. Higher information denotes more precision for measuring a person’s perception of communication.

Page 30: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

30

0.00

0.50

1.00

1.50

2.00

2.50

3.00

3.50

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Information

B16*. …your follow-up care doctor listen carefully to you?

0.00

0.25

0.50

0.75

1.00

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Probability

ExcellentPoor

Page 31: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

31

0

50

100

150

200

250

300

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Frequency

0.00

0.50

1.00

1.50

2.00

2.50

3.00

3.50

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Information

B16*. …your follow-up care doctor listen carefully to you?

ExcellentPoor

Page 32: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

32

0

50

100

150

200

250

300

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Frequency

ExcellentPoor

0.00

0.50

1.00

1.50

2.00

2.50

3.00

3.50

4.00

4.50

5.00

5.50

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Information

Page 33: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

33

0.00

5.00

10.00

15.00

20.00

25.00

30.00

35.00

-3.00 -2.00 -1.00 0.00 1.00 2.00 3.00

Communication

Inform

ation

ExcellentPoor

Scale Information

Page 34: Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. drhays@ucla.edu RCMAR/EXPORT September 15, 2008, 3-4pm

34

Thank you