23
Computerized Adaptive Testing (CAT) Shu-chuan Kao, PhD, Senior Manager, Measurement & Testing

Computerized Adaptive Testing (CAT)

  • Upload
    others

  • View
    7

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Computerized Adaptive Testing (CAT)

Computerized Adaptive Testing (CAT)

Shu-chuan Kao, PhD, Senior Manager, Measurement &

Testing

Page 2: Computerized Adaptive Testing (CAT)

Objectives

By the end of the session, participants will be able to:• Understand how Computerized Adaptive Testing (CAT)

works• Recognize REx-PN® CAT components

• Variable Length Exam• Content Balancing• Exam stopping rules

Page 3: Computerized Adaptive Testing (CAT)

What is “Adaptive” testing?Exams are customized for individual candidates. The difficulty of each test is tailored to the person taking it.

! High-ability candidates will receive more difficult items! Easy items provide little information about a high

performer’s ability

! Low-ability candidates will receive fewer difficult items! Avoid guessing on items that are too difficult

Page 4: Computerized Adaptive Testing (CAT)

What is “Adaptive” testing? (Cont.)• Every time you answer an item, the computer re-estimates

your ability based on all the previous answers and the difficulty of those items.

• Then the computer selects the next item that the candidate has a 50/50 chance of a correct response.• Items are not too easy or too hard• The goal is to get as much information as possible about

your true ability• You should find each item challenging as each item is

targeted to your ability

Page 5: Computerized Adaptive Testing (CAT)

What is “Adaptive” testing? (Cont.)• When you answer more items, the computer re-estimates

your ability immediately and the estimate becomes more precise.

Answer

Re-estimatecandidate’s ability

AnswerMore precise ability estimate

Page 6: Computerized Adaptive Testing (CAT)

Item administered Correct response Incorrect responseAbility estimate 95% Confidence interval

!"

!#

!$

!%

&

%

$

#

"

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

!"#$%

Number of operational items

Passing Standard

High

Low

Example of CATItem administered Correct response Incorrect responseItem administered Correct response Incorrect response

Item

dif

ficu

lty

Hard

Easy

Can

dida

te a

bilit

y

Page 7: Computerized Adaptive Testing (CAT)

Components of REx-PN® CAT

Exam Stopping

Rules

Variable Length Exam

Content Balancing

Page 8: Computerized Adaptive Testing (CAT)

Variable Length Exam

The number of questions depends upon their ability:Minimum: 90 (60 operational + 30 pretest) itemsMaximum: 150 (120 operational + 30 pretest) items

• Shorter length exams - ability is “far” from the passing standard (clearly above or clearly below)

• Longer length exams - ability is “close” to the passing standard

Page 9: Computerized Adaptive Testing (CAT)

• The REx-PN Test Plan dictates a specific percentage of the operational items in each content area

Content Balancing

Page 10: Computerized Adaptive Testing (CAT)

!"#"$%&'"(#)"(*+'#"'#(,$",((#),#(-&.."$/(#)"(%+/#(.$+%(#)"(#"/#(01,'(/0"*&.&*,#&+'/

2-"'#&.3(&#"%/(&'(#)"(*+'#"'#(,$",(,'-(/"1"*#(,'(&#"%(4&#)(

-&..&*51#3(#),#(%,#*)"/(#)"(,6&1&#3(

"/#&%,#"

!"#$%&'#$(&#)*$&'+&'+# *,-#&".&#/$&+$0$1#'"2&,03"-'#/(+

Content Balancing

Page 11: Computerized Adaptive Testing (CAT)

Stopping Rules and Pass/Fail Decisions

• Performance is reported only as a pass-fail classification.• At least 60 operational items have been answered within

the 4-hour exam time.• Three stopping rules are used to terminate an exam and

make the pass/fail decision.Ø 95% Confidence IntervalØ Maximum Length Ø Run Out Of Time

Page 12: Computerized Adaptive Testing (CAT)

Stopping Rule #1: 95% Confidence Interval

• Respond to at least 60 operational items.• The exam continues when the confidence interval straddles

the passing standard.• Exam stops when the confidence interval is clearly above or

below the passing standard.Ø Above the passing standard à PassØ Below the passing standard à Fail

Page 13: Computerized Adaptive Testing (CAT)

-4

-3

-2

-1

0

1

2

3

4

0 5 10 15 20 25 30 35 40 45 50 55 60 65Number of operational items

Begin Evaluation

Fail

95% Confidence Interval is below the passing standard

Sample REx-PNMinimum Item Fail Chart

Page 14: Computerized Adaptive Testing (CAT)

-4

-3

-2

-1

0

1

2

3

4

0 10 20 30 40 50 60 70 80 90 100 110 120 130

Number of operational items

Begin EvaluationPass

95% Confidence Interval is above the passing standard

Sample REx-PNMedium Length Pass Chart

Page 15: Computerized Adaptive Testing (CAT)

Stopping Rule #2: Maximum Length• Candidates with abilities very close to the passing standard

may receive maximum length examinations.• When a candidate answers 120 operational items,

computer disregards the 95% certainty requirement.• Outcome depends on the final ability.

Ø ABOVE the passing standardà PassØ AT or BELOW the passing standard à Fail

Page 16: Computerized Adaptive Testing (CAT)

-4

-3

-2

-1

0

1

2

3

4

0 10 20 30 40 50 60 70 80 90 100 110 120 130

Number of operational items

Begin Evaluation Pass

Evaluate the last ability estimateMaximum length exam

Sample REx-PNMaximum Length Pass Chart

Page 17: Computerized Adaptive Testing (CAT)

-4

-3

-2

-1

0

1

2

3

4

0 10 20 30 40 50 60 70 80 90 100 110 120 130

Number of operational items

Begin Evaluation Fail

Evaluate the last ability estimateMaximum length exam

Sample REx-PNMaximum Length Fail Chart

Page 18: Computerized Adaptive Testing (CAT)

• A candidate spend 4 hours on the exam, but the exam has not been terminated.

• Responded to 60 at least OP items AND the final ability estimate is above the passing standard à Pass

• Responded fewer than 60 OP items or the final ability estimate is at or below the passing standard à Fail

Stopping Rule #3: Run-Out-Of-Time

Page 19: Computerized Adaptive Testing (CAT)

-4

-3

-2

-1

0

1

2

3

4

0 10 20 30 40 50 60 70 80 90 100 110 120 130

Number of operational items

Begin Evaluation

Run out of time

Evaluate the last ability estimatePass

Sample REx-PNRun-Out-Of-Time Pass Chart

Page 20: Computerized Adaptive Testing (CAT)

-4

-3

-2

-1

0

1

2

3

4

0 10 20 30 40 50 60 70 80 90 100 110 120 130

Number of operational items

Begin Evaluation

Run out of timeFail

Sample REx-PNRun-Out-Of-Time Fail Chart

Page 21: Computerized Adaptive Testing (CAT)

-4

-3

-2

-1

0

1

2

3

4

0 10 20 30 40 50 60 70 80 90 100 110 120 130

Number of operational items

Begin Evaluation

Run out of time

Evaluate the last ability

Fail

Sample REx-PNRun-Out-Of-Time Fail Chart

Page 22: Computerized Adaptive Testing (CAT)

• For more information about REx-PN and CAT, visit rexpn.com

More Information

Page 23: Computerized Adaptive Testing (CAT)

!"#$%&'($)