ABCs of IRT

November 18, 2010

Diane M. Talley, MAStephen B. Johnson, PhDJames A. Penny, PhD

Castle Worldwide

Psychometrics as Science and Art

2010 ICE Educational Conference

IRT and Classical Concepts of IRT

– A logit– The abc’s

Benefits– Pre-equating– immediate scoring– Population invariance

Assumptions Implications

The right tools for the job

Data Program Tool

Versus

Classical versus IRT model

Classical versus IRT

Classical Model IRT Model Traditional Modern

Requires less strict adherence to assumptions

Requires stricter adherence to assumptions

Sample dependent Population invariant

Statistics (p – diff, p-biserial – disc)

Probability-based statistics (b-diff, a-disc, c-guessing)

Simple scoring model (raw score)

Scoring is more complex

What’s a logit?

Ability

The Performance

StandardProbability

b (difficulty)

-2.3 -2

-1.3 -1

-0.3 0

Paint by Numbers Leonardo

a (discrimination) and b

0.25 0.5

0.75 1

1.25 1.5

1.75 2

2.25 2.5

a, b, and c (guessing)

0.25 0.5

0.75 1

1.25 1.5

1.75 2

2.25 2.5

Fit statistics

Comparison of Infit and Outfit

Infit Outfit

ICE 2010 Conference Atlanta Georgia

Outfit Mean Square Plot

0 5 10 15 20 25 30Item Order

Infit Mean Square Plot

00.20.40.60.8

11.21.41.6

0 5 10 15 20 25 30Item Order

Population Invariance

Low Performing

High Performing

Item 1 .15 .50

Item 2 .60 .80

Item 3 .70 .92

Classical Difficulty Values IRT Difficulty Values

Low Performing

High Performing

Item 1 1.50 1.50

Item 2 0.00 0.00

Item 3 -.75 -.75

IRT Pre-Equating

What does it mean? Why would you want to do it? What does it mean for building item banks

and forms?

Test Information Function (TIF)

Comparison of Test Information Functions

-3 -2.75 -2.5 -2.25 -2 -1.75 -1.5 -1.25 -1 -0.75 -0.5 -0.25 0 0.25 0.5 0.775 1.025 1.275 1.525 1.775 2.025 2.275 2.525 2.775 3.025

Form A

Form B

Assumptions

Unidimensionality Local Independence

Implications

Item writing– Leave those scored items alone!– Focused item writing targeting the performance standard

Assembly– Items selected for a form should be around the standard

Testing and Reporting – Field test items for pre-equating/on-demand scoring– Form assignment– Scoring – Recalibration– Harder to explain to stakeholders

Does IRT make sense for you? What is the size and maturity of your program and

item bank? Do you like to tinker with items? Do your program requirements change frequently?

How experienced/capable are your item writers? How do you score candidates?

IRT or number correct Do you hold scores or do immediate scoring?

Can you afford a psychometrician?

Questions?

Diane M. Talley dtalley@castleworldwide.comJames A. Penny jpenny@castleworldwide.com Stephen B. Johnson sjohnson@castleworldwide.com

919.572.6880

ABCs of IRT

Documents

ABCs of Airplanes

ABCs of Medicaid

ABCs of Budgeting

ABCs of WWII

CAT5508 2 001 003 - 5508(2) TA-2 - Accent Bearings · 2016. 4. 1. · irt 1212-1 irt 1216-1 irt 1222-1 irt 1216-1 irt 1220-1 irt 1215-2 irt 1220-2 irt 1225-2 irt 1215-2 irt 1225-2

Abcs of Probes

ABCs of Rotary

ABCs of Communication

Product specification FlexTrack IRT 501 … specification - Robot user documentation 3HAC024534-001 Product manual - FlexTrack IRT 501-66 IRT 501-66R IRT 501-90 IRT 501-90R 3HAW050008590

IRT 4-20 PCAUTO IRT 4-10 PCAUTO IRT 3-20 PCD IRT ......IRT 4-20 PCAUTO IRT 4-10 PCAUTO IRT 3-20 PCD IRT COMBI 4-10 IR-UVA IRT COMBI 4-20 IR-UVA GB DE FR SE IT ES Assembly Manual Montageanleitung

ABCs of Autolisp.pdf

IRT 424 DTP IRT 425 DTP IRT 428 DTP - Hedson...Assembly manual Montageanleitung Manuel d’Installation Monteringsanvisning Manuale di montaggio Manual de ensamblado IRT 424 DTP IRT

Abcs of Adcs

ABCs Of Webinars

ABCS of Mojo

Abcs of Concrete

ABCs of Meteorology

ABCs of Stuttering

ABCs of Docker

Abcs of Cpdos