Testing in the classroom: Using tests to promote learning Richard P. Phelps Universidad Finis Terrae, Santiago, Chile January 7, 2014

Embed Size (px)

Citation preview

  • Slide 1
  • Testing in the classroom: Using tests to promote learning Richard P. Phelps Universidad Finis Terrae, Santiago, Chile January 7, 2014
  • Slide 2
  • Q. What is a standardized test? A. An assessment with at least one aspect in its content or administration standardized. Q. What is the key advantage of standardized testing? A. It is standardized.
  • Slide 3
  • 2012, Richard P PHELPSWorld Association of Education Research, 17th Congress, Reims, June, 2012 3 Meta-analysis A method for summarizing a large research literature, with a single, comparable measure.
  • Slide 4
  • John Hatties meta-analyses of meta-analyses
  • Slide 5
  • John Hatties list Acceleration Classroom behavioral techniques Vocabulary programs Repeated reading programs Creativity programs Student prior achievement Self-questioning by students Study skills Problem-solving teaching Not labeling students Student-centered teaching Classroom cohesion Pre-term birth weight Peer influences Classroom management techniques Outdoor-adventure programs Home environment Socio-economic status 1. 11. 21. 31.
  • Slide 6
  • 2012, Richard P PHELPS The effect of testing on student achievement: 1910-2010 Richard P. PHELPS
  • Slide 7
  • 2012, Richard P PHELPSWorld Association of Education Research, 17th Congress, Reims, June, 2012 7 The effect of testing on student achievement 12-year long study analyzed close to 700 separate studies, and more than 1,600 separate effects 2,000 other studies were reviewed and found incomplete or inappropriate lacking sufficient time and money, hundreds of other studies will not be reviewed
  • Slide 8
  • 2012, Richard P PHELPSWorld Association of Education Research, 17th Congress, Reims, June, 2012 8 Studies included in the meta-analyses 2.when: a test is newly introduced, or newly removed quantity of testing is increased or reduced test stakes are introduced or increased, or removed or reduced
  • Slide 9
  • 2012, Richard P PHELPSWorld Association of Education Research, 17th Congress, Reims, June, 2012 9 Number of studies of effects, by methodology type Methodology type Number of studies Number of effects Quantitative177640 Surveys and public opinion polls (US & Canada) 247813 Qualitative245 TOTAL6691698
  • Slide 10
  • 2012, Richard P PHELPSWorld Association of Education Research, 17th Congress, Reims, June, 2012 10 Effect size: Interpretation d between 0.25 & 0.50 weak effect d between 0.50 et 0.75 medium effect d more than 0.75 strong effect
  • Slide 11
  • 2012, Richard P PHELPSWorld Association of Education Research, 17th Congress, Reims, June, 2012 11 Which predictors matter? Treatment Group Mean Effect Size is made aware of performance, and control group is not+0.98 receives targeted instruction (e.g., remediation)+0.96 is tested with higher stakes than control group+0.87 is tested more frequently than control group+0.85
  • Slide 12
  • Why tests? Students tend to study more, and learn more, when: they know they will be tested, but not precisely what will be tested (e.g.) Experiment comparing gains of students with take-home tests to those with in class tests -- the latter learned substantially more. when there is reinforcement of material already studied Mastery learning experiments of 1960s1980s: Students learn more when asked to recall what they have learned. Up to a point, the more students are made to actively process information, and describe it to others, the better they learn.
  • Slide 13
  • 2012, Richard P PHELPSWorld Association of Education Research, 17th Congress, Reims, June, 2012 13 Surveys and opinion polls: Regular standardized tests, performance tests Regular tests (N 125) Performance tests (N 50) Respondent opiniondd Achievement is increased1.0 weighted by size of study population1.90.5 Instruction is improved1.0 weighted by size of study population0.9 Tests help align instruction1.0 weighted by size of study population0.50.9
  • Slide 14
  • 2012, Richard P PHELPSWorld Association of Education Research, 17th Congress, Reims, June, 2012 14 Qualitative studies: Effect on student achievement Direction of effect Number of studiesPercent of studies Percent without the inferred Positive2048493 Positive inferred2410 Mixed522 No change834 Negative311 TOTAL244100 244 studies conducted in the past century in over 30 countries
  • Slide 15
  • Repeated retrieval during learning is the key to long-term retention.
  • Slide 16
  • 10 benefits of testing and their applications to education Roediger, Putnam and Smith Direct effects of testing SOURCE: Roediger, Putnam, & Smith, Ten benefits of testing and their applications to educational practice, Psychology of Learning and Motivation, 55, 2011. Retrieval practice during tests enhances retention of the retrieved information (relative to not testing or even to studying) -- the testing effect Repeated retrieval produces knowledge that can be retrieved flexibly and transferred to other situations On open-ended assessments (e.g., essay tests) retrieval practice induced by tests helps students organize information into a coherent knowledge base. Repeated retrieval leads to easier retrieval of related information
  • Slide 17
  • 10 benefits of testing and their applications to education Roediger, Putnam and Smith Indirect effects of testing SOURCE: Roediger, Putnam, & Smith, Ten benefits of testing and their applications to educational practice, Psychology of Learning and Motivation, 55, 2011. Students tested frequently study more and with more regularity. Tests permit students to discover gaps in their knowledge and adjust their study efforts to focus on difficult material. Students who study after taking a test learn more than if they had not taken a test. Students who self-test or are tested more frequently in class learn more.
  • Slide 18
  • 10 benefits of testing and their applications to education Roediger, Putnam and Smith SOURCE: Roediger, Putnam, & Smith, Ten benefits of testing and their applications to educational practice, Psychology of Learning and Motivation, 55, 2011. Benefit 1: The Testing Effect: Retrieval Aids Later Retention Benefit 2: Testing Identifies Gaps in Knowledge Benefit 3: Testing Causes Students to Learn More from the Next Study Episode Benefit 4: Testing Produces Better Organization of Knowledge Benefit 5: Testing Improves Transfer of Knowledge to New Contexts Benefit 6: Testing can Facilitate Retrieval of Material That was not Tested Benefit 7: Testing Improves Metacognitive Monitoring Benefit 8: Testing Prevents Interference from Prior Material when Learning New Material Benefit 9: Testing Provides Feedback to Instructors Benefit 10: Frequent Testing Encourages Students to Study
  • Slide 19
  • 10 benefits of testing and their applications to education Roediger, Putnam and Smith SOURCE: Roediger, Putnam, & Smith, Ten benefits of testing and their applications to educational practice, Psychology of Learning and Motivation, 55, 2011. Benefit 1: The Testing Effect: Retrieval Aids Later Retention Benefit 2: Testing Identifies Gaps in Knowledge Benefit 3: Testing Causes Students to Learn More from the Next Study Episode Benefit 4: Testing Produces Better Organization of Knowledge Benefit 5: Testing Improves Transfer of Knowledge to New Contexts Benefit 6: Testing can Facilitate Retrieval of Material That was not Tested Benefit 7: Testing Improves Metacognitive Monitoring Benefit 8: Testing Prevents Interference from Prior Material when Learning New Material Benefit 9: Testing Provides Feedback to Instructors Benefit 10: Frequent Testing Encourages Students to Study
  • Slide 20
  • 10 benefits of testing and their applications to education Roediger, Putnam and Smith Most teachers should be testing much more frequently, with smaller, shorter, less consequential tests. Students learns more when they test. But learn best when the tests are spaced. What is the optimal lapse of time between tests? The best time to test again is just before students start forgetting the information. This time lapse is shorter with discrete material, like mathematics, than with other subjects. Some studies suggest that math students should be tested at least once a week.
  • Slide 21
  • The more high-stakes decision points, the better the student performance ? SOURCE: Phelps, Benchmarking to the best in mathematics, Evaluation Review, 2001
  • Slide 22
  • Quality control has proportionally greater effect in poorer countries SOURCE: Phelps, Benchmarking to the best in mathematics, Evaluation Review, 2001
  • Slide 23
  • What testing skills do teachers need for interpreting information from large-scale tests? Basic understanding of statistics: - distributions, mean, median, skewness, kurtosis - sampling error, measurement error - type 1 / type 2 error, statistical power - sampling (size, representativeness) Protocols to help them explain tests to others: - to students - to parents - to the media
  • Slide 24
  • What testing skills do teachers need for developing and administering classroom tests? Practice (with each other) in writing items / prompts / rubrics : - unambiguous, relevant, un-biased Learn the optimal frequency, spacing of tests for your subject field and grade level. Understand that useful assessment can be very simple: -e.g., save the last few minutes of each class to assess by asking students to record 2-3 concepts they learned that day
  • Slide 25
  • But, you can only know what students are learning if you assess. It is easy to know what you are teaching.