Investigating Robustness of Item Response Theory Proficiency Estimators to Atypical Response Behaviors Under Two‐Stage Multistage Testing

  • Kim S
  • Moses T
N/ACitations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

The purpose of this study is to evaluate the extent to which item response theory ( IRT ) proficiency estimation methods are robust to the presence of aberrant responses under the GRE ® General Test multistage adaptive testing ( MST ) design. To that end, a wide range of atypical response behaviors affecting as much as 10% of the test items were simulated using a generic GRE 2‐stage MST and the 2‐parameter logistic ( 2PL ) IRT model. As expected, some differences were found among the 5 estimators in terms of the recovery of the true theta ability; for example, Bayesian estimators had lower error variance and their estimates were regressed to the mean. Once the IRT theta estimates were scaled onto a comparable reporting score scale, however, it was found that all the estimation methods investigated, including the one currently used to score GRE MSTs , were equally robust under the simulated conditions. Report Number: ETS RR‐16–22

Cite

CITATION STYLE

APA

Kim, S., & Moses, T. (2016). Investigating Robustness of Item Response Theory Proficiency Estimators to Atypical Response Behaviors Under Two‐Stage Multistage Testing. ETS Research Report Series, 2016(2), 1–23. https://doi.org/10.1002/ets2.12111

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free