Abstract
Existing tests of interrater agreements have high statistical power; however, they lack specificity. If the ratings of the two raters do not show agreement but are not random, the current tests, some of which are based on Cohen’s kappa, will often reject the null hypothesis, leading to the wrong conclusion that agreement is present. A new test of interrater agreement, applicable to nominal or ordinal categories, is presented. The test statistic can be expressed as a ratio (labeled QA, ranging from 0 to infinity) or as a proportion (labeled PA, ranging from 0 to 1). This test weighs information supporting agreement with information supporting disagreement. This new test’s effectiveness (power and specificity) is compared with five other tests of interrater agreement in a series of Monte Carlo simulations. The new test, although slightly less powerful than the other tests reviewed, is the only one sensitive to agreement only. We also introduce confidence intervals on the proportion of agreement.
Author supplied keywords
Cite
CITATION STYLE
Cousineau, D., & Laurencelle, L. (2015). A Ratio Test of Interrater Agreement With High Specificity. Educational and Psychological Measurement, 75(6), 979–1001. https://doi.org/10.1177/0013164415574086
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.