A Ratio Test of Interrater Agreement With High Specificity

4Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Existing tests of interrater agreements have high statistical power; however, they lack specificity. If the ratings of the two raters do not show agreement but are not random, the current tests, some of which are based on Cohen’s kappa, will often reject the null hypothesis, leading to the wrong conclusion that agreement is present. A new test of interrater agreement, applicable to nominal or ordinal categories, is presented. The test statistic can be expressed as a ratio (labeled QA, ranging from 0 to infinity) or as a proportion (labeled PA, ranging from 0 to 1). This test weighs information supporting agreement with information supporting disagreement. This new test’s effectiveness (power and specificity) is compared with five other tests of interrater agreement in a series of Monte Carlo simulations. The new test, although slightly less powerful than the other tests reviewed, is the only one sensitive to agreement only. We also introduce confidence intervals on the proportion of agreement.

Author supplied keywords

Cite

CITATION STYLE

APA

Cousineau, D., & Laurencelle, L. (2015). A Ratio Test of Interrater Agreement With High Specificity. Educational and Psychological Measurement, 75(6), 979–1001. https://doi.org/10.1177/0013164415574086

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free