Peirce's i and Cohen's κ for 2×2 Measures of Rater Reliability

This study examined a historical mixture model approach to the evaluation of ratings made in “gold standard” and two-rater 2×2 contingency tables. Peirce's i and the derived i average were discussed in relation to a widely used index of reliability in the behavioral sciences, Cohen's κ....

Full description

Saved in:
Bibliographic Details
Main Authors: Beau Abar, Eric Loken
Format: Article
Language:English
Published: Wiley 2010-01-01
Series:Journal of Probability and Statistics
Online Access:http://dx.doi.org/10.1155/2010/480364
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This study examined a historical mixture model approach to the evaluation of ratings made in “gold standard” and two-rater 2×2 contingency tables. Peirce's i and the derived i average were discussed in relation to a widely used index of reliability in the behavioral sciences, Cohen's κ. Sample size, population base rate of occurrence, the true “science of the method”, and guessing rates were manipulated across simulations. In “gold standard” situations, Peirce's i tended to recover the true reliability of ratings as well as better than κ. In two-rater situations, iave tended to recover the true reliability as well as better than κ in most situations. The empirical utility and potential theoretical benefits of mixture model methods in estimating reliability are discussed, as are the associations between the i statistics and other modern mixture model approaches.
ISSN:1687-952X
1687-9538