4.6 Article

Bayesian approaches to the weighted kappa-like inter-rater agreement measures

期刊

STATISTICAL METHODS IN MEDICAL RESEARCH
卷 30, 期 10, 页码 2329-2351

出版社

SAGE PUBLICATIONS LTD
DOI: 10.1177/09622802211037068

关键词

Agreement table; prior distribution; order restriction; ordinal data; rater; weighted kappa

资金

  1. VIED
  2. RMIT

向作者/读者索取更多资源

Bayesian approaches are proposed in this study for the estimation of inter-rater agreement measures, to include prior information on raters' assessment behavior and impose order restrictions on scores. These approaches improve accuracy and mitigate anomalies, with theoretical and practical implications discussed for Bayesian estimation of five agreement measures with three different weights using an agreement table with grey zones. Monte Carlo simulation study evaluates classification accuracy of Bayesian and classical approaches, providing recommendations for selecting the highest performing agreement measure and weight combination based on table structure and sample size.
Inter-rater agreement measures are used to estimate the degree of agreement between two or more assessors. When the agreement table is ordinal, different weight functions that incorporate row and column scores are used along with the agreement measures. The selection of row and column scores is effectual on the estimated degree of agreement. The weighted measures are prone to the anomalies frequently seen in agreement tables such as unbalanced table structures or grey zones due to the assessment behaviour of the raters. In this study, Bayesian approaches for the estimation of inter-rater agreement measures are proposed. The Bayesian approaches make it possible to include prior information on the assessment behaviour of the raters in the analysis and impose order restrictions on the row and column scores. In this way, we improve the accuracy of the agreement measures and mitigate the impact of the anomalies in the estimation of the strength of agreement between the raters. The elicitation of prior distributions is described theoretically and practically for the Bayesian estimation of five agreement measures with three different weights using an agreement table having two grey zones. A Monte Carlo simulation study is conducted to assess the classification accuracy of the Bayesian and classical approaches for the considered agreement measures for a given level of agreement. Recommendations for the selection of the highest performing agreement measure and weight combination are made in the breakdown of the table structure and sample size.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据