☆ 3.8 Proceedings Paper

Measuring Annotator Agreement Generally across Complex Structured, Multi-object, and Free-text Annotation Tasks

PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22) (2022)

Journal

PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22)

Volume -, Issue -, Pages 1720-1730

Publisher

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3485447.3512242

Keywords

annotation; labeling; inter-annotator agreement; quality assurance

Funding

Knight Foundation
Micron Foundation
Good Systems, a UT Austin Grand Challenge to develop responsible AI technologies

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study investigates and proposes new measures for assessing inter-annotator agreement (IAA) in complex labeling tasks.

When annotators label data, a key metric for quality assurance is inter-annotator agreement (IAA): the extent to which annotators agree on their labels. Though many IAA measures exist for simple categorical and ordinal labeling tasks, relatively little work has considered more complex labeling tasks, such as structured, multi-object, and free-text annotations. Krippendorff's a, best known for use with simpler labeling tasks, does have a distance-based formulation with broader applicability, but little work has studied its efficacy and consistency across complex annotation tasks. We investigate the design and evaluation of IAA measures for complex annotation tasks, with evaluation spanning seven diverse tasks: image bounding boxes, image keypoints, text sequence tagging, ranked lists, free text translations, numeric vectors, and syntax trees. We identify the difficulty of interpretability and the complexity of choosing a distance function as key obstacles in applying Krippendorff's a generally across these tasks. We propose two novel, more interpretable measures, showing they yield more consistent IAA measures across tasks and annotation distance functions.

Measuring Annotator Agreement Generally across Complex Structured, Multi-object, and Free-text Annotation Tasks

Journal

PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22)

Publisher

ASSOC COMPUTING MACHINERY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Measuring Annotator Agreement Generally across Complex Structured, Multi-object, and Free-text Annotation Tasks

Journal

PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22)

Publisher

ASSOC COMPUTING MACHINERY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper