Inter-annotator Agreement

Abstract

This chapter touches upon several issues in the calculation and assessment of inter annotator agreement. It gives an introduction to the theory behind agreement coefficients and examples of their application to linguistic annotation tasks. Specific examples explore variation in annotator performance due to heterogeneous data, complex labels, item difficulty, and annotator differences, showing how global agreement coefficients may mask these sources of variation, and how detailed agreement studies can give insight into both the annotation process and the nature of the underlying data. The chapter also reviews recent work on using machine learning to exploit the variation among annotators and learn detailed models from which accurate labels can be inferred. I therefore advocate an approach where agreement studies are not used merely as a means to accept or reject a particular annotation scheme, but as a tool for exploring patterns in the data that are being annotated.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 17, 2017
Accession Number
AD1158943

Entities

People

  • Ron Artstein

Organizations

  • University of Southern California

Tags

Communities of Interest

  • Biomedical

DTIC Thesaurus Topics

  • Accuracy
  • Agreements
  • Coefficients
  • Computational Linguistics
  • Computational Processes
  • Computer Programming
  • Computer Science
  • Dialogue Systems
  • Language
  • Linguistics
  • Machine Learning
  • Natural Language Processing
  • Probability
  • Ratings
  • Reliability
  • Standards
  • Test And Evaluation
  • United States

Readers

  • Computational Linguistics
  • Electromagnetic Wave Scattering and Antenna Radiation Engineering
  • Neural Network Machine Learning.

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks