Impact of Annotator Demographics on Sentiment Dataset Labeling

Abstract

As machine learning methods become more powerful and capture more nuances of human behavior, biases in the dataset can shape what the model learns and is evaluated on. This paper explores and attempts to quantify the uncertainties and biases due to annotator demographics when creating sentiment analysis datasets. We ask >1000 crowdworkers to provide their demographic information and annotations for multimodal sentiment data and its component modalities. We show that demographic differences among annotators impute a significant effect on their ratings, and that these effects also occur in each component modality. We compare predictions of different state-of-the-art multimodal machine learning algorithms against annotations provided by different demographic groups, and find that changing annotator demographics can cause >4.5 in accuracy difference when determining positive versus negative sentiment. Our findings underscore the importance of accounting for crowdworker attributes, such as demographics, when building datasets, evaluating algorithms, and interpreting results for sentiment analysis.

Document Details

Document Type
Pub Defense Publication
Publication Date
Nov 07, 2022
Source ID
10.1145/3555632

Entities

People

  • Jacob You
  • Jennifer Jacobs
  • Pradeep Sen
  • Tobias Hollerer
  • Tonja-katrin Machulla
  • Yi Ding

Organizations

  • Georgia State University
  • National Science Foundation
  • Office of Naval Research
  • Technical University of Dortmund
  • University of California, Santa Barbara

Tags

Fields of Study

  • Computer science

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Neural Network Machine Learning.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval
  • AI & ML - Neural Networks