Impact of Annotator Demographics on Sentiment Dataset Labeling

Abstract

As machine learning methods become more powerful and capture more nuances of human behavior, biases in the dataset can shape what the model learns and is evaluated on. This paper explores and attempts to quantify the uncertainties and biases due to annotator demographics when creating sentiment analysis datasets. We ask >1000 crowdworkers to provide their demographic information and annotations for multimodal sentiment data and its component modalities. We show that demographic differences among annotators impute a significant effect on their ratings, and that these effects also occur in each component modality. We compare predictions of different state-of-the-art multimodal machine learning algorithms against annotations provided by different demographic groups, and find that changing annotator demographics can cause >4.5 in accuracy difference when determining positive versus negative sentiment. Our findings underscore the importance of accounting for crowdworker attributes, such as demographics, when building datasets, evaluating algorithms, and interpreting results for sentiment analysis.

Document Details

Document Type: Pub Defense Publication
Publication Date: Nov 07, 2022
Source ID: 10.1145/3555632

Entities

People

Jacob You
Jennifer Jacobs
Pradeep Sen
Tobias Hollerer
Tonja-katrin Machulla
Yi Ding

Organizations

Georgia State University
National Science Foundation
Office of Naval Research
Technical University of Dortmund
University of California, Santa Barbara

Impact of Annotator Demographics on Sentiment Dataset Labeling

Abstract

Document Details

Entities

People

Organizations

Tags

Fields of Study

Readers

Technology Areas