Moral Foundations Twitter Corpus: A Collection of 35k Tweets Annotated for Moral Sentiment

Abstract

Research has shown that accounting for moral sentiment in natural language can yield insight into a variety of on- and off-line phenomena such as message diffusion, protest dynamics, and social distancing. However, measuring moral sentiment in natural language is challenging, and the difficulty of this task is exacerbated by the limited availability of annotated data. To address this issue, we introduce the Moral Foundations Twitter Corpus, a collection of 35,108 tweets that have been curated from seven distinct domains of discourse and hand annotated by at least three trained annotators for 10 categories of moral sentiment. To facilitate investigations of annotator response dynamics, we also provide psychological and demographic metadata for each annotator. Finally, we report moral sentiment classification baselines for this corpus using a range of popular methodologies.

Document Details

Document Type
Pub Defense Publication
Publication Date
Feb 19, 2020
Source ID
10.1177/1948550619876629

Entities

People

  • Aida Mostafazadeh Davani
  • Arineh Mirinjian
  • Brendan Kennedy
  • Christian Leong
  • Christina Park
  • Gabriela Moreno
  • Gwenyth Portillo-wightman
  • Jenna Chin
  • Joe Hoover
  • Jun Yen Leung
  • Leigh Yeh
  • Madelyn Mendlen
  • Mohammad Atari
  • Morteza Dehghani
  • Shreya Havaldar
  • Tingyee E. Chang
  • Ying Lin
  • Zahra Kamel

Organizations

  • Rensselaer Polytechnic Institute
  • United States Army Research Laboratory
  • University of Southern California

Tags

Readers

  • Neural Network Machine Learning.
  • Political Violence and Terrorism Studies.