Sociolinguistically Informed Natural Language Processing: Automating Irony Detection

Abstract

Irony detection is an important problem in computational sociolinguistic tasks, such as automatic community detection on the web. But the ironic/sincere distinction has proven to be a particularly difficult classification problem. Existing Machine Learning (ML) and Natural Language Processing (NLP) approaches, which tend to rely on simple statistical models built on top of word counts, are not very good at it. We hypothesize that this is because, in contrast to most text classification problems, word counts and syntactic features alone do not constitute an adequate representation for verbal irony detection. Indeed, sociolinguistic theories of verbal irony imply that a model of the speaker is a necessary condition for irony detection, at least in certain cases.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 13, 2015
Accession Number
ADA623456

Entities

People

  • Byron C Wallace

Organizations

  • Brown University

Tags

Communities of Interest

  • Autonomy

DTIC Thesaurus Topics

  • Cognitive Science
  • Computational Linguistics
  • Computational Science
  • Computer Science
  • Detection
  • Digital Media
  • Engineering
  • Language
  • Linguistics
  • Machine Learning
  • Natural Language Processing
  • Natural Languages
  • Network Science
  • Online Communications
  • Social Media
  • Standards
  • Students

Readers

  • Computational Linguistics
  • Computer Vision.
  • Personnel Management and Statistics in the Military and Department of Defense

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval
  • AI & ML - Machine Translation
  • AI & ML - Neural Networks