Sociolinguistically Informed Natural Language Processing: Automating Irony Detection

Abstract

Irony detection is an important problem in computational sociolinguistic tasks, such as automatic community detection on the web. But the ironic/sincere distinction has proven to be a particularly difficult classification problem. Existing Machine Learning (ML) and Natural Language Processing (NLP) approaches, which tend to rely on simple statistical models built on top of word counts, are not very good at it. We hypothesize that this is because, in contrast to most text classification problems, word counts and syntactic features alone do not constitute an adequate representation for verbal irony detection. Indeed, sociolinguistic theories of verbal irony imply that a model of the speaker is a necessary condition for irony detection, at least in certain cases.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Apr 13, 2015
Accession Number: ADA623456

Entities

People

Byron C Wallace

Organizations

Brown University

Sociolinguistically Informed Natural Language Processing: Automating Irony Detection

Abstract

Document Details

Entities

People

Organizations

Tags

Communities of Interest

DTIC Thesaurus Topics

Readers

Technology Areas