RECAST

Abstract

With the widespread use of toxic language online, platforms are increasingly using automated systems that leverage advances in natural language processing to automatically flag and remove toxic comments. However, most automated systems, when detecting and moderating toxic language, do not provide feedback to their users, let alone provide an avenue of recourse for these users to make actionable changes. We present our work, RECAST, an interactive, open-sourced web tool for visualizing these models' toxic predictions, while providing alternative suggestions for flagged toxic language. Our work also provides users with a new path of recourse when using these automated moderation tools. RECAST highlights text responsible for classifying toxicity, and allows users to interactively substitute potentially toxic phrases with neutral alternatives. We examined the effect of RECAST via two large-scale user evaluations, and found that RECAST was highly effective at helping users reduce toxicity as detected through the model. Users also gained a stronger understanding of the underlying toxicity criterion used by black-box models, enabling transparency and recourse. In addition, we found that when users focus on optimizing language for these models instead of their own judgement (which is the implied incentive and goal of deploying automated models), these models cease to be effective classifiers of toxicity compared to human annotations. This opens a discussion for how toxicity detection models work and should work, and their effect on the future of online discourse.

Document Details

Document Type
Pub Defense Publication
Publication Date
Apr 13, 2021
Source ID
10.1145/3449280

Entities

People

  • Austin P. Wright
  • Diyi Yang
  • Duen Horng (polo) Chau
  • Haekyu Park
  • Muhammed Ahmed
  • Omar Shaikh
  • Stephane Pinel
  • Will Epperson

Organizations

  • Defense Advanced Research Projects Agency
  • Georgia Tech
  • Mailchimp
  • National Science Foundation

Tags

Fields of Study

  • Computer science
  • Environmental science

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Systems Analysis and Design
  • Toxicology/Environmental Toxicology

Technology Areas

  • AI & ML