The ICSI Meeting Recorder Dialog Act (MRDA) Corpus

Abstract

We describe a new corpus of over 180,000 hand- annotated dialog act tags and accompanying adjacency pair annotations for roughly 72 hours of speech from 75 naturally-occurring meetings. We provide a brief summary of the annotation system and labeling procedure, inter-annotator reliability statistics, overall distributional statistics, a description of auxiliary files distributed with the corpus, and information on how to obtain the data.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2004
Accession Number
ADA460980

Entities

People

  • Elizabeth Shriberg
  • Hannah Carvey
  • Jeremy Ang
  • Raj Dhillon
  • Sonali Bhagat

Organizations

  • International Computer Science Institute

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Agreements
  • Audio Files
  • Automated Speech Recognition
  • Computer Science
  • Computer Vision
  • Computers
  • Hot Spots
  • Information Operations
  • Language
  • Natural Language Processing
  • Natural Languages
  • Recognition
  • Recording Systems
  • Reliability
  • Statistics

Readers

  • Computational Linguistics
  • Computer Science/Computer Engineering/Data Science/Digital Signal Processing.
  • Speech Processing/Speech Recognition.