Creating Spoken Dialogue Characters from Corpora without Annotations

Abstract

Virtual humans are being used in a number of applications, including simulation-based training, multi-player games, and museum kiosks. Natural language dialogue capabilities are an essential part of their human-like persona. These dialogue systems have a goal of being believable and generally have to operate within the bounds of their restricted domains. Most dialogue systems operate on a dialogue-act level and require extensive annotation efforts. Semantic annotation and rule authoring have long been known as bottlenecks for developing dialogue systems for new domains. In this paper, we investigate several dialogue models for virtual humans that are trained on an unannotated human-human corpus. These are inspired by information retrieval and work on the surface text level. We evaluate these in text-based and spoken interactions and also against the upper baseline of human-human dialogues.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 27, 2007
Accession Number
AD1158330

Entities

People

  • David R Traum
  • Sudeep Gandhe

Organizations

  • University of Southern California

Tags

Communities of Interest

  • Biomedical

DTIC Thesaurus Topics

  • Amoebic Dysentery
  • Artificial Intelligence
  • Artificial Intelligence Software
  • Automated Speech Recognition
  • Case Studies
  • Computational Science
  • Computer Programs
  • Dialogue Systems
  • Information Retrieval
  • Language
  • Linguistics
  • Models
  • Natural Languages
  • Personality
  • Recognition
  • Segmented
  • Simulations
  • Training
  • United States

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Team-Based Human-Centered Cognitive Task Decision Making and Information Performance.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation