Subject-Based Evaluation Measures for Interactive Spoken Language Systems

Abstract

The DARPA Spoken Language effort has profited greatly from its emphasis on tasks and common evaluation metrics. Common, standardized evaluation procedures have helped the community to focus research effort, to measure progress, and to encourage communication among participating sites. The task and the evaluation metrics, however, must be consistent with the goals of the Spoken Language program, namely interactive problem solving. Our evaluation methods have evolved with the technology, moving from evaluation of read speech from a fixed corpus through evaluation of isolated canned sentences to evaluation of spontaneous speech in context in a canned corpus. A key component missed in current evaluations is the role of subject interaction with the system. Because of the great variability across subjects, however, it is necessary to use either a large number of subjects or a within-subject design. This paper proposes a within-subject design comparing the results of a software-sharing exercise carried out jointly by MIT and SRI.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1992
Accession Number
ADA460272

Entities

People

  • Elizabeth Shrlberg
  • Elizabeth Wade
  • Lynette Hirschman
  • Patti Price

Organizations

  • SRI International

Tags

Communities of Interest

  • Air Platforms
  • Energy and Power Technologies
  • Ground and Sea Platforms

DTIC Thesaurus Topics

  • Acquisition
  • Aircrafts
  • Automated Speech Recognition
  • Contracts
  • Databases
  • Debriefing
  • Errors
  • Hybrid Systems
  • Information Operations
  • Language
  • Natural Language Understanding
  • Natural Languages
  • Recognition
  • Resource Management
  • Standards
  • Test And Evaluation
  • Transport Aircraft

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Educational Psychology
  • Regression Analysis.