Subject-Based Evaluation Measures for Interactive Spoken Language Systems

Abstract

The DARPA Spoken Language effort has profited greatly from its emphasis on tasks and common evaluation metrics. Common, standardized evaluation procedures have helped the community to focus research effort, to measure progress, and to encourage communication among participating sites. The task and the evaluation metrics, however, must be consistent with the goals of the Spoken Language program, namely interactive problem solving. Our evaluation methods have evolved with the technology, moving from evaluation of read speech from a fixed corpus through evaluation of isolated canned sentences to evaluation of spontaneous speech in context in a canned corpus. A key component missed in current evaluations is the role of subject interaction with the system. Because of the great variability across subjects, however, it is necessary to use either a large number of subjects or a within-subject design. This paper proposes a within-subject design comparing the results of a software-sharing exercise carried out jointly by MIT and SRI.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Jan 01, 1992
Accession Number: ADA460272

Entities

People

Elizabeth Shrlberg
Elizabeth Wade
Lynette Hirschman
Patti Price

Organizations

SRI International

Subject-Based Evaluation Measures for Interactive Spoken Language Systems

Abstract

Document Details

Entities

People

Organizations

Tags

Communities of Interest

DTIC Thesaurus Topics

Fields of Study

Readers