Informedia News-On Demand: Using Speech Recognition to Create a Digital Video Library

Abstract

In theory, speech recognition technology can make any spoken words in video or audio media usable for text indexing, search and retrieval. This article describes the News-on-Demand application created within the Informedia(TM) Digital Video Library project and discusses how speech recognition is used in transcript creation from video, alignment with closed-captioned transcripts, audio paragraph segmentation and a spoken query interface. Speech recognition accuracy varies dramatically depending on the quality and type of data used. Informal information retrieval tests show that reasonable recall and precision can be obtained with only moderate speech recognition accuracy.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Mar 19, 1998
Accession Number: ADA350404

Entities

People

Alexander G. Hauptmann
Howard D. Wactlar
Michael J. Witbrock

Organizations

Carnegie Mellon University

Informedia News-On Demand: Using Speech Recognition to Create a Digital Video Library

Abstract

Document Details

Entities

People

Organizations

Tags

Communities of Interest

DTIC Thesaurus Topics

Fields of Study

Readers

Technology Areas