Speech-Based Information Retrieval for Digital Libraries

Abstract

Libraries and archives collect recorded speech and multimedia objects that contain recorded speech, and such material may comprise a substantial portion of the collection in future digital libraries. Presently, access to most of this material is provided using a combination of manually annotated metadata and linear search. Recent advances in speech processing technology have produced a number of techniques for extracting features from recorded speech that could provide a useful basis for the retrieval of speech or multimedia objects in large digital library collections. Among these features are the semantic content of the speech, the identity of the speaker, and the language in which the speech was spoken. We propose to develop a graphical and auditory user interface for speech- based information retrieval that exploits these features to facilitate selection of recorded speech and multimedia information objects that include recorded speech. We plan to use that interface to evaluate the effectiveness and usability of alternative ways of exploiting those features and as a testbed for the evaluation of advanced retrieval techniques such as cross-language speech retrieval.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 1997
Accession Number
ADA458105

Entities

People

  • Douglas W. Oard

Organizations

  • University of Maryland

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Contracts
  • Information Operations
  • Information Retrieval
  • Instructions
  • Language
  • Materials
  • Metadata
  • Military Research
  • Multimedia
  • Standards
  • Universities
  • User Interface

Fields of Study

  • Computer science

Readers

  • Database Systems and Applications
  • Geospatial Intelligence and Artificial Intelligence Analytics
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval