Fast Unconstrained Audio Search in Numerous Human Languages

Abstract

We present a system to index and search conversational speech using a scoring heuristic on the expected posterior counts of phone n-grams in recognition lattices. We report significant improvements in retrieval effectiveness on five human languages over a strong 1-best baseline. The method is shown to improve the utility "mean average precision" of the retrieved lattices? rank order and to do so with a search cost negligible compared to the fastest yet known methods for the linear scanning of phonetic lattices.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 2007
Accession Number
ADA489878

Entities

People

  • J. S. Olsson
  • Jonathan Wintrode
  • Matthew F Lee

Organizations

  • University of Maryland

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Accuracy
  • Algorithms
  • Automated Speech Recognition
  • Decoding
  • Demographic Cohorts
  • Department Of Defense
  • Errors
  • Indexes
  • Information Retrieval
  • Language
  • Natural Languages
  • Precision
  • Probability
  • Recognition
  • Scanning
  • Sequences

Readers

  • Computational Linguistics
  • Operations Research
  • Vision Science/Vision Psychology/Cognitive Neuroscience.