Efficient CEPSTRAL Normalization for Robust Speech Recognition

Abstract

In this paper we describe and compare the performance of a series of cepstrum-based procedures that enable the CMU SPHINX-II speech recognition system to maintain a high level of recognition accuracy over a wide variety of acoustical environments. We describe the MFCDCN algorithm, an environment-independent extension of the efficient SDCN and FCDCN algorithms developed previously. We compare the performance of these algorithms with the very simple RASTA and cepstral mean normalization procedures, describing the performance of these algorithms in the context of the 1992 DARPA CSR evaluation using secondary microphones, and in the DARPA stress-test evaluation.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1993
Accession Number
ADA458659

Entities

People

  • Alejandro Acero
  • Fu-hua Liu
  • Richard M. Stern
  • Xuedong Huang

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Accuracy
  • Additives (Chemicals)
  • Algorithms
  • Automated Speech Recognition
  • Computer Science
  • Digital Signal Processing
  • Filters
  • Filtration
  • Language
  • Linear Filtering
  • Natural Language Processing
  • Probability
  • Recognition
  • Signal Processing
  • Stress Tests
  • Test And Evaluation
  • Test Methods

Fields of Study

  • Computer science

Readers

  • Parallel and Distributed Computing.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML