Phoneme Class Based Adaptation for Mismatch Acoustic Modeling of Distant Noisy Speech (Preprint)

Abstract

A new adaptation strategy for distant noisy speech is created by phoneme class based approaches for context independent acoustic models. Unlike the previous approaches such as MLLR-MAP adaptation which adapts acoustic model to the features, our phoneme-class based adaptation (PCBA) adapts the distant data features to our acoustic model which has trained on close microphone TIMIT sentences. The essence of PCBA is to create a transformation strategy which makes the distribution of phoneme-classes of distant noisy speech be similar to those of close microphone acoustic model in thirteen dimensional MFCC space (mostly in c0-c1 plane). It creates a mean, orientation and variance adaptation scheme for each phoneme class to compensate the mismatch. New adapted features and new and improved acoustic models which are produced by PCBA are outperforming those created by MLLR-MAP adaptation for ASR and KWS. And PCBA offers a new powerful understanding in acoustic-modeling of distant speech.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 2012
Accession Number
ADA568361

Entities

People

  • John H. Hansen
  • Seckin Uluskan

Tags

Communities of Interest

  • Air Platforms
  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Acoustics
  • Air Force
  • Air Force Research Laboratories
  • Architectural Acoustics
  • Automated Speech Recognition
  • Contracts
  • Covariance
  • Detection
  • Eigenvalues
  • Eigenvectors
  • Electrical Engineering
  • Markov Models
  • Microphones
  • Models
  • Neural Networks
  • Orientation (Direction)
  • Two Dimensional

Readers

  • Robotics and Automation.
  • Speech Processing/Speech Recognition.

Technology Areas

  • Space