Isolated Word Recognition From In-Ear Microphone Data Using Hidden Markov Models (HMM)

Abstract

This thesis is part of an ongoing larger scale research study started in 2004 at the Naval Postgraduate School (NPS) which aims to develop a speech-driven human-machine interface for the operation of semi-autonomous military robots in noisy operational environments. Earlier work included collecting a small database of isolated word utterances of seven words from 20 adult subjects using an in-ear microphone. The research conducted here develops a speaker-independent isolated word recognizer from these acoustic signals based on a discrete observation Hidden Markov Model (HMM) The study implements the HMM-based isolated word recognizer in three steps. The first step performs the endpoint detection and speech segmentation by using short-term temporal analysis. The second step includes speech feature extraction using static and dynamic MFCC parameters and vector quantization of continuous-valued speech features. Finally, the last step involves the discrete-observation HMM-based classifier for isolated word recognition. Experimental results show the average classification performance around 92.77%. The most significant result of this study is that the acoustic signals originating from speech organs and collected within the external ear canal via the in-ear microphone can be used for isolated word recognition.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 2006
Accession Number
ADA445459

Entities

People

  • Remzi S. Kurcan

Organizations

  • Naval Postgraduate School

Tags

Communities of Interest

  • Autonomy
  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Acoustic Propagation
  • Acoustic Signals
  • Computational Science
  • Databases
  • Detection
  • Ear
  • Feature Extraction
  • Frequency Bands
  • Hidden Markov Models
  • Information Science
  • Markov Models
  • Neural Networks
  • Pattern Recognition
  • Probabilistic Models
  • Recognition
  • Signal Processing
  • Speech Compression

Readers

  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation
  • Autonomy