Real-Time Speech Recognition System for Robotic Control Applications Using an Ear-Microphone

Abstract

This study is part of an ongoing research started in 2004 at the Naval Postgraduate School (NPS) investigating the development of a human-machine interface command-and-control package for controlling robotic units in operational environments. An ear microphone is used to collect the voice-activated commands providing hands-free control instructions in noisy environments [Kurcan, 2006; Bulbuller, 2006]. This study presents the hardware implementation of a theoretical Isolated Word Recognition (IWR) system designed in an earlier study. The recognizer uses a short-term energy and zero-crossing based detection scheme, and a discrete Hidden Markov model recognizer designed to recognize seven isolated words. Mel frequency cepstrum coefficients (MFCC) are used for discriminating features in the recognizer phase. The hardware system implemented uses commercial off-the-shelf (COTS) electronic components, in-ear microphone, is portable and costs under $50.00. The implemented speech capturing system uses the ear-microphone and the Si3000 Audio Codec to capture and sample speech clearly. The microprocessor processes the detected speech in real-time. The microprocessor s I/O devices work effectively with the audio codec and computer for sampling and training, without communication problems or data loss. The current implementation uses 1.181 msec to process each 15 msec data frame. Resulting recognition performances average around 73.72%.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 01, 2007
Accession Number
ADA470422

Entities

People

  • Dimitrios S. Koliousis

Organizations

  • Naval Postgraduate School

Tags

Communities of Interest

  • Autonomy
  • C4I
  • Energy and Power Technologies
  • Human Systems

DTIC Thesaurus Topics

  • Automated Speech Recognition
  • Computational Science
  • Computer Programming
  • Computers
  • Digital Signal Processing
  • Ear
  • Electrical Engineering
  • Electronic Components
  • Frequency
  • Hidden Markov Models
  • Identification
  • Integrated Circuits
  • Markov Models
  • Mathematical Models
  • Recognition
  • Signal Processing
  • Word Recognition

Readers

  • Computer Science/Computer Engineering/Data Science/Digital Signal Processing.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Autonomous Systems
  • Autonomy
  • Fully Networked C3
  • Fully Networked C3 - Command and Control
  • Microelectronics
  • Microelectronics - Microelectromechanical Systems