Language Identification Through Parallel Phone Recognition.

Abstract

Language identification systems that employ acoustic likelihoods from language-dependent phoneme recognizers to perform language classification have been shown to yield high performance on clean speech. In this report, such a method was applied to language identification of telephone speech. Phoneme recognizers were developed for English, German, Japanese, Mandarin, and Spanish using hidden Markov models. Each of these processed the input speech and output a phoneme sequence in their respective languages along with a likelihood score. The language of the incoming speech was hypothesized as the language of the model having the highest likelihood. The main differences between this system and those developed in the past are that this system processed telephone speech, could identify up to five languages, and used phonetic transcriptions to train the language-specific models. The five-language, forced-choice recognition rate on 45-s utterances was 71.9%. On 10-s utterances the recognition decreased to 70.3%. In addition, it was found that adding word-specific phonemes to the training set had a negligible effect on language identification results. (AN)

Open PDF

Document Details

Document Type
Technical Report
Publication Date
May 19, 1995
Accession Number
ADA295381

Entities

People

  • C. S. Chou
  • M. A. Zissman

Organizations

  • Massachusetts Institute of Technology

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Accuracy
  • Biometric Security
  • Classification
  • Department Of Defense
  • Education
  • Grammars
  • Hidden Markov Models
  • Identification
  • Identification Systems
  • Language
  • Markov Models
  • Models
  • Probability
  • Recognition
  • Security
  • Training

Readers

  • Computational Linguistics
  • Speech Processing/Speech Recognition.