Speaker Indexing in Large Audio Databases Using Anchor Models

Abstract

This paper introduces the technique of anchor modeling in the applications of speaker detection and speaker indexing. The anchor modeling algorithm is refined by pruning the number of models needed. The system is applied to the speaker detection problem where its performance is shown to fall short of the state-of-the-art Gaussian Mixture Model with Universal Background Model (GMM-UBM) system. However, it is further shown that its computational efficiency lends itself to speaker indexing for searching large audio databases for desired speakers. Here, excessive computation may prohibit the use of the GMM-UBM recognition system. Finally, the paper presents a method for cascading anchor model and GMM-UBM detectors for speaker indexing. This approach benefits from the efficiency of anchor modeling and high accuracy of GMM-UBM recognition.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2001
Accession Number
ADA525859

Entities

People

  • D. A. Reynolds
  • D. E. Sturim
  • E. Singer
  • J. P. Campbell

Organizations

  • Massachusetts Institute of Technology

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Accuracy
  • Air Force
  • Algorithms
  • Computations
  • Databases
  • Detection
  • Detectors
  • Efficiency
  • Errors
  • False Alarms
  • Hidden Markov Models
  • Models
  • Precision
  • Probability
  • Recognition
  • Warning Systems

Fields of Study

  • Computer science

Readers

  • Computational Modeling and Simulation
  • Computer Vision.
  • Speech Processing/Speech Recognition.