Data Selection for Within-Class Covariance Estimation

Abstract

Methods for performing channel and session compensation in conjunction with subspace techniques have been a focus of considerable study recently and have led to significant gains in speaker recognition performance. While developers have typically exploited the vast archive of speaker labeled data available from earlier NIST evaluations to train the within-class and across-class covariance matrices required by these techniques, little attention has been paid to the characteristics of the data required to perform the training efficiently. This paper focuses on within-class covariance normalization and shows that a reduction in training data requirements can be achieved by proper data selection. In particular, it is shown that the key variables are the total amount of data and the degree of handset variability, with total calls per handset playing a smaller role. The study offers insight into efficient within-class covariance matrix training data collection in real world applications.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 08, 2016
Accession Number
AD1033828

Entities

People

  • Douglas Reynolds
  • Elliot Singer
  • Tyler Campbell

Organizations

  • MIT Lincoln Laboratory

Tags

DTIC Thesaurus Topics

  • Acquisition
  • Air Force
  • Compensation
  • Computational Complexity
  • Covariance
  • Data Science
  • Department Of Defense
  • Dimensionality Reduction
  • Discriminant Analysis
  • Feature Extraction
  • Information Science
  • Mobile Phones
  • Recognition
  • Test And Evaluation
  • Training
  • United States
  • United States Government

Readers

  • Regression Analysis.
  • Speech Processing/Speech Recognition.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms