Improving the Effectiveness of Speaker Verification Domain Adaptation With Inadequate In-Domain Data

Abstract

This paper addresses speaker verification domain adaptation with inadequate in-domain data. Specifically, we explore the cases where in-domain data sets do not include speaker labels, contain speakers with few samples, or contain speakers with low channel diversity. Existing domain adaptation methods are reviewed, and their shortcomings are discussed. We derive an unsupervised version of fully Bayesian adaptation which reduces the reliance on rich in-domain data. When applied to domain adaptation with inadequate in-domain data, the proposed approach yields competitive results when the samples per speaker are reduced, and outperforms existing supervised methods when the channel diversity is low, even without requiring speaker labels. These results are validated on the SRE16, which uses a highly inadequate in-domain data set.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 20, 2017
Accession Number
AD1032785

Entities

People

  • Bengt J. Borgström
  • Douglas A. Reynolds
  • Elliot Singer
  • Omid Sadjadi

Organizations

  • MIT Lincoln Laboratory

Tags

DTIC Thesaurus Topics

  • Additives (Chemicals)
  • Air Force
  • Bayesian Networks
  • Coefficients
  • Covariance
  • Data Sets
  • Degradation
  • Department Of Defense
  • Discriminant Analysis
  • English Language
  • Equations
  • Gaussian Distributions
  • Models
  • Random Variables
  • United States
  • United States Government
  • Verification

Fields of Study

  • Computer science

Readers

  • Neural Network Machine Learning.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation