Speaker Recognition on Lossy Compressed Speech Using the Speex Codec

Abstract

This paper examines the impact of lossy speech coding with Speex on GMM-UBM speaker recognition (SR). Audio from 120 speakers was compressed with Speex into twelve data sets, each with a different level of compression quality from 0 (most compressed) to 10 (least), plus uncompressed. Experiments looked at performance under matched and mismatched compression conditions, using models conditioned for the coded environment, and Speex coding applied to improving SR performance on other coders. Results show that Speex is effective for compression of data used in SR and that Speex coding can improve performance on data compressed by the GSM codec.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 2009
Accession Number
ADA516549

Entities

People

  • A. D. Lawson
  • A. R. Stauffer

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Audio Files
  • Compression
  • Computational Complexity
  • Data Compression
  • Environment
  • Identification
  • Mobile Communications
  • Recognition
  • Speech
  • Speech Compression
  • Standards
  • Test And Evaluation
  • Time Compression

Fields of Study

  • Computer science

Readers

  • Computer Programming and Software Development.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML