The MMSR Bilingual and Crosschannel Corpora for Speaker Recognition Research and Evaluation

Abstract

We describe efforts to create corpora to support and evaluate systems that meet the challenge of speaker recognition in the face of both channel and language variation. In addition to addressing ongoing evaluation of speaker recognition systems, these corpora are aimed at the bilingual and crosschannel dimensions. We report on specific data collection efforts at the Linguistic Data Consortium, the 2004 speaker recognition evaluation program organized by the National Institute of Standards and Technology (NIST), and the research ongoing at the US Federal Bureau of Investigation and MIT Lincoln Laboratory. We cover the design and requirements, the collections and evaluation integrating discussions of the data preparation, research, technology development and evaluation on a grand scale.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2004
Accession Number
ADA525635

Entities

People

  • Alvin F. Martin
  • Christopher Cieri
  • David Miler
  • Hirotaka Nakasone
  • Joseph P. Campbell
  • Kevin Walker
  • Mark A. Przybocki

Organizations

  • Massachusetts Institute of Technology

Tags

Communities of Interest

  • Autonomy

DTIC Thesaurus Topics

  • Attrition
  • Automatic
  • Computer Science
  • Computers
  • Detection
  • English Language
  • Foreign Languages
  • Governments
  • Identification
  • Language
  • Laptop Computers
  • Microphones
  • Mobile Phones
  • Recognition
  • Recording Systems
  • Test And Evaluation
  • United States

Fields of Study

  • Computer science

Readers

  • Defense Technology Research and Development.
  • Speech Processing/Speech Recognition.
  • Technical Research and Report Writing.

Technology Areas

  • AI & ML
  • AI & ML - DoD AI Strategy
  • AI & ML - Machine Translation