Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004

Abstract

This paper discusses some of the factors that should be considered when designing a speech corpus collection to be used for text-independent speaker recognition evaluation. The factors include telephone handset type, telephone transmission type, language, and (non-telephone) microphone type. The paper describes the design of the new corpus collection being undertaken by the Linguistic Data Consortium (LDC) to support the 2004 and subsequent NIST speech recognition evaluations. Some preliminary information on the resulting 2004 evaluation test set is offered.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2004
Accession Number
ADA525784

Entities

People

  • Alvin Martin
  • David A. B. Miller
  • Hirotaka Nakasone
  • Joseph Campbell
  • Mark Przybocki

Organizations

  • National Institute of Standards and Technology

Tags

DTIC Thesaurus Topics

  • Air Force
  • Automated Speech Recognition
  • Data Sets
  • Education
  • Identification
  • Information Operations
  • Language
  • Microphones
  • Mobile Devices
  • Mobile Phones
  • Recognition
  • Recording Systems
  • Spanish Language
  • Test And Evaluation
  • Training
  • United States
  • United States Government

Fields of Study

  • Computer science

Readers

  • Geospatial Intelligence and Artificial Intelligence Analytics
  • Speech Processing/Speech Recognition.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation