Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004
Abstract
This paper discusses some of the factors that should be considered when designing a speech corpus collection to be used for text-independent speaker recognition evaluation. The factors include telephone handset type, telephone transmission type, language, and (non-telephone) microphone type. The paper describes the design of the new corpus collection being undertaken by the Linguistic Data Consortium (LDC) to support the 2004 and subsequent NIST speech recognition evaluations. Some preliminary information on the resulting 2004 evaluation test set is offered.
Document Details
- Document Type
- Technical Report
- Publication Date
- Jan 01, 2004
- Accession Number
- ADA525784
Entities
People
- Alvin Martin
- David A. B. Miller
- Hirotaka Nakasone
- Joseph Campbell
- Mark Przybocki
Organizations
- National Institute of Standards and Technology