Development of Domain-Specific Scenarios for Training and Evaluation of Two-Way, Free Form, Spoken Language Translation Devices
Abstract
To create effective and accurate two-way, free form, spoken language translation devices, the technologies must have appropriate training data. The goal of the Defense Advanced Research Projects Agency Spoken Language Communication and Translation System for Tactical Use (TRANSTAC) program is to demonstrate capabilities to rapidly develop and field this technology, so speakers of different languages can communicate in real-world tactical situations. A critical component is to generate data sets to train and evaluate the technologies. A novel approach was developed to collect these data, employing innovative data-collection and evaluation scenarios. This article describes the scenario methodology used for the TRANSTAC data collections and evaluations.
Document Details
- Document Type
- Technical Report
- Publication Date
- Mar 01, 2010
- Accession Number
- ADA522914
Entities
People
- Brian A. Weiss
- Marnie Menzel
Organizations
- National Institute of Standards and Technology