IBM MASTOR SYSTEM: Multilingual Automatic Speech-to-speech Translator
Abstract
In this paper, we describe the IBM MASTOR, a speech-to-speech translation system that can translate spontaneous free-form speech in real-time on both laptop and hand-held PDAs. Challenges include speech recognition and machine translation in adverse environments, lack of training data and linguistic resources for under-studied languages, and the need to rapidly develop capabilities for new languages. Another challenge is designing algorithms and building models in a scalable manner to perform well even on memory and CPU deficient hand-held computers. We describe our approaches, experience, and success in building working free-form S2S systems that can handle two language pairs (including a low-resource language).
Document Details
- Document Type
- Technical Report
- Publication Date
- Jan 01, 2006
- Accession Number
- ADA456612
Entities
People
- Bowen Zhou
- Charles Prosser
- Hong-kwang Kuo
- Liang Gu
- Mohamed Afify
- Ruhi Sarikaya
- Wei Zhang
- Wei-zhong Zhu
- Yonggang Deng
- Yuqing Gao
Organizations
- IBM Thomas J. Watson Research Center