Active Learning for Automatic Audio Processing of Unwritten Languages (ALAPUL)

Abstract

This work addresses automatic transcription for languages without (usable) written resources. Previous work has addressed this problem using entirely unsupervised methodologies. Our approach in contrast investigates the use of linguistic and speaker knowledge which are often available even if text resources are not. We create a framework that benefits from such resources, not assuming orthographic representations and avoiding manual generation of word-level transcriptions. We adapt a universal phone recognizer to the target language and use it to convert audio into a searchable phone string for lexical unit discovery via fuzzy sub-string matching. Linguistic knowledge is used to constrain phone recognition output. Target language speakers are used to assist a linguist in creating phonetic transcriptions for the adaptation of acoustic and language models, by re-speaking more clearly a small portion of the target language audio. We also explore robust features and feature transform through deep auto-encoders for better phone recognition performance. We target iterative learning to improve the system through multiple iterations of user feedback.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jul 01, 2016
Accession Number
AD1020704

Entities

People

  • Andreas Kathol
  • Chris Bartels
  • Collen Richey
  • Dimitra Vergyri
  • Julian Vanhout
  • Vikramjit Mitra
  • Wen Wang

Organizations

  • SRI International

Tags

Communities of Interest

  • C4I

DTIC Thesaurus Topics

  • Accuracy
  • Air Force
  • Air Force Research Laboratories
  • Artificial Intelligence Software
  • Automata Theory
  • Automated Speech Recognition
  • Automatic
  • Bayesian Networks
  • Computational Science
  • Computer Languages
  • Detection
  • Governments
  • Information Science
  • Intelligence Community (United States)
  • Language
  • Neural Networks
  • Recognition

Fields of Study

  • Computer science
  • Linguistics

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Computational Linguistics
  • Sensor Fusion and Tracking Systems.