Adding Statistical Machine Translation Adaptation to Computer-Assisted Translation

Abstract

Statistical machine translation (SMT) has proven effective for general purpose language translation but not for highly specialized domains like medicine, military operations, and law enforcement, which have their own technical jargon. We present a novel approach for iteratively incorporating a human translator in the loop to adapt SMT models to a particular domain. We show how these models can be made accessible via Web services and integrated with computer-assisted translation (CAT) tools. In this report, we describe a novel human-in-the-loop post-editing domain adaptation algorithm for refining SMT models using the Joshua decoder and integrate it with a CAT tool called OmegaT.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 2013
Accession Number
ADA585928

Entities

People

  • John J. Morgan
  • Robert P . Winkler
  • Somiya Metu
  • Stephen A. Larocca

Organizations

  • United States Army Research Laboratory

Tags

Communities of Interest

  • Biomedical

DTIC Thesaurus Topics

  • Afghanistan Conflict
  • Algorithms
  • Computational Linguistics
  • Computational Science
  • Computer Languages
  • Computers
  • Language
  • Linguistics
  • Machine Translation
  • Military Operations
  • Natural Language Processing
  • Shell Scripts
  • Translations
  • Translators
  • United States Central Command
  • Web Service
  • Word Processors

Fields of Study

  • Computer science

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Computational Linguistics

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation