Incremental Syntactic Language Models for Phrase-Based Translation

Abstract

This paper describes a novel technique for incorporating syntactic knowledge into phrase-based machine translation through incremental syntactic parsing. Bottom-up and top-down parsers typically require a completed string as input. This requirement makes it difficult to incorporate them into phrase-based translation, which generates partial hypothesized translations from left-to-right. Incremental syntactic language models score sentences in a similar left-to-right fashion, and are therefore a good mechanism for incorporating syntax into phrase-based translation. We give a formal definition of one such linear-time syntactic language model, detail its relation to phrase-based decoding, and integrate the model with the Moses phrase-based translation system. We present empirical results on a constrained Urdu-English translation task that demonstrate a significant BLEU score improvement and a large decrease in perplexity.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 01, 2011
Accession Number
ADA558560

Entities

People

  • Chris Callison-burch
  • Lane Schwartz
  • Stephen Wu
  • William Schuler

Organizations

  • Air Force Research Laboratory

Tags

DTIC Thesaurus Topics

  • Air Force
  • Artificial Intelligence
  • Automated Speech Recognition
  • Computational Linguistics
  • Computational Science
  • Computer Science
  • Decoding
  • Grammars
  • Language
  • Linguistics
  • Machine Translation
  • Markov Models
  • Natural Language Processing
  • Natural Languages
  • Probability
  • Probability Distributions
  • Random Variables

Fields of Study

  • Computer science

Readers

  • Computational Linguistics

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation