Towards a Unified Approach to Memory- and Statistical-Based Machine Translation

Abstract

We present a set of algorithms that enable us to translate natural language sentences by exploiting both a translation memory and a statistical-based translation model. Our results show that an automatically derived translation memory can be used within a statistical framework to often find translations of higher probability than those found using solely a statistical model. The translations produced using both the translation memory and the statistical model are significantly better than translations produced by two commercial systems: our hybrid system translated perfectly 58% of the 505 sentences in a test collection, while the commercial systems translated perfectly only 40-42% of them.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2001
Accession Number
ADA461149

Entities

People

  • Daniel Marcu

Organizations

  • University of Southern California

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Decoding
  • Fertility
  • Grammars
  • Hybrid Systems
  • Information Operations
  • Information Science
  • Language
  • Linguistics
  • Machine Translation
  • Natural Language Processing
  • Natural Languages
  • Probability
  • Test And Evaluation
  • Translations

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Computational Modeling and Simulation

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Translation