Oxygen: A Language Independent Linerization Engine

Abstract

This paper describes a language independent linearization engine, oxyGen. This system compiles target language grammars into programs that take feature graphs as inputs and generate word lattices that can be passed along to the statistical extraction module of the generation system Nitrogen. The grammars are written using a flexible and powerful language, oxyL, that has the power of a programming language but focuses on natural language realization. This engine have been used successfully in creating an English linearization program that is currently used as part of a Chinese-English machine translation system.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 01, 2000
Accession Number
ADA458719

Entities

People

  • Nizar Habash

Organizations

  • University of Maryland

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Automated Text Summarization
  • Compilers
  • Computer Programming
  • Computers
  • Demographic Cohorts
  • Extraction
  • Grammars
  • Hierarchies
  • Information Science
  • Language
  • Machine Translation
  • Natural Languages
  • Programming Languages
  • Translations
  • United States
  • Universities

Fields of Study

  • Computer science

Readers

  • Computational Linguistics

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation