Principle-Based Parsing for Machine Translation

Abstract

Many syntactic parsing strategies for machine translation systems are based entirely on context-free grammars. These parsers require an overwhelming number of rules; thus, translation systems using rule-based parsers either have limited linguistic coverage, or they have poor performance due to formidable grammar size. This report shows how a principle-based parser with a co-routine design improves parsing for translation. The parser consists of a skeletal structure-building mechanism that operates in conjunction with a linguistically based constraint module, passing control back and forth until a set of underspecified skeletal phrase-structures is converted into a fully instantiated parse tree. The modularity of the parsing design accommodates linguistic generalization, reduces the grammar size, allows extension to other languages, and is compatible with studies of human language processing. Keywords: Natural language processing, Interlingual translation, Parsing, Subroutines, Principles vs. Rules, Co-routine design, Linguistic constraints.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Dec 01, 1987
Accession Number: ADA199183

Entities

People

Bonnie J. Dorr

Organizations

Massachusetts Institute of Technology

Principle-Based Parsing for Machine Translation

Abstract

Document Details

Entities

People

Organizations

Tags

Communities of Interest

DTIC Thesaurus Topics

Readers

Technology Areas