Robust Processing of Real-World Natural-Language Texts

Abstract

It is often assumed that when natural language processing meets the real world, the ideal of aiming for complete and correct interpretations has to be abandoned. However, our experience with TACITUS, especially in the MUC-3 evaluation, has shown that principled techniques for syntactic and pragmatic analysis can be bolstered with methods for achieving robustness. We describe and evaluate a method for dealing with unknown words and a method for filtering out sentences irrelevant to the task. We describe three techniques for making syntactic analysis more robust-an agenda-based scheduling parser, a recovery technique for failed parses, and a new technique called terminal substring parsing. For pragmatics processing, we describe how the method of abductive inference is inherently robust, in that an interpretation is always possible, so that in the absence of the required world knowledge, performance degrades gracefully. Each of these techniques have been evaluated and the results of the evaluations are presented.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1991
Accession Number
ADA258837

Entities

People

  • David Magerman
  • Douglas E. Appelt
  • Jerry R. Hobbs
  • John Bear
  • Mabry Tyson

Organizations

  • SRI International

Tags

Communities of Interest

  • Weapons Technologies

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Artificial Intelligence
  • Automated Speech Recognition
  • Errors
  • Explosions
  • Grammars
  • Language
  • Morphology (Linguistics)
  • Natural Language Processing
  • Natural Languages
  • Precision
  • Reasoning
  • Recognition
  • Terrorists
  • Test Sets
  • Text Processing

Fields of Study

  • Computer science
  • Engineering

Readers

  • Computational Linguistics
  • Systems Analysis and Design
  • Team-Based Human-Centered Cognitive Task Decision Making and Information Performance.

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms
  • AI & ML - Machine Translation