Handling Ill-Formed Input: Session Introduction,

Abstract

This session consists of papers by Jensen and Heidorn; Marsh; and Granger et al. The paper by Jensen and Heidorn presents a particular heuristic for dealing with unparsable input. Since they have separate explicit heuristics for specific ungrammatical forms, a significant proportion of unparsable input in their system will be relatively ill-formed. Marsh's paper describes a technique for filling in material omitted from fragmentary inputs. Both syntactic information and domain-specific constraints on semantic classes are used. The paper by Granger et al. reports on NOMAD, a system for taking cryptic, errorful naval ship-to-shore messages and generating well-formed versions. The paper describes the methods used for processing unknown words, fragments, missing punctuation, and tense errors.

Document Details

Document Type
Technical Report
Publication Date
Feb 01, 1983
Accession Number
ADP001172

Entities

People

  • Ralph M. Weischedel

Organizations

  • University of Delaware

Tags

DTIC Thesaurus Topics

  • California
  • Computer Languages
  • Formal Languages
  • Language
  • Materials
  • Natural Language Processing
  • Natural Languages

Readers

  • Computational Linguistics