PROTEUS (PROtotype TExt Understanding System) and PUNDIT (Prolog UNDerstander of Integrated Text): Research in Text Understanding at the Department of Computer Science, New York University and System Development Corporation--A Burroughs Company.

Abstract

We are engaged in the development systems capable of analyzing short narrative messages dealing with a limited domain and extracting the information contained in the narrative. These systems are initially being applied to messages describing equipment failure. This work is a joint effort of New York University and the System Development Corp. for the DARPA Strategic Computing Program. Our aim is to create a system reliable enough for use in an operational environment. This is a formidable task, both because the texts are unedited (and so contain various errors) and because the complexity of any real domain precludes us from assembling a complete collection of the relationships and domain knowledge relevant to understanding texts in the domain. Our basic approach to increasing reliability will be to bring to bear on the analysis task as many different types of constraints as possible. These include constraints related to syntax, semantics, domain knowledge, and discourse structure. In order to be able to capture the detailed knowledge about the domain that is needed for correct message analyses, we are initially limiting ourselves to messages about one particular piece of equipment (the starting air compressor); if we are successful in this narrow domain, we intend to gradually broaden our system.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 1986
Accession Number
ADA173891

Entities

People

  • Lynette Hirschman
  • Ralph David Grishman

Organizations

  • New York University

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Air Compressors
  • Analyzers
  • Command Centers
  • Compressors
  • Computer Programming
  • Computer Science
  • Computers
  • Corporations
  • Grammars
  • Language
  • Lisp Programming Language
  • Message Processing
  • Natural Languages
  • New York
  • Semantics
  • Symbolic Programming
  • Teamwork

Readers

  • Artificial Intelligence
  • Systems Analysis and Design
  • Technical Research and Report Writing.