Coping with Ambiguity and Unknown Words through Probabilistic Models

Abstract

From spring 1990 through fall 1991, we performed a battery of small experiments to test the effectiveness of supplementing knowledge-based techniques with probabilistic models. This paper reports our experiments in predicting parts of speech of highly ambiguous words, predicting the intended interpretation of an utterance when more than one interpretation satisfies all known syntactic and semantic constraints, and learning case frame information for verbs from example uses. From these experiments, we are convinced that probabilistic models based on annotated corpora can effectively reduce the ambiguity in processing text and can be used to acquire lexical information from a corpus, by supplementing knowledge-based techniques. Based on the results of those experiments, we have constructed a new natural language system (PLUM)for extracting data from text, e.g., newswire text.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 01, 1993
Accession Number
ADA574703

Entities

People

  • Jeff Palmucci
  • Lance Ramshaw
  • Marie Meteer
  • Ralph Weischedel
  • Richard Schwartz

Organizations

  • BBN Technologies

Tags

Communities of Interest

  • Human Systems
  • Materials and Manufacturing Processes
  • Weapons Technologies

DTIC Thesaurus Topics

  • Algorithms
  • Ambiguity
  • Artificial Intelligence
  • Computational Linguistics
  • Computational Science
  • Computer Science
  • Databases
  • Dictionaries
  • Grammars
  • Language
  • Linguistics
  • Markov Models
  • Models
  • Natural Language Processing
  • Natural Languages
  • Probabilistic Models
  • Probability

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Instructional Design and Training Evaluation.