Semantic Tagging using a Probabilistic Context Free Grammar

Abstract

This paper describes a statistical model for extraction of events at the sentence level, or "semantic tagging", typically the first level of processing in Information Extraction systems. We illustrate the approach using a management succession task, tagging sentences with three slots involved in each succession event: the post, person coming into the post, and person leaving the post. The approach requires very limited resources: a part-of-speech tagger; a morphological analyzer; and a set of training examples that have been labeled with the three slots and the indicator (verb or noun) used to express the event. Training on 560 sentences, and testing on 356 sentences, shows the accuracy of the approach is 77.5% (if partial slot matches are deemed incorrect) or 87.8% (if partial slot matches are deemed correct).

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1998
Accession Number
ADA458893

Entities

People

  • Michael D. Collins
  • Scott R. Miller

Organizations

  • University of Pennsylvania

Tags

Communities of Interest

  • Autonomy
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Analyzers
  • Artificial Intelligence
  • Automated Speech Recognition
  • Communication Systems
  • Computational Science
  • Context Free Grammars
  • Grammars
  • Language
  • Linguistics
  • Machine Learning
  • Models
  • Natural Languages
  • Probabilistic Models
  • Probability
  • Probability Distributions
  • Recognition
  • Standards

Readers

  • Computational Linguistics
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval
  • AI & ML - Machine Translation