Generalizing Semantic Role Annotations Across Syntactically Similar Verbs

Abstract

Large corpora of parsed sentences with semantic role labels (e.g. PropBank) provide training data for use in the creation of high-performance automatic semantic role labeling systems. Despite the size of these corpora, individual verbs (or rolesets) often have only a handful of instances in these corpora, and only a fraction of English verbs have even a single annotation. In this paper, we describe an approach for dealing with this sparse data problem, enabling accurate semantic role labeling for novel verbs (rolesets) with only a single training example. Our approach involves the identification of syntactically similar verbs found in Prop-Bank, the alignment of arguments in their corresponding rolesets, and the use of their corresponding annotations in Prop-Bank as surrogate training data.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 01, 2007
Accession Number
ADA470421

Entities

People

  • Andrew S. Gordon
  • Reid Swanson

Organizations

  • University of Southern California

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Automatic
  • Computational Linguistics
  • Computational Science
  • Experimental Design
  • Frequency
  • Identification
  • Information Operations
  • Language
  • Linguistics
  • Machine Learning
  • Military Research
  • Natural Language Processing
  • Natural Languages
  • Precision
  • Supervised Machine Learning
  • Training

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Neural Network Machine Learning.