One Sense Per Collocation

Abstract

Previous work [Gale, Church and Yarowsky, 1992] showed that with high probability a polysemous word has one sense per discourse. In this paper we show that for certain definitions of collocation, a polysemous word exhibits essentially only one sense per collocation. We test this empirical hypothesis for several definitions of sense and collocation, and discover that it holds with 90-99% accuracy for binary ambiguities. We utilize this property in a disambiguation algorithm that achieves precision of 92% using combined models of very local context.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1993
Accession Number
ADA458621

Entities

People

  • David Yarowsky

Organizations

  • University of Pennsylvania

Tags

Communities of Interest

  • Human Systems
  • Materials and Manufacturing Processes
  • Weapons Technologies

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Ambiguity
  • Character Recognition
  • Data Sets
  • English Language
  • Foreign Languages
  • Frequency
  • Information Retrieval
  • Information Science
  • Language
  • Linguistics
  • Neural Networks
  • Optical Character Recognition
  • Precision
  • Probability
  • Probability Distributions

Fields of Study

  • Computer science

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Materials Science (Mechanical Engineering).
  • Regression Analysis.