Confidence Links Between Name Entities in Disparate Documents

Abstract

The invention relates to cross-document entity co-reference systems in which naturally occurring entity mentions in a document corpus are analyzed and transformed into name clusters that represent global entities. In a first aspect of the invention, a name variation module analyzes naturally occurring names of entities extracted from the document corpus and provides an initial set of equivalent names that could refer to the same real world entity. In a second aspect of the invention a disambiguation module takes the initial set of equivalent names and uses an agglomerative clustering algorithm to disambiguate the potentially co-referent named entities.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 03, 2013
Accession Number
ADA599551

Entities

People

  • Alex Baron
  • Elizabeth M. Boschee
  • Marjorie R. Freedman
  • Ralph M. Weischedel

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Clustering
  • Computers
  • Computing Devices
  • Databases
  • Electronic Mail
  • Information Retrieval
  • Instructions
  • Inventions
  • Language
  • Law
  • Natural Language Processing
  • Natural Languages
  • Patents
  • United Nations
  • United States

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Theoretical Analysis.