Extrinsic Evaluation of Automated Information Extraction Programs

Abstract

Information extraction (IE) plays a vital role in Natural Language Processing and also serves as the foundation for computational visualization of information. Information can be converted into a user-defined, ontology-friendly format much faster and more efficiently by automating IE. Programs like General Architecture for Text Engineering (GATE) and Automap allow entities such as name, date, location, and organization to be extracted from a corpus written in a natural language. Verbs and other parts of speech can be extracted using these programs as well. The extracted information can then be formatted into a computer-readable language for visualization and populating a database for use by the fusion community to provide actionable intelligence for the Warfighter. This technical note documents the results of the comparison of the IE tools offered by GATE versus those in Automap.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
May 01, 2010
Accession Number
ADA533074

Entities

People

  • Frank Small
  • William Tanenbaum

Organizations

  • United States Army Research Laboratory

Tags

Communities of Interest

  • Human Systems

DTIC Thesaurus Topics

  • Computers
  • Databases
  • Engineering
  • Extraction
  • Graphical User Interface
  • Information Science
  • Language
  • Military Research
  • Natural Language Processing
  • Natural Languages
  • Ontologies
  • Precision
  • Preprocessing
  • Test And Evaluation
  • User Interface

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Geospatial Intelligence and Artificial Intelligence Analytics
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval
  • AI & ML - Machine Translation