Description of the SAIC DX System as Used for MUC-6

Abstract

This is a very young project, operational for only a few months. The focus of the effort is dataextraction, the identification of instances of data-classes in "commercial" text --e.g., newspapers, technical reports, business correspondence, intelligence briefs. Instances of data-classes in text are phrases which identify factual content, such as names of people or organizations, products, financial amounts, quantities, and so forth. A number of applications would be very well served simply with automated and accurate identification of the data-classes that occurred in their texts of interest, with interpretations left to experts. Such applications include extraction of bibliographic information, document indexing, competitive analyses based on open sources. technical information retrieval, foreign technology and political assessments, tracking financial and other resource transactions in the written media, and various types of link analyses based on text correlations.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 1995
Accession Number
ADA633960

Entities

People

  • Lance A. Miller

Organizations

  • Leidos

Tags

Communities of Interest

  • C4I
  • Human Systems

DTIC Thesaurus Topics

  • Abstracts
  • Classification
  • Computer Programming
  • Corporations
  • Databases
  • Debugging
  • Detection
  • Extraction
  • Foreign Technology
  • Fuzzy Logic
  • Grammars
  • Identification
  • Language
  • Money
  • Pattern Recognition
  • Programming Languages
  • Recognition

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Gender and Food Studies
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - DoD AI Strategy
  • AI & ML - Information Retrieval