Fractals Text Mining Using Bibliometrics and Database Tomography

Abstract

Database Tomography (DT) is a textual database analysis system consisting of two major components: (1) algorithms for extracting multi-word phrase frequencies and phrase proximities (physical closeness of the multi-word technical phrases) from any type of large textual database, to augment (2) interpretive capabilities of the expert human analyst. DT was used to obtain technical intelligence from a Fractals database derived from the Science Citation Index (SCI)/Social Science Citation Index (SSCI). Phrase-frequency analysis by the technical domain experts provided the pervasive technical themes of the Fractals database, and the phrase proximity analysis provided the relationships among the pervasive technical themes. Bibliometric analysis of the Fractals literature supplemented the DT results with author/journal/institution publication and citation data.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jul 01, 2003
Accession Number
ADA416675

Entities

People

  • Guido Malpohl
  • Michael F. Schlesinger
  • Ronald Neil Kostoff

Organizations

  • Office of Naval Research

Tags

Communities of Interest

  • Advanced Electronics
  • Air Platforms
  • Biomedical
  • Energy and Power Technologies
  • Weapons Technologies

DTIC Thesaurus Topics

  • Algorithms
  • Computational Science
  • Computer Science
  • Databases
  • Fluid Dynamics
  • Fluid Flow
  • Geography
  • Information Processing
  • Information Retrieval
  • Information Science
  • Military Research
  • Physical Theories
  • Physics Laboratories
  • Social Sciences
  • Spreadsheet Software
  • Technical Intelligence
  • Text Mining

Readers

  • Computational Linguistics
  • Library and Information Science
  • Wave Propagation and Nonlinear Chaotic Dynamics.