Science and Technology Text Mining: Electric Power Sources

Abstract

Database Tomography (DT) is a textual database analysis system consisting of two major components: 1) algorithms for extracting multi-word phrase frequencies and phrase proximities (physical closeness of the multi-word technical phrases) from any type of large textual database, to augment 2) interpretative capabilities of the expert human analyst. DT was used to derive technical intelligence from a Power Sources database derived from the Science Citation Index (SCI). Phrase frequency analysis by the technical domain experts provided the pervasive technical themes of the Power Sources database, and the phrase proximity analysis provided the relationships among the pervasive technical themes. Bibliometric analysis of the Power Sources literature supplemented the DT results with author/ journal/ institution/ country publication and citation data.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 2004
Accession Number
ADA421789

Entities

People

  • George Karypis
  • James A. Humenik
  • Kirstin M. Pfeil
  • Rene Tshiteya
  • Ronald Neil Kostoff

Organizations

  • Office of Naval Research

Tags

DTIC Thesaurus Topics

  • Chemical Synthesis
  • Chemistry
  • Climate Change
  • Electric Power
  • Energy
  • Energy Production
  • Energy Storage
  • Energy Transfer
  • Environmental Protection
  • Heat Transfer
  • Information Science
  • Material Degradation Processes
  • Materials
  • Materials Laboratories
  • Materials Processing
  • Materials Science
  • Natural Language Processing

Readers

  • Computational Linguistics
  • Library and Information Science