AUTOMATIC INDEXING FROM MACHINE READABLE ABSTRACTS OF SCIENTIFIC DOCUMENTS

Abstract

State-of-the-art of machine indexing is reported. Various proposed machine indexing methods are reviewed and evaluated. Methods for comparing machine and human indexing as well as machine indexing systems among themselves are described. Possible approaches to various problem solutions in machine indexing are indicated. The report describes the design of the Formal Autoindexing of Scientific Texts (FAST) system. Characteristics of Uniterm co- ordinate indexes are investigated and generalizations to scientific indexes made. Laws for the formation of words in the indexing language are derived and verified. The operational principles of the FAST system and test results of various system components are reported. Indexes produced by the FAST method are compared with those produced by human indexers for inter-indexer and intra- indexer consistency. A method of formal evaluation of indexes using the information theory approach is presented and applied to the FAST and conventional indexes. It is concluded that the FAST system can produce Uniterm co-ordinate indexes adequate to user's requirements better and faster than human indexers can do.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 1965
Accession Number
AD0481148

Entities

People

  • Pranas Zunde

Tags

Communities of Interest

  • Advanced Electronics
  • Biomedical
  • C4I
  • Space
  • Weapons Technologies

DTIC Thesaurus Topics

  • Air Force
  • Chemical Synthesis
  • Chemistry
  • Computational Science
  • Computer Programs
  • Computers
  • Data Processing
  • Data Science
  • Databases
  • Information Processing
  • Information Retrieval
  • Information Science
  • Information Systems
  • Language
  • Linguistics
  • Mathematical Models
  • Surveys

Readers

  • Library and Information Science
  • Manufacturing Engineering.
  • Theoretical Analysis.