HIGH-SPEED DOCUMENT PERUSAL

Abstract

This report includes: A SAMPLING PROCEDURE FOR CLUSTERING SIMILAR DOCUMENTS, by C. T. Abraham. Dec 6l. TIME ESTIMATION IN BOOLEAN INDEX SEARCHING, by E. Wong. Dec 61.AN ENGLISH-LIKE EXTENSION OF AN APPLIED PREDICATE CALCULUS, by H. Bohnert. Feb 62.AFOSR support led to the accomplishment of five results concerning the possibility of high-speed document perusal. These are: derivation of a formula for the average time to search an index; algorithms for translating English-like sentences into logic-like sentences; development of efficient techniques for grouping similar texts; implementation of a high-speed automatic dictionary lookup procedure; and, construction of computer programs for constructing representative abstracts and index terms. These constitute some first steps toward experimentally demonstrating the feasibility of a cooperative man-machine system for high-quality, high-speed perusal of large document collections; also, toward solving some of the basic logico-linguistic problems in the way of more completely automatic, highquality perusal. (Author)

Document Details

Document Type
Technical Report
Publication Date
May 01, 1962
Accession Number
AD0285255

Entities

People

  • C.t. Abraham
  • M. Kochen

Organizations

  • IBM Thomas J. Watson Research Center

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Automatic
  • Calculus
  • Clustering
  • Computer Programs
  • Computers
  • Construction
  • Dictionaries
  • Human-Machine Systems
  • Index Terms
  • Indexes
  • Sampling

Readers

  • Computer Engineering
  • Database Systems and Applications
  • Statistical inference.