Phrase Dictionary Distribution Analysis and Growth Prediction Report.

Abstract

The report describes a study of the DDC Phrase Glossary. It includes a computer program to tabulate word frequencies for blocks of phrases of optional sizes. On the basis of these distributions, empirical and statistical analyses are made including two prediction models. Two-word distributions are also included. Based upon the available distributions, a two-word Phrase Glossary size of 320,000 two-word phrases was determined. Also included are analyses of various techniques, such as suffix truncation, imbedded phrases, and query effectiveness. Comparisons are made of the DDC system to other plain language machine retrieval systems. (Author)

Document Details

Document Type
Technical Report
Publication Date
Apr 26, 1974
Accession Number
AD0780957

Entities

People

  • D. J. Stewart
  • J. G. Fisher
  • J. H. Waite
  • R. Boehm
  • S. D. Epstein

Tags

DTIC Thesaurus Topics

  • Computer Programs
  • Computers
  • Computing-Related Activities
  • Data Science
  • Dictionaries
  • Frequency
  • Information Science
  • Interdisciplinary Science
  • Language
  • Mathematical Analysis
  • Mathematics
  • Statistical Analysis
  • Statistics
  • Truncation

Readers

  • Approximation Theory.
  • Computational Linguistics
  • Computer Science.