DATA FILE SIZE AND ITS RELATION TO THE BAYESIAN EFFECTIVENESS OF AN INFORMATION RETRIEVAL SYSTEM

Abstract

A simple Bayesian measure of system effectiveness for information retrieval systems is proposed. The measure combines the recall and precision ratios of an information system with the utility structure of the system user. Using the measure, it is possible to show that effective systems are possible only under a very narrow set of conditions. In particular, it is shown that using present state-of-the-art indexing, it is not possible to have effective systems with file sizes much in excess of 100,000 documents.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 1965
Accession Number
AD0618311

Entities

People

  • Ugo O. Gagliardi

Tags

Communities of Interest

  • C4I

DTIC Thesaurus Topics

  • Abstracts
  • Air Force
  • Availability
  • Computer Programming
  • Computer Programs
  • Computers
  • Contractors
  • Contracts
  • Government Procurement
  • Governments
  • Indexes
  • Information Retrieval
  • Information Systems
  • Precision
  • Probability
  • Security
  • Statistical Analysis

Fields of Study

  • Computer science

Readers

  • Computer Science.
  • Neural Network Machine Learning.
  • Regression Analysis.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Information Retrieval