Document Retrieval Systems,

Abstract

Some of the mathematical properties of a term matching information retrieval system are investigated. It is shown that the common retrieval method of using a query vector, a matching function, and a threshold is equivalent to retrieving documents by requiring that a specific mathematical combination of the over and under indexing errors between the query vector and document index vector is bounded. Furthermore, the over and under indexing error set provides a sample space for a probabilistic description of the retrieval process. Using this approach an explicit form of the expected recall ratio is derived. (Author)

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1972
Accession Number
AD0737042

Entities

People

  • Edward Abraham Mark

Organizations

  • University of Illinois Urbana–Champaign

Tags

DTIC Thesaurus Topics

  • Information Retrieval

Readers

  • Approximation Theory.
  • Database Systems and Applications

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Information Retrieval
  • Space