Effective and Efficient Automatic Database Selection

Abstract

We examine a class of database selection algorithms that require only document frequency information. The CORI algorithm is an instance of this class of algorithms. In previous work, we showed that CORI is more effective than (g)GLOSS when evaluated against a relevance-based standard. In this paper, we introduce a family of other algorithms in this class and examine components of these algorithms and of the CORI algorithm to begin identifying the factors responsible for their performance. We establish that the class of algorithms studied here is more effective and efficient than (g)GLOSS and is applicable to a wider variety of operational environments. In particular, this methodology is completely decoupled from the database indexing technology so is as useful in heterogeneous environments as in homogeneous environments.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1999
Accession Number
ADA478116

Entities

People

  • Allison L. Powell
  • James C. French
  • Jamie Callan

Organizations

  • University of Virginia

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Automatic
  • Availability
  • Classification
  • Computers
  • Contracts
  • Databases
  • Environment
  • Frequency
  • Information Operations
  • Instructions
  • Massachusetts
  • Monitoring
  • Security
  • Standards

Fields of Study

  • Computer science

Readers

  • Distributed Systems and Data Platform Development
  • Medical Imaging.
  • Systems Analysis and Design