Experiments with ClueWeb09: Relevance Feedback and Web Tracks

Abstract

In this paper, we report on our TREC experiments with the ClueWeb09 document collection. We participated in the relevance feedback and web tracks. While our phase 1 relevance feedback run's performance was good, our other relevance feedback and web track submissions' performances were lacking. We suspect this performance difference is caused by the Category B document subset of the ClueWeb09 collection having a higher prior probability of relevance than the rest of the collection. Future work will involve a more detailed error analysis of our experiments.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2009
Accession Number
ADA517764

Entities

People

  • Charles L. Clarke
  • Gordon V. Cormack
  • Mark D. Smucker

Organizations

  • University of Waterloo

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Computer Programming
  • Computer Science
  • Contracts
  • Education
  • Electronic Mail
  • Feedback
  • Information Operations
  • Instructions
  • Language
  • Maryland
  • Probability
  • Standards
  • United States
  • Universities

Fields of Study

  • Computer science

Readers

  • Information Retrieval
  • Regression Analysis.