Microsoft Research at TREC 2009. Web and Relevance Feedback Tracks

Abstract

We took part in the Web and Relevance Feedback tracks, using the ClueWeb09 corpus. To process the corpus, we developed a parallel processing pipeline which avoids the generation of an inverted file. We describe the components of the parallel architecture and the pipeline and how we ran the TREC experiments, and we present effectiveness results.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2009
Accession Number
ADA517733

Entities

People

  • Dennis Fetterly
  • Emine Yilmaz
  • Marc Najork
  • Nick Craswell
  • Stephen Robertson

Organizations

  • Microsoft Research

Tags

Communities of Interest

  • Ground and Sea Platforms
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Collapse
  • Data Processing
  • Extraction
  • Fault Tolerance
  • Feedback
  • Frequency
  • Information Operations
  • Infrastructure
  • Judgment
  • Language
  • Parallel Computing
  • Parallel Processing
  • Standards
  • Training
  • Validation

Readers

  • Database Systems and Applications
  • Information Retrieval
  • Robotics and Automation.