External Query Expansion in the Blogosphere

Abstract

We describe the participation of the University of Amsterdam's ILPS group in the blog track at TREC 2008. We mainly explored different ways of using external corpora to expand the original query. In the blog post retrieval task we did not succeed in improving over a simple baseline (equal weights for both the expanded and original query). Obtaining optimal weights for the original and the expanded query remains a subject of investigation. In the blog distillation task we tried to improve over our (strong) baseline using external expansion, but due to differences in the run setup, comparing these runs is hard. Compared to a simpler baseline, we see an improvement for the run using external expansion on the combination of news, Wikipedia and blog posts.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2008
Accession Number
ADA512721

Entities

People

  • Maarten De Rijke
  • Wouter Weerkamp

Organizations

  • University of Amsterdam

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Base Lines
  • Distillation
  • Governments
  • Indicators
  • Information Operations
  • Instructions
  • Netherlands
  • Online Communications
  • Polarity
  • Precision
  • Probability
  • Scientific Research
  • Standards
  • Universities

Readers

  • Information Retrieval
  • Systems Analysis and Design