UMass at TREC 2008 Blog Distillation Task

Abstract

This paper presents the work done for the TREC 2008 blog distillation task. We introduce two new methods based on blog site search using resource selection which was the framework we used for the TREC 2007 blog distillation task. One is a new factor that penalizes the topical diversity of a blog. The other is a query expansion technique. We compare the methods to strong baselines.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2008
Accession Number
ADA512727

Entities

People

  • Jangwon Seo
  • W. Bruce Croft

Organizations

  • University of Massachusetts Amherst

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Base Lines
  • Distillation
  • Frequency
  • Information Operations
  • Information Retrieval
  • Instructions
  • Judgment
  • Language
  • Massachusetts
  • Materials
  • Materials Processing
  • Sampling
  • Standards
  • Statistical Samples

Fields of Study

  • Computer science

Readers

  • Information Retrieval
  • Regression Analysis.