From Blogs to News: Identifying Hot Topics in the Blogosphere

Abstract

We describe the participation of the University of Amsterdam's ILPS group in the blog track at TREC 2009. We focus on the top stories identification task, and take an approach that does not require the headlines of top stories to be known beforehand. We explore the feasibility of a so-called blogs to news approach: given a date and a set of blog posts, identify the main topics for that date. This approach is more general than just finding top stories, but it can still be applied to the task of headline ranking. Results show that this general approach, applied to the task at hand, is among the top performing approaches in this year's TREC.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2009
Accession Number
ADA517750

Entities

People

  • Maarten De Rijke
  • Manos Tsagkias
  • Wouter Weerkamp

Organizations

  • University of Amsterdam

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Data Science
  • Detection
  • Distillation
  • Governments
  • Identification
  • Information Operations
  • Information Science
  • Intelligent Systems
  • Language
  • New York
  • Online Communications
  • Scientific Research
  • Standards
  • Statistical Sampling
  • Statistics

Fields of Study

  • Computer science

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Strategic Security Studies
  • Technical Research and Report Writing.