THUIR at TREC 2009 Web Track: Finding Relevant and Diverse Results for Large Scale Web Search

Abstract

This is the 8th year that IR group of Tsinghua University (THUIR) participates in TREC. This year we focus on Web track, which contains two tasks, namely ad hoc and diversity. On ad hoc task, we improved the efficiency of our distributed retrieval system TMiner to handle terabytes of Web data. Then three studies have been done, namely page quality estimation, ranking feature analysis, and model comparison. On diversity task, we proposed several new approaches on searching strategy, user intention detection, and duplication elimination. To mine user's intention, we proposed and compared two different strategies, namely 'searching + content-based diversity' which is a kind of result clustering, and 'user based diverse intention prediction + searching' which is in the branch of query expansion.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2009
Accession Number
ADA517807

Entities

People

  • Bo Zhou
  • F. Chen
  • J. W. Miao
  • Mei Zhang
  • Q. L. Xing
  • R. W. Chen
  • S. P. Ma
  • Tong Zhu
  • Y. F. Xue
  • Yu Jin
  • Yueqiang Liu
  • Z. C. Li

Organizations

  • Tsinghua University

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Clustering
  • Compression
  • Compression Ratio
  • Computer Science
  • Detection
  • Electronic Mail
  • Elimination
  • Filtration
  • Frequency
  • Information Science
  • Models
  • Operating Systems
  • Probabilistic Models
  • Probability
  • Training

Fields of Study

  • Computer science

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Database Systems and Applications
  • Information Retrieval