THUIR at TREC 2009 Web Track: Finding Relevant and Diverse Results for Large Scale Web Search
Abstract
This is the 8th year that IR group of Tsinghua University (THUIR) participates in TREC. This year we focus on Web track, which contains two tasks, namely ad hoc and diversity. On ad hoc task, we improved the efficiency of our distributed retrieval system TMiner to handle terabytes of Web data. Then three studies have been done, namely page quality estimation, ranking feature analysis, and model comparison. On diversity task, we proposed several new approaches on searching strategy, user intention detection, and duplication elimination. To mine user's intention, we proposed and compared two different strategies, namely 'searching + content-based diversity' which is a kind of result clustering, and 'user based diverse intention prediction + searching' which is in the branch of query expansion.
Document Details
- Document Type
- Technical Report
- Publication Date
- Nov 01, 2009
- Accession Number
- ADA517807
Entities
People
- Bo Zhou
- F. Chen
- J. W. Miao
- Mei Zhang
- Q. L. Xing
- R. W. Chen
- S. P. Ma
- Tong Zhu
- Y. F. Xue
- Yu Jin
- Yueqiang Liu
- Z. C. Li
Organizations
- Tsinghua University