ICTNET at Web Track TREC2014
Abstract
An ad-hoc task in TREC investigates the performance of systems that search a static set of documents using previously- unseen topics. This year, the ClueWeb12 [1] dataset are used. The overall goal of the risk - sensitive task is to explore algorithms and evaluation methods for systems that try to jointly maximize an average effectiveness measure across queries, while minimizing effectiveness losses with respect to a provided baseline. Two baselines from different IR systems are supplied this year in order to understand the nature of risk- reward tradeoffs achievable by a system that can adapt to different baselines. The rest of this paper is organized as follows. In Section 2, we discuss the processing of ClueWeb 12, derived data and external resources. In Section 3, the BM25 model with term proximity , the diversification method and the results fusion strategy are introduced. We report experimental results and the corresponding re-ranking strategy in Section 4. Finally, our work is concluded in Section 5.
Document Details
- Document Type
- Technical Report
- Publication Date
- Nov 01, 2014
- Accession Number
- ADA618656
Entities
People
- Feng Guan
- Man Du
- Xiaoming Yu
- Xipeng Li
- Xueqi Cheng
- Yuanhai Xue
- Yue Liu
Organizations
- Chinese Academy of Sciences