ICTNET at Web Track TREC2014

Abstract

An ad-hoc task in TREC investigates the performance of systems that search a static set of documents using previously- unseen topics. This year, the ClueWeb12 [1] dataset are used. The overall goal of the risk - sensitive task is to explore algorithms and evaluation methods for systems that try to jointly maximize an average effectiveness measure across queries, while minimizing effectiveness losses with respect to a provided baseline. Two baselines from different IR systems are supplied this year in order to understand the nature of risk- reward tradeoffs achievable by a system that can adapt to different baselines. The rest of this paper is organized as follows. In Section 2, we discuss the processing of ClueWeb 12, derived data and external resources. In Section 3, the BM25 model with term proximity , the diversification method and the results fusion strategy are introduced. We report experimental results and the corresponding re-ranking strategy in Section 4. Finally, our work is concluded in Section 5.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2014
Accession Number
ADA618656

Entities

People

  • Feng Guan
  • Man Du
  • Xiaoming Yu
  • Xipeng Li
  • Xueqi Cheng
  • Yuanhai Xue
  • Yue Liu

Organizations

  • Chinese Academy of Sciences

Tags

DTIC Thesaurus Topics

  • Abstracts
  • African Americans
  • Algorithms
  • Availability
  • Classification
  • Contracts
  • Data Processing
  • Data Science
  • Frequency
  • Information Operations
  • Instructions
  • Maryland
  • Monitoring
  • Platforms
  • Recognition
  • Standards

Fields of Study

  • Computer science

Readers

  • Information Retrieval
  • Systems Analysis and Design
  • Team-Based Human-Centered Cognitive Task Decision Making and Information Performance.