Combination of Evidence for Effective Web Search

Abstract

In this paper we describe Carnegie Mellon University's sub-mission to the TREC 2010 Web Track. Our baseline run combines different methods, of which in particular the spam prior and mixture model were found the most effective. We also experimented with expansion over the Wikipedia corpus and found that picking the right Wikipedia articles for expansion can improve performance substantially. Furthermore, we did preliminary experiments with combining expansion over the Wikipedia corpus with expansion over the top ranked web pages.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2010
Accession Number
ADA546564

Entities

People

  • Dong Nguyen
  • Jamie Callan

Organizations

  • Carnegie Mellon University

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Acquisition
  • Base Lines
  • Collapse
  • Composite Materials
  • Electronic Mail
  • Human-Machine Interaction
  • Information Operations
  • Language
  • Natural Language Processing
  • Precision
  • Standards
  • Universities

Fields of Study

  • Computer science

Readers

  • Information Retrieval
  • Software Engineering.