Improving Passage Retrieval Using Interactive Elicition and Statistical Modeling

Abstract

The University of Maryland and Johns Hopkins University worked together in the 2004 High Accuracy Retrieval from Documents (HARD) track to explore design options for interactive passage retrieval systems. HARD assessors responded to clarification forms by (1) selected additional search terms from an automatically constructed list of potentially discriminating terms, (2) selected relevant passages from an automatically constructed list of possibly relevant passages, and (3) entered additional search terms. Query expansion based on these three types of elicited information yielded statistically significant improvements in R-precision over baselines with and without blind relevance feedback. For topics that requested passages as answers, a preliminary analysis shows that statistical models for passage extent trained on HARD 2003 data yielded a significant improvement over a replication of the University of Maryland's hard-2003 technique for passage extent determination, and the results of the new technique appear to generally be well above the median for HARD 2004 systems.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2006
Accession Number
ADA456309

Entities

People

  • Damianos Karakos
  • Daqing He
  • Dina Demner-fushman
  • Douglas W. Oard
  • Sanjeev Khudanpur

Organizations

  • University of Maryland

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Bandwidth
  • Base Lines
  • Computational Science
  • Computer Science
  • Feedback
  • Hidden Markov Models
  • Information Retrieval
  • Judgment
  • Language
  • Linguistics
  • Machine Learning
  • Markov Chains
  • Markov Models
  • Models
  • Precision
  • Probability
  • Universities

Readers

  • Brain and Cognitive Science; Experimental Psychology; Cognitive Neuroscience
  • Information Retrieval
  • Systems Analysis and Design