Evaluating the Quality of Query Refinement Suggestions in Information Retrieval

Abstract

Automatic suggestion of alternative terms to refine a user's query is an effective technique to help the user quickly narrow down to his(her) specific information need. However, evaluating the effectiveness of these suggestions has remained quite subjective, with a vast majority of the past work relying on expensive user studies. In this work, we look at this problem from the IR perspective. We propose two objective measures that evaluate the quality of Query Refinement (QR) suggestions, based on the degree to which the documents retrieved by the QR suggestions, when used as queries, capture the overall sub-topical structure underlying the topic of the original query. The first measure, known as Maximum Matching Averaged Mean Average Precision (MM-AMAP) requires labeled documents for the sub-topics underlying the query's topic. The second measure which we call Distinctness and MAP based F1 (DMAP-F1) requires only labeled documents that are relevant to the original query. We also define a series of simple QR suggestion techniques, each of which is intuitively better than the previous ones and evaluate them using our measures on TDT3 and TDT4 corpora. Our experiments show that our evaluation metrics numerically capture our intuitive expectations on performance, thus informally validating our measures. Further, we also show that the second metric DMAP-F1, that does not require sub-topic judgments, is consistent in results as well as statistically highly correlated with the first metric. This allows us to perform extensive evaluations of the quality of QR suggestion techniques on standard TREC collections in the future.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2006
Accession Number
ADA454796

Entities

People

  • Chirag Shah
  • Ramesh Nallapati

Organizations

  • University of Massachusetts Amherst

Tags

Communities of Interest

  • Air Platforms
  • Biomedical

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Automated Speech Recognition
  • Automobiles
  • Clustering
  • Computer Science
  • Computers
  • Feedback
  • Information Retrieval
  • Judgment
  • Language
  • New York
  • Precision
  • Standards
  • Test And Evaluation
  • Test Sets
  • United States

Fields of Study

  • Computer science

Readers

  • Information Retrieval
  • Instructional Design and Training Evaluation.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Information Retrieval