IBM's Statistical Question Answering System - TREC-11

Abstract

In this paper, we document our efforts to extend our statistical question answering system for TREC-11. We incorporated a web search feature, and novel extensions of statistical machine translation as well as extracting lexical patterns for exact answers from a supervised corpus. Without modification to our base set of thirty-one categories, we were able to achieve a confidence weighted score of 0.455 and an accuracy of 29%. We improved our model on selecting exact answers by insisting on exact answers in the training corpus and this resulted in a 7% gain on TREC-11 but a much larger gain of 46% on TREC-10.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2006
Accession Number
ADA456310

Entities

People

  • Abraham Ittycheriah
  • Salim Roukos

Organizations

  • IBM Thomas J. Watson Research Center

Tags

Communities of Interest

  • Air Platforms

DTIC Thesaurus Topics

  • Abstracts
  • Accuracy
  • Distribution Functions
  • Feedback
  • Iberian Peninsula
  • Information Operations
  • Judgment
  • Machine Translation
  • Mathematics
  • Natural Language Processing
  • Probability
  • Rejection
  • Test And Evaluation
  • Test Sets
  • Training
  • Translations

Fields of Study

  • Computer science

Readers

  • Computational Linguistics

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Information Retrieval
  • AI & ML - Machine Translation