Extracting Answers from the Web Using Knowledge Annotation and Knowledge Mining Techniques

Abstract

Aranea is a question answering system that extracts answers from the World Wide Web using knowledge annotation and knowledge mining techniques. Knowledge annotation, which utilizes semistructured database techniques, is effective for answering large classes of commonly occurring questions. Knowledge mining, which utilizes statistical techniques, can leverage the massive amounts of data available on the Web to overcome many natural language processing challenges. Aranea integrates these two different paradigms of question answering into a single framework. For the TREC evaluation, we also explored the problem of answer projection, or finding supporting documents for our Web-derived answers from the AQUAINT corpus.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2006
Accession Number
ADA456267

Entities

People

  • Aaron Fernandes
  • Boris Katz
  • Gregory Marton
  • Jimmy Lin
  • Stefanie Tellex

Organizations

  • Massachusetts Institute of Technology

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Air Force Research Laboratories
  • Algorithms
  • Artificial Intelligence
  • Biographies
  • Databases
  • Engineering
  • Information Retrieval
  • Information Systems
  • Language
  • Linguistics
  • Natural Language Processing
  • Natural Language Understanding
  • Natural Languages
  • New York
  • Test And Evaluation
  • Websites
  • World Wide Web

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Distributed Systems and Data Platform Development

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval