Evaluating Question-Answering Techniques in Chinese

Abstract

An important first step in developing a cross-lingual question answering system is to understand whether techniques developed with English text will also work with other languages, such as Chinese. The Marsha Chinese question answering system described in this paper uses techniques similar to those used in the English systems developed for TREC. Marsha consists of three main components: the query processing module, the Hanquery search engine, and the answer extraction module. It also contains some specific techniques dealing with Chinese language characteristics, such as word segmentation and ordinals processing. Evaluation of the system is done using a method based on the TREC question-answering track. The results of the evaluation show that the performance of Marsha is comparable to some English question answering systems in TREC 8 track. An English language version of Marsha further indicates that the heuristics used are applicable to the English question answering task.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2001
Accession Number
ADA458581

Entities

People

  • W. Bruce Croft
  • Xiaoyan Li

Organizations

  • University of Massachusetts Amherst

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Applied Computer Science
  • Artificial Intelligence
  • Chinese Language
  • Computer Languages
  • Computer Science
  • Computer Vision
  • English Language
  • Extraction
  • Information Retrieval
  • Language
  • Natural Language Processing
  • Natural Languages
  • Probabilistic Models
  • Recognition
  • Test And Evaluation
  • United States

Readers

  • Computational Linguistics
  • Information Retrieval