Automatic Web Searching and Categorizing Using Query Expansion and Focusing

Abstract

We are in the process of build a prototype system that improves precision and recall rates for web search using query expansion and focusing techniques. We use linguistic analysis and co-occurrence information to analyze syntactic structures of the users' queries to improve search results. One standard method of improving internet search is through query expansion. The major query expansion techniques add terms using (i) lexical semantic relations and (ii) relevance feed back. The lexical semantic relations in WordNet have been used widely as a main lexical resource for approach (i). Past research results indicate that using WordNet did not significantly improve information retrieval effectiveness. Our query expansion system also uses WordNet in a query expansion stage. However, instead of just adding all related terms from WordNet (synonyms, hypernyms, hyponyms, etc.) directly into user's queries, our system selects only useful additional terms. This selection process uses syntactic analysis combined with collocation and co-occurrence information from a large corpus collected from our domain of interest (IT).

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 10, 2003
Accession Number
ADA409512

Entities

People

  • Sumali Conlon

Organizations

  • University of Mississippi

Tags

Communities of Interest

  • Materials and Manufacturing Processes
  • Space

DTIC Thesaurus Topics

  • Abstracts
  • Automated Text Summarization
  • Automatic
  • Business Administration
  • Computers
  • Computing Devices
  • Decision Support Systems
  • Information Processing
  • Information Retrieval
  • Information Systems
  • Internet
  • Language
  • Natural Language Processing
  • Natural Languages
  • Networks
  • Standards

Fields of Study

  • Computer science

Readers

  • Computational Linguistics

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval
  • AI & ML - Machine Translation