University of Waterloo at TREC 2008 Blog Track

Abstract

The paper reports the University of Waterloo participation in the opinion and polarity tasks of the Blog track. The proposed method uses a lexicon built from several linguistic resources. The opinion discriminating ability of each subjective lexical unit was estimated using the Kullback-Leibler divergence. The KLD scores of subjective words occurring within fixed-size windows around instances of query terms were used in calculating document scores. The described system also used a method of identifying phrases in topic titles by matching them to Wikipedia titles. The results show that both KLD-based scores of subjective lexical units and Wikipedia-matched phrases are useful techniques that help improve opinion retrieval performance.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2008
Accession Number
ADA512717

Entities

People

  • Olga Vechtomova

Organizations

  • University of Waterloo

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Business Intelligence
  • Engineering
  • European Union
  • Frequency
  • Information Operations
  • Information Retrieval
  • Information Theory
  • Judgment
  • Language
  • Natural Languages
  • Online Communications
  • Polarity
  • Probability
  • Probability Distributions
  • Standards
  • Universities

Readers

  • Computational Linguistics
  • Information Retrieval
  • Regression Analysis.