UTDallas at TREC 2008 Blog Track

Abstract

This paper describes our participation in the 2008 TREC Blog track. Our system consists of 3 components: data preprocessing, topic retrieval, and opinion finding. In the topic retrieval task, we applied Lemur IR toolkit and used various techniques for query expansion. In the opinion finding and polarization task, we employed a feature-based classification approach. Then re-ranking was performed using a linear combination of the opinionated score and the topic relevance score. Our system achieved reasonable performance in this evaluation.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2008
Accession Number
ADA512725

Entities

People

  • Bin Li
  • Feifan Liu
  • Yang Liu

Organizations

  • University of Texas at Dallas

Tags

Communities of Interest

  • Autonomy

DTIC Thesaurus Topics

  • Abstracts
  • Classification
  • Computer Science
  • Data Processing
  • Feature Extraction
  • Filters
  • Filtration
  • Information Operations
  • Information Retrieval
  • Information Science
  • Language
  • Machine Learning
  • Online Communications
  • Polarity
  • Preprocessing
  • Probability
  • Statistics

Fields of Study

  • Computer science

Readers

  • Computer Vision.
  • Information Retrieval