UGent Participation in the Microblog Track 2012

Abstract

In this paper, we describe the search system, developed at Ghent University for the TREC 2012 Microblog Track in order to rank Twitter messages or "tweet" from a fixed corpus in response to a number of search requests. Our system ranks the tweets based on a Logistic Regression classifier trained with data from the Microblog Track 2011. The features used for training the classifier include local tweets features, but also, query expansion and tweet expansion features, based on external Web data, which appear to significantly improve results.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2012
Accession Number
ADA581312

Entities

People

  • Chris Develder
  • Joannes Deleu
  • Piet Demeester
  • Thomas Demeester
  • Thong H. Duc

Organizations

  • Ghent University

Tags

Communities of Interest

  • Autonomy
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Applied Computer Science
  • Artificial Intelligence Computing
  • Classification
  • Data Sets
  • Information Systems
  • Judgment
  • Language
  • Machine Learning
  • Neural Networks
  • Online Communications
  • Social Media
  • Standards
  • Training
  • Universities
  • Unsupervised Machine Learning
  • Vocabulary

Fields of Study

  • Computer science

Readers

  • Information Retrieval
  • Neural Network Machine Learning.