Entropy Based Classifier Combination for Sentence Segmentation

Abstract

We describe recent extensions to our previous work, where we explored the use of individual classifiers, namely, boosting and maximum entropy models for sentence segmentation. In this paper we extend the set of classification methods with support vector machine (SVM). We propose a new dynamic entropy-based classifier combination approach to combine these classifiers, and compare it with the traditional classifier combination techniques, namely, voting, linear regression and logistic regression. Furthermore, we also investigate the combination of hidden event language models with the output of the proposed classifier combination, and the output of individual classifiers. Experimental studies conducted on the Mandarin TDT4 broadcast news database shows that the SVM classifier as an individual classifier improves over our previous best system. However, the proposed entropy-based classifier combination approach shows the best improvement in F-Measure of 1% absolute, and the voting approach shows the best reduction in NIST error rate of 2.7% absolute when compared to the previous best system.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2007
Accession Number
ADA463040

Entities

People

  • D. Hakkani-tur
  • E. Shriberg
  • Jason M. Fung
  • M. Magimai-doss
  • N. Mirghafori
  • O. Cetin

Organizations

  • International Computer Science Institute

Tags

Communities of Interest

  • Autonomy
  • Energy and Power Technologies
  • Human Systems

DTIC Thesaurus Topics

  • Artificial Intelligence Software
  • Automated Speech Recognition
  • Boundaries
  • Classification
  • Computational Science
  • Computer Languages
  • Computer Science
  • Data Sets
  • Hidden Markov Models
  • Language
  • Machine Learning
  • Machine Translation
  • Markov Models
  • Natural Language Processing
  • Probability
  • Recognition
  • Test Sets

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Computational Modeling and Simulation

Technology Areas

  • AI & ML