Multi-Conditional Learning: Generative/Discriminative Training for Clustering and Classification

Abstract

This paper multi-conditional learning (MCL) a training criterion based on a product of multiple conditional likelihoods. When combining the traditional conditional probability of "label given input" with a generative probability of "input given label" the later acts as a surprisingly effective regularizer. When applied to models with latent variables, MCL combines the structure-discovery capabilities of generative topic models, such as latent Dirichlet allocation and the exponential family harmonium, with the accuracy and robustness of discriminative classifiers, such as logistic regression and conditional random fields. We present results on several standard text data sets showing significant reductions in classification error due to MCL regularization, and substantial gains in precision and recall due to the latent structure discovered under MCL.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2006
Accession Number
ADA449621

Entities

People

  • Andrew McCallum
  • Chris Pal
  • Greg Druck
  • Xuerui Wang

Organizations

  • University of Massachusetts Amherst

Tags

Communities of Interest

  • Biomedical

DTIC Thesaurus Topics

  • Artificial Intelligence Software
  • Bayesian Networks
  • Classification
  • Computational Science
  • Computer Languages
  • Computer Science
  • Data Sets
  • Information Retrieval
  • Information Science
  • Learning
  • Monte Carlo Method
  • Network Science
  • Neural Networks
  • Probabilistic Models
  • Probability
  • Random Variables
  • Training

Fields of Study

  • Computer science

Readers

  • Speech Processing/Speech Recognition.
  • Statistical inference.
  • ballistics.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Learning Algorithms
  • AI & ML - Neural Networks