Memoized Online Variational Inference for Dirichlet Process Mixture Models

Abstract

Variational inference algorithms provide the most effective framework for largescale training of Bayesian nonparametric models. Stochastic online approaches are promising, but are sensitive to the chosen learning rate and often converge to poor local optima. We present a new algorithm, memorized online variational inference, which scales to very large (yet finite) datasets while avoiding the complexities of stochastic gradient. Our algorithm maintains finite-dimensional sufficient statistics from batches of the full dataset, requiring some additional memory but still scaling to millions of examples. Exploiting nested families of variational bounds for infinite nonparametric models, we develop principled birth and merge moves allowing non-local optimization. Births adaptively add components to the model to escape local optima, while merges remove redundancy and improve speed. Using Dirichlet process mixture models for image clustering and denoising, we demonstrate major improvements in robustness and accuracy.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 27, 2014
Accession Number
ADA621678

Entities

People

  • Erik B. Sudderth
  • Michael C. Hughes

Organizations

  • Brown University

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Clustering
  • Computational Complexity
  • Computer Science
  • Computing-Related Activities
  • Covariance
  • Data Science
  • Information Science
  • Iterations
  • Learning
  • Models
  • Monte Carlo Method
  • Optimization
  • Sensitivity
  • Statistics
  • Training
  • Truncation

Fields of Study

  • Computer science

Readers

  • Distributed Systems and Data Platform Development
  • Finite Element Method (FEM) for solving Partial Differential Equations (PDEs)
  • Mathematical Modeling and Probability Theory.

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms
  • AI & ML - Neural Networks