Online Dictionary Learning for Sparse Coding

Abstract

Sparse coding-that is, modelling data vectors as sparse linear combinations of basis elements-is widely used in machine learning, neuroscience, signal processing, and statistics. This paper focuses on learning the basis set, also called dictionary, to adapt it to specific data, an approach that has recently proven to be very effective for signal reconstruction and classification in the audio and image processing domains. This paper proposes a new online optimization algorithm for dictionary learning, based on stochastic approximations, which scales up gracefully to large datasets with millions of training samples. A proof of convergence is presented, along with experiments with natural images demonstrating that it leads to faster performance and better dictionaries than classical batch algorithms for both small and large datasets.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 2009
Accession Number
ADA513243

Entities

People

  • Francis Bach
  • Guillermo Sapiro
  • Jean Ponce
  • Julien Mairal

Organizations

  • University of Minnesota

Tags

Communities of Interest

  • Energy and Power Technologies
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Computations
  • Computer Programming
  • Convex Sets
  • Dictionaries
  • Image Processing
  • Image Restoration
  • Learning
  • Machine Learning
  • Optimization
  • Perturbation Theory
  • Probability
  • Probability Distributions
  • Random Variables
  • Signal Processing
  • Statistics
  • Test Sets

Fields of Study

  • Computer science

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Computational Linguistics
  • Computer Vision.

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms