Diffusion Kernels on Statistical Manifolds

Abstract

A family of kernels for statistical learning is introduced that exploits the geometric structure of statistical models. The kernels are based on the heat equation on the Riemannian manifold defined by the Fisher information metric associated with a statistical family, and generalize the Gaussian kernel of Euclidean space. As an important special case, kernels based on the geometry of multinomial families are derived, leading to kernel-based learning algorithms that apply naturally to discrete data. Bounds on covering numbers and Rademacher averages for the kernels are proved using bounds on the eigenvalues of the Laplacian on Riemannian manifolds. Experimental results are presented form document classification, for which the use of multinomial geometry is natural and well motivated, and improvements are obtained over the standard use of Gaussian or linear kernels, which have been the standard for text classification.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 16, 2004
Accession Number
ADA461122

Entities

People

  • Guy Lebanon
  • John D. Lafferty

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Autonomy
  • C4I

DTIC Thesaurus Topics

  • Algorithms
  • Artificial Intelligence
  • Data Mining
  • Differential Equations
  • Differential Geometry
  • Equations
  • Geometric Forms
  • Geometry
  • Information Processing
  • Information Retrieval
  • Information Science
  • Machine Learning
  • Network Science
  • Probability
  • Statistical Analysis
  • Supervised Machine Learning
  • Two Dimensional

Fields of Study

  • Computer science

Readers

  • Computer Vision.
  • Linear Algebra
  • Regression Analysis.

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms
  • Space