Minimax-Optimal Rates for Sparse Additive Models over Kernel Classes via Convex Programming

Abstract

Sparse additive models are families of d-variate functions that have the additive decomposition f* = Sum, j included in S, f(j)* , where S is a unknown subset of cardinality s much less than d. We consider the case where each component function f(j)* lies in a reproducing kernel Hilbert space, and analyze a simple kernel-based convex program for estimating the unknown function f*. Working within a high-dimensional framework that allows both the dimension d and sparsity s to scale, we derive convergence rates in the L2(P) and L2(Pn) norms. These rates consist of two terms: a subset selection term of the order (s log d)/n , corresponding to the difficulty of finding the unknown s-sized subset, and an estimation error term of the order sV(n)(sup 2), where V(n)(sup 2) is the optimal rate for estimating an univariate function within the RKHS. We complement these achievable results by deriving minimax lower bounds on the L2(P) error, thereby showing that our method is optimal up to constant factors for sub-linear sparsity s = o(d). Thus, we obtain optimal minimax rates for many interesting classes of sparse additive models, including polynomials, splines, nite-rank kernel classes, as well as Sobolev smoothness classes.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 01, 2010
Accession Number
ADA528415

Entities

People

  • Bin Yu
  • Garvesh Raskutti
  • Martin J. Wainwright

Organizations

  • University of California, Berkeley

Tags

Communities of Interest

  • C4I

DTIC Thesaurus Topics

  • Additives (Chemicals)
  • Algorithms
  • Computer Programming
  • Construction
  • Convergence
  • Convex Programming
  • Decomposition
  • Eigenvalues
  • Equations
  • Errors
  • Estimators
  • Hilbert Space
  • Inequalities
  • Kernel Functions
  • Polynomials
  • Probability
  • Random Variables

Fields of Study

  • Computer science

Readers

  • Linear Algebra
  • Operations Research
  • Statistical inference.

Technology Areas

  • Space