Minimax-Optimal Rates for Sparse Additive Models over Kernel Classes via Convex Programming
Abstract
Sparse additive models are families of d-variate functions that have the additive decomposition f* = Sum, j included in S, f(j)* , where S is a unknown subset of cardinality s much less than d. We consider the case where each component function f(j)* lies in a reproducing kernel Hilbert space, and analyze a simple kernel-based convex program for estimating the unknown function f*. Working within a high-dimensional framework that allows both the dimension d and sparsity s to scale, we derive convergence rates in the L2(P) and L2(Pn) norms. These rates consist of two terms: a subset selection term of the order (s log d)/n , corresponding to the difficulty of finding the unknown s-sized subset, and an estimation error term of the order sV(n)(sup 2), where V(n)(sup 2) is the optimal rate for estimating an univariate function within the RKHS. We complement these achievable results by deriving minimax lower bounds on the L2(P) error, thereby showing that our method is optimal up to constant factors for sub-linear sparsity s = o(d). Thus, we obtain optimal minimax rates for many interesting classes of sparse additive models, including polynomials, splines, nite-rank kernel classes, as well as Sobolev smoothness classes.
Document Details
- Document Type
- Technical Report
- Publication Date
- Aug 01, 2010
- Accession Number
- ADA528415
Entities
People
- Bin Yu
- Garvesh Raskutti
- Martin J. Wainwright
Organizations
- University of California, Berkeley