Sparsity and Smoothness Via the Fused Lasso

Abstract

The lasso penalizes a least squares regression by the sum of the absolute values (L1-norm) of the coefficients. The form of this penalty encourages sparse solutions (with many coefficients equal to 0). We propose the ‘fused lasso’, a generalization that is designed for problems with features that can be ordered in some meaningful way. The fused lasso penalizes the L1-norm of both the coefficients and their successive differences. Thus it encourages sparsity of the coefficients and also sparsity of their differences—i.e. local constancy of the coefficient profile. The fused lasso is especially useful when the number of features p is much greater than N, the sample size. The technique is also extended to the ‘hinge’ loss function that underlies the support vector classifier. We illustrate the methods on examples from protein mass spectroscopy and gene expression data.

Document Details

Document Type
Pub Defense Publication
Publication Date
Dec 13, 2004
Source ID
10.1111/j.1467-9868.2005.00490.x

Entities

People

  • Ji Zhu
  • Keith Knight
  • Michael Saunders
  • Robert Tibshirani
  • Saharon Rosset

Organizations

  • International Business Machines Corporation (Armonk, NY)
  • National Institutes of Health
  • National Science Foundation
  • Office of Naval Research
  • Stanford University
  • University of Michigan
  • University of Toronto

Tags

Fields of Study

  • Computer science

Readers

  • Computer Vision.
  • Operations Research
  • Regression Analysis.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Learning Algorithms