Column Subset Selection, Matrix Factorization, and Eigenvalue Optimization

Abstract

Given a fixed matrix, the problem of column subset selection requests a column submatrix that has favorable spectral properties. Most research from the algorithms and numerical linear algebra communities focuses on a variant called rank-revealing QR, which seeks a well-conditioned collection of columns that spans the (numerical) range of the matrix. The functional analysis literature contains another strand of work on column selection whose algorithmic implications have not been explored. In particular, a celebrated result of Bourgain and Tzafriri demonstrates that each matrix with normalized columns contains a large column submatrix that is exceptionally well conditioned. Unfortunately, standard proofs of this result cannot be regarded as algorithmic. This paper presents a randomized, polynomial-time algorithm that produces the submatrix promised by Bourgain and Tzafriri. The method involves random sampling of columns, followed by a matrix factorization that exposes the well-conditioned subset of columns. This factorization which is due to Grothendieck, is regarded as a central tool in modern functional analysis. The primary novelty in this work is an algorithm, based on eigenvalue minimization, for constructing the Grothendieck factorization. These ideas also result in a novel approximation algorithm for the (infinity, 1) norm of a matrix, which is generally NP-hard to compute exactly. As an added bonus this work reveals a surprising connection between matrix factorization and the famous maxcut semidefinite program.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jul 01, 2008
Accession Number
ADA633832

Entities

People

  • Joel A. Tropp

Organizations

  • California Institute of Technology

Tags

DTIC Thesaurus Topics

  • Algebra
  • Algorithms
  • Banach Space
  • Computational Science
  • Computer Science
  • Eigenvalues
  • Eigenvectors
  • Functional Analysis
  • Harmonic Analysis
  • Linear Algebra
  • Mathematics
  • Optimization
  • Polynomials
  • Probability
  • Random Variables
  • Sampling
  • Standards

Fields of Study

  • Computer science

Readers

  • Linear Algebra
  • Systems Analysis and Design