Auto-tuning the Matrix Powers Kernel with SEJITS

Abstract

The matrix powers kernel, used in communication-avoiding Krylov subspace methods, requires runtime auto-tuning for best performance. We demonstrate how the SEJITS (Selective Embedded Just-In- Time Specialization) approach can be used to deliver a high-performance and performance-portable implementation of the matrix powers kernel to application authors, while separating their high-level concerns from those of auto-tuner implementers involving low-level optimizations. The benefits of delivering this kernel in the form of a specializer, rather than a traditional library, are discussed. Performance of the matrix powers kernel specializer is evaluated in the context of a communication-avoiding conjugate gradient (CA-CG) solver, which compares favorably to traditional CG.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: May 11, 2012
Accession Number: ADA561678

Entities

People

Jeffrey Morlan

Organizations

University of California, Berkeley

Auto-tuning the Matrix Powers Kernel with SEJITS

Abstract

Document Details

Entities

People

Organizations

Tags

DTIC Thesaurus Topics

Readers