Increasing Accuracy of Iterative Refinement in Limited Floating-Point Arithmetic on Half-Precision Accelerators

Abstract

The emergence of deep learning as a leading computational workload for machine learning tasks on large-scale cloud infrastructure installations has led to plethora of accelerator hardware releases. However, the reduced precision and range of the floating-point numbers on these new platforms makes it a non-trivial task to leverage these unprecedented advances in computational power for numerical linear algebra operations that come with a guarantee of robust error bounds. In order to address these concerns, we present a number of strategies that can be used to increase the accuracy of limited-precision iterative refinement. By limited precision, we mean 16-bit floating-point formats implemented in modern hardware accelerators and are not necessarily compliant with the IEEE half-precision specification. We include the explanation of a broader context and connections to established IEEE floating-point standards and existing high-performance computing (HPC) benchmarks. We also present a new formulation of LU factorization that we call signed square root LU which produces more numerically balanced Land U factors which directly address the problems of limited range of the low-precision storage formats. The experimental results indicate that it is possible to recover substantial amounts of the accuracy in the system solution that would otherwise be lost. Previously, this could only be achieved by using iterative refinement based on single-precision floating-point arithmetic. The discussion will also explore the numerical stability issues that are important for robust linear solvers on these new hardware platforms.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2019
Accession Number
AD1112737

Entities

People

  • Ichitaro Yamazaki
  • Jack Dongarra
  • Piotr Luszczek

Organizations

  • University of Tennessee system

Tags

Communities of Interest

  • Autonomy
  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Arithmetic
  • Artificial Intelligence
  • Artificial Intelligence Software
  • Computations
  • Computer Programming
  • Computers
  • Deep Learning
  • Floating Point Operations
  • High Performance Computing
  • Linear Algebra
  • Linear Systems
  • Machine Learning
  • Neural Networks
  • Numbers
  • Parallel Computing
  • Parallel Processing
  • Precision
  • Square Roots
  • Standards
  • United States

Readers

  • Computational Fluid Dynamics (CFD)
  • Computer Programming and Software Development.
  • Distributed Systems and Data Platform Development

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms
  • AI & ML - Neural Networks