Efficient Reuse Distance Analysis of Multicore Scaling for Loop-Based Parallel Programs

Abstract

Reuse Distance (RD) analysis is a powerful memory analysis tool that can potentially help architects study multicore processor scaling. One key obstacle, however, is that multicore RD analysis requires measuring Concurrent Reuse Distance (CRD) and Private-LRU-stack Reuse Distance (PRD) profiles across thread-interleaved memory reference streams. Sensitivity to memory interleaving makes CRD and PRD profiles architecture dependent, preventing them from analyzing different processor configurations. For loop-based parallel programs, CRD and PRD profiles shift coherently across RD values with core count scaling because interleaving threads are symmetric. Simple techniques can predict such shifting, making the analysis of numerous multicore configurations from a small set of CRD and PRD profiles feasible. Given the ubiquity of parallel loops, such techniques will be extremely valuable for studying future large multicore designs.

Document Details

Document Type
Pub Defense Publication
Publication Date
Feb 01, 2013
Source ID
10.1145/2427631.2427632

Entities

People

  • Donald Yeung
  • Meng-ju Wu

Organizations

  • Defense Advanced Research Projects Agency
  • Division of Computing and Communication Foundations
  • University of Maryland

Tags

Fields of Study

  • Computer science

Readers

  • Approximation Theory.
  • Parallel and Distributed Computing.
  • Systems Analysis and Design