Efficient Reuse Distance Analysis of Multicore Scaling for Loop-Based Parallel Programs
Abstract
Reuse Distance (RD) analysis is a powerful memory analysis tool that can potentially help architects study multicore processor scaling. One key obstacle, however, is that multicore RD analysis requires measuring Concurrent Reuse Distance (CRD) and Private-LRU-stack Reuse Distance (PRD) profiles across thread-interleaved memory reference streams. Sensitivity to memory interleaving makes CRD and PRD profiles architecture dependent, preventing them from analyzing different processor configurations. For loop-based parallel programs, CRD and PRD profiles shift coherently across RD values with core count scaling because interleaving threads are symmetric. Simple techniques can predict such shifting, making the analysis of numerous multicore configurations from a small set of CRD and PRD profiles feasible. Given the ubiquity of parallel loops, such techniques will be extremely valuable for studying future large multicore designs.
Document Details
- Document Type
- Pub Defense Publication
- Publication Date
- Feb 01, 2013
- Source ID
- 10.1145/2427631.2427632
Entities
People
- Donald Yeung
- Meng-ju Wu
Organizations
- Defense Advanced Research Projects Agency
- Division of Computing and Communication Foundations
- University of Maryland