Graphics Processing Unit (GPU) Performance on an N-Body Problem
Abstract
The objective of this study is to evaluate the performance of clusters of Nvidia graphics processing units on an N-body problem derived from the computation of vector potentials. Two clusters are used for this purpose. The first is a 2-node, Intel Xeon system with a single Tesla S870 system cross connected to each node. The second is a 20-node Opteron system with one Quadro FX 5600 GPU per node. The results show a significant increase in performance when GPUs accelerate the computation. With 16 GPUs and a sufficiently large problem, an estimated 3 teraflops is achieved.
Document Details
- Document Type
- Technical Report
- Publication Date
- Aug 01, 2009
- Accession Number
- ADA512706
Entities
People
- Pat Collins
Organizations
- Lockheed Martin