Hybrid MPI-OpenMP versus MPI Implementations: A Case Study

Abstract

In this paper we explore the performance of a hybrid, or mixed-mode (MPI-OpenMP), parallel C++ implementation versus a direct MPI implementation. This case-study provides sufficient amount of detail so it can be used as a point of departure for further research or as an educational resource for additional code development regarding the study of mixed-mode versus direct MPI implementations. The hardware test-bed was a 64-processor cluster featuring 16 multi-core nodes with four cores per node. The algorithm being benchmarked is a parallel cyclic convolution algorithm with no inter-node communication that tightly matches our particular cluster architecture. In this particular case-study a time-domain-based cyclic convolution algorithm was used in each parallel subsection. Time domain-based implementations are slower than frequency domain-based implementations, but give the exact integer result when performing very large, purely integer, cyclic convolution.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 25, 2014
Accession Number
ADA624980

Entities

People

  • Felix Nevarez
  • Marvi Teixeira
  • Osvaldo Mangual
  • Reynaldo Lopez

Organizations

  • Polytechnic University of Puerto Rico

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Case Studies
  • Convolution
  • Equations
  • Frequency Domain
  • Jet Propulsion
  • Literature
  • Numbers
  • Operating Systems
  • Parallel Computing
  • Parallel Processing
  • Parallel Processors
  • Permutations
  • Puerto Rico
  • Scalability
  • Time Domain
  • Universities

Fields of Study

  • Computer science

Readers

  • Parallel and Distributed Computing.