A survey of pipelined workflow scheduling

Abstract

A large class of applications need to execute the same workflow on different datasets of identical size. Efficient execution of such applications necessitates intelligent distribution of the application components and tasks on a parallel machine, and the execution can be orchestrated by utilizing task, data, pipelined, and/or replicated parallelism. The scheduling problem that encompasses all of these techniques is called pipelined workflow scheduling , and it has been widely studied in the last decade. Multiple models and algorithms have flourished to tackle various programming paradigms, constraints, machine behaviors, or optimization goals. This article surveys the field by summing up and structuring known results and approaches.

Document Details

Document Type
Pub Defense Publication
Publication Date
Aug 01, 2013
Source ID
10.1145/2501654.2501664

Entities

People

  • Anne Benoit
  • Erik Saule
  • Umit Catalyurek
  • Yves Robert

Organizations

  • Agence Nationale de la Recherche
  • Air Force Research Laboratory
  • Division of Computer and Network Systems
  • Ohio State University
  • United States Department of Energy
  • École Normale Supérieure de Lyon

Tags

Fields of Study

  • Computer science
  • Engineering

Readers

  • Distributed Systems and Data Platform Development
  • Parallel and Distributed Computing.
  • Theoretical Analysis.