Evaluation of Task Assignment Policies for Supercomputing Servers: The Case for Load Unbalancing and Fairness

Abstract

While the MPP is still the most common architecture in supercomputer centers today, a simpler and cheaper machine configuration is growing increasingly common. This alternative setup may be described simply as a collection of multiprocessors or a distributed server system. This collection of multiprocessors is fed by a single common stream of jobs, where each job is dispatched to exactly one of the multiprocessor machines for processing. The biggest question which arises in such distributed server systems is what is a good policy for assigning jobs to host machines. Many task assignment policies have been proposed, but not systematically evaluated under supercomputing workloads. In this paper we start by comparing existing task assignment policies using a trace-driven simulation under supercomputing workloads. We use analysis to validate our results and to provide intuition. We find that while the performance of supercomputing servers varies widely with the task assignment policy, none of the above policies perform as well as we would like. We observe that all task assignment policies proposed thus far aim to balance load among the hosts. We propose a policy which purposely unbalances load among the hosts, yet, counter-to-intuition, is also fair in that it achieves the same expected slowdown for all jobs - thus no jobs are biased against. We evaluate this policy again using both trace-driven simulation and analysis. We find that the performance of the load unbalancing policy is significantly better than the best of those policies which balance load.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 2000
Accession Number
ADA377091

Entities

People

  • Bianca Schroeder
  • Mor Harchol-balter

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Computer Science
  • Computers
  • Distributed Computing
  • Dynamic Loads
  • Intervals
  • Measurement
  • Multiprocessors
  • Operating Systems
  • Probability
  • Scheduling (Production)
  • Simulations
  • Simulators
  • Supercomputers
  • Test And Evaluation
  • Workload
  • Workshops

Fields of Study

  • Computer science

Readers

  • Distributed Systems and Data Platform Development
  • Economics
  • Occupational Health and Safety.