The Statistical Properties of Host Load (Extended Version)

Abstract

Understanding how host load changes over time is instrumental in predicting the execution time of tasks or jobs, such as in dynamic load balancing and distributed soft real-time systems. To improve this understanding, we collected week-long, 1 Hz resolution traces of the Digital Unix 5 second exponential load average on over 35 different machines including production and research cluster machines, compute servers, and desktop workstations. Separate sets of traces were collected at two different times of the year. The traces capture all of the dynamic load information available to user-level programs on these machines. We present a detailed statistical analysis of these traces here, including summary statistics, distributions, and time series analysis results. Two significant new results are that load is self-similar and that it displays epochal behavior. All of the traces exhibit a high degree of self-similarity with Hurst parameters ranging from 0.73 to 0.99, strongly biased toward the top of that range. The traces also display epochal behavior in that the local frequency content of the load signal remains quite stable for long periods of time (150-450 seconds mean) and changes abruptly at epoch boundaries. Despite these complex behaviors, we have found that relatively simple linear models are sufficient for short-range host load prediction.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 1999
Accession Number
ADA363679

Entities

People

  • Peter A. Dinda

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Computer Science
  • Computers
  • Data Science
  • Dynamic Loads
  • Frequency
  • Frequency Domain
  • Information Science
  • Load Distribution
  • Normal Distribution
  • Operating Systems
  • Simulations
  • Stationary Processes
  • Statistical Analysis
  • Statistics
  • Stochastic Processes
  • Time Domain
  • Time Series Analysis

Readers

  • Computer Science.
  • Mathematical Modeling and Probability Theory.
  • Systems Analysis and Design