Data Mining Meets Performance Evaluation: Fast Algorithms for Modeling Bursty Traffic

Abstract

Network, web, and disk I/O traffic are usually bursty, self-similar [9, 3, 5, 6] and therefore can not be modeled adequately with Poisson arrivals[9]. However, we do want to model these types of traffic and to generate realistic traces, because of obvious applications for disk scheduling, network management, web server design. Previous models (like fractional Brownian motion, ARFIMA etc) tried to capture the burstiness . However the proposed models either require too many parameters to fit and/or require prohibitively large (quadratic) time to generate large traces. We propose a simple, parsimonious method, the b-model , which solves both problems: It requires just one parameter (b), and it can easily generate large traces. In addition, it has many more attractive properties: (a)With our proposed estimation algorithm, it requires just a single pass over the actual trace to estimate b. For example, a one-day-long disk trace in milliseconds contains about 86Mb data points and requires about 3 minutes for model fitting and 5 minutes for generation. (b) The resulting synthetic traces are very realistic: our experiments on real disk and web traces show that our synthetic traces match the real ones very well in terms of queuing behavior.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 2001
Accession Number
ADA461275

Entities

People

  • Christos Faloutsos
  • Mengzhi Wang
  • Ngai H. Chan
  • Spiros Paradimitriou
  • Tara Madhyastha

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Algorithms
  • Computations
  • Computer Science
  • Computers
  • Construction
  • Data Mining
  • Data Science
  • Data Sets
  • Demographic Cohorts
  • Demography
  • Engineering
  • Information Science
  • Networks
  • Simulators
  • Test And Evaluation
  • Time Intervals
  • Workload

Fields of Study

  • Computer science

Readers

  • Computational Modeling and Simulation
  • Computer Networking
  • Wave Propagation and Nonlinear Chaotic Dynamics.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Learning Algorithms
  • AI & ML - Neural Networks