On the Relevance of Communication Costs of Rollback-Recovery Protocols,

Abstract

Communication overhead has been traditionally the primary metric for evaluating rollback-recovery protocols. This paper reexamines the prominence of this metric in light of the recent increases in processor and network speeds. We introduce a new recovery algorithm for a family of rollback-recovery protocols based on logging. The new algorithm incurs a higher communication overhead during recovery than previous algorithms, but it requires less access to stable storage and imposes no restrictions on the execution of live processes. Experimental results show that the new algorithm performs better than one that is optimized for low communication overhead. These results suggest that in modern environments, latency in accessing stable storage and intrusion of a particular algorithm on the execution of live processes are more important than the number of messages exchanged during recovery. (KAR) P. 3

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 01, 1995
Accession Number
ADA296396

Entities

People

  • E. N. Elnozahy

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Asynchronous Systems
  • Computer Science
  • Computers
  • Damage Detection
  • Detection
  • Distributed Computing
  • Fault Tolerance
  • Intrusion
  • Prototypes
  • Recovery
  • Test And Evaluation

Fields of Study

  • Computer science

Readers

  • Economics
  • Parallel and Distributed Computing.