Tradeoffs in Implementing Primary-Backup Protocols.

Abstract

One way to implement a fault-tolerant service is by using multiple servers that fail independently. The state of the service is replicated and distributed among these servers, and updates are coordinated so that even when a subset of the servers fail, the service remains available. A common approach to structuring such replicated services is to designate one server as the primary and all the others as backups. Clients make requests by sending messages only to the primary. If the primary fails, then a failover occurs and one of the backups takes over. This service architecture is commonly called the primary backup or the primary copy approach. In most such primary-backup protocols, when the primary receives a client request, it informs the backups about the request, and then responds to the client. Informally, this primary-backup protocol is non- blocking if the primary does not wait for an acknowledgement from the backups before it sends the response; otherwise, it is blocking. Most of the existing protocols are blocking as non-blocking protocols cannot be constructed for some kinds of failures. However, we show that non-blocking protocols can be constructed for most of the process and communication failures that are expected to occur in the primary-backup systems of the future.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Oct 01, 1992
Accession Number
ADA257465

Entities

People

  • Keith Marzullo
  • Navin Budhiraja

Organizations

  • Cornell University

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Backup Systems
  • Computer Science
  • Computers
  • Damage Detection
  • Detectors
  • Frequency
  • Local Area Networks
  • Networks
  • Step Functions
  • Transport Protocols
  • Universities

Fields of Study

  • Computer science

Readers

  • Computer Networking
  • Public Financial Management and Budgeting