Backward Error Recovery in Redundant Disk Arrays

Abstract

Redundant disk arrays are single fault tolerant, incorporating a layer of error handling not found in nonredundant disk systems. Recovery from these errors is complex, due in part to the large number of erroneous states the system may reach. The established approach to error recovery in disk systems is to transition directly from an erroneous state to completion. This technique, known as forward error recovery, relies upon the context in which an error occurs to determine the steps required to reach completion, which implies forward error recovery is design specific. Forward error recovery requires the enumeration of all erroneous states the system may reach and the construction of a forward path from each erroneous state. We propose a method of error recovery which does not rely upon the enumeration of erroneous states or the context in which errors occur. When an error is encountered, we advocate mechanized recovery to an error-free state from which an operation may be retried. Using a form of backward error recovery, we are able to manage the complexity of error recovery in redundant disk arrays without sacrificing performance.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 27, 1994
Accession Number
ADA286064

Entities

People

  • Garth A. Gibson
  • William V. Courtright Ii

Organizations

  • Carnegie Mellon University

Tags

DTIC Thesaurus Topics

  • Algorithms
  • Computer Architecture
  • Computer Programming
  • Computer Science
  • Computers
  • Computing System Architectures
  • Data Storage Systems
  • Data Visualization
  • Databases
  • Distributed Computing
  • Engineering
  • Fault Tolerance
  • Fault Tolerant Computing
  • Language
  • Operating Systems
  • Reliability
  • Word Processors

Readers

  • Computer Programming and Software Development.
  • Manufacturing Engineering.
  • Systems Analysis and Design