Adaptive Fault-Resistant Systems

Abstract

The combined effects of faults and resource failures, wide swings in service demand, and situation-dependent user requirements, stress a computer's ability to satisfy its service expectations. This is an especially significant problem in distributed system that employ unreliable communications, and whose components may operate in different and perhaps harsh physical data and usage environments. One of the goals for adaptive design is to allow flexible use of available resources to cover a much wider range of different kinds of environmental variables than could be covered by a fixed, worst-case design. The research focused on the tasks of: (1) developing a theory of adaptive fault- resistant systems and general principles of architectural design; (2) developing specific architectural design techniques; and (3) demonstrating adaptive designs. Three mechanisms were investigated: (1) Adaptive Distributed Recovery Blocks (ADRBs), a multiple-mode scheme for error detection and recovery, useful for both hardware and software faults; (2) adaptive fault tolerance for hybrid faults, an economical technique for tolerating both simple and complex fault types; and (3) adaptive distributed thread integrity, a technique for detecting and repairing thread breaks in a wide range of operating environments using the Alpha programming model. Adaptive distributed systems, Recovery blocks, Fault tolerance, Anomaly management.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Oct 01, 1994
Accession Number
ADA286406

Entities

People

  • E. D. Jensen
  • Ira Greenberg
  • Jack Goldberg
  • Kane Kim
  • Raymond Clark

Organizations

  • SRI International

Tags

Communities of Interest

  • C4I
  • Energy and Power Technologies
  • Engineered Resilient Systems
  • Human Systems

DTIC Thesaurus Topics

  • Acceptability
  • Accuracy
  • Adaptive Control Systems
  • Application Software
  • Case Studies
  • Command And Control
  • Computer Networks
  • Computer Programming
  • Computers
  • Data Processing
  • Databases
  • Detection
  • Detectors
  • Local Area Networks
  • Operating Systems
  • Probability
  • Standards

Fields of Study

  • Computer science
  • Engineering

Readers

  • Fault Tolerant Diagnosis of Black and White Balloon Isolation Tests Using ¥.
  • Parallel and Distributed Computing.
  • Systems Analysis and Design