Design and Analysis of Fault-Tolerant Distributed Real-Time Computer Systems
Abstract
The objective of this project was to contribute to the establishment of the scientific foundation for designing fault-tolerant distributed computer systems. Main results obtained from this research project are as follows. (1) Identification of critical research issues and some promising research directions in real-time fault-tolerant distributed computing; (2) A skeleton of the foundation for realizing system-level fault tolerance, which includes among others the DRB (distributed recovery block) scheme, the DCONV (distributed conversation) scheme, the PTC (programmer-transparent coordination) scheme, a TB (temporary blackout) handling scheme, and the complementary relationship among the schemes; These schemes enable the computer system to detect and recover from both hardware and software faults without missing the deadlines for processing important data and delivering outputs to the controlled object/environment; (3) A preliminary structure of a model of real-time distributed computation; (4) A theoretical investigation into the efficiency and diagnostic power of basic processor-level diagnosis approaches in diagnosing hypercubes conducted; (5) An enhancement of three of the real-time computer network testbeds established in the UCI DREAM (Distributed Real-time Ever Available Microcomputing) Laboratory made.
Document Details
- Document Type
- Technical Report
- Publication Date
- Jul 25, 1991
- Accession Number
- ADA239521
Entities
People
- K. H. Kim
Organizations
- University of California, Irvine