Fault-tolerance model for multiprocessor real-time systems

Sheng Tzong Cheng, Chia Mei Chen, Satish K. Tripathi

Research output: Contribution to journalConference article

2 Citations (Scopus)

Abstract

System reliability is an important aspect of real-time systems, because the result of a real-time application may be valid only if the application functions correctly and its timing constraints are satisfied. There are two kinds of faults, hardware and software faults, and the paper considers hardware transient faults. Full replication or full hardware redundancy can achieve a high degree of reliability; however, it wastes lots of resources. For most real-time systems, such schemes might not be available and hence reliability estimation becomes essential. We propose an analytic model for system reliability estimation based on the Markov chain and investigate the accuracy of the estimated reliability. The results show that the proposed model obtains good estimation in various simulated real-time systems.

Original languageEnglish
Pages (from-to)457-477
Number of pages21
JournalJournal of Computer and System Sciences
Volume61
Issue number3
DOIs
Publication statusPublished - 2000 Dec
Event17th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems - Seattle, WA, USA
Duration: 1998 Jun 11998 Jun 4

Fingerprint

Multiprocessor Systems
Fault tolerance
Real time systems
Fault Tolerance
Reliability Estimation
Real-time
Hardware
System Reliability
Fault
Transient Faults
Model
Replication
Redundancy
Timing
Markov chain
Markov processes
Valid
Computer systems
Resources
Software

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Networks and Communications
  • Computational Theory and Mathematics
  • Applied Mathematics

Cite this

Cheng, Sheng Tzong ; Chen, Chia Mei ; Tripathi, Satish K. / Fault-tolerance model for multiprocessor real-time systems. In: Journal of Computer and System Sciences. 2000 ; Vol. 61, No. 3. pp. 457-477.
@article{f397da9267fa49a1818432076bdd1724,
title = "Fault-tolerance model for multiprocessor real-time systems",
abstract = "System reliability is an important aspect of real-time systems, because the result of a real-time application may be valid only if the application functions correctly and its timing constraints are satisfied. There are two kinds of faults, hardware and software faults, and the paper considers hardware transient faults. Full replication or full hardware redundancy can achieve a high degree of reliability; however, it wastes lots of resources. For most real-time systems, such schemes might not be available and hence reliability estimation becomes essential. We propose an analytic model for system reliability estimation based on the Markov chain and investigate the accuracy of the estimated reliability. The results show that the proposed model obtains good estimation in various simulated real-time systems.",
author = "Cheng, {Sheng Tzong} and Chen, {Chia Mei} and Tripathi, {Satish K.}",
year = "2000",
month = "12",
doi = "10.1006/jcss.2000.1704",
language = "English",
volume = "61",
pages = "457--477",
journal = "Journal of Computer and System Sciences",
issn = "0022-0000",
publisher = "Academic Press Inc.",
number = "3",

}

Fault-tolerance model for multiprocessor real-time systems. / Cheng, Sheng Tzong; Chen, Chia Mei; Tripathi, Satish K.

In: Journal of Computer and System Sciences, Vol. 61, No. 3, 12.2000, p. 457-477.

Research output: Contribution to journalConference article

TY - JOUR

T1 - Fault-tolerance model for multiprocessor real-time systems

AU - Cheng, Sheng Tzong

AU - Chen, Chia Mei

AU - Tripathi, Satish K.

PY - 2000/12

Y1 - 2000/12

N2 - System reliability is an important aspect of real-time systems, because the result of a real-time application may be valid only if the application functions correctly and its timing constraints are satisfied. There are two kinds of faults, hardware and software faults, and the paper considers hardware transient faults. Full replication or full hardware redundancy can achieve a high degree of reliability; however, it wastes lots of resources. For most real-time systems, such schemes might not be available and hence reliability estimation becomes essential. We propose an analytic model for system reliability estimation based on the Markov chain and investigate the accuracy of the estimated reliability. The results show that the proposed model obtains good estimation in various simulated real-time systems.

AB - System reliability is an important aspect of real-time systems, because the result of a real-time application may be valid only if the application functions correctly and its timing constraints are satisfied. There are two kinds of faults, hardware and software faults, and the paper considers hardware transient faults. Full replication or full hardware redundancy can achieve a high degree of reliability; however, it wastes lots of resources. For most real-time systems, such schemes might not be available and hence reliability estimation becomes essential. We propose an analytic model for system reliability estimation based on the Markov chain and investigate the accuracy of the estimated reliability. The results show that the proposed model obtains good estimation in various simulated real-time systems.

UR - http://www.scopus.com/inward/record.url?scp=0034506175&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034506175&partnerID=8YFLogxK

U2 - 10.1006/jcss.2000.1704

DO - 10.1006/jcss.2000.1704

M3 - Conference article

AN - SCOPUS:0034506175

VL - 61

SP - 457

EP - 477

JO - Journal of Computer and System Sciences

JF - Journal of Computer and System Sciences

SN - 0022-0000

IS - 3

ER -