TY - GEN
T1 - Fault-tolerant parallel processing with real-time error detection and recovery
AU - Chen, Chung Ho
AU - Somani, Arun K.
N1 - Publisher Copyright:
© 1992 IEEE.
PY - 1992
Y1 - 1992
N2 - The performance of parallel processing for real-time application is very sensitive to the reliability of the system. This paper presents a unique error recovery mechanism based on new cache states, verified and non-verified, to detect and recover errors produced by the processor or cache memory or both due to transient faults. The proposed scheme remedies the insufficiency of the error-correcting code when facing with processor transient fault. This cache-based recovery metkod not only recovers errors in a local cache memory but also prevents the propagation of errors to other caches. We show that this new error recovery scheme can be easily integrated with existing cache coherency protocols.
AB - The performance of parallel processing for real-time application is very sensitive to the reliability of the system. This paper presents a unique error recovery mechanism based on new cache states, verified and non-verified, to detect and recover errors produced by the processor or cache memory or both due to transient faults. The proposed scheme remedies the insufficiency of the error-correcting code when facing with processor transient fault. This cache-based recovery metkod not only recovers errors in a local cache memory but also prevents the propagation of errors to other caches. We show that this new error recovery scheme can be easily integrated with existing cache coherency protocols.
UR - http://www.scopus.com/inward/record.url?scp=85064613979&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85064613979&partnerID=8YFLogxK
U2 - 10.1109/ACSSC.1992.269071
DO - 10.1109/ACSSC.1992.269071
M3 - Conference contribution
AN - SCOPUS:85064613979
T3 - Conference Record - Asilomar Conference on Signals, Systems and Computers
SP - 994
EP - 998
BT - Conference Record of the 26th Asilomar Conference on Signals, Systems and Computers, ACSSC 1992
PB - IEEE Computer Society
T2 - 26th Asilomar Conference on Signals, Systems and Computers, ACSSC 1992
Y2 - 26 October 1992 through 28 October 1992
ER -