TY - GEN
T1 - A low-latency checkpointing scheme for mobile computing systems
AU - Li, Guohui
AU - Shu, Lih-Chyun
PY - 2005/12/1
Y1 - 2005/12/1
N2 - Fault-tolerant mobile computing systems have different requirements and restrictions, not taken into account by conventional distributed systems. This paper presents a co-ordinated checkpointing scheme which reduces the delay involved in a global checkpointing process for mobile systems. A piggyback technique is used to track and record the checkpoint dependency information among processes during normal message transmission. During checkpointing, a concurrent checkpointing technique is designed to use the pre-recorded process dependency information to minimize process blocking time by sending checkpoint requests to dependent processes at once, hence saving the time to trace the dependency tree. Our checkpoint algorithm forces a minimum number of processes to take checkpoints. Via probability-based analysis, we show that our scheme can significantly reduce the latency associated with checkpoint request propagation, compared to traditional coordinated checkpointing approach.
AB - Fault-tolerant mobile computing systems have different requirements and restrictions, not taken into account by conventional distributed systems. This paper presents a co-ordinated checkpointing scheme which reduces the delay involved in a global checkpointing process for mobile systems. A piggyback technique is used to track and record the checkpoint dependency information among processes during normal message transmission. During checkpointing, a concurrent checkpointing technique is designed to use the pre-recorded process dependency information to minimize process blocking time by sending checkpoint requests to dependent processes at once, hence saving the time to trace the dependency tree. Our checkpoint algorithm forces a minimum number of processes to take checkpoints. Via probability-based analysis, we show that our scheme can significantly reduce the latency associated with checkpoint request propagation, compared to traditional coordinated checkpointing approach.
UR - http://www.scopus.com/inward/record.url?scp=34248507212&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34248507212&partnerID=8YFLogxK
U2 - 10.1109/COMPSAC.2005.26
DO - 10.1109/COMPSAC.2005.26
M3 - Conference contribution
AN - SCOPUS:34248507212
SN - 0769522092
SN - 9780769522098
T3 - Proceedings - International Computer Software and Applications Conference
SP - 491
EP - 496
BT - Proceedings of the 29th Annual International Computer Software and Applications Conference - Workshops and Fast Abstracts, COMPSAC 2005
T2 - 29th Annual International Computer Software and Applications Conference, COMPSAC 2005
Y2 - 26 July 2005 through 28 July 2005
ER -