Process recovery in heterogeneous systems

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Heterogeneous computing environments, where computers may have different instruction set architectures, data representations, and operating systems, complicate checkpointing and recovery of processes. This paper descdbes an approach to recovery and ah implementation, PREACHES, that provides portable checkpointing of single-process applications in heterogeneous systems using checkpoint propagation. The checkpoint propagation mechanism creates machine-dependent checkpoints for different architectures in the heterogeneous environment. A process is restored on a specific machine with the checkpoint that is appropdate for the architecture. An implementation of PREACHES has been evaluated on a heterogeneous network of workstations, including Sun, HP, and Pentium machines. The experimental results show that PREACHES achieves efficient checkpointing anal rapid recovery.

Original languageEnglish
Pages (from-to)126-138
Number of pages13
JournalIEEE Transactions on Computers
Volume52
Issue number2
DOIs
Publication statusPublished - 2003 Feb 1

Fingerprint

Checkpoint
Heterogeneous Systems
Checkpointing
Recovery
Computer workstations
Heterogeneous networks
Propagation
Network of Workstations
Heterogeneous Computing
Sun
Heterogeneous Environment
Heterogeneous Networks
Operating Systems
Dependent
Experimental Results
Architecture

All Science Journal Classification (ASJC) codes

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture
  • Computational Theory and Mathematics

Cite this

@article{013de2c130fb472a933ad13785a15331,
title = "Process recovery in heterogeneous systems",
abstract = "Heterogeneous computing environments, where computers may have different instruction set architectures, data representations, and operating systems, complicate checkpointing and recovery of processes. This paper descdbes an approach to recovery and ah implementation, PREACHES, that provides portable checkpointing of single-process applications in heterogeneous systems using checkpoint propagation. The checkpoint propagation mechanism creates machine-dependent checkpoints for different architectures in the heterogeneous environment. A process is restored on a specific machine with the checkpoint that is appropdate for the architecture. An implementation of PREACHES has been evaluated on a heterogeneous network of workstations, including Sun, HP, and Pentium machines. The experimental results show that PREACHES achieves efficient checkpointing anal rapid recovery.",
author = "Kuo-Feng Ssu and Fuchs, {W. Kent} and Hewijin-Christine Jiau",
year = "2003",
month = "2",
day = "1",
doi = "10.1109/TC.2003.1176981",
language = "English",
volume = "52",
pages = "126--138",
journal = "IEEE Transactions on Computers",
issn = "0018-9340",
publisher = "IEEE Computer Society",
number = "2",

}

Process recovery in heterogeneous systems. / Ssu, Kuo-Feng; Fuchs, W. Kent; Jiau, Hewijin-Christine.

In: IEEE Transactions on Computers, Vol. 52, No. 2, 01.02.2003, p. 126-138.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Process recovery in heterogeneous systems

AU - Ssu, Kuo-Feng

AU - Fuchs, W. Kent

AU - Jiau, Hewijin-Christine

PY - 2003/2/1

Y1 - 2003/2/1

N2 - Heterogeneous computing environments, where computers may have different instruction set architectures, data representations, and operating systems, complicate checkpointing and recovery of processes. This paper descdbes an approach to recovery and ah implementation, PREACHES, that provides portable checkpointing of single-process applications in heterogeneous systems using checkpoint propagation. The checkpoint propagation mechanism creates machine-dependent checkpoints for different architectures in the heterogeneous environment. A process is restored on a specific machine with the checkpoint that is appropdate for the architecture. An implementation of PREACHES has been evaluated on a heterogeneous network of workstations, including Sun, HP, and Pentium machines. The experimental results show that PREACHES achieves efficient checkpointing anal rapid recovery.

AB - Heterogeneous computing environments, where computers may have different instruction set architectures, data representations, and operating systems, complicate checkpointing and recovery of processes. This paper descdbes an approach to recovery and ah implementation, PREACHES, that provides portable checkpointing of single-process applications in heterogeneous systems using checkpoint propagation. The checkpoint propagation mechanism creates machine-dependent checkpoints for different architectures in the heterogeneous environment. A process is restored on a specific machine with the checkpoint that is appropdate for the architecture. An implementation of PREACHES has been evaluated on a heterogeneous network of workstations, including Sun, HP, and Pentium machines. The experimental results show that PREACHES achieves efficient checkpointing anal rapid recovery.

UR - http://www.scopus.com/inward/record.url?scp=0037325654&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0037325654&partnerID=8YFLogxK

U2 - 10.1109/TC.2003.1176981

DO - 10.1109/TC.2003.1176981

M3 - Article

VL - 52

SP - 126

EP - 138

JO - IEEE Transactions on Computers

JF - IEEE Transactions on Computers

SN - 0018-9340

IS - 2

ER -