Reliability and cost optimization in distributed computing systems

Chung Chi Hsieh, Yi Che Hsieh

Research output: Contribution to journalArticlepeer-review

81 Citations (Scopus)

Abstract

The reliability of the communication network and its processing units and the strategy of task allocation are essential in determining the system reliability of a distributed computing system. Reliability of such systems can be improved by endowing resource redundancy or the use of highly reliable components. In this paper, we develop a relationship between system cost and hardware redundancy levels, assuming cycle-free distributed computing systems. Based on the derived relationship, we propose a hybrid heuristic which combines genetic algorithms and the steepest decent method to seek the optimal task allocation and hardware redundancy policies such that system cost is minimized. The purpose of this paper is to develop the optimal task allocation and hardware redundancy policies for a cycle-free distributed computing system with hardware redundancy so that system cost during the period of task execution is minimized.

Original languageEnglish
Pages (from-to)1103-1119
Number of pages17
JournalComputers and Operations Research
Volume30
Issue number8
DOIs
Publication statusPublished - 2003 Jul

All Science Journal Classification (ASJC) codes

  • General Computer Science
  • Modelling and Simulation
  • Management Science and Operations Research

Fingerprint

Dive into the research topics of 'Reliability and cost optimization in distributed computing systems'. Together they form a unique fingerprint.

Cite this