4.7 Article

Reliability of a Distributed Computing System With Performance Sharing

Journal

IEEE TRANSACTIONS ON RELIABILITY
Volume 71, Issue 4, Pages 1555-1566

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TR.2021.3111031

Keywords

Reliability; Task analysis; Distributed computing; Computational modeling; Optimization; Resource management; Computers; Common bus performance sharing; distributed computing systems; optimization; system reliability; universal generating function (UGF)

Funding

  1. National Natural Science Foundation of China [71971176, 71725001, 71910107002]
  2. Applied Basic Research Program of Sichuan Province [2020YJ0027]
  3. Fundamental Research Funds for the Central Universities [JBK2103010]

Ask authors/readers for more resources

Existing research has focused on improving the reliability of distributed computing systems, without considering the performance sharing mechanism. This study proposes a reliability model for evaluating distributed computing systems with performance sharing, and formulates an optimization model for deriving the optimal performance sharing policy.
Existing research has been concentrated on improving the reliability of a distributed computing system through optimizing tasks allocation, providing software redundancy and providing hardware redundancy. None of these works considered the performance sharing mechanism in a distributed computing system. Different from other performance sharing systems whose reliability can be calculated directly, the reliability evaluation of a distributed computing system with performance sharing is more challenging since the reliability depends on the task execution time of each processor after performance sharing. This research considers a distributed computing system with performance sharing mechanism such that the computing power can be redistributed among different processors in the system. A reliability model is proposed to evaluate the distributed computing system with performance sharing. An optimization model is formulated to derive the optimal performance sharing policy such that the system reliability can be maximized. Both analytic examples and numerical examples are carried out to illustrate the proposed model and algorithm.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available