4.7 Review

Toward a Smart Cloud: A Review of Fault-Tolerance Methods in Cloud Systems

Journal

IEEE TRANSACTIONS ON SERVICES COMPUTING
Volume 14, Issue 2, Pages 589-605

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TSC.2018.2816644

Keywords

Cloud computing; Fault tolerance; Fault tolerant systems; Software as a service; Computer architecture; Cloud computing; fault-tolerance; reliability; availability; smart cloud; machine learning; artificial intelligence

Ask authors/readers for more resources

This paper provides a comprehensive survey of fault tolerance methods for cloud computing, including ReActive Methods (RAMs), PRoactive Methods (PRMs), and ReSilient Methods (RSMs). Machine Learning and Artificial Intelligence have played a significant role in optimizing recovery time in RSM domain. Current issues and challenges in cloud fault tolerance are also discussed to identify potential areas for future research.
This paper presents a comprehensive survey of the state-of-the-art work on fault tolerance methods proposed for cloud computing. The survey classifies fault-tolerance methods into three categories: 1) ReActive Methods (RAMs); 2) PRoactive Methods (PRMs); and 3) ReSilient Methods (RSMs). RAMs allow the system to enter into a fault status and then try to recover the system. PRMs tend to prevent the system from entering a fault status by implementing mechanisms that enable them to avoid errors before they affect the system. On the other hand, recently emerging RSMs aim to minimize the amount of time it takes for a system to recover from a fault. Machine Learning and Artificial Intelligence have played an active role in RSM domain in such a way that the recovery time is mapped to a function to be optimized (i.e., by converging the recovery time to a fraction of milliseconds). As the system learns to deal with new faults, the recovery time will become shorter. In addition, current issues and challenges in cloud fault tolerance are also discussed to identify promising areas for future research.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available