4.7 Article

FarSpot: Optimizing Monetary Cost for HPC Applications in the Cloud Spot Market

Journal

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS
Volume 33, Issue 11, Pages 2955-2967

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPDS.2021.3134644

Keywords

Costs; Predictive models; Pricing; Task analysis; Cloud computing; Prediction algorithms; Fault tolerant systems; Cloud computing; spot market; price prediction; ensemble models

Funding

  1. National Natural Science Foundation of China [62172282, 61802260, 62072311, 61972259, 62122056, U2001212]
  2. Guangdong Basic and Applied Basic Research Foundation [2020B1515120028, 2019B151502055]
  3. Guangdong NSF [2019A1515012053]
  4. Shenzhen Science and Technology Foundation [JCYJ20210324094402008, JCYJ20210324093212034]
  5. Tencent Rhinoceros Birds Scientific Research Foundation for Young Teachers of Shenzhen University

Ask authors/readers for more resources

This article presents FarSpot, an optimization framework for HPC applications in the latest cloud spot market that aims to minimize application cost while ensuring performance constraints. FarSpot utilizes accurate long-term price prediction and a cost-aware deadline assignment algorithm to reduce the monetary cost of HPC applications.
Recently, we have witnessed many HPC applications developed and hosted in the cloud, which can benefit from the elastic and diversified resources on the cloud, while on the other hand confronting high costs for executing the long-running HPC applications. Although public clouds such as Amazon EC2 offer spot instances with dynamic and usually low prices compared to on-demand ones, the spot prices can vary significantly and sometimes can even be more expensive than on-demand prices of the same type. Previous work on reducing the monetary cost for HPC applications using spot instances focused on designing fault tolerance techniques or selecting appropriate instance types/bid prices to make good usage of the low spot prices. However, with the recent update of spot pricing model on Amazon EC2, these work may become either inefficient or invalid. In this article, we present FarSpot which is an optimization framework for HPC applications in the latest cloud spot market with the goal of minimizing application cost while ensuring performance constraints. FarSpot provides accurate long-term price prediction for a wide range of spot instance types using ensemble-based learning method. It further incorporates a cost-aware deadline assignment algorithm to distribute application deadline to each task according to spot price changes. With the assigned subdeadline of each task, FarSpot dynamically migrates tasks among spot instances to reduce execution cost. Evaluation results using real HPC benchmark show that 1) the prediction error of FarSpot is very low (below 3%), 2) FarSpot reduced the monetary cost by 32% on average compared to state-of-the-art algorithms, and 3) FarSpot satisfies the user-specified deadline constraints at all time.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available