4.5 Article

Tuning configuration of apache spark on public clouds by combining multi-objective optimization and performance prediction model

Journal

JOURNAL OF SYSTEMS AND SOFTWARE
Volume 180, Issue -, Pages -

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.jss.2021.111028

Keywords

Spark; Configuration tuning; Multi-objective optimization

Funding

  1. National Natural Science Foundation of China [62072342, 61672392]

Ask authors/readers for more resources

Choosing the right configuration for Spark in the public cloud to ensure efficient periodic jobs is challenging due to a large configuration space. The AB-MOEA/D algorithm combines multi-objective optimization and performance prediction to automatically search for optimal configurations, significantly outperforming previous work in terms of execution time and cost.
Choosing the right configuration for Spark deployed in the public cloud to ensure the efficient running of periodic jobs is hard, because there can be a huge configuration space to explore which is composed of numerous performance-related parameters in different dimensions (e.g., application-level and cloud-level). Choosing poorly will not only significantly degrade performance but may also lead to greater overhead. However, automatically searching for the optimal configuration of various applications to trade-off performance and cost is challenging. To address this issue, we propose a new optimal configuration search algorithm named AB-MOEA/D by combining multi-objective optimization algorithm and performance prediction model. AB-MOEA/D uses a decomposition-based multi-objective optimization algorithm to find the configuration with the objective of minimizing the execution time and cost, where the performance model constructed on the Adaboost algorithm is used to evaluate the fitness of each candidate configuration. Besides, we also present the configuration automatic tuning system with AB-MOEA/D as the optimization engine. The experimental results on six benchmarks with five data sets show that AB-MOEA/D significantly outperforms the previous work in terms of execution time and cost, with average improvements of approximately 35 and 40 percent. (C) 2021 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available