期刊
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS
卷 26, 期 5, 页码 2737-2755出版社
SPRINGER
DOI: 10.1007/s10586-022-03767-0
关键词
Big data processing framework; Configuration parameter; High dimensional black-box optimization; Bayesian optimization
To achieve better performance, big data processing frameworks usually have a large number of performance-critical configuration parameters. Manually configuring these parameters is time-consuming, so there is a need for automatic tuning. This paper proposes a black-box approach, ATConf, to automatically tune the configuration parameters for BDPFs. Experimental results show that ATConf can reduce the execution time by 46.52% compared to the default configuration.
To support various application scenarios, big data processing frameworks (BDPFs) such as Spark usually provide users with a large number of performance-critical configuration parameters. Since manually configuring is both labor-intensive and time-consuming, automatically tuning configurations parameters for BDPFs to achieve better performance has been an urgent need. To simultaneously address the corresponding challenges such as high dimensional configuration space, we propose ATConf-a new black-box approach of automatically tuning the internal and external configuration parameters for BDPFs. Experimental results based on our local distributed Spark cluster show that the best execution time achieved by ATConf is as much as 46.52% less than the default configuration. Besides, compared with the four baselines, ATConf is able to further reduce the relative execution time over default by at least 4.10% under the same constraint of observation times.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据