期刊
PROCEEDINGS OF THE VLDB ENDOWMENT
卷 14, 期 12, 页码 3190-3193出版社
ASSOC COMPUTING MACHINERY
DOI: 10.14778/3476311.3476405
关键词
-
资金
- NSF of China [61925205, 61632016]
- Beijing National Research Center for Information Science and Technology
- Huawei
- TAL education
This article introduces the application of machine learning techniques in optimizing databases, categorizing database tasks into three types of problems and using different machine learning models to solve them. These problems include NP-hard problems, regression problems, and prediction problems, corresponding to different database optimization tasks.
Machine learning techniques have been proposed to optimize the databases. For example, traditional empirical database optimization techniques (e.g., cost estimation, join order selection, knob tuning, index and view advisor) cannot meet the high-performance requirement for large-scale database instances, various applications and diversified users, especially on the cloud. Fortunately, machine learning based techniques can alleviate this problem by judiciously selecting optimization strategy. In this tutorial, we categorize database tasks into three typical problems that can be optimized by different machine learning models, including NP-hard problems (e.g., knob space exploration, index/view selection, partition-key recommendation for offline optimization; query rewrite, join order selection for online optimization), regression problems (e.g., cost/cardinality estimation, index/view benefit estimation, query latency prediction), and prediction problems (e.g., query workload prediction). We review existing machine learning based techniques to address these problems and provide research challenges.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据