4.4 Article Proceedings Paper

Machine Learning for Databases

Journal

PROCEEDINGS OF THE VLDB ENDOWMENT
Volume 14, Issue 12, Pages 3190-3193

Publisher

ASSOC COMPUTING MACHINERY
DOI: 10.14778/3476311.3476405

Keywords

-

Funding

  1. NSF of China [61925205, 61632016]
  2. Beijing National Research Center for Information Science and Technology
  3. Huawei
  4. TAL education

Ask authors/readers for more resources

This article introduces the application of machine learning techniques in optimizing databases, categorizing database tasks into three types of problems and using different machine learning models to solve them. These problems include NP-hard problems, regression problems, and prediction problems, corresponding to different database optimization tasks.
Machine learning techniques have been proposed to optimize the databases. For example, traditional empirical database optimization techniques (e.g., cost estimation, join order selection, knob tuning, index and view advisor) cannot meet the high-performance requirement for large-scale database instances, various applications and diversified users, especially on the cloud. Fortunately, machine learning based techniques can alleviate this problem by judiciously selecting optimization strategy. In this tutorial, we categorize database tasks into three typical problems that can be optimized by different machine learning models, including NP-hard problems (e.g., knob space exploration, index/view selection, partition-key recommendation for offline optimization; query rewrite, join order selection for online optimization), regression problems (e.g., cost/cardinality estimation, index/view benefit estimation, query latency prediction), and prediction problems (e.g., query workload prediction). We review existing machine learning based techniques to address these problems and provide research challenges.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available