4.7 Article

An automatic clustering technique for query plan recommendation

Journal

INFORMATION SCIENCES
Volume 545, Issue -, Pages 620-632

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2020.09.037

Keywords

Query processing; Incremental DBSCAN clustering; NSGA-II; Cluster validity indices; Multi-objective optimization (MOO)

Ask authors/readers for more resources

The query optimizer is responsible for identifying the most efficient Query Execution Plans (QEP's) and reusing previously generated plans is an efficient technique for query processing. To improve accuracy, queries are rewritten and converted into vectors. A multi-objective automatic query plan recommendation method has been introduced, optimizing cluster validity indices.
The query optimizer is responsible for identifying the most efficient Query Execution Plans (QEP's). The distributed database relations may be kept in several places. These results in a dramatic increase in the number of alternative query' plans. The query optimizer cannot exhaustively explore the alternative query plans in a vast search space at reasonable computational costs. Henceforth, reusing the previously generated plans instead of generating new plans for new queries is an efficient technique for query processing. To improve the accuracy of clustering, we've rewritten the queries to standardize their structures. Furthermore, TF representation schema has been used to convert the queries into vectors. In this paper, we've introduced a multi-objective automatic query plan recommendation method, a combination of incremental DBSCAN and NSGA-II. The quality of the results of incremental DBSCAN has been influenced by Minpts (minimum points) and Eps (epsilon). Two cluster validity indices, Dunn index and Davies-Bouldin index, have simultaneously been optimized to calculate the goodness of an answer. Comparative results have been shown against the incremental DBSCAN and K-means regarding an external cluster validity index, namely, the ARI. By comparing different types of query workloads, we've found that the introduced method outperforms the other well-known approaches. (C) 2020 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available