4.2 Article

LSShare: an efficient multiple query optimization system in the cloud

Journal

DISTRIBUTED AND PARALLEL DATABASES
Volume 32, Issue 4, Pages 583-605

Publisher

SPRINGER
DOI: 10.1007/s10619-014-7150-1

Keywords

Multiple query optimization; Query processing; SQL-rewriting; Subexpression identification

Funding

  1. NSFC [61202025, 61373031, 61373156, 61272099, 61261160502]
  2. Program for Changjiang Scholars and Innovative Research Team in University of China (PCSIRT) [IRT1158]
  3. Scientific Innovation Act of STCSM [13511504200]
  4. Singapore NRF [CREATE E2S2]
  5. EU FP7 CLIMBER project [PIRSES-GA-2012-318939]
  6. State High-Tech Development Plan [2013AA01A601]
  7. Microsoft Research Asia (the Urban Informatics Research Grant)
  8. STCSM [12ZR1414900]

Ask authors/readers for more resources

Multiple query optimization (MQO) in the cloud has become a promising research direction due to the popularity of cloud computing, which runs massive data analysis queries (jobs) routinely. These CPU/IO intensive analysis queries are complex and time-consuming but share common components. It is challenging to detect, share and reuse the common components among thousands of SQL-like queries. Previous solutions to MQO, heuristic or genetic based, are not appropriate for the large growing query set situation. In this paper, we develop a sharing system called LSShare using our proposed Lineage-Signature approach. By LSShare, we can efficiently solve the MQO problem in a recurring query set situation in the cloud. Our system has been prototyped in a distributed system built for massive data analysis based on Alibaba's cloud computing platform (http://www.alibaba.com/). Experimental results on real data sets demonstrate the efficiency and effectiveness of the proposed approach.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available