4.5 Article

An efficient parallel indexing structure for multi-dimensional big data using spark

Journal

JOURNAL OF SUPERCOMPUTING
Volume 77, Issue 10, Pages 11187-11214

Publisher

SPRINGER
DOI: 10.1007/s11227-021-03718-3

Keywords

Multi-Dimensional Data; Big Data; Indexing; Parallel Processing; Spark

Ask authors/readers for more resources

With the increasing daily production of data, indexing, storing, and retrieving huge amounts of data have become common problems. This paper proposes a new indexing structure called ParISSS for multi-dimensional big data, which outperforms other indexing systems.
With the increasing daily production of data in recent years, indexing, storing and retrieving huge amounts of data have become a common problem, especially for multi-dimensional big data. Although R-tree has proved to be efficient for indexing multi-dimensional big data, the R-tree suffers from the curse of dimensionality problem. Many researchers continue to use the R-tree in their studies as it is the most famous tree-like structure for indexing multi-dimensional data. However, with increasing numbers of dimensions in multi-dimensional data the performance of R-Tree will decrease. This paper proposes a new indexing structure called Parallel Indexing System Structure based on Spark (ParISSS), which is an efficient system for indexing multi-dimensional big data, to overcome these problems. ParISSS introduces six types of computing nodes, the reception-node is used to insert and index data, the normal-node is used to store indexed data, the resolution-node is used to distribute a reception-index to a normal-node, the representative-node is used to receive queries from the user, and the reply-node and check-node are used to send the results to the user. We also introduced BR*-tree structure to improve the storing and searching processes. We present an extensive experimental evaluation of our system, comparing several indexing systems. The experimental results show that ParISSS outperforms other indexing systems.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available