Journal
JOURNAL OF SUPERCOMPUTING
Volume 77, Issue 10, Pages 11187-11214Publisher
SPRINGER
DOI: 10.1007/s11227-021-03718-3
Keywords
Multi-Dimensional Data; Big Data; Indexing; Parallel Processing; Spark
Ask authors/readers for more resources
With the increasing daily production of data, indexing, storing, and retrieving huge amounts of data have become common problems. This paper proposes a new indexing structure called ParISSS for multi-dimensional big data, which outperforms other indexing systems.
With the increasing daily production of data in recent years, indexing, storing and retrieving huge amounts of data have become a common problem, especially for multi-dimensional big data. Although R-tree has proved to be efficient for indexing multi-dimensional big data, the R-tree suffers from the curse of dimensionality problem. Many researchers continue to use the R-tree in their studies as it is the most famous tree-like structure for indexing multi-dimensional data. However, with increasing numbers of dimensions in multi-dimensional data the performance of R-Tree will decrease. This paper proposes a new indexing structure called Parallel Indexing System Structure based on Spark (ParISSS), which is an efficient system for indexing multi-dimensional big data, to overcome these problems. ParISSS introduces six types of computing nodes, the reception-node is used to insert and index data, the normal-node is used to store indexed data, the resolution-node is used to distribute a reception-index to a normal-node, the representative-node is used to receive queries from the user, and the reply-node and check-node are used to send the results to the user. We also introduced BR*-tree structure to improve the storing and searching processes. We present an extensive experimental evaluation of our system, comparing several indexing systems. The experimental results show that ParISSS outperforms other indexing systems.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available