4.5 Article

In-situ visual exploration over big raw data

期刊

INFORMATION SYSTEMS
卷 95, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.is.2020.101616

关键词

Visual analytics; Progressive & adaptive indexes; User-driven incremental processing; Interactive indexing; RawVis; In-situ query processing; Big data visualization

资金

  1. European Union (European Social Fund - ESF) through the Operational Programme Human Resources Development, Education and Lifelong Learning 2014-2020'' in the context of the project PLOIGIA: Navigation and Visual Analytics on Raw Datasets'' [MIS 5005089]

向作者/读者索取更多资源

The paper presents the RawVis framework for efficient query processing on large raw data files in interactive exploration and analytics scenarios, which is built on the lightweight in-memory tile-based index VALINOR. Experimental results demonstrate that the RawVis method outperforms existing solutions in terms of response time, disk accesses, and memory consumption, being significantly faster and requiring less memory resources, particularly during exploration scenarios.
Data exploration and visual analytics systems are of great importance in Open Science scenarios, where less tech-savvy researchers wish to access and visually explore big raw data files (e.g., json, csv) generated by scientific experiments using commodity hardware and without being overwhelmed in the tedious processes of data loading, indexing and query optimization. In this paper, we present our work for enabling efficient query processing on large raw data files for interactive visual exploration scenarios and analytics. We introduce a framework, named RawVis, built on top of a lightweight in-memory tile-based index, VALINOR, that is constructed on-the-fly given the first user query over a raw file and progressively adapted based on the user interaction. We evaluate the performance of a prototype implementation compared to three other alternatives and show that our method outperforms in terms of response time, disk accesses and memory consumption. Particularly during an exploration scenario, the proposed method in most cases is about 5-10x faster compared to existing solutions, and requires significantly less memory resources. (C) 2020 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据