4.5 Article

HiePaCo: Scalable Hierarchical Exploration in Abstract Parallel Coordinates Under Budget Constraints

Journal

BIG DATA RESEARCH
Volume 17, Issue -, Pages 1-17

Publisher

ELSEVIER
DOI: 10.1016/j.bdr.2019.07.001

Keywords

Interactive visualization; Big data; Large-scale visualization; Parallel coordinates; Hierarchical aggregation; Multi-scale visualization

Funding

  1. French Investissement d'Avenir Program (Big Data - Cloud Computing topic) [PIAO18062-645401, PIAO17298-398711]

Ask authors/readers for more resources

In exploratory visualization systems, interactions allow to manipulate a visual representation and thereby gain insight into its supporting data. The responsiveness of these interactions is crucial, but achieving it on common hardware becomes increasingly difficult with the ever-growing size of datasets. Moreover, the representation of a large dataset itself is challenging since screen space is limited and, past a certain size, the number of items exceeds the number of pixels available or may render the representation unhelpful. The focus of this paper is on multidimensional data and parallel coordinates. For the system to be scalable, we propose a multiscale representation based on hierarchical aggregation on the clientside and distributed computing on a horizontally scalable infrastructure on the server-side. Multiscale visualization builds on several levels of abstraction to provide interactive and incremental changes in the level of detail. Horizontal scalability refers to the ability to increase the resources of the computing infrastructure by connecting additional computers. This paper presents: (1) a graph-based formalism for describing multiscale representations of parallel coordinates and their interactions and (2) a client-server system with a focus+context representation for multiscale parallel coordinates and distributed computation on a remote data-intensive infrastructure. We leverage the proposed formalism to describe several design possibilities for usual interactions in parallel coordinates, hierarchical navigation, and edition. We illustrated the scalability and usage of the representation in a real-world case. Performance experiments demonstrate that on a 15-computer cluster, the prototype system can scale to billion-item datasets while preserving the interactivity for analysis. (C) 2019 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available