4.6 Article

Getting over High-Dimensionality: How Multidimensional Projection Methods Can Assist Data Science

Journal

APPLIED SCIENCES-BASEL
Volume 12, Issue 13, Pages -

Publisher

MDPI
DOI: 10.3390/app12136799

Keywords

high-dimensional data; dimensionality reduction; multidimensional scaling; artificial intelligence; information visualization

Funding

  1. Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior-Brasil (CAPES) [001]

Ask authors/readers for more resources

Exploring and analyzing multidimensional data is complex and requires sophisticated tools. Multidimensional projection techniques are powerful tools for transforming multidimensional data into visual information. Integrating these methods into data sciences frameworks can enhance visual analytics.
The exploration and analysis of multidimensional data can be pretty complex tasks, requiring sophisticated tools able to transform large amounts of data bearing multiple parameters into helpful information. Multidimensional projection techniques figure as powerful tools for transforming multidimensional data into visual information according to similarity features. Integrating this class of methods into a framework devoted to data sciences can contribute to generating more expressive means of visual analytics. Although the Principal Component Analysis (PCA) is a well-known method in this context, it is not the only one, and, sometimes, its abilities and limitations are not adequately discussed or taken into consideration by users. Therefore, knowing in-depth multidimensional projection techniques, their strengths, and the possible distortions they can create is of significant importance for researchers developing knowledge-discovery systems. This research presents a comprehensive overview of current state-of-the-art multidimensional projection techniques and shows example codes in Python and R languages, all available on the internet. The survey segment discusses the different types of techniques applied to multidimensional projection tasks from their background, application processes, capabilities, and limitations, opening the internal processes of the methods and demystifying their concepts. We also illustrate two problems, from a genetic experiment (supervised) and text mining (non-supervised), presenting solutions through multidimensional projection application. Finally, we brought elements that reverberate the competitiveness of multidimensional projection techniques towards high-dimension data visualization, commonly needed in data sciences solutions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available