☆ 4.7 Article

A Survey of Unsupervised Generative Models for Exploratory Data Analysis and Representation Learning

ACM COMPUTING SURVEYS (2021)

期刊

ACM COMPUTING SURVEYS

卷 54, 期 5, 页码 -

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3450963

关键词

Blind source separation; manifold learning; neural networks; exploratory data analysis; representation learning; explainable machine learning; unsupervised deep learning

类别

Computer Science, Theory & Methods

资金

EC within the H2020 Program under project MOSAICrOWN
Italian Ministry of Research within the PRIN program under project HOPE
Universita degli Studi di Milano under project AI4FAO
JPMorgan Chase Co
EC within the H2020 Program under project MARSAL

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In recent years, the rise of big data has made it more challenging to uncover hidden structures within messy and high-dimensional datasets. Exploratory data analysis and unsupervised generative learning models play a crucial role in discovering significant features and patterns in data. Researchers can leverage these methods for data exploration and learning representations through study and practice.

For more than a century, the methods for data representation and the exploration of the intrinsic structures of data have developed remarkably and consist of supervised and unsupervised methods. However, recent years have witnessed the flourishing of big data, where typical dataset dimensions are high and the data can come in messy, incomplete, unlabeled, or corrupted forms. Consequently, discovering the hidden structure buried inside such data becomes highly challenging. From this perspective, exploratory data analysis plays a substantial role in learning the hidden structures that encompass the significant features of the data in an ordered manner by extracting patterns and testing hypotheses to identify anomalies. Unsupervised generative learning models are a class of machine learning models characterized by their potential to reduce the dimensionality, discover the exploratory factors, and learn representations without any predefined labels; moreover, such models can generate the data from the reduced factors' domain. The beginner researchers can find in this survey the recent unsupervised generative learning models for the purpose of data exploration and learning representations; specifically, this article covers three families of methods based on their usage in the era of big data: blind source separation, manifold learning, and neural networks, from shallow to deep architectures.

A Survey of Unsupervised Generative Models for Exploratory Data Analysis and Representation Learning

期刊

ACM COMPUTING SURVEYS

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A Survey of Unsupervised Generative Models for Exploratory Data Analysis and Representation Learning

期刊

ACM COMPUTING SURVEYS

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文