☆ 4.6 Article

Persistent-homology-based machine learning: a survey and a comparative study

ARTIFICIAL INTELLIGENCE REVIEW (2022)

期刊

ARTIFICIAL INTELLIGENCE REVIEW

卷 55, 期 7, 页码 5169-5213

出版社

SPRINGER

DOI: 10.1007/s10462-022-10146-z

关键词

Persistent homology; Machine learning; Persistent diagram; Persistent barcode; Kernel; Feature extraction

类别

Computer Science, Artificial Intelligence

资金

Nanyang Technological University [M4081840, M4081842]
Data Science and Artificial Intelligence Research@NTU [M4082115]
Singapore Ministry of Education [RG109/19, MOE2018-T2-1-033, MOE-T2EP20120-0013]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper presents a systematic review of persistent homology (PH) and PH-based machine learning models from a computational perspective. It discusses the recent development of mathematical models, tools, and applications of PH. The paper also compares different types of simplicial complexes, feature extractions, and machine learning models in protein secondary structure classification.

A suitable feature representation that can both preserve the data intrinsic information and reduce data complexity and dimensionality is key to the performance of machine learning models. Deeply rooted in algebraic topology, persistent homology (PH) provides a delicate balance between data simplification and intrinsic structure characterization, and has been applied to various areas successfully. However, the combination of PH and machine learning has been hindered greatly by three challenges, namely topological representation of data, PH-based distance measurements or metrics, and PH-based feature representation. With the development of topological data analysis, progresses have been made on all these three problems, but widely scattered in different literatures. In this paper, we provide a systematical review of PH and PH-based supervised and unsupervised models from a computational perspective. Our emphasizes are the recent development of mathematical models and tools, including PH software and PH-based functions, feature representations, kernels, and similarity models. Essentially, this paper can work as a roadmap for the practical application of PH-based machine learning tools. Further, we compare between two types of simplicial complexes (alpha and Vietrois-Rips complexes), two types of feature extractions (barcode statistics and binned features), and three types of machine learning models (support vector machines, tree-based models, and neural networks), and investigate their impacts on the protein secondary structure classification.

Persistent-homology-based machine learning: a survey and a comparative study

期刊

ARTIFICIAL INTELLIGENCE REVIEW

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Persistent-homology-based machine learning: a survey and a comparative study

期刊

ARTIFICIAL INTELLIGENCE REVIEW

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文