4.8 Article

Topological feature engineering for machine learning based halide perovskite materials design

Journal

NPJ COMPUTATIONAL MATERIALS
Volume 8, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41524-022-00883-8

Keywords

-

Funding

  1. Nanyang Technological University [M4081842.110]
  2. Singapore Ministry of Education Academic Research fund [RG109/19, MOE-T2EP50120-0004, MOE-T2EP20120-0013]
  3. National Research Foundation (NRF), Singapore under its NRF Investigatorship [NRF-NRFI2018-04]

Ask authors/readers for more resources

Accelerated materials development with machine learning and high throughput experimentation is crucial for addressing energy challenges. In the field of perovskite materials design, learning models based on persistent functions offer improved accuracy and performance comparable to deep learning models. The multiscale simplicial complex approach provides a precise representation for structures and interactions, enhancing transferability to machine learning models. Advanced geometrical and topological invariants are efficient feature engineering approaches that greatly improve the performance of learning models for molecular data analysis.
Accelerated materials development with machine learning (ML) assisted screening and high throughput experimentation for new photovoltaic materials holds the key to addressing our grand energy challenges. Data-driven ML is envisaged as a decisive enabler for new perovskite materials discovery. However, its full potential can be severely curtailed by poorly represented molecular descriptors (or fingerprints). Optimal descriptors are essential for establishing effective mathematical representations of quantitative structure-property relationships. Here we reveal that our persistent functions (PFs) based learning models offer significant accuracy advantages over traditional descriptor based models in organic-inorganic halide perovskite (OIHP) materials design and have similar performance as deep learning models. Our multiscale simplicial complex approach not only provides a more precise representation for OIHP structures and underlying interactions, but also has better transferability to ML models. Our results demonstrate that advanced geometrical and topological invariants are highly efficient feature engineering approaches that can markedly improve the performance of learning models for molecular data analysis. Further, new structure-property relationships can be established between our invariants and bandgaps. We anticipate that our molecular representations and featurization models will transcend the limitations of conventional approaches and lead to breakthroughs in perovskite materials design and discovery.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available