4.7 Article

An Automated Machine Learning architecture for the accelerated prediction of Metal-Organic Frameworks performance in energy and environmental applications

Journal

MICROPOROUS AND MESOPOROUS MATERIALS
Volume 300, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.micromeso.2020.110160

Keywords

Automated machine learning; Metal-organic frameworks; Carbon dioxide; Methane; Environment; Energy

Funding

  1. European Research Council (ERC) under the European Union's Seventh Framework Programme (FP/2007-2013) [617393]

Ask authors/readers for more resources

Due to their exceptional host-guest properties, Metal-Organic Frameworks (MOFs) are promising materials for storage of various gases with environmental and technological interest. Molecular modeling and simulations are invaluable tools, extensively used over the last two decades for the study of various properties of MOFs. In particular, Monte Carlo simulation techniques have been employed for the study of the gas uptake capacity of several MOFs at a wide range of different thermodynamic conditions. Despite the accurate predictions of molecular simulations, the accurate characterization and the high-throughput screening of the enormous number of MOFs that can be potentially synthesized by combining various structural building blocks is beyond present computer capabilities. In this work, we propose and demonstrate the use of an alternative approach, namely one based on an Automated Machine Learning (AutoML) architecture that is capable of training machine learning and statistical predictive models for MOFs' chemical properties and estimate their predictive performance with confidence intervals. The architecture tries numerous combinations of different machine learning (ML) algorithms, tunes their hyper-parameters, and conservatively estimates performance of the final model. We demonstrate that it correctly estimates performance even with few samples (<100) and that it provides improved predictions over trying a single standard method, like Random Forests. The AutoML pipeline democratizes ML to non-expert material-science practitioners that may not know which algorithms to use on a given problem, how to tune them, and how to correctly estimate their predictive performance, dramatically improving productivity and avoiding common analysis pitfalls. A demonstration on the prediction of the carbon dioxide and methane uptake at various thermodynamic conditions is used as a showcase sharable at https://app.jadbio.com/share/86477fd7-d467-464d-ac41-febb047544b.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available