4.6 Article

Audacity of huge: overcoming challenges of data scarcity and data quality for machine learning in computational materials discovery

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Chemistry, Physical

Deciphering Cryptic Behavior in Bimetallic Transition-Metal Complexes with Machine Learning

Michael G. Taylor et al.

Summary: The study demonstrates a data-driven approach to uncovering structure-property relationships in heterobimetallic complexes, developing multiple linear regression and kernel ridge regression models to predict oxidation potentials and metal-metal bonding. Key features analysis reveals fundamental atomic contributions influencing complex behavior, providing guidance for rational bimetallic design and suggesting transferability of properties between periods.

JOURNAL OF PHYSICAL CHEMISTRY LETTERS (2021)

Article Chemistry, Multidisciplinary

Using Machine Learning and Data Mining to Leverage Community Knowledge for the Engineering of Stable Metal-Organic Frameworks

Aditya Nandy et al.

Summary: Researchers have successfully extracted and analyzed key aspects of MOF stability from a large number of published reports using natural language processing and image analysis, enabling faster predictions of stability using machine learning models and structure-property relationships encoding. This allows for more efficient identification of strategies to engineer increased stability into typically unstable MOFs targeted for catalytic applications, with the expectation of accelerating the discovery of stable and practical MOF materials for a wide range of applications.

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY (2021)

Review Multidisciplinary Sciences

Opportunities and challenges of text mining in materials research

Olga Kononova et al.

Summary: Research publications serve as the primary repository of scientific knowledge, but their unstructured format poses obstacles to large-scale analysis. Recent advances in natural language processing have provided tools for information extraction, but challenges arise when applied to scientific text with technical terminology. Text mining methodology in materials science is still in its early stages, with efforts focused on understanding the application of TM and NLP in this field.

ISCIENCE (2021)

Article Chemistry, Physical

A Ranked-Orbital Approach to Select Active Spaces for High-Throughput Multireference Computation

Daniel S. King et al.

Summary: High-throughput computation has been widely applied in the field of chemistry in the past decade, but the active space selection in multireference wave function methods has been a technical challenge. A ranked-orbital approach is proposed to standardize multireference methods for high-throughput computation, allowing for meaningful comparison of different active space selection schemes. A new orbital ranking scheme, the approximate pair coefficient (APC) method, is introduced and shown to perform well in test systems.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2021)

Article Chemistry, Multidisciplinary

Discovering Relationships between OSDAs and Zeolites through Data Mining and Generative Neural Networks

Zach Jensen et al.

Summary: Organic structure directing agents (OSDAs) play a crucial role in synthesizing micro- and mesoporous materials, especially zeolites, but their interaction mechanisms with zeolite frameworks are not well understood. A data-driven approach was used to establish OSDA-zeolite relationships through a comprehensive database of 5,663 synthesis routes. Structural features of OSDAs were analyzed using WHIM descriptors to relate them to different types of zeolites, and a generative neural network was adapted to suggest new OSDA candidates for specific zeolite structures and gel chemistry.

ACS CENTRAL SCIENCE (2021)

Review Chemistry, Multidisciplinary

Computational Discovery of Transition-metal Complexes: From High-throughput Screening to Machine Learning

Aditya Nandy et al.

Summary: This review focuses on the techniques that enable high-throughput search of transition-metal chemical space, covering the development and limitations of traditional computational chemistry along with leveraging experimental data sources. It discusses how advances in statistical modeling, artificial intelligence, and automation accelerate the discovery of lead compounds and design rules in transition-metal chemistry.

CHEMICAL REVIEWS (2021)

Article Chemistry, Physical

Inverse Design of Materials That Exhibit the Magnetocaloric Effect by Text-Mining of the Scientific Literature and Generative Deep Learning

Callum J. Court et al.

Summary: This study presents a magnetic-materials discovery pipeline using natural language processing, machine learning, and generative models to predict ferromagnetic compounds in the Heusler alloy family. A database of 2910 magnetocaloric compounds was created from scientific literature, and predictive models were applied to identify six potential materials for solid-state refrigeration applications.

CHEMISTRY OF MATERIALS (2021)

Article Chemistry, Medicinal

ChemDataExtractor 2.0: Autopopulated Ontologies for Materials Science

Juraj Mavracic et al.

Summary: The article introduces a framework for automated populating ontologies, enabling direct extraction of a larger group of properties linked by a semantic network. Exploiting data-rich sources, a new model concept is presented for data extraction of chemical and physical properties. With automatically generated parsers for data extraction and forward-looking interdependency resolution, the power of the approach is illustrated through automatic extraction of a crystallographic hierarchy.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2021)

Article Chemistry, Medicinal

Materials Precursor Score: Modeling Chemists' Intuition for the Synthetic Accessibility of Porous Organic Cage Precursors

Steven Bennett et al.

Summary: The computational discovery of new materials is hindered by the difficulty in transitioning from prediction to synthesis. By considering the ease of synthesis of material precursors, a model was developed to bias towards precursors that are easier to synthesize, accelerating experimental realization and material development.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2021)

Article Chemistry, Physical

Machine-Learned Energy Functionals for Multiconfigurational Wave Functions

Daniel S. King et al.

Summary: MC-DDFMs are methods aiming to correct the energy of multiconfigurational wave functions using a machine-learned functional of some wave function feature, such as density or on-top density. These methods show promise for the development of MC-PDFT functionals and are able to achieve near-benchmark performance on systems not used for training, indicating a degree of active-space independence.

JOURNAL OF PHYSICAL CHEMISTRY LETTERS (2021)

Article Chemistry, Multidisciplinary

Using collective knowledge to assign oxidation states of metal cations in metal-organic frameworks

Kevin Maik Jablonka et al.

Summary: Understanding the oxidation state of metal centres in compounds and materials is crucial for understanding their chemical bonding and properties. By using a machine-learning model trained on chemist-assigned chemical names in the Cambridge Structural Database, oxidation states can be automatically assigned to metal ions in metal-organic frameworks, with high accuracy and reliability. This approach demonstrates the ability to detect incorrect assignments in the database, showcasing how collective knowledge can be harnessed through machine learning.

NATURE CHEMISTRY (2021)

Article Physics, Multidisciplinary

Learning the Exchange-Correlation Functional from Nature with Fully Differentiable Density Functional Theory

M. F. Kasim et al.

Summary: Training neural networks to replace the exchange-correlation functional within a fully differentiable three-dimensional Kohn-Sham density functional theory framework can greatly improve simulation accuracy with only a few experimental data points.

PHYSICAL REVIEW LETTERS (2021)

Article Multidisciplinary Sciences

Discovery of complex oxides via automated experiments and data science

Lusann Yang et al.

Summary: This study utilizes high-throughput optical property measurements to identify novel materials in three-cation metal oxide composition spaces with emergent properties. By developing data models and conducting experiments, a Co-Ta-Sn substitutional alloy oxide with tunable transparency, catalytic activity, and stability in strong acid electrolytes was discovered. Close coupling of data validation to experiment design was crucial in accelerating scientific discovery.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2021)

Article Materials Science, Multidisciplinary

Machine learning the quantum-chemical properties of metal-organic frameworks for accelerated materials discovery

Andrew S. Rosen et al.

Summary: This study introduces a Quantum MOF (QMOF) database, which contains computed quantum-chemical properties for over 14,000 experimentally synthesized MOFs. Machine learning models trained on this database can efficiently discover MOFs with targeted properties, with the successful prediction of MOFs with low band gaps being highlighted.

MATTER (2021)

Review Chemistry, Multidisciplinary

Replacing hybrid density functional theory: motivation and recent advances

Benjamin G. Janesko

Summary: Density functional theory (DFT) is a widely-used approximation method for electronic structure in chemistry, physics, and materials science, but its accuracy is constrained by tradeoffs between over-delocalization and under-binding. This review discusses these tradeoffs and introduces six modern approaches to improve accuracy while keeping computational cost relatively low. Practical suggestions are provided for users to address and mitigate the impact of these tradeoffs on their simulations.

CHEMICAL SOCIETY REVIEWS (2021)

Article Chemistry, Multidisciplinary

A data-driven perspective on the colours of metal-organic frameworks

Kevin Maik Jablonka et al.

Summary: This study explores building a model to predict the color of compounds based on subjective and categorical human color assignments, while also identifying inadequacies in current reporting mechanisms and proposing an alternative approach.

CHEMICAL SCIENCE (2021)

Review Chemistry, Multidisciplinary

A Design-to-Device Pipeline for Data-Driven Materials Discovery

Jacqueline M. Cole

ACCOUNTS OF CHEMICAL RESEARCH (2020)

Article Chemistry, Medicinal

ImageDataExtractor: A Tool To Extract and Quantify Data from Microscopy Images

Karim T. Mukaddem et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2020)

Article Chemistry, Medicinal

Inorganic Materials Synthesis Planning with Literature-Trained Neural Networks

Edward Kim et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2020)

Article Chemistry, Physical

Efficient Treatment of Large Active Spaces through Multi-GPU Parallel Implementation of Direct Configuration Interaction

B. Scott Fales et al.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2020)

Article Chemistry, Physical

Automation of Active Space Selection for Multireference Methods via Machine Learning on Chemical Bond Dissociation

WooSeok Jeong et al.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2020)

Article Chemistry, Physical

Simple Automatized Tool for Exchange-Correlation Functional Fitting

Artem Mitrofanov et al.

JOURNAL OF PHYSICAL CHEMISTRY A (2020)

Article Multidisciplinary Sciences

Does repeat synthesis in materials chemistry obey a power law?

Mayank Agrawal et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2020)

Article Chemistry, Physical

Artificial Neural Networks Applied as Molecular Wave Function Solvers

Peng-Jian Yang et al.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2020)

Article Chemistry, Physical

Seeing Is Believing: Experimental Spin States from Machine Learning Model Structure Predictions

Michael G. Taylor et al.

JOURNAL OF PHYSICAL CHEMISTRY A (2020)

Article Chemistry, Physical

Machine-Learning Coupled Cluster Properties through a Density Tensor Representation

Benjamin G. Peyton et al.

JOURNAL OF PHYSICAL CHEMISTRY A (2020)

Article Chemistry, Physical

Completing density functional theory by machine learning hidden messages from molecules

Ryo Nagai et al.

NPJ COMPUTATIONAL MATERIALS (2020)

Review Chemistry, Multidisciplinary

Big-Data Science in Porous Materials: Materials Genomics and Machine Learning

Kevin Maik Jablonka et al.

CHEMICAL REVIEWS (2020)

Article Chemistry, Physical

A critical examination of compound stability predictions from machine-learned formation energies

Christopher J. Bartel et al.

NPJ COMPUTATIONAL MATERIALS (2020)

Article Chemistry, Physical

Data-Driven Approaches Can Overcome the Cost-Accuracy Trade-Off in Multireference Diagnostics

Chenru Duan et al.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2020)

Article Chemistry, Physical

Semi-supervised Machine Learning Enables the Robust Detection of Multireference Character at Low Cost

Chenru Duan et al.

JOURNAL OF PHYSICAL CHEMISTRY LETTERS (2020)

Article Multidisciplinary Sciences

Understanding the diversity of the metal-organic framework ecosystem

Seyed Mohamad Moosavi et al.

NATURE COMMUNICATIONS (2020)

Article Chemistry, Physical

Automatic Selection of Active Orbitals from Generalized Valence Bond Orbitals

Jingxiang Zou et al.

JOURNAL OF PHYSICAL CHEMISTRY A (2020)

Article Chemistry, Physical

Rapid Detection of Strong Correlation with Machine Learning for Transition-Metal Complex High-Throughput Screening

Fang Liu et al.

JOURNAL OF PHYSICAL CHEMISTRY LETTERS (2020)

Article Chemistry, Multidisciplinary

Deep-neural-network solution of the electronic Schrodinger equation

Jan Hermann et al.

NATURE CHEMISTRY (2020)

Article Multidisciplinary Sciences

Quantum chemical accuracy from density functional approximations via machine learning

Mihail Bogojeski et al.

NATURE COMMUNICATIONS (2020)

Article Chemistry, Physical

Materials Informatics with PoreBlazer v4.0 and the CSD MOF Database

Lev Sarkisov et al.

CHEMISTRY OF MATERIALS (2020)

Article Chemistry, Medicinal

tmQM Dataset-Quantum Geometries and Properties of 86k Transition Metal Complexes

David Balcells et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2020)

Article Computer Science, Artificial Intelligence

Prediction of water stability of metal-organic frameworks using machine learning

Rohit Batra et al.

NATURE MACHINE INTELLIGENCE (2020)

Review Chemistry, Multidisciplinary

Computational Ligand Descriptors for Catalyst Design

Derek J. Durand et al.

CHEMICAL REVIEWS (2019)

Article Multidisciplinary Sciences

Capturing chemical intuition in synthesis of metal-organic frameworks

Seyed Mohamad Moosavi et al.

NATURE COMMUNICATIONS (2019)

Article Nanoscience & Nanotechnology

Torsion Angle Effect on the Activation of UiO Metal-Organic Frameworks

Ghada Ayoub et al.

ACS APPLIED MATERIALS & INTERFACES (2019)

Article Chemistry, Multidisciplinary

A Machine Learning Approach to Zeolite Synthesis Enabled by Automatic Literature Data Extraction

Zach Jensen et al.

ACS CENTRAL SCIENCE (2019)

Article Chemistry, Medicinal

Named Entity Recognition and Normalization Applied to Large-Scale Information Extraction from the Materials Science Literature

L. Weston et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2019)

Article Chemistry, Physical

Data-Driven Acceleration of the Coupled-Cluster Singles and Doubles Iterative Solver

Jacob Townsend et al.

JOURNAL OF PHYSICAL CHEMISTRY LETTERS (2019)

Article Multidisciplinary Sciences

Unsupervised word embeddings capture latent knowledge from materials science literature

Vahe Tshitoyan et al.

NATURE (2019)

Article Chemistry, Multidisciplinary

Identification Schemes for Metal-Organic Frameworks To Enable Rapid Search and Cheminformatics Analysis

Benjamin J. Bucior et al.

CRYSTAL GROWTH & DESIGN (2019)

Article Thermodynamics

Advances, Updates, and Analytics for the Computation-Ready, Experimental Metal-Organic Framework Database: CoRE MOF 2019

Yongchul G. Chung et al.

JOURNAL OF CHEMICAL AND ENGINEERING DATA (2019)

Article Multidisciplinary Sciences

Unifying machine learning and quantum chemistry with a deep neural network for molecular wavefunctions

K. T. Schuett et al.

NATURE COMMUNICATIONS (2019)

Review Chemistry, Multidisciplinary

Computational Approach to Molecular Catalysis by 3d Transition Metals: Challenges and Opportunities

Konstantinos D. Vogiatzis et al.

CHEMICAL REVIEWS (2019)

Article Chemistry, Multidisciplinary

Exploring density functional subspaces with genetic algorithms

Michael Gastegger et al.

MONATSHEFTE FUR CHEMIE (2019)

Article Multidisciplinary Sciences

Prediction of higher-selectivity catalysts by computer-driven worlflow and machine learning

Andrew F. Zahrt et al.

SCIENCE (2019)

Article Multidisciplinary Sciences

Data-driven design of metal-organic frameworks for wet flue gas CO2 capture

Peter G. Boyd et al.

NATURE (2019)

Article Nanoscience & Nanotechnology

Autonomous Molecular Design: Then and Now

Tanja Dimitrov et al.

ACS APPLIED MATERIALS & INTERFACES (2019)

Article Chemistry, Medicinal

Rational Density Functional Selection Using Game Theory

Suzanne McAnanama-Brereton et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2018)

Article Chemistry, Medicinal

Text Mining Metal-Organic Framework Papers

Sanghoon Park et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2018)

Review Chemistry, Multidisciplinary

Predictive and mechanistic multivariate linear regression models for reaction development

Celine B. Santiago et al.

CHEMICAL SCIENCE (2018)

Article Chemistry, Physical

Machine Learning Configuration Interaction

J. P. Coe

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2018)

Article Multidisciplinary Sciences

Bypassing the Kohn-Sham equations with machine learning

Felix Brockherde et al.

NATURE COMMUNICATIONS (2017)

Article Multidisciplinary Sciences

ANI-1, A data set of 20 million calculated off-equilibrium conformations for organic molecules

Justin S. Smith et al.

SCIENTIFIC DATA (2017)

Article Chemistry, Physical

Virtual screening of inorganic materials synthesis parameters with deep learning

Edward Kim et al.

NPJ COMPUTATIONAL MATERIALS (2017)

Article Chemistry, Medicinal

ChemDataExtractor: A Toolkit for Automated Extraction of Chemical Information from the Scientific Literature

Matthew C. Swain et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2016)

Article Chemistry, Physical

Automated Selection of Active Orbital Spaces

Christopher J. Stein et al.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2016)

Article Multidisciplinary Sciences

Machine-learning-assisted materials discovery using failed experiments

Paul Raccuglia et al.

NATURE (2016)

Article Chemistry, Multidisciplinary

The Cambridge Structural Database

Colin R. Groom et al.

ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE CRYSTAL ENGINEERING AND MATERIALS (2016)

Article Computer Science, Artificial Intelligence

Random forests

L Breiman

MACHINE LEARNING (2001)