4.7 Review

Metrics for Benchmarking and Uncertainty Quantification: Quality, Applicability, and Best Practices for Machine Learning in Chemistry

Journal

TRENDS IN CHEMISTRY
Volume 3, Issue 2, Pages 146-156

Publisher

CELL PRESS
DOI: 10.1016/j.trechm.2020.12.004

Keywords

-

Funding

  1. NSF CAREER program [OAC-1751161]
  2. NSF Big Data Spokes program [IIS-1761990]

Ask authors/readers for more resources

This review highlights two important issues to consider when applying machine learning in the chemical and materials domain: statistical loss function metrics for model validation and benchmarking, and uncertainty quantification of predictions. These topics are often overlooked by chemists, but are crucial for comparing model performance and developing successful machine learning applications in chemistry.
This review aims to draw attention to two issues of concern when we set out to make machine learning work in the chemical and materials domain, that is, statistical loss function metrics for the validation and benchmarking of data-derived models, and the uncertainty quantification of predictions made by them. They are often overlooked or underappreciated topics as chemists typically only have limited training in statistics. Aside from helping to assess the quality, reliability, and applicability of a given model, these metrics are also key to comparing the performance of different models and thus for developing guidelines and best practices for the successful application of machine learning in chemistry.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available