Journal
SCIENCE
Volume 360, Issue 6385, Pages 186-190Publisher
AMER ASSOC ADVANCEMENT SCIENCE
DOI: 10.1126/science.aar5169
Keywords
-
Categories
Funding
- Princeton University
- Camille Dreyfus Teacher-Scholar Award
- Amgen Young Investigator Award
Ask authors/readers for more resources
Machine learning methods are becoming integral to scientific inquiry in numerous disciplines. We demonstrated that machine learning can be used to predict the performance of a synthetic reaction in multidimensional chemical space using data obtained via high-throughput experimentation. We created scripts to compute and extract atomic, molecular, and vibrational descriptors for the components of a palladium-catalyzed Buchwald-Hartwig cross-coupling of aryl halides with 4-methylaniline in the presence of various potentially inhibitory additives. Using these descriptors as inputs and reaction yield as output, we showed that a random forest algorithm provides significantly improved predictive performance over linear regression analysis. The random forest model was also successfully applied to sparse training sets and out-of-sample prediction, suggesting its value in facilitating adoption of synthetic methodology.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available