4.7 Article

FAPD: An Astringency Threshold and Astringency Type Prediction Database for Flavonoid Compounds Based on Machine Learning

期刊

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jafc.2c08822

关键词

astringency threshold; astringency type; flavonoid compounds; machine learning (ML); flavonoid astringency prediction database (FAPD)

向作者/读者索取更多资源

A Flavonoid Astringency Prediction Database (FAPD) was developed using machine learning to predict the astringency threshold and type of flavonoid compounds based on molecular fingerprint similarities and clustering analysis. Regression models and the best models were used to successfully predict the thresholds and types, and they were verified using t-SNE. This research provides a new approach to study the molecular structure-flavor property relationship of food components.
Astringency is a puckering or velvety sensation mainly derived from flavonoid compounds in food. The traditional experimental approach for astringent compound discovery was labor-intensive and cost-consuming, while machine learning (ML) can greatly accelerate this procedure. Herein, we propose the Flavonoid Astringency Prediction Database (FAPD) based on ML. First, the Molecular Fingerprint Similarities (MFSs) and thresholds of flavonoid compounds were hierarchically clustering analyzed. For the astringency threshold prediction, four regressions models (i.e., Gaussian Process Regression (GPR), Support Vector Regression (SVR), Random Forest (RF), and Gradient Boosted Decision Tree (GBDT)) were established, and the best model was RF which was interpreted by the SHapley Additive exPlanations (SHAP) approach. For the astringency type prediction, six and Stochastic Gradient Descent (SGD)) were established, and the best model was SGD. Furthermore, over 1200 natural flavonoid compounds were discovered and built into the customized FAPD. In FAPD, the astringency thresholds were achieved by RF; the astringency types were distinguished by SGD, and the real and predicted astringency types were verified by t-Distributed Stochastic Neighbor Embedding (t-SNE). Therefore, ML models can be used to predict the astringency threshold and astringency type of flavonoid compounds, which provides a new paradigm to research the molecular structure-flavor property relationship of food components.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据