4.5 Article

Combining In Vivo Data with In Silico Predictions for Modeling Hepatic Steatosis by Using Stratified Bagging and Conformal Prediction

Journal

CHEMICAL RESEARCH IN TOXICOLOGY
Volume 34, Issue 2, Pages 656-668

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.chemrestox.0c00511

Keywords

-

Funding

  1. European Union [681002]
  2. Austrian Science Fund (FWF) [P 29712, W1232 - MolTag]

Ask authors/readers for more resources

In this study, reliable in silico models were developed for predicting hepatic steatosis by combining various descriptor combinations, leading to improved model performance. Both meta-classifiers and the conformal prediction framework were found to be useful for handling imbalanced toxicity data sets.
Hepatic steatosis (fatty liver) is a severe liver disease induced by the excessive accumulation of fatty acids in hepatocytes. In this study, we developed reliable in silico models for predicting hepatic steatosis on the basis of an in vivo data set of 1041 compounds measured in rodent studies with repeated oral exposure. The imbalanced nature of the data set (1:8, with the steatotic compounds belonging to the minority class) required the use of meta-classifiers-bagging with stratified under-sampling and Mondrian conformal prediction-on top of the base classifier Test Set random forest. One major goal was the investigation of the influence of different descriptor combinations on model perform- ance (tested by predicting an external validation set): physicochemical descriptors (RDKit), ToxPrint features, as well as predictions from in silico nuclear receptor and transporter models. All models based upon descriptor combinations including physicochemical features led to reasonable balanced accuracies (BAs between 0.65 and 0.69 for the respective models). Combining physicochemical features with transporter predictions and further with ToxPrint features gave the best performing model (BAs up to 0.7 and efficiencies of 0.82). 'Whereas both meta-classifiers proved useful for this highly imbalanced toxicity data set, the conformal prediction framework also guarantees the error level and thus might be favored for future studies in the field of predictive toxicology.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available