☆ 4.7 Article

Comparison of Random Forest and Pipeline Pilot Naive Bayes in Prospective QSAR Predictions

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2012)

Journal

JOURNAL OF CHEMICAL INFORMATION AND MODELING

Volume 52, Issue 3, Pages 792-803

Publisher

AMER CHEMICAL SOC

DOI: 10.1021/ci200615h

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Random forest is currently considered one of the best QSAR methods available in terms of accuracy of prediction. However, it is computationally intensive. Naive Bayes is a simple, robust classification method. The Laplacian-modified Naive Bayes implementation is the preferred QSAR method in the widely used commercial chemoinformatics platform Pipeline Pilot. We made a comparison of the ability of Pipeline Pilot Naive Bayes (PLPNB) and random forest to make accurate predictions on 18 large, diverse in-house QSAR data sets. These include on-target and ADME-related activities. These data sets were set up as classification problems with either binary or multicategory activities. We used a time-split method of dividing training and test sets, as we feel this is a realistic way of simulating prospective prediction. PLPNB is computationally efficient. However, random forest predictions are at least as good and in many cases significantly better than those of PLPNB on our data sets. PLPNB performs better with ECFP4 and ECFP6 descriptors, which are native to Pipeline Pilot, and more poorly with other descriptors we tried.

Comparison of Random Forest and Pipeline Pilot Naive Bayes in Prospective QSAR Predictions

Journal

JOURNAL OF CHEMICAL INFORMATION AND MODELING

Publisher

AMER CHEMICAL SOC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Comparison of Random Forest and Pipeline Pilot Naive Bayes in Prospective QSAR Predictions

Journal

JOURNAL OF CHEMICAL INFORMATION AND MODELING

Publisher

AMER CHEMICAL SOC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper