4.3 Article

Filter feature selectors in the development of binary QSAR models

Journal

SAR AND QSAR IN ENVIRONMENTAL RESEARCH
Volume 30, Issue 5, Pages 313-345

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/1062936X.2019.1588160

Keywords

Feature selection; QSAR; dimensionality reduction; molecular activity prediction

Funding

  1. Spanish Ministry of Science and Innovation [TIN2015-66108-P]

Ask authors/readers for more resources

The application of machine learning methods to the construction of quantitative structure-activity relationship models is a complex computational problem in which dimensionality reduction of the representation of the molecular structure plays a fundamental role in predicting a target activity. The feature selection pre-processing approach has been indicated to be effective in dimensionality reduction for building simpler and more understandable models. In this paper, a performance comparative study of 13 state-of-the-art feature selection filter methods is conducted. Structure-activity relationship models are constructed using three widely used classifiers and a diverse collection of datasets. The comparative study utilizes robust statistical tests to compare the algorithms. According to the experimental results, there are substantial differences in performance among the evaluated feature selection methods. The methods that exhibit the best performance are correlation-based feature selection, fast clustering-based feature selection and the set cover method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available