4.7 Article

Different approaches to encode and model 3D information in a MIA-QSAR perspective

Journal

Publisher

ELSEVIER
DOI: 10.1016/j.chemolab.2021.104286

Keywords

3D information; MIA-QSAR; 2D image projections; Molecular slice images; Molecule-in-a-box face images

Funding

  1. Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior (CAPES) [001]
  2. Conselho Nacional de Desenvolvimento Cientifico e Tecnologico (CNPq) [301371/2017-2]
  3. Fundacao de Amparoa Pesquisa do Estado de Minas Gerais (FAPEMIG)

Ask authors/readers for more resources

This study explores different approaches for encoding three-dimensional information into MIA-QSAR descriptors and successfully applies them to three datasets of antiviral compounds. The models established show satisfactory performance in both internal and external validation, with correlation statistical results similar to previous studies.
Tridimensional information is a fundamental aspect for modelling and explaining biological/physicochemical properties. In this sense, the goal of this study was to explore different approaches for encoding this type of information into MIA-QSAR (Multivariate Image Analysis applied to Quantitative Structure-Activity Relationships) descriptors and to effectively model these new features. Originally, MIA-QSAR is a technique based on the treatment of 2D images of molecules. The approaches explored in this work were: (I) the use of 2D image projections of computationally optimized molecular geometries as a source of 3D information for a powerful machine learning method (support vector machine applied to regression); (II) the use of slice images obtained from the optimized molecules placed inside a theoretical box as a source of 3D descriptors for a multi-way regression method (trilinear PLS); and (III) the use of images viewed from different faces of the above box as an alternative source of 3D MIA-QSAR descriptors. These strategies were applied in three different data sets comprising antiHCV, anti-SARS-CoV, and anti-HIV compounds. Satisfactory parameters for both internal and external validation were achieved in all three models, and the statistical results of correlation were at least similar to those earlier reported for these series of compounds. Nevertheless, the risk of chance correlation could not be excluded as demonstrated by y-randomization tests. Whereas the traditional MIA-QSAR method, that uses perfectly congruent, non-optimized geometries of pharmacophoric substructures as images, is more efficient than 3D MIAQSAR, the latter uses tridimensional digital objects as descriptors for the first time in QSAR for regression purposes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available