4.7 Article

A deep autoencoder framework for discovery of metastable ensembles in biomacromolecules

Journal

JOURNAL OF CHEMICAL PHYSICS
Volume 155, Issue 11, Pages -

Publisher

AIP Publishing
DOI: 10.1063/5.0059965

Keywords

-

Funding

  1. TIFR Centre for Interdisciplinary Sciences, India
  2. Department of Atomic Energy, Government of India [RTI 4007]
  3. Ramanujan Fellowship
  4. Department of Science and Technology (DST) of India [CRG/2019/001219]
  5. TIFR

Ask authors/readers for more resources

This study introduces a method to map the conformational landscape of biomacromolecules into a latent space by using auto-encoders, identifying key metastable states and revealing conformational heterogeneities. The derived latent space variables were found to be optimal compared to traditional methods, and were able to predict folding behavior in different systems.
Biomacromolecules manifest dynamic conformational fluctuation and involve mutual interconversion among metastable states. A robust mapping of their conformational landscape often requires the low-dimensional projection of the conformational ensemble along optimized collective variables (CVs). However, the traditional choice for the CV is often limited by user-intuition and prior knowledge about the system, and this lacks a rigorous assessment of their optimality over other candidate CVs. To address this issue, we propose an approach in which we first choose the possible combinations of inter-residue C alpha-distances within a given macromolecule as a set of input CVs. Subsequently, we derive a non-linear combination of latent space embedded CVs via auto-encoding the unbiased molecular dynamics simulation trajectories within the framework of the feed-forward neural network. We demonstrate the ability of the derived latent space variables in elucidating the conformational landscape in four hierarchically complex systems. The latent space CVs identify key metastable states of a bead-in-a-spring polymer. The combination of the adopted dimensional reduction technique with a Markov state model, built on the derived latent space, reveals multiple spatially and kinetically well-resolved metastable conformations for GB1 beta-hairpin. A quantitative comparison based on the variational approach-based scoring of the auto-encoder-derived latent space CVs with the ones obtained via independent component analysis (principal component analysis or time-structured independent component analysis) confirms the optimality of the former. As a practical application, the auto-encoder-derived CVs were found to predict the reinforced folding of a Trp-cage mini-protein in aqueous osmolyte solution. Finally, the protocol was able to decipher the conformational heterogeneities involved in a complex metalloenzyme, namely, cytochrome P450. Published under an exclusive license by AIP Publishing. https://doi.org/10.1063/5.0059965

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available