4.4 Article

Sequence-structure relationship study in all-α transmembrane proteins using an unsupervised learning approach

Journal

AMINO ACIDS
Volume 47, Issue 11, Pages 2303-2322

Publisher

SPRINGER WIEN
DOI: 10.1007/s00726-015-2010-5

Keywords

Transmembrane protein; Learning approach; Sequence-structure relationship; Protein structure; Artificial neural network; Hybrid protein model; Structural alphabet; Classification

Funding

  1. Ministry of Research (France)
  2. University Paris Diderot, Sorbonne, Paris Cite (France)
  3. National Institute for Blood Transfusion (INTS, France)
  4. National Institute for Health and Medical Research (INSERM, France)
  5. labex GR-Ex
  6. National Institute for Agricultural Research (INRA, France)
  7. National Center for Scientific Research (CNRS)

Ask authors/readers for more resources

Transmembrane proteins (TMPs) are major drug targets, but the knowledge of their precise topology structure remains highly limited compared with globular proteins. In spite of the difficulties in obtaining their structures, an important effort has been made these last years to increase their number from an experimental and computational point of view. In view of this emerging challenge, the development of computational methods to extract knowledge from these data is crucial for the better understanding of their functions and in improving the quality of structural models. Here, we revisit an efficient unsupervised learning procedure, called Hybrid Protein Model (HPM), which is applied to the analysis of transmembrane proteins belonging to the all-alpha structural class. HPM method is an original classification procedure that efficiently combines sequence and structure learning. The procedure was initially applied to the analysis of globular proteins. In the present case, HPM classifies a set of overlapping protein fragments, extracted from a non-redundant databank of TMP 3D structure. After fine-tuning of the learning parameters, the optimal classification results in 65 clusters. They represent at best similar relationships between sequence and local structure properties of TMPs. Interestingly, HPM distinguishes among the resulting clusters two helical regions with distinct hydrophobic patterns. This underlines the complexity of the topology of these proteins. The HPM classification enlightens unusual relationship between amino acids in TMP fragments, which can be useful to elaborate new amino acids substitution matrices. Finally, two challenging applications are described: the first one aims at annotating protein functions (channel or not), the second one intends to assess the quality of the structures (X-ray or models) via a new scoring function deduced from the HPM classification.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available