4.6 Article

A new hybrid coding for protein secondary structure prediction based on primary structure similarity

Journal

GENE
Volume 618, Issue -, Pages 8-13

Publisher

ELSEVIER
DOI: 10.1016/j.gene.2017.03.011

Keywords

Hybrid code; Protein secondary structure prediction; Protein primary structure; Support vector machine

Funding

  1. National Natural Science Foundation of China [11671009]
  2. Zhejiang Provincial Natural Science Foundation of China [LY14A010032]
  3. Zhejiang Province Key Science and Technology Innovation Team Project [2013TD18]
  4. Project of 521 Excellent Talent of Zhejiang Sci-Tech University

Ask authors/readers for more resources

The coding pattern of protein can greatly affect the prediction accuracy of protein secondary structure. In this paper, a novel hybrid coding method based on the physicochemical properties of amino acids and tendency factors is proposed for the prediction of protein secondary structure. The principal component analysis (PCA) is first applied to the physicochemical properties of amino acids to construct a 3-bit-code, and then the 3 tendency factors of amino acids are calculated to generate another 3-bit-code. Two 3-bit-codes are fused to form a novel hybrid 6-bit-code. Furthermore, we make a geometry-based similarity comparison of the protein primary structure between the reference set and the test set before the secondary structure prediction. We finally use the support vector machine (SVM) to predict those amino acids which are not detected by the primary structure similarity comparison. Experimental results show that our method achieves a satisfactory improvement in accuracy in the prediction of protein secondary structure. (C) 2017 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available