4.5 Article

Tertiary structure predictions on a comprehensive benchmark of medium to large size proteins

Journal

BIOPHYSICAL JOURNAL
Volume 87, Issue 4, Pages 2647-2655

Publisher

CELL PRESS
DOI: 10.1529/biophysj.104.045385

Keywords

-

Categories

Funding

  1. NIGMS NIH HHS [R01 GM037408, R01 GM048835, GM-48835, GM-37408] Funding Source: Medline

Ask authors/readers for more resources

We evaluate tertiary structure predictions on medium to large size proteins by TASSER, a new algorithm that assembles protein structures through rearranging the rigid fragments from threading templates guided by a reduced C(alpha) and side-chain based potential consistent with threading based tertiary restraints. Predictions were generated for 745 proteins 201-300 residues in length that cover the Protein Data Bank (PDB) at the level of 35% sequence identity. With homologous proteins excluded, in 365 cases, the templates identified by our threading program, PROSPECTOR_3, have a root-mean-square deviation (RMSD) to native <6.5 angstrom, with >70% alignment coverage. After TASSER assembly, in 408 cases the best of the top five full-length models has a RMSD <6.5 angstrom. Among the 745 targets are 18 membrane proteins, with one-third having a predicted RMSD <5.5 Angstrom. For all representative proteins less than or equal to 300 residues that have corresponding multiple NMR structures in the Protein Data Bank, approximate to20% of the models generated by TASSER are closer to the NMR structure centroid than the farthest individual NMR model. These results suggest that reasonable structure predictions for nonhomologous large size proteins can be automatically generated on a proteomic scale, and the application of this approach to structural as well as functional genomics represent promising applications of TASSER.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available