4.3 Article

SSALN: An alignment algorithm using structure-dependent substitution matrices and gap penalties learned from structurally aligned protein pairs

Journal

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS
Volume 62, Issue 4, Pages 881-891

Publisher

WILEY
DOI: 10.1002/prot.20854

Keywords

alignment accuracy; secondary structure; relative solvent accessibility; position-specific gap penalties; structurally aligned protein pairs

Funding

  1. NIGMS NIH HHS [GM067823] Funding Source: Medline

Ask authors/readers for more resources

In template-based modeling of protein structures, the generation of the alignment between the target and the template is a critical step that significantly affects the accuracy of the final model. This paper proposes an alignment algorithm SSALN that learns substitution matrices and position-specific gap penalties from a database of structurally aligned protein pairs. In addition to the amino acid sequence information, secondary structure and solvent accessibility information of a position are used to derive substitution scores and position-specific gap penalties. In a test set of CASP5 targets, SSALN outperforms sequence alignment methods such as a Smith-Waterman algorithm with BLOSUM50 and PSI BLAST. SSALN also generates better alignments than PSI BLAST in the CASP6 test set. LOOPP server prediction based on an SSALN alignment is ranked the best for target T0280_1 in CASP6. SSALN is also compared with several threading methods and sequence alignment methods on the ProSup benchmark. SSALN has the highest alignment accuracy among the methods compared. On the Fischer's benchmark, SSALN performs better than CLUSTALW and GenTHREADER, and generates more alignments with accuracy > 50%, > 60% or > 70% than FUGUE, but fewer alignments with accuracy > 80% than FUGUE. All the supplemental materials can be found at http://www.cs.cornell. edu/-similar to jianq/research.htm.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available