☆ 4.3 Article

Conformationally selective multidimensional chemical shift ranges in proteins from a PACSY database purged using intrinsic quality criteria

JOURNAL OF BIOMOLECULAR NMR (2016)

期刊

JOURNAL OF BIOMOLECULAR NMR

卷 64, 期 2, 页码 115-130

出版社

SPRINGER

DOI: 10.1007/s10858-016-0013-5

关键词

Protein chemical shift; Databases; Protein secondary structure; Data mining; PIQC; PACSY; PLUQin; SQAT

类别

Biochemistry & Molecular Biology Spectroscopy

资金

Brandeis University
NIH [GM066976]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We have determined refined multidimensional chemical shift ranges for intra-residue correlations (C-13-C-13, N-15-C-13, etc.) in proteins, which can be used to gain type-assignment and/or secondary-structure information from experimental NMR spectra. The chemical-shift ranges are the result of a statistical analysis of the PACSY database of > 3000 proteins with 3D structures (1,200,207 C-13 chemical shifts and > 3 million chemical shifts in total); these data were originally derived from the Biological Magnetic Resonance Data Bank. Using relatively simple non-parametric statistics to find peak maxima in the distributions of helix, sheet, coil and turn chemical shifts, and without the use of limited hand-picked data sets, we show that similar to 94 % of the C-13 NMR data and almost all N-15 data are quite accurately referenced and assigned, with smaller standard deviations (0.2 and 0.8 ppm, respectively) than recognized previously. On the other hand, approximately 6 % of the C-13 chemical shift data in the PACSY database are shown to be clearly misreferenced, mostly by ca. -2.4 ppm. The removal of the misreferenced data and other outliers by this purging by intrinsic quality criteria (PIQC) allows for reliable identification of secondary maxima in the two-dimensional chemical-shift distributions already pre-separated by secondary structure. We demonstrate that some of these correspond to specific regions in the Ramachandran plot, including left-handed helix dihedral angles, reflect unusual hydrogen bonding, or are due to the influence of a following proline residue. With appropriate smoothing, significantly more tightly defined chemical shift ranges are obtained for each amino acid type in the different secondary structures. These chemical shift ranges, which may be defined at any statistical threshold, can be used for amino-acid type assignment and secondary-structure analysis of chemical shifts from intra-residue cross peaks by inspection or by using a provided command-line Python script (PLUQin), which should be useful in protein structure determination. The refined chemical shift distributions are utilized in a simple quality test (SQAT) that should be applied to new protein NMR data before deposition in a databank, and they could benefit many other chemical-shift based tools.

Conformationally selective multidimensional chemical shift ranges in proteins from a PACSY database purged using intrinsic quality criteria

期刊

JOURNAL OF BIOMOLECULAR NMR

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Conformationally selective multidimensional chemical shift ranges in proteins from a PACSY database purged using intrinsic quality criteria

期刊

JOURNAL OF BIOMOLECULAR NMR

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文