4.7 Article

Improved Assessment of Globularity of Protein Structures and the Ellipsoid Profile of the Biological Assemblies from the PDB

期刊

BIOMOLECULES
卷 13, 期 2, 页码 -

出版社

MDPI
DOI: 10.3390/biom13020385

关键词

bioinformatics; biological assembly; bounding ellipsoid; globularity; kernel density; nearest neighbor search; protein complex; protein domain; principal component analysis

向作者/读者索取更多资源

This paper presents an update to the ellipsoid profile algorithm (EP) which measures the globularity of protein structures without calculating molecular surfaces. The algorithm compares the volume of the voxelized representation of the atoms with the volume of voxels that can fit inside an enclosing ellipsoid. The updated EP algorithm includes an improved outlier detection subroutine based on principal component analysis, which enhances its performance in distinguishing between dense and disordered regions of molecules.
In this paper, we present an update to the ellipsoid profile algorithm (EP), a simple technique for the measurement of the globularity of protein structures without the calculation of molecular surfaces. The globularity property is understood in this context as the ability of the molecule to fill a minimum volume enclosing ellipsoid (MVEE) that approximates its assumed globular shape. The more of the interior of this ellipsoid is occupied by the atoms of the protein, the better are its globularity metrics. These metrics are derived from the comparison of the volume of the voxelized representation of the atoms and the volume of all voxels that can fit inside that ellipsoid (a uniform unit angstrom cube lattice). The so-called ellipsoid profile shows how the globularity changes with the distance from the center. Two of its values, the so-called ellipsoid indexes, are used to classify the structure as globular, semi-globular or non-globular. Here, we enhance the workflow of the EP algorithm via an improved outlier detection subroutine based on principal component analysis. It is capable of robust distinguishing between the dense parts of the molecules and, for example, disordered chain fragments fully exposed to the solvent. The PCA-based method replaces the current approach based on kernel density estimation. The improved EP algorithm was tested on 2124 representatives of domain superfamilies from SCOP 2.08. The second part of this work is dedicated to the survey of globularity of 3594 representatives of biological assemblies from molecules currently deposited in the PDB and analyzed by the 3DComplex database (monomers and complexes up to 60 chains).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据