4.5 Article

Protein bioinformatics and mixtures of bivariate von Mises distributions for angular data

Journal

BIOMETRICS
Volume 63, Issue 2, Pages 505-512

Publisher

BLACKWELL PUBLISHING
DOI: 10.1111/j.1541-0420.2006.00682.x

Keywords

bivariate angular data; bivariate circular mixture; directional statistics; distribution on torus; myoglobin; protein conformational angles; Ramachandran plots

Ask authors/readers for more resources

A fundamental problem in bioinformatics is to characterize the secondary structure of a protein, which has traditionally been carried out by examining a scatterplot (Ramachandran plot) of the conformational angles. We examine two natural bivariate von Mises distributions-referred to as Sine and Cosine models-wbich have five parameters and, for concentrated data, tend to a bivariate normal distribution. These are analyzed and their main properties derived. Conditions on the parameters are established which result in bimodal behavior for the joint density and the marginal distribution, and we note an interesting situation in which the joint density is bimodal but the marginal distributions are unimodal. We carry out comparisons of the two models, and it is seen that the Cosine model may be preferred. Mixture distributions of the Cosine model are fitted to two representative protein datasets using the expectation maximization algorithm, which results in an objective partition of the scatterplot into a number of components. Our results are consistent with empirical observations; new insights are discussed.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available