4.6 Article

False positive reduction in protein-protein interaction predictions using gene ontology annotations

期刊

BMC BIOINFORMATICS
卷 8, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/1471-2105-8-262

关键词

-

向作者/读者索取更多资源

Background: Many crucial cellular operations such as metabolism, signalling, and regulations are based on protein- protein interactions. However, the lack of robust protein- protein interaction information is a challenge. One reason for the lack of solid protein- protein interaction information is poor agreement between experimental findings and computational sets that, in turn, comes from huge false positive predictions in computational approaches. Reduction of false positive predictions and enhancing true positive fraction of computationally predicted protein- protein interaction datasets based on highly confident experimental results has not been adequately investigated. Results: Gene Ontology ( GO) annotations were used to reduce false positive protein-protein interactions ( PPI) pairs resulting from computational predictions. Using experimentally obtained PPI pairs as a training dataset, eight top- ranking keywords were extracted from GO molecular function annotations. The sensitivity of these keywords is 64.21% in the yeast experimental dataset and 80.83% in the worm experimental dataset. The specificities, a measure of recovery power, of these keywords applied to four predicted PPI datasets for each studied organisms, are 48.32% and 46.49% ( by average of four datasets) in yeast and worm, respectively. Based on eight top- ranking keywords and co-localization of interacting proteins a set of two knowledge rules were deduced and applied to remove false positive protein pairs. The 'strength', a measure of improvement provided by the rules was defined based on the signal-to-noise ratio and implemented to measure the applicability of knowledge rules applying to the predicted PPI datasets. Depending on the employed PPI-predicting methods, the strength varies between two and ten-fold of randomly removing protein pairs from the datasets. Conclusion: Gene Ontology annotations along with the deduced knowledge rules could be implemented to partially remove false predicted PPI pairs. Removal of false positives from predicted datasets increases the true positive fractions of the datasets and improves the robustness of predicted pairs as compared to random protein pairing, and eventually results in better overlap with experimental results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据