4.4 Article

Visualizing the Protein Sequence Universe

期刊

出版社

WILEY
DOI: 10.1002/cpe.3072

关键词

MapReduce; data-enabled life sciences; sequence similarity; computational bioinfor-matics; protein annotation; protein sequence universe; PSU; COG; UniProt; UniRef; DELSA; multidimensional scaling; data visualization; BLAST; Azure; Sammon; Twister; Hadoop; Needleman-Wunsch; Hive; MPI; EM

资金

  1. NSF [DBI: 0969929, 0910818]
  2. NIH [5 RC2 HG 005806-02]
  3. NIH (NIGMS) [R01 GM-076680-04]
  4. NIH (NIDDK) [U01-DK-089571, U01-DK-072473]
  5. Division of Computing and Communication Foundations
  6. Direct For Computer & Info Scie & Enginr [0910818] Funding Source: National Science Foundation

向作者/读者索取更多资源

Modern biology is experiencing a rapid increase in data volumes that challenges our analytical skills and existing cyberinfrastructure. Exponential expansion of the protein sequence universe (PSU), the protein sequence space, together with the costs and complexities of manual curation creates a major bottleneck in life sciences research. Existing resources lack scalable visualization tools that are instrumental for functional annotation. Here, we describe a new visualization tool using multidimensional scaling to create a 3D embedding of the protein space. The advantages of the proposed PSU method include the ability to scale to large numbers of sequences, integrate different similarity measures with other functional and experimental data, and facilitate protein annotation. We applied the method to visualize the prokaryotic PSU using sequence alignment scores. As an annotation example, we used the interpolation approach to map the set of annotated archaeal proteins into the prokaryotic PSU. Transdisciplinary approaches akin to the one described in this paper are urgently needed to quickly and efficiently translate the influx of new data into tangible innovations and groundbreaking discoveries. Copyright (c) 2013 John Wiley & Sons, Ltd.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据