4.8 Article

UniqueProt: creating representative protein sequence sets

期刊

NUCLEIC ACIDS RESEARCH
卷 31, 期 13, 页码 3789-3791

出版社

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkg620

关键词

-

资金

  1. NIGMS NIH HHS [R01 GM063029, R01-GM63029-01] Funding Source: Medline
  2. NLM NIH HHS [R01 LM007329, 1-R01-LM07329-01] Funding Source: Medline

向作者/读者索取更多资源

UniqueProt is a practical and easy to use web service designed to create representative, unbiased data sets of protein sequences. The largest possible representative sets are found through a simple greedy algorithm using the HSSP-value to establish sequence similarity. UniqueProt is not a real clustering program in the sense that the 'representatives' are not at the centres of well-defined clusters since the definition of such clusters is problem-specific. Overall, UniqueProt is a reasonable fast solution for bias in data sets. The service is accessible at http://cubic.bioc.columbia.edu/services/uniqueprot; a command-line version for Linux is downloadable from this web site.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据