期刊
BMC RESEARCH NOTES
卷 15, 期 1, 页码 -出版社
SPRINGERNATURE
DOI: 10.1186/s13104-022-06129-6
关键词
Clustering; Genomics; Proteomics; Bayesian; Autoclass; Machine learning
资金
- French ministry of research
This article presents an online tool called AutoClassWeb, which provides an easy-to-use web interface for implementing the Bayesian clustering algorithm to classify and further analyze genes or proteins in genomics and proteomics.
Objective: Data clustering is a common exploration step in the omics era, notably in genomics and proteomics where many genes or proteins can be quantified from one or more experiments. Bayesian clustering is a powerful unsupervised algorithm that can classify several thousands of genes or proteins. AutoClass C, its original implementation, handles missing data, automatically determines the best number of clusters but is not user-friendly. Results: We developed an online tool called AutoClassWeb, which provides an easy-to-use and simple web interface for Bayesian clustering with AutoClass. Input data are entered as TSV files and quality controlled. Results are provided in formats that ease further analyses with spreadsheet programs or with programming languages, such as Python or R. AutoClassWeb is implemented in Python and is published under the 3-Clauses BSD license. The source code is available at https://github.com/pierrepo/autoclassweb along with a detailed documentation.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据