☆ 4.5 Article

Emergent unsupervised clustering paradigms with potential application to bioinformatics

FRONTIERS IN BIOSCIENCE-LANDMARK (2008)

期刊

FRONTIERS IN BIOSCIENCE-LANDMARK

卷 13, 期 -, 页码 677-690

出版社

FRONTIERS IN BIOSCIENCE INC

DOI: 10.2741/2711

关键词

clustering; feature selection; model order selection; semisupervised learning; confounding effects; data fusion; information bottleneck; stability criteria; hierarchical clustering; review

类别

Biochemistry & Molecular Biology Cell Biology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

In recent years, there has been a great upsurge in the application of data clustering, statistical classification, and related machine learning techniques to the field of molecular biology, in particular analysis of DNA microarray expression data. Clustering methods can be used to group co-expressed genes, shedding light on gene function and co-regulation. Alternatively, they can group samples or conditions to identify phenotypical groups, disease subgroups, or to help identify disease pathways. A rich variety of unsupervised techniques have been applied, including partitional, hierarchical, graph-based, model-based, and biclustering methods. While a number of machine learning problems and tools have found mainstream applications in bioinformatics, in this article we identify some challenging problems which, though clearly relevant to bioinformatics, have not been extensively investigated in this domain. These include i) unsupervised clustering with unsupervised feature selection, ii) semisupervised learning, iii) unsupervised learning (and supervised learning) in the presence of confounding variables, and iv) stability of clustering solutions. We review recent methods which address these problems and take the position that these methods are well-suited to addressing some common scenarios that occur in bioinformatics.

Emergent unsupervised clustering paradigms with potential application to bioinformatics

期刊

FRONTIERS IN BIOSCIENCE-LANDMARK

出版社

FRONTIERS IN BIOSCIENCE INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Emergent unsupervised clustering paradigms with potential application to bioinformatics

期刊

FRONTIERS IN BIOSCIENCE-LANDMARK

出版社

FRONTIERS IN BIOSCIENCE INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文