4.7 Article

Clustering biological data with SOMs: On topology preservation in non-linear dimensional reduction

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 40, 期 9, 页码 3841-3845

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2012.12.074

关键词

Clustering; Bioinformatics; Dimensional reduction; Topology preservation

资金

  1. National Scientific and Technical Research Council [PIP 1122008, PIP 2072008, PIP 1142009]
  2. National Agency for the Promotion of Science and Technology [PICT 2008 00100, PICT 2008 01940, PAE 37122, PAE-PICT 00052]
  3. National Institute for Agricultural Technology of Argentina [PE 243542]

向作者/读者索取更多资源

Dimensional reduction is a widely used technique for exploratory analysis of large volume of data. In biological datasets, each object is described by a large number of variables (or dimensions) and it is crucial to perform their analyses in a smaller space, to extract useful information. Kohonen self-organizing maps (SOMs) have been recently proposed in systems biology as a useful tool for exploratory analysis, data integration and discovery of new relationships in *omics datasets. SOMs have been traditionally used for clustering in several data mining problems, mainly due to their ability to preserve input data topology and reduce a high dimensional input space into a 2-D map. In spite of this, the above-mentioned dimensional reduction can lead to counterintuitive results. Sometimes, maps having almost the same size, trained on the same dataset, and with identical learning algorithms and parameters, may find different clusters. However, one would expect that small changes in map sizes or another training condition would not result in an abrupt different location of any of the grouped patterns. The aim of this work is to analyze and explain this issue through a real case study involving transcriptomic and metabolomic data, since it might have an important impact when interpreting clustering results over a biological dataset. (C) 2012 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据