4.4 Article Proceedings Paper

Multidimensional partitioning and bi-partitioning: analysis and application to gene expression data sets

Journal

INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS
Volume 85, Issue 3-4, Pages 475-485

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/00207160701210158

Keywords

data mining dimension reduction; graph laplacian; microarray; singular value decomposition; tumour classification

Ask authors/readers for more resources

Eigenvectors and, more generally, singular vectors, have proved to be useful tools for data mining and dimension reduction. Spectral clustering and reordering algorithms have been designed and implemented in many disciplines, and they can be motivated from several different standpoints. Here we give a general, unified derivation from an applied linear algebra perspective. We use a variational approach that has the benefit of (a) naturally introducing an appropriate scaling, (b) allowing for a solution in any desired dimension, and (c) dealing with both the clustering and bi-clustering issues in the same framework. The motivation and analysis is then backed up with examples involving two large data sets from modern, high-throughput, experimental cell biology. Here, the objects of interest are genes and tissue samples, and the experimental data represents gene activity. We show that looking beyond the dominant, or Fiedler, direction reveals important information.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available