4.3 Article

SpectraMiner, an interactive data mining and visualization software for single particle mass spectroscopy: A laboratory test case

期刊

INTERNATIONAL JOURNAL OF MASS SPECTROMETRY
卷 258, 期 1-3, 页码 58-73

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.ijms.2006.06.015

关键词

single particle mass spectrometer; data classification; data visualization

向作者/读者索取更多资源

Single particle mass spectrometers are sophisticated instruments designed to measure the sizes and compositions of a wide range of individual particles in situ, in real-time. They characterize hundreds of thousands or millions of particles, generating vast amounts of rich and complex data, the proper mining of which requires dedicated state of the art tools. The analysis of individual particle mass spectra is particularly difficult because of their high dimensionality-each data point, representing a single particle, includes the 450 mass spectral peak intensities, particle size, and time of detection. The first step is to organize the data; a process typically accomplished by grouping particles of similar attributes. Since the common assumption is that the data should be reduced to become manageable, they are typically classified into a small number of clusters (similar to 10), each of which is represented by an average/representative spectrum. Our approach is quite different. We have developed a data mining and visualization software package we call SpectraMiner that makes it possible to handle hundreds of clusters, limiting loss of information and thus overcoming the boundaries set by traditional statistical data analysis approaches. Data, which often include over 1 million particle spectra, are organized using K-mean clustering algorithm. The clusters are merged into nodes by sequentially combining similar clusters. The final structure is displayed in a hierarchical dynamical tree or circular dendogram. This interactive dendogram is the visual interface that allows for real-time data exploration and mining. Clicking on any of the clusters/nodes in the dendogram reveals the detailed information about the particles that reside at that position. At each step the scientist is in control of the level of detail and the visualization format, rapidly switching between them while running the program on a PC. Here we present a study that puts the classification aspect of SpectraMiner to the test. Twelve types of laboratory generated particles are carefully chosen to test some of the difficult aspects of single particle mass spectroscopy. We quantify the degree of particle identification and separation at a number of levels and demonstrate how the visualization tools that SpectraMiner provides can be used to refine, steer and control the data mining process. (c) 2006 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据