4.7 Article

AUTOMATED UNSUPERVISED CLASSIFICATION OF THE SLOAN DIGITAL SKY SURVEY STELLAR SPECTRA USING k-MEANS CLUSTERING

期刊

ASTROPHYSICAL JOURNAL
卷 763, 期 1, 页码 -

出版社

IOP Publishing Ltd
DOI: 10.1088/0004-637X/763/1/50

关键词

astronomical databases: miscellaneous; methods: data analysis; methods: statistical; stars: fundamental parameters; stars: general

资金

  1. Spanish Ministry for Science [AYA 2010-21887-C0404]
  2. Consolider-Ingenio Program [MICINN CSD2006-00070]
  3. Alfred P. Sloan Foundation
  4. National Science Foundation
  5. U.S. Department of Energy Office of Science
  6. University of Arizona
  7. Brazilian Participation Group
  8. Brookhaven National Laboratory
  9. University of Cambridge
  10. Carnegie Mellon University
  11. University of Florida
  12. French Participation Group
  13. German Participation Group
  14. Harvard University
  15. Instituto de Astrofisica de Canarias
  16. Michigan State/Notre Dame/JINA Participation Group
  17. Johns Hopkins University
  18. Lawrence Berkeley National Laboratory
  19. Max Planck Institute for Astrophysics
  20. Max Planck Institute for Extraterrestrial Physics
  21. New Mexico State University
  22. New York University
  23. Ohio State University
  24. Pennsylvania State University
  25. University of Portsmouth
  26. Princeton University
  27. Spanish Participation Group
  28. University of Tokyo
  29. University of Utah
  30. Vanderbilt University
  31. University of Virginia
  32. University of Washington
  33. Yale University

向作者/读者索取更多资源

Large spectroscopic surveys require automated methods of analysis. This paper explores the use of k-means clustering as a tool for automated unsupervised classification of massive stellar spectral catalogs. The classification criteria are defined by the data and the algorithm, with no prior physical framework. We work with a representative set of stellar spectra associated with the Sloan Digital Sky Survey (SDSS) SEGUE and SEGUE-2 programs, which consists of 173,390 spectra from 3800 to 9200 angstrom sampled on 3849 wavelengths. We classify the original spectra as well as the spectra with the continuum removed. The second set only contains spectral lines, and it is less dependent on uncertainties of the flux calibration. The classification of the spectra with continuum renders 16 major classes. Roughly speaking, stars are split according to their colors, with enough finesse to distinguish dwarfs from giants of the same effective temperature, but with difficulties to separate stars with different metallicities. There are classes corresponding to particular MK types, intrinsically blue stars, dust-reddened, stellar systems, and also classes collecting faulty spectra. Overall, there is no one-to-one correspondence between the classes we derive and the MK types. The classification of spectra without continuum renders 13 classes, the color separation is not so sharp, but it distinguishes stars of the same effective temperature and different metallicities. Some classes thus obtained present a fairly small range of physical parameters (200 K in effective temperature, 0.25 dex in surface gravity, and 0.35 dex in metallicity), so that the classification can be used to estimate the main physical parameters of some stars at a minimum computational cost. We also analyze the outliers of the classification. Most of them turn out to be failures of the reduction pipeline, but there are also high redshift QSOs, multiple stellar systems, dust-reddened stars, galaxies, and, finally, odd spectra whose nature we have not deciphered. The template spectra representative of the classes are publicly available in the online journal and at ftp://stars:kmeans@ftp.iac.es.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据