4.7 Article

Photometric redshift estimation of BASS DR3 quasars by machine learning

Journal

MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY
Volume 509, Issue 2, Pages 2289-2303

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/mnras/stab3165

Keywords

methods: statistical; techniques: photometric; astronomical data bases: miscellaneous; galaxies: distances and redshifts; quasars: general; galaxies: photometric

Funding

  1. National Natural Science Foundation of China (NSFC) [11573019, 11803055, 11873066, 12133001, 11433005]
  2. NSFC [U1531246, U1731125, U1731243, U1731109]
  3. Chinese Academy of Sciences (CAS) [U1531246, U1731125, U1731243, U1731109]
  4. 13th Five-year Informatization Plan of Chinese Academy of Sciences [XXH13503-03-107]
  5. China Manned Space Project [CMS-CSST-2021-A06]
  6. National Astronomical Observatories of China
  7. Chinese Academy of Sciences [XDB09000000]
  8. Special Fund for Astronomy from the Ministry of Finance
  9. External Cooperation Program of the Chinese Academy of Sciences [114A11KYSB20160057]
  10. National Development and Reform Commission
  11. National Aeronautics and Space Administration
  12. Alfred P. Sloan Foundation
  13. U.S. Department of Energy Office of Science
  14. Brazilian Participation Group
  15. Carnegie Institution for Science
  16. Carnegie Mellon University
  17. Chilean Participation Group
  18. French Participation Group
  19. Harvard-Smithsonian Center for Astrophysics
  20. Instituto de Astrofisica de Canarias
  21. Johns Hopkins University
  22. Kavli Institute for the Physics and Mathematics of the Universe IPMU)/University of Tokyo
  23. Lawrence Berkeley National Laboratory
  24. Leibniz Institut fur Astrophysik Potsdam (AIP)
  25. Max-PlanckInstitut fur Astronomie (MPIA Heidelberg)
  26. Max-Planck-Institut fur Astrophysik (MPA Garching)
  27. Max-Planck-Institut fur Extraterrestrische Physik (MPE)
  28. New Mexico State University
  29. New York University
  30. University of Notre Dame
  31. Observatario Nacional/MCTI
  32. Ohio State University
  33. Pennsylvania State University
  34. Shanghai Astronomical Observatory
  35. United Kingdom Participation Group
  36. Universidad Nacional Autonoma de Mexico
  37. University of Arizona
  38. University of Colorado Boulder
  39. University of Oxford
  40. University of Portsmouth
  41. University of Utah
  42. University of Virginia
  43. University of Washington
  44. University of Wisconsin
  45. Vanderbilt University
  46. Yale University
  47. Fundamental Science Data Sharing Platform [DKA2017-12-02-07]
  48. Center for High-Performance Computing at the University of Utah

Ask authors/readers for more resources

In this study, the performance of XGBoost, CatBoost, and Random Forest algorithms in estimating photometric redshifts of quasars was compared, with CatBoost showing the best performance, especially for high-redshift quasars. The two-step model, which utilizes CatBoost as the core algorithm, outperforms the one-step model and provides more accurate redshift predictions for high-redshift quasars.
Correlating Beijing-Arizona Sky Survey (BASS) data release 3 (DR3) catalogue with the ALLWISE data base, the data from optical and infrared information are obtained. The quasars from Sloan Digital Sky Survey are taken as training and test samples while those from LAMOST are considered as external test sample. We propose two schemes to construct the redshift estimation models with XGBoost, CatBoost, and Random Forest. One scheme (namely one-step model) is to predict photometric redshifts directly based on the optimal models created by these three algorithms; the other scheme (namely two-step model) is to first classify the data into low- and high-redshift data sets, and then predict photometric redshifts of these two data sets separately. For one-step model, the performance of these three algorithms on photometric redshift estimation is compared with different training samples, and CatBoost is superior to XGBoost and Random Forest. For two-step model, the performances of these three algorithms on the classification of low and high redshift subsamples are compared, and CatBoost still shows the best performance. Therefore, CatBoost is regarded as the core algorithm of classification and regression in two-step model. In contrast to one-step model, two-step model is optimal when predicting photometric redshift of quasars, especially for high-redshift quasars. Finally, the two models are applied to predict photometric redshifts of all quasar candidates of BASS DR3. The number of high-redshift quasar candidates is 3938 (redshift >= 3.5) and 121 (redshift >= 4.5) by two-step model. The predicted result will be helpful for quasar research and follow-up observation of high-redshift quasars.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available