☆ 4.7 Article

Prediction of galaxy halo masses in SDSS DR7 via a machine learning approach

MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY (2019)

Journal

MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY

Volume 490, Issue 2, Pages 2367-2379

Publisher

OXFORD UNIV PRESS

DOI: 10.1093/mnras/stz2775

Keywords

galaxies: clusters: general; galaxies: groups: general; cosmology: observations; large-scale structure of Universe

Funding

NSF
Vanderbilt Advanced Computing Center for Research and Education (ACCRE)
National Science Foundation (NSF) [AST-1151650, CE170100013]
Alfred P. Sloan Foundation
National Science Foundation
U.S. Department of Energy
National Aeronautics and Space Administration
Japanese Monbukagakusho
Max Planck Society
Higher Education Funding Council for England
American Museum of Natural History
Astrophysical Institute Potsdam
University of Basel
University of Cambridge
Case Western Reserve University, University of Chicago
Drexel University
Institute for Advanced Study
Japan Participation Group
Johns Hopkins University
Joint Institute for Nuclear Astrophysics
Kavli Institute for Particle Astrophysics and Cosmology
Chinese Academy of Sciences (LAMOST)
Los Alamos National Laboratory
Max-Planck-Institute for Astronomy (MPIA)
Max-Planck-Institute for Astrophysics (MPA), New Mexico State University
Ohio State University, University of Pittsburgh, University of Portsmouth
Princeton University
United States Naval Observatory

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

We present a machine learning (ML) approach for the prediction of galaxies' dark matter halo masses which achieves an improved performance over conventional methods. We train three ML algorithms (XGBoost, random forests, and neural network) to predict halo masses using a set of synthetic galaxy catalogues that are built by populating dark matter haloes in N-body simulations with galaxies and that match both the clustering and the joint distributions of properties of galaxies in the Sloan Digital Sky Survey (SDSS). We explore the correlation of different galaxy- and group-related properties with halo mass, and extract the set of nine features that contribute the most to the prediction of halo mass. We find that mass predictions from the ML algorithms are more accurate than those from halo abundance matching (HAM) or dynamical mass estimates (DYN). Since the danger of this approach is that our training data might not accurately represent the real Universe, we explore the effect of testing the model on synthetic catalogues built with different assumptions than the ones used in the training phase. We test a variety of models with different ways of populating dark matter haloes, such as adding velocity bias for satellite galaxies. We determine that, though training and testing on different data can lead to systematic errors in predicted masses, the ML approach still yields substantially better masses than either HAM or DYN. Finally, we apply the trained model to a galaxy and group catalogue from the SDSS DR7 and present the resulting halo masses.

Prediction of galaxy halo masses in SDSS DR7 via a machine learning approach

Journal

MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Prediction of galaxy halo masses in SDSS DR7 via a machine learning approach

Journal

MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper