4.7 Article

Painting galaxies into dark matter haloes using machine learning

Journal

MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY
Volume 478, Issue 3, Pages 3410-3422

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/mnras/sty1169

Keywords

galaxies: evolution; cosmology: theory; large-scale structure of Universe

Funding

  1. South African Research Chairs Initiative
  2. South African National Research Foundation
  3. Simons Foundation's Center for Computational Astrophysics
  4. Distinguished Visitor Program at Space Telescope Science Institute
  5. UWC's Office [e77f814]
  6. STFC [ST/R000972/1] Funding Source: UKRI

Ask authors/readers for more resources

We develop amachine learning (ML) framework to populate large darkmatter-only simulations with baryonic galaxies. Our ML framework takes input halo properties including halo mass, environment, spin, and recent growth history, and outputs central galaxy and halo baryonic properties, including stellar mass (M-*), star formation rate (SFR), metallicity (Z), neutral (HI), and molecular (H-2) hydrogen mass. We apply this to the MUFASA cosmological hydrodynamic simulation, and show that it recovers the mean trends of output quantities with halo mass highly accurately, including following the sharp drop in SFR and gas in quenched massive galaxies. However, the scatter around the mean relations is underpredicted. Examining galaxies individually, at z = 0, the stellar mass and metallicity are accurately recovered (sigma less than or similar to 0.2 dex), but SFR and HI show larger scatter (sigma greater than or similar to 0.3 dex); these values improve somewhat at z = 1 and 2. Remarkably, ML quantitatively recovers second parameter trends in galaxy properties, e.g. that galaxies with higher gas content and lower metallicity have higher SFR at a given M*. Testing various ML algorithms, we find that none perform significantly better than the others, nor does ensembling improve performance, likely because none of the algorithms reproduce the large observed scatter around the mean properties. For the random forest algorithm, we find that halo mass and nearby (similar to 200 kpc) environment are the most important predictive variables followed by growth history, while halo spin and similar to Mpc-scale environment are not important. Finally, we study the impact of additionally inputting key baryonic properties M*, SFR and Z, as would be available e.g. from an equilibrium model, and show that particularly providing the SFR enables HI to be recovered substantially more accurately.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available