4.7 Article

Mapping circumgalactic medium observations to theory using machine learning

Journal

MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY
Volume 525, Issue 1, Pages 1167-1181

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/mnras/stad2266

Keywords

galaxies: evolution; galaxies: general; galaxies: haloes; quasars: absorption lines

Ask authors/readers for more resources

We propose a random forest (RF) framework for predicting physical conditions of the circumgalactic medium (CGM) from quasar absorption line observables. The RF models are trained on synthetic absorbers from the simba cosmological simulation to capture the complex relationship between CGM observables and underlying gas conditions. The models accurately reproduce the true values with normalized transverse standard deviations ranging from 0.32 to 0.54 dex in temperature, 0.49 to 0.53 dex in metallicity, and 0.50 to 0.54 dex in overdensity.
We present a random forest (RF) framework for predicting circumgalactic medium (CGM) physical conditions from quasar absorption line observables, trained on a sample of Voigt profile-fit synthetic absorbers from the simba cosmological simulation. Traditionally, extracting physical conditions from CGM absorber observations involves simplifying assumptions such as uniform single-phase clouds, but by using a cosmological simulation we bypass such assumptions to better capture the complex relationship between CGM observables and underlying gas conditions. We train RF models on synthetic spectra for H i and selected metal lines around galaxies across a range of star formation rates, stellar masses, and impact parameters, to predict absorber overdensities, temperatures, and metallicities. The models reproduce the true values from simba well, with normalized transverse standard deviations of 0.50-0.54 dex in overdensity, 0.32-0.54 dex in temperature, and 0.49-0.53 dex in metallicity predicted from metal lines (not H i), across all ions. Examining the feature importance, the RF indicates that the overdensity is most informed by the absorber column density, the temperature is driven by the line width, and the metallicity is most sensitive to the specific star formation rate. Alternatively examining feature importance by removing one observable at a time, the overdensity and metallicity appear to be more driven by the impact parameter. We introduce a normalizing flow approach in order to ensure the scatter in the true physical conditions is accurately spanned by the network. The trained models are available online.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available