We present a random forest (RF) framework for predicting circumgalactic medium (CGM) physical conditions from quasar absorption line observables, trained on a sample of Voigt profile-fit synthetic absorbers from the simba cosmological simulation. Traditionally, extracting physical conditions from CGM absorber observations involves simplifying assumptions such as uniform single-phase clouds, but by using a cosmological simulation we bypass such assumptions to better capture the complex relationship between CGM observables and underlying gas conditions. We train RF models on synthetic spectra for H i and selected metal lines around galaxies across a range of star formation rates, stellar masses, and impact parameters, to predict absorber overdensities, temperatures, and metallicities. The models reproduce the true values from simba well, with normalized transverse standard deviations of 0.50-0.54 dex in overdensity, 0.32-0.54 dex in temperature, and 0.49-0.53 dex in metallicity predicted from metal lines (not H i), across all ions. Examining the feature importance, the RF indicates that the overdensity is most informed by the absorber column density, the temperature is driven by the line width, and the metallicity is most sensitive to the specific star formation rate. Alternatively examining feature importance by removing one observable at a time, the overdensity and metallicity appear to be more driven by the impact parameter. We introduce a normalizing flow approach in order to ensure the scatter in the true physical conditions is accurately spanned by the network. The trained models are available online.
CITATION STYLE
Appleby, S., Davé, R., Sorini, D., Lovell, C. C., & Lo, K. (2023). Mapping circumgalactic medium observations to theory using machine learning. Monthly Notices of the Royal Astronomical Society, 525(1), 1167–1181. https://doi.org/10.1093/mnras/stad2266
Mendeley helps you to discover research relevant for your work.