A first look at creating mock catalogs with machine learning techniques

N/ACitations
Citations of this article
26Readers
Mendeley users who have this article in their library.

Abstract

We investigate machine learning (ML) techniques for predicting the number of galaxies (Ngal) that occupy a halo, given the halo's properties. These types of mappings are crucial for constructing the mock galaxy catalogs necessary for analyses of large-scale structure. The ML techniques proposed here distinguish themselves from traditional halo occupation distribution (HOD) modeling as they do not assume a prescribed relationship between halo properties and Ngal. In addition, our ML approaches are only dependent on parent halo properties (like HOD methods), which are advantageous over subhalo-based approaches as identifying subhalos correctly is difficult. We test two algorithms: support vector machines (SVM) and k-nearest-neighbor (kNN) regression. We take galaxies and halos from the Millennium simulation and predict Ngal by training our algorithms on the following six halo properties: number of particles, M200, σv , v max, half-mass radius, and spin. For Millennium, our predicted N gal values have a mean-squared error (MSE) of ∼0.16 for both SVM and kNN. Our predictions match the overall distribution of halos reasonably well and the galaxy correlation function at large scales to ∼5%-10%. In addition, we demonstrate a feature selection algorithm to isolate the halo parameters that are most predictive, a useful technique for understanding the mapping between halo properties and Ngal. Lastly, we investigate these ML-based approaches in making mock catalogs for different galaxy subpopulations (e.g., blue, red, high Mstar, low Mstar). Given its non-parametric nature as well as its powerful predictive and feature selection capabilities, ML offers an interesting alternative for creating mock catalogs. © 2013. The American Astronomical Society. All rights reserved.

Cite

CITATION STYLE

APA

Xu, X., Ho, S., Trac, H., Schneider, J., Poczos, B., & Ntampaka, M. (2013). A first look at creating mock catalogs with machine learning techniques. Astrophysical Journal, 772(2). https://doi.org/10.1088/0004-637X/772/2/147

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free