4.7 Article

A FIRST LOOK AT CREATING MOCK CATALOGS WITH MACHINE LEARNING TECHNIQUES

期刊

ASTROPHYSICAL JOURNAL
卷 772, 期 2, 页码 -

出版社

IOP Publishing Ltd
DOI: 10.1088/0004-637X/772/2/147

关键词

galaxies: halos; large-scale structure of universe; methods: numerical

资金

  1. McWilliams Center for Cosmology Postdoctoral Fellowship by the Bruce and Astrid McWilliams Center for Cosmology
  2. NSF [AST-1109730]
  3. M. Hildred Blewett Fellowship of the American Physical Society

向作者/读者索取更多资源

We investigate machine learning (ML) techniques for predicting the number of galaxies (N-gal) that occupy a halo, given the halo's properties. These types of mappings are crucial for constructing the mock galaxy catalogs necessary for analyses of large-scale structure. The ML techniques proposed here distinguish themselves from traditional halo occupation distribution (HOD) modeling as they do not assume a prescribed relationship between halo properties and N-gal. In addition, our ML approaches are only dependent on parent halo properties (like HOD methods), which are advantageous over subhalo-based approaches as identifying subhalos correctly is difficult. We test two algorithms: support vector machines (SVM) and k-nearest-neighbor (kNN) regression. We take galaxies and halos from the Millennium simulation and predict N-gal by training our algorithms on the following six halo properties: number of particles, M-200, sigma(v), v(max), half-mass radius, and spin. For Millennium, our predicted Ngal values have a mean-squared error (MSE) of similar to 0.16 for both SVM and kNN. Our predictions match the overall distribution of halos reasonably well and the galaxy correlation function at large scales to similar to 5%-10%. In addition, we demonstrate a feature selection algorithm to isolate the halo parameters that are most predictive, a useful technique for understanding the mapping between halo properties and N-gal. Lastly, we investigate these ML-based approaches in making mock catalogs for different galaxy subpopulations (e. g., blue, red, high M-star, low M-star). Given its non-parametric nature as well as its powerful predictive and feature selection capabilities, ML offers an interesting alternative for creating mock catalogs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据