This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

BayeSED-GALAXIES I. Performance test for simultaneous photometric redshift and stellar population parameter estimation of galaxies in the CSST wide-field multiband imaging survey

Yunkun Han Yunnan Observatories, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China Center for Astronomical Mega-Science, Chinese Academy of Sciences, 20A Datun Road, Chaoyang District, Beijing, 100012, P. R. China Key Laboratory for the Structure and Evolution of Celestial Objects, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China International Centre of Supernovae, Yunnan Key Laboratory, Kunming 650216, P. R. China Lulu Fan Deep Space Exploration Laboratory / Department of Astronomy, University of Science and Technology of China, Hefei 230026, China School of Astronomy and Space Science, University of Science and Technology of China, Hefei 230026, China Xian Zhong Zheng Purple Mountain Observatory, Chinese Academy of Sciences, 10 Yuanhua Road, Nanjing 210023, China Jin-Ming Bai Yunnan Observatories, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China Center for Astronomical Mega-Science, Chinese Academy of Sciences, 20A Datun Road, Chaoyang District, Beijing, 100012, P. R. China Key Laboratory for the Structure and Evolution of Celestial Objects, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China Zhanwen Han Yunnan Observatories, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China Center for Astronomical Mega-Science, Chinese Academy of Sciences, 20A Datun Road, Chaoyang District, Beijing, 100012, P. R. China Key Laboratory for the Structure and Evolution of Celestial Objects, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China International Centre of Supernovae, Yunnan Key Laboratory, Kunming 650216, P. R. China
Abstract

The forthcoming CSST wide-field multiband imaging survey will produce seven-band photometric spectral energy distributions (SEDs) for billions of galaxies. The effective extraction of astronomical information from these massive datasets of SEDs relies on the techniques of both SED synthesis (or modeling) and analysis (or fitting). We evaluate the performance of the latest version of BayeSED code combined with SED models with increasing complexity for simultaneously determining the photometric redshifts and stellar population parameters of galaxies in this survey. By using an empirical statistics-based mock galaxy sample without SED modeling errors, we show finding that the random observational errors in photometries are more important sources of errors than the parameter degeneracies and Bayesian analysis method and tool. By using a Horizon-AGN hydrodynamical simulation-based mock galaxy sample with SED modeling errors about the star formation histories (SFHs) and dust attenuation laws (DALs), the simple typical assumptions lead to significantly worse parameter estimation with CSST photometries only.The SED models with more flexible (or complicated) forms of SFH/DAL do not necessarily lead to better estimation of redshift and stellar population parameters. We discuss the selection of the best SED model by means of Bayesian model comparison in different surveys. Our results reveal that the Bayesian model comparison with Bayesian evidence may favor SED models with different complexities when using photometries from different surveys. Meanwhile, the SED model with the largest Bayesian evidence tends to give the best performance of parameter estimation, which is more clear for photometries with larger discriminative power.

galaxies: fundamental parameters — galaxies: stellar content — galaxies: statistics — methods: data analysis — methods: statistical
journal: ApJS

1 Introduction

Understanding the complex ecosystem of stars, interstellar gas and dust, and supermassive black holes in galaxies is one of the most important challenges in modern astrophysics (National Academies of Sciences, Engineering, and Medicine, 2021). The new generation of space and ground telescopes and the corresponding large surveys will provide vast amounts of multi-band data for understanding the cosmic ecosystems and all the complex physical processes involved. For example, the James Webb Space Telescope (JWST, Rieke et al., 2005; Gardner et al., 2006; Beichman et al., 2012) is able to detect the earliest stages of galaxies from infrared at unprecedented depths, and is expected to provide decisive observations of the first generation of stars and galaxies (Beichman et al., 2012; Robertson, 2022). Meanwhile, forthcoming deep and wide field surveys with the Chinese Space Station Telescope (CSST, Zhan, 2011, 2018, 2021), the Euclid Space Telescope (Laureijs et al., 2011; Joachimi, 2016), the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST, Ivezić et al., 2019; Breivik et al., 2022), and the Nancy Grace Roman Space Telescope (NGRST, Green et al., 2012; Dore et al., 2019) will provide multi-band photometric and spectroscopic information for billions of galaxies. Especially, the CSST wide-field multiband imaging survey is set to image approximately 17,500 square degrees of the sky using NUV, u, g, r, i, z, and y bands in about 10 years of orbital time, which aims to achieve a 5σ5\sigma limiting magnitude of 26 (AB mag) or higher for point sources in the g and r bands. How to effectively and reliably measure the redshift and the properties of various physical components of galaxies from the obtained huge amount of photometric spectral energy distributions (SEDs) data has become an urgent task to be done. A new generation of SED synthesis and analysis methods and tools are strongly demanded to effectively extract physical information from those massive datasets of observational SEDs.

The SED synthesis and analysis of galaxies are two aspects that are both opposite and unified in nature. The reliability and efficiency of the SED synthesis and analysis methods and tools will directly determine the reliability and efficiency of physical information extraction from the massive multi-wavelength datasets. In terms of SED synthesis of galaxies, the evolutionary synthesis technique of stellar population has become the core method from the pioneering works of Tinsley & Gunn (1976); Tinsley (1978). Nowadays, the stellar population synthesis models of BC03 (Bruzual & Charlot, 2003), M05 (Maraston, 2005), FSPS (Conroy et al., 2009), BPASS (Eldridge & Stanway, 2009), among others are widely used in the study of the formation and evolution of galaxies. However, in the SED synthesis models of galaxies, many important uncertainties remain in almost all the model ingredients (Conroy et al., 2009, 2010; Conroy & Gunn, 2010; Conroy, 2013), such as the initial stellar mass function (IMF) (Padoan et al., 1997; Hoversten & Glazebrook, 2008; van Dokkum, 2008; Bastian et al., 2010; Cappellari et al., 2012; Ferreras et al., 2013; Gennaro et al., 2018), the physics of stellar evolution (Thomas & Maraston, 2003; Zhang et al., 2005; Maraston et al., 2006; Han et al., 2007; Marigo et al., 2008; Bertelli et al., 2008; Brott et al., 2011; Hernández-Pérez & Bruzual, 2013), stellar spectral libraries (Coelho, 2009; Choi et al., 2019; Knowles et al., 2019; Coelho et al., 2020; Knowles et al., 2021; Yan et al., 2019), the complex star formation and metallicity enrichment histories (SFHs and MEHs) (Debsarma et al., 2016; Iyer et al., 2019; Carnall et al., 2019; Leja et al., 2019; Aufort et al., 2020; Wang & Lilly, 2020; Iyer et al., 2020; Côté et al., 2016; Maiolino & Mannucci, 2019; Valentini et al., 2019), the reprocessing by the interstellar gas and dust (Draine, 2003, 2010; Galliano et al., 2018; Kewley et al., 2019; Salim & Narayanan, 2020; Tacconi et al., 2020), and the possible contribution from active galactic nuclei (AGNs) at the center of galaxies (Antonucci, 1993, 2012; Netzer, 2015; Hickox & Alexander, 2018; Brown et al., 2019a, b; Lyu & Rieke, 2022). Different choices of these model ingredients will lead to very different estimation of the redshifts and physical parameters of galaxies, as well as different and even conflicting conclusions about the formation and evolution of galaxies. Therefore, the proper selection of these model ingredients is an essential step in any SED analysis work of galaxies (Han & Han, 2019; Han et al., 2020).

In terms of SED analysis of galaxies, the Bayesian method has been widely adopted in the last decade. For example, the widely used and actively developing SED fitting codes, such as MAGPHYS (da Cunha et al., 2008), CIGALE (Noll et al., 2009; Boquien et al., 2019), GalMC (Acquaviva et al., 2011), BayeSED (Han & Han, 2012, 2014), BEAGLE (Chevallard & Charlot, 2016), Prospector (Leja et al., 2017), BAGPIPES (Carnall et al., 2018), and ProSpect (Robotham et al., 2020) are all based on the Bayesian methods. Besides, a long list of new SED fitting codes, such as MCSED (Bowman et al., 2020), piXedfit (Abdurro’uf et al., 2021), gsf (Morishita, 2022), and Lightning (Doore et al., 2023) among others, are build along this way. The application of Bayesian methods implies that the SED analysis of galaxies is considered as a more general Bayesian inference problem instead of the previous Chi-square minimization-based optimization problem known as SED fitting. For the parameter estimation of a give SED model, the Bayesian approach provides the complete posterior probability distribution of parameters as the solution to the SED analysis problem, which is computationally more demanding but allows a more formal and simultaneous estimation of parameter values and their uncertainties. More importantly, for the selection of model ingredients, the Bayesian approach also provides the very useful Bayesian evidence which can be considered as a quantified Occam’s razor for effective model selection.

A noteworthy difference among the Bayesian SED analysis tools is that the earlier tools (e.g. MAGPHYS and CIGALE) are based on irregular or regular grid search, while the newer generation of tools (e.g. GalMC and BayeSED) are based on more advanced random sampling techniques such as Markov Chain Monte Carlo (MCMC, Sharma, 2017; Hogg & Foreman-Mackey, 2018) and Nested Sampling (NS, Skilling, 2006; Buchner, 2021; Ashton et al., 2022). The advantage of the grid-based Bayesian approach is that an SED library with regular or irregular model grids can be built in advance for only once. Besides, the prior probabilities can be set more freely during this procedure. Then it can be used in the analysis of a large sample of galaxies of any size without the generation of new SEDs. However, the size of SED library needs to be very large to allow a reasonable parameter estimation for all galaxies in the sample, especially in the case with regular grids where the number of required grids will increase dramatically with the number of free parameters. In contrast, a sampling-based Bayesian approach allows for a more detailed and efficient sampling of the parameter space for each galaxy and allows for a finer reconstruction of the posterior, leading to more reliable parameter estimates. However, the theoretical SED synthesis needs to be done in real-time and repeated many times, which could be very computationally expensive for the analysis of very large samples of galaxies. Fortunately, a much more efficient synthesis SED models can be achieved with the help of machine learning techniques. For example, in Han & Han (2014) we have employed the artificial neural network (ANN) and K-nearest neighbor (KNN) searching techniques to speed up the sampling-based Bayesian approach. The combination of sampling-based Bayesian inference and machine learning techniques enables the detailed Bayesian SED analysis of very large samples of galaxies (Han et al., 2019). Although the training phase of machine-learning-based SED synthesis method could be very time-consuming, especially for very complex SED models with many free parameters and the accurate synthesis of high-resolution SED, it is very promising with more advanced training techniques (Alsing et al., 2020; Gilda et al., 2021; Qiu & Kang, 2022; Hahn & Melchior, 2022) and worthwhile to carry out further exploration in this direction.

For the study of galaxy formation and evolution, the ideal SED synthesis and analysis tool should be able to simultaneously account for the contributions of stars, interstellar gas and dust, and AGN components, and to provide accurate and efficient estimates of the redshift and the physical properties of all components. However, in practice, it is very difficult, if not impossible, to fully satisfy all of these requirements. Therefore, a good SED synthesis and analysis tool should attempt to achieve a reasonable balance among these requirements as much as possible. This is what we are trying to achieve during the development of the BayeSED code (Han & Han, 2012, 2014, 2019; Han et al., 2019, 2020). In this work, we will rigorously test the performance of the latest version of BayeSED code combined with SED models with increasing complexity for simultaneous photometric redshift and stellar population parameter estimation of galaxies, so as to be ready for the analysis of the forthcoming massive datasets from the CSST wide-field multiband imaging survey and others.

We begin in §2 by introducing the methods we have employed for the generation of empirical statistics-based (§2.1) and hydrodynamical simulation-based (§2.2) mock catalog of galaxies, observational error modeling (§2.3) and the selection of samples (§2.4) that will be used for the performance test. In §3, we briefly describe the Bayesian approach of photometric SED analysis methods, including parameter estimation (§3.1) and model selection (§3.2). Especially, in §3.3, we will introduce some runtime parameters of MultiNest algorithm which is the core engine of BayeSED. They need to be properly tuned to improve the performance of BayeSED. We present the results of performance test for the case without SED modeling errors using an empirical statistics-based mock galaxy sample in §4. In §5, by employing the simplest SED model, we present the results of performance test for the case with SED modeling errors about the SFH and DAL of galaxies using a Horizon-AGN hydrodynamical simulation-based mock galaxy sample for CSST-like imaging survey. In §6, we discuss the effectiveness of more flexible (or complex) forms of SFH and DAL of galaxies for improving the performance of simultaneous redshift and stellar parameter estimation in CSST-like (§6.1), CSST+Euclid-like (§6.2), and COSMOS-like (§6.3) surveys with increasing discriminative power, respectively. Especially, we also discuss the relation between the metrics of the quality of parameter estimation and Bayesian evidence, as well as how they depend on the different surveys. Finally, a summary of our results and conclusions is presented in §7.

Throughout this work, we assume a flat Λ\Lambda-cold dark matter cosmology with H0=70H_{0}=70 km s-1 Mpc-1, Ωm=0.3\Omega_{m}=0.3, ΩΛ=0.7\Omega_{\Lambda}=0.7 (Spergel et al., 2003, WMAP-5), and the Chabrier (2003) IMF. All the presented magnitudes are in the AB system (Oke, 1974).

2 Bayesian photometric SED synthesis with BayeSED

The SED synthesis (or modeling) module is an essential part in any Bayesian SED fitting code. In BayeSED-V3, we have added more functions for SED synthesis, especially for the simulation of mock observation of galaxies in a Bayesian way. This is not just crucial for the current work, but also lays the foundation for future applications of machine learning and simulation-based Bayesian inference methods in Bayesian SED fitting (Hahn & Melchior, 2022; Hahn et al., 2022). For this work, we use the empirical statistics-based (§2.1) and hydrodynamical simulation-based (§2.2) methods to generate mock photometric catalog, add noise with a simple magnitude limit-based approach (2.3), and select a sample (2.4) similar to previous works for the performance test in the next two sections. In the following, we introduce them in more detail.

2.1 Empirical statistics-based photometric mock catalog

The first method to generate mock photometric catalog is built by randomly draw samples from the parameter space of a particular SED model while under the constraints of some empirical statistical properties of galaxies. The sampling is performed with the same nested sampler MultiNest as in the Bayesian SED analysis mode of BayeSED. A sample from this catalog will be used in §4 to test the performance of redshift and stellar population parameter estimation in the case where the SED modeling is perfect, since exactly the same SED modeling method will be used in the Bayesian SED analysis of it.

2.1.1 SED modeling

As in Han & Han (2019), the SED of a galaxy is modeled as the luminosity of starlight from stellar populations of varying ages and metallicities, transmitted through the Interstellar Medium (ISM) and the Intergalactic Medium (IGM) to the observer. Specifically, the luminosity emitted at wavelength λ\lambda by a galaxy with age=tage=t can be given as:

Lλ(t)\displaystyle L_{\lambda}(t) =0tdtψ(tt)Sλ[t,Z(tt)]Tλism(t,t)\displaystyle=\int_{0}^{t}{\rm d}\hbox{$t^{\prime}$}\,\psi(t-\hbox{$t^{\prime}$})\,S_{\lambda}[\hbox{$t^{\prime}$},{Z}(t-\hbox{$t^{\prime}$})]\,T^{\textsc{ism}}_{\lambda}(t,t^{\prime}) (1)
=Tλism0tdtψ(tt)Sλ[t,Z(tt)],\displaystyle=T^{\textsc{ism}}_{\lambda}\,\int_{0}^{t}{\rm d}\hbox{$t^{\prime}$}\,\psi(t-\hbox{$t^{\prime}$})\,S_{\lambda}[\hbox{$t^{\prime}$},{Z}(t-\hbox{$t^{\prime}$})]\,, (2)

where ψ(tt)\psi(t-\hbox{$t^{\prime}$}) is the star formation history (SFH) describing the SFR as a function of the time ttt-\hbox{$t^{\prime}$}, and Sλ[t,Z(tt)]S_{\lambda}[\hbox{$t^{\prime}$},{Z}(t-t^{\prime})] is the luminosity emitted per unit wavelength per unit mass by a simple stellar population (SSP) of age tt^{\prime} and metallicity Z(tt){Z}(t-t^{\prime}).

Tλism(t,t)T^{\textsc{ism}}_{\lambda}(t,t^{\prime}) is the transmission function of the ISM (Charlot & Longhetti, 2001), which is contributed by two components:

Tλism(t,t)=Tλ+(t,t)Tλ0(t,t),T^{\textsc{ism}}_{\lambda}(t,t^{\prime})=T^{+}_{\lambda}(t,t^{\prime})T^{0}_{\lambda}(t,t^{\prime}), (3)

where Tλ+(t,t)T^{+}_{\lambda}(t,t^{\prime}) and Tλ0(t,t)T^{0}_{\lambda}(t,t^{\prime}) are the transmission functions of the ionized gas and the neutral ISM, respectively. The transmission through ionized gas can be modeled with photoionization code such as CLOUDY. However, we set Tλ+(t,t)=1T^{+}_{\lambda}(t,t^{\prime})=1 in this work to be consistent with the hydrodynamical-simulation based catalog (§2.2). A detailed modeling of Tλ+(t,t)T^{+}_{\lambda}(t,t^{\prime}) with CLOUDY to account for the combined effects of starlight absorption, nebular line emission, ionized continuum emission, and possible emission from warm dust within HII regions will be presented in a companion paper. Meanwhile, the transmission functions of the neutral ISM is considered with a simple time-independent dust attenuation law (DAL) and uniformly applied to the whole galaxy.

Z(tt){Z}(t-t^{\prime}) is the stellar metallicity as a function of the time ttt-t^{\prime}, which describe the chemical evolution history of the galaxy. In previous works, we assume a time-independent metallicity, i.e. Z(tt)=Z0{Z}(t-t^{\prime})=Z_{0}, as in many SED fitting codes of galaxy. To properly consider the evolution of stellar metallicity, we additionally employ a linear SFH-to-metallicity mapping model(Driver et al., 2013; Robotham et al., 2020; Thorne et al., 2021; Alsing et al., 2023):

Z(tt)=(Z(tage)Zmin)1M0tψ(tt)dt+ZminZ(t-t^{\prime})=\left(Z(t_{\mathrm{age}})-Z_{\min}\right)\frac{1}{M}\int_{0}^{t}\psi(t-\hbox{$t^{\prime}$}){\rm d}\hbox{$t^{\prime}$}+Z_{\min} (4)

Generally, the main ingredients for our SED modeling of galaxies are the SSP model, SFH, CEH and DAL. In this work,as the construction of the Horizon-AGN hydrodynamical-simulation based catalog (§2.2), we use the SSP model assuming a Chabrier (2003) stellar IMF from the widely used stellar population synthesis model of Bruzual & Charlot (2003). The SFH of galaxies is typically parameterized as the exponentially declining form: SFR(t)et/τ{\rm SFR}(t)\propto e^{-t/\tau}\,(hereafter τ\tau model). The τ\tau model only describes the SFH of galaxies in a closed box without inflow of pristine gas and outflow of processed gas, where the gases are converted to stars at a rate proportional to the remaining gas and with a fixed efficiency (Schmidt, 1959; Tinsley, 1980). It is widely discussed in the literature that this simple assumption may lead to systematically biased estimation of stellar population parameters, especially for galaxies at z2z\gtrsim 2 (Lee et al., 2009, 2010; Reddy et al., 2012; Ciesla et al., 2017; Carnall et al., 2018). Therefore, some more flexible and physically inspired form of models have been suggested to improve the measurement of SFHs of galaxies and the estimation of their stellar population parameters and photometric redshift (Pacifici et al., 2012; Ciesla et al., 2017; Iyer & Gawiser, 2017; Carnall et al., 2019; Leja et al., 2019; Iyer et al., 2019; Lower et al., 2020; Suess et al., 2022).

In the present work, we employ three extension of the τ\tau model with different complexity. The first one is described as:

ψ(t)tβ×exp(t/τ),\psi(t)\propto t^{\beta}\times\exp({-t/\tau}), (5)

which is just an extended form of the delayed-τ\tau model (Lee et al., 2010). Apparently, the typical τ\tau model and delayed-τ\tau model are just two special cases of this model (hereafter β\beta-τ\tau model) with β=0\beta=0 and β=1\beta=1, respectively. The second one is the β\beta-τ\tau model combined with a quenching (or rejuvenation) component which is described as (Ciesla et al., 2016):

ψ(t){tβ×exp(t/τ), when t<=ttrunc rSFR×ψ(t=ttrunc), when t>ttrunc \psi(t)\propto\begin{cases}t^{\beta}\times\exp(-t/\tau),&\text{ when }t<=t_{\text{trunc }}\\ r_{\mathrm{SFR}}\times\psi\left(t=t_{\text{trunc}}\right),&\text{ when }t>t_{\text{trunc }}\end{cases} (6)

where ttrunct_{\text{trunc}} is the time when the star formation is quenched (rSFR<1r_{\mathrm{SFR}}<1) or rejuvenated (rSFR>1r_{\mathrm{SFR}}>1), and rSFRr_{\mathrm{SFR}} is the ratio between ψ(t>ttrunc )\psi(t>t_{\text{trunc }}) and ψ(t=ttrunc )\psi(t=t_{\text{trunc }}):

rSFR=ψ(t>ttrunc )ψ(ttrunc ).r_{\mathrm{SFR}}=\frac{\psi\left(t>t_{\text{trunc }}\right)}{\psi\left(t_{\text{trunc }}\right)}. (7)

This model (hereafter β\beta-τ\tau-r model) is a further extension of the β\beta-τ\tau model with the latter being the special case with rSFR=1r_{\mathrm{SFR}}=1. The third one is the double power-law model (Diemer et al., 2017; Carnall et al., 2018; Alsing et al., 2023) combined with a quenching (or rejuvenation) component which is described as:

ψ(t){1.0(tt)α+(tt)β, when t<=ttrunc rSFR×ψ(t=ttrunc), when t>ttrunc \psi(t)\propto\begin{cases}\frac{1.0}{{(\frac{t}{t^{*}})^{\alpha}+(\frac{t}{t^{*}})^{-\beta}}},&\text{ when }t<=t_{\text{trunc }}\\ r_{\mathrm{SFR}}\times\psi\left(t=t_{\text{trunc}}\right),&\text{ when }t>t_{\text{trunc }}\end{cases} (8)

where α\alpha and β\beta are the falling and rising slopes, respectively, and tt^{*} is related to the time at which star formation peaks, which is defined as tτtaget^{*}\equiv\tau t_{\mathrm{age}} for the age of the galaxy taget_{\mathrm{age}}. A major advantage of the double-power-law model is the decoupling of the rising and falling parts of the SFH. Therefore, this model (hereafter α\alpha-β\beta-τ\tau-r model) is even more flexible than the β\beta-τ\tau-r model.

The dust attenuation law (DAL) is another very important ingredient for the SED modeling of galaxies (Walcher et al., 2011; Conroy, 2013). When deriving the photometric redshift and physical properties of galaxies from the analysis of their photometric or spectroscopic observations, a universal DAL as a simple uniform screen is commonly assumed. However, different choices of the universal law may lead to very different estimation of photometric redshift and physical parameters of galaxies (Pforr et al., 2012, 2013; Salim & Narayanan, 2020). Especially, many studies show that the dust attenuation curve of different galaxies are very different (Kriek & Conroy, 2013; Reddy et al., 2015; Salmon et al., 2016; Salim & Boquien, 2019; Shivaei et al., 2020), and therefore there is no universal DAL as expected on theoretical grounds (Witt & Gordon, 2000; Seon & Draine, 2016; Narayanan et al., 2018; Lower et al., 2022). By a detailed study of the dust attenuation curves of about 230,000 individual galaxies in the local universe using photometric data covering from UV to IR bands, Salim et al. (2018) presented new forms of attenuation laws that are suitable for normal star-forming galaxies, high-z analogs, and quiescent galaxies (See also Noll et al., 2009). In this work, we additionally employ this new form of DAL which is parameterized as following:

Aλ,modAV=kλRV=kλ,CalRV,Cal(λ5500Å)δ+DλRV,mod,\frac{A_{\rm\lambda,{\rm mod}}}{A_{\rm V}}=\frac{k_{\lambda}}{R_{\rm V}}=\frac{k_{\lambda,{\rm Cal}}}{R_{V,{\rm Cal}}}\left(\frac{\lambda}{5500\text{\AA}}\right)^{\delta}+\frac{D_{\lambda}}{R_{V,{\rm mod}}}, (9)

where kλ,Cal/RV,Cal{k_{\lambda,{\rm Cal}}}/{R_{V,{\rm Cal}}} is the Calzetti et al. (2000) DAL with RV,Cal=4.05{R_{V,{\rm Cal}}}=4.05. The power law term with an exponent δ\delta is introduced to deviate from the slope of the Calzetti et al. (2000) DAL. RV,modR_{V,{\rm mod}} is the δ\delta-dependent ratio of total to selective extinction for the modified law. The term DλD_{\lambda} is introduced to add a UV bump. The relationship between RV,modR_{V,{\rm mod}} and δ\delta is given by:

RV,mod=RV,Cal(RV,Cal+1)(0.44/0.55)δRV,Cal.R_{V,{\rm mod}}=\frac{R_{V,{\rm Cal}}}{(R_{V,{\rm Cal}}+1)(0.44/0.55)^{\delta}-R_{V,{\rm Cal}}}. (10)

The UV bump following a Drude profile (Fitzpatrick, 1986) is represented as:

Dλ(Eb)=Ebλ2γ2[λ2(λ0)2]2+λ2γ2,D_{\lambda}(E_{\rm b})=\frac{E_{\rm b}\lambda^{2}\gamma^{2}}{[\lambda^{2}-(\lambda_{0})^{2}]^{2}+\lambda^{2}\gamma^{2}}, (11)

with the amplitude EbE_{\rm b}, fixed central wavelength λ0=0.2175μm\lambda_{0}=0.2175\mu{\rm m} and width γ=0.35μm\gamma=0.35\mu{\rm m}.

In total, we have considered six different combinations of SFH, CEH, and DAL with increasing complexity. A summary of these models, their parameters and priors are shown in Table 1.

Table 1: Summary of SED models, parameters and priors.
Model Parameter Range Prior
zz [0,4][0,4] U
log(M/M){\rm log}(M_{*}/{\rm M_{\odot}}) [8,12][8,12] U
log(age/yr){\rm log}(age/\rm{yr})ccfootnotemark: [8,10.14][8,10.14] U
AvA_{\rm v} [0,4][0,4] U
SFH=τ\tau,CEH=no log(Z0/Z){\rm log}(Z_{0}/Z_{\odot}) [2.30,0.70][-2.30,0.70] U
DAL=Cal+00aafootnotemark: log(τ/yr){\rm log}(\tau/\rm{yr}) [6,10][6,10] U
SFH=τ\tau,CEH=yes log(Z(tage)/Z){\rm log}(Z(t_{\mathrm{age}})/Z_{\odot}) [2.30,0.70][-2.30,0.70] U
DAL=Cal+00aafootnotemark: log(τ/yr){\rm log}(\tau/\rm{yr}) [6,10][6,10] U
SFH=τ\tau,CEH=yes log(Z(tage)/Z){\rm log}(Z(t_{\mathrm{age}})/Z_{\odot}) [2.30,0.70][-2.30,0.70] U
DAL=Sal+18bbfootnotemark: log(τ/yr){\rm log}(\tau/\rm{yr}) [6,10][6,10] U
EbE_{\rm b} [2,6][-2,6] U
δ\delta [1.2,0.4][-1.2,0.4] U
SFH=β\beta-τ\tau log(Z(tage)/Z){\rm log}(Z(t_{\mathrm{age}})/Z_{\odot}) [2.30,0.70][-2.30,0.70] U
CEH=yes log(τ/yr){\rm log}(\tau/\rm{yr}) [6,10][6,10] U
DAL=Sal+18bbfootnotemark: β\beta [0,1][0,1] U
EbE_{\rm b} [2,6][-2,6] U
δ\delta [1.2,0.4][-1.2,0.4] U
SFH=β\beta-τ\tau-r log(Z(tage)/Z){\rm log}(Z(t_{\mathrm{age}})/Z_{\odot}) [2.30,0.70][-2.30,0.70] U
CEH=yes log(τ/yr){\rm log}(\tau/\rm{yr}) [6,10][6,10] U
DAL=Sal+18bbfootnotemark: rSFR [1e6,1e6][1e-6,1e6] LUddfootnotemark:
ttrunc/taget_{\rm trunc}/t_{\mathrm{age}} [0,1][0,1] U
β\beta [0,1][0,1] U
EbE_{\rm b} [2,6][-2,6] U
δ\delta [1.2,0.4][-1.2,0.4] U
SFH=α\alpha-β\beta-τ\tau-r log(Z(tage)/Z){\rm log}(Z(t_{\mathrm{age}})/Z_{\odot}) [2.30,0.70][-2.30,0.70] U
CEH=yes τ=t/tage\tau=t^{*}/t_{\mathrm{age}} [0.007,1][0.007,1] U
DAL=Sal+18bbfootnotemark: α\alpha [0.01,1000][0.01,1000] LU
β\beta [0.01,1000][0.01,1000] LU
rSFR [1e6,1e6][1e-6,1e6] LUddfootnotemark:
ttrunc/taget_{\rm trunc}/t_{\mathrm{age}} [0,1][0,1] U
EbE_{\rm b} [2,6][-2,6] U
δ\delta [1.2,0.4][-1.2,0.4] U

Finally, we also include the effect of IGM absorption with the description of Madau (1995). Other other more recent consideration of IGM absorption are also available in BayeSED. However, the exploration of the effects of different choices of IGM absorption models on the redshift and stellar population parameter estimation is beyond the scope of this work, which will not change the conclusions given here.

2.1.2 Galaxy population modeling

To model the galaxy population, we need to set the joint probability distribution that characterizes the statistical properties of the galaxy population. The statistical properties of the galaxy population are the results of the complex physical procedures happened during the formation and evolution of galaxies. In this work, we employ some widely discussed empirical statistical properties of galaxies to model the galaxy population phenomenologically. It should be mentioned that there are large uncertainties in these statistical properties, and we do not attempt to use the most up-to-date results for all of them in this work. The other choices of statistical properties of galaxies will not change the conclusions of this work.

Refer to caption
Figure 1: The joint distributions of redshift and physical parameters of the empirical statistics-based mock galaxy population produced with BayeSED combined with SED models of different complexity. With the same set of empirical statistics, different SED models lead to slightly different redshift and age distribution, which is likely due to different mapping relations from free parameters to derived parameters. Meanwhile, only the two SED models with a quenching component produce a clear region of quiescent galaxies below the star-forming main sequence.

Similar to Tanaka (2015)(See also Alsing et al., 2023), we assume that the joint probability distribution of the stellar population parameters and redshift of galaxy population can be factorized as

P(𝑮,z)=P(M,z)\displaystyle P(\bm{G},z)=P(M_{*},z) (12)
×P(SFRM,z)\displaystyle\quad\times P(SFR\mid M_{*},z)
×P(AVSFR,z)\displaystyle\quad\times P(A_{\rm V}\mid SFR,z)
×P(ageM,z)\displaystyle\quad\times P(age\mid M_{*},z)
×P(ZMWM,z).\displaystyle\quad\times P(Z_{\rm MW}\mid M_{*},z).

The joint distribution of stellar mass and redshift is defined as

P(M,z)Φ(M,z)dV(z),P(M_{*},z)\propto\Phi(M_{*},z)dV(z), (13)

where Φ(M,z)\Phi(M_{*},z) is the unnormalized stellar mass function and dV(z)dV(z) is the differential comoving volume element. We employ the recent measurement of stellar mass function and its redshift evolution from Leja et al. (2020), while a WMAP-5 (Spergel et al., 2003) cosmology for the comoving volume element.

Following Tanaka (2015), we assume that P(SFRM,z)P(SFR\mid M_{*},z) can be expressed as the sum of two Gaussians to represent two distinct sequences formed by star forming and quiescent galaxies:

P(SFRM,z)(1fQ)σSFexp[12(logSFRlogSFRSF(M,z)σSF)2]+fQσQexp[12(logSFRlogSFRSF(M,z)+2σQ)2],\begin{gathered}P\left(\mathrm{SFR}\mid M_{*},z\right)\propto\\ \frac{(1-f_{\mathrm{Q}})}{\sigma_{\mathrm{SF}}}\exp\left[-\frac{1}{2}\left(\frac{\log\mathrm{SFR}-\log\mathrm{SFR}_{\mathrm{SF}}\left(M_{*},z\right)}{\sigma_{\mathrm{SF}}}\right)^{2}\right]\\ \quad+\frac{f_{\mathrm{Q}}}{\sigma_{\mathrm{Q}}}\exp\left[-\frac{1}{2}\left(\frac{\log\mathrm{SFR}-\log\mathrm{SFR}_{\mathrm{SF}}\left(M_{*},z\right)+2}{\sigma_{\mathrm{Q}}}\right)^{2}\right],\end{gathered} (14)

where SFRSF(M,z)\operatorname{SFR}_{\mathrm{SF}}\left(M_{*},z\right) is the mean SFR of star forming galaxies given by

SFRSF(M,z)=SFR(z)×M1011MMyr1,\operatorname{SFR}_{\mathrm{SF}}\left(M_{*},z\right)=\operatorname{SFR}^{*}(z)\times\frac{M_{*}}{10^{11}M_{\odot}}M_{\odot}\mathrm{yr}^{-1}, (15)

with

SFR(z)={10×(1+z)2.1(z<2)19×(1+z)1.5(z2),\operatorname{SFR}^{*}(z)=\begin{cases}10\times(1+z)^{2.1}&(z<2)\\ 19\times(1+z)^{1.5}&(z\geqslant 2)\end{cases}, (16)

and the fraction of quiescent galaxies is given as a function of stellar mass and redshift (Behroozi et al., 2013):

fQ(M,z)=[(M1010.2+0.5zM)1.3+1]1.f_{\mathrm{Q}}\left(M_{*},z\right)=\left[\left(\frac{M_{*}}{10^{10.2+0.5z}M_{\odot}}\right)^{-1.3}+1\right]^{-1}. (17)

As in Tanaka (2015), the dust attenuation is considered to positively correlate with SFR:

P(τVSFR,z)1στVexp[12(τVτVστV)2],P\left(\tau_{V}\mid\mathrm{SFR},z\right)\propto\frac{1}{{\sigma_{\tau_{V}}}}\exp\left[-\frac{1}{2}\left(\frac{\tau_{V}-\left\langle\tau_{V}\right\rangle}{\sigma_{\tau_{V}}}\right)^{2}\right], (18)

where στV=0.5\sigma_{\tau_{V}}=0.5, and

τV={0.2(SFR0<1)0.2+0.5logSFR0(SFR0>1),\left\langle\tau_{V}\right\rangle=\begin{cases}0.2&\left(\mathrm{SFR}_{0}<1\right)\\ 0.2+0.5\log\mathrm{SFR}_{0}&\left(\mathrm{SFR}_{0}>1\right)\end{cases}, (19)
SFR0=100SFRSFR(z),\mathrm{SFR}_{0}=100\frac{\mathrm{SFR}}{\mathrm{SFR}^{*}(z)}, (20)

Then, we use the relation between τV\tau_{V} and AVA_{\rm V} to obtain P(AVSFR,z)P(A_{\rm V}\mid SFR,z).

The probability of the age of a galaxy is described conditionally on the stellar mass and redshift:

P(ageM,z)exp[SFRSFR(z)],P\left(\operatorname{age}\mid M_{*},z\right)\propto\exp\left[-\frac{\langle\mathrm{SFR}\rangle}{\operatorname{SFR}^{*}(z)}\right], (21)

where

SFR=M/ age. \langle\mathrm{SFR}\rangle=M_{*}/\text{ age. } (22)

This lead to a low probability for a massive galaxy with young age, while a high probability for a low-mass galaxy with the same age.

Finally, the probability of mass weighted stellar metallicity is modeled as:

P(ZMWM,z)1σlog(ZMW)exp[12(log(ZMW)log(ZMW)σlog(ZMW))2],\begin{array}[]{l}P\left(Z_{\rm MW}\mid M_{*},z\right)\propto\\ \frac{1}{{\sigma_{\log(Z_{\rm MW})}}}\exp\left[-\frac{1}{2}\left(\frac{\log(Z_{\rm MW})-\left\langle\log(Z_{\rm MW})\right\rangle}{\sigma_{\log(Z_{\rm MW})}}\right)^{2}\right],\end{array} (23)

where σlog(ZMW)=0.1{\sigma_{\log(Z_{\rm MW})}}=0.1, and

log(ZMW)=0.40[logM10]+0.67exp(0.50z)1.04\begin{array}[]{l}\left\langle\log(Z_{\rm MW})\right\rangle=\\ 0.40\left[\log M_{*}-\right.10]+0.67\exp(-0.50z)-1.04\end{array} (24)

is the redshift dependent stellar mass and metallicity relation (Ma et al., 2016) which is predicted by using the high-resolution cosmological zoom-in simulations from the Feedback in Realistic Environment (FIRE) project (Hopkins et al., 2014).

To generate empirical statistics-based mock catalog of galaxies, we employ the MultiNest algorithm to draw samples from the joint probability distribution of the stellar population parameters and redshift of galaxy population by setting P(𝑮,z)P(\bm{G},z) in Equation 12 to be the likelihood function. Besides, to simulate a magnitude-limited sample, we can additionally set the likelihood function to be 0 when the magnitude in a given band is larger than a given value. Since the sampling points with likelihood to be 0 will be ignored by MultiNest, the obtained posterior sample can be used to buid a magnitude-limited sample of mock galaxies with some physical constraints from the empirical statistical properties of galaxies. More details about the selection of magnitude-limited sample is presented in §2.4. The mock catalog can be build with the posterior sample of redshift and all physical parameters given by MultiNest. However, this is a weighted sample (Yallup et al., 2022), which can not be directly used as a mock sample of galaxies. To build a more realistic mock sample of galaxies, we use bootstrap resampling method to obtain an unweighted sample.

In total, we have build six mock catalog of galaxies by employing SED models with different combinations of SFH, CEH, and DAL and increasing complexity as shown in Table 1, respectively. The employed priors of redshift and stellar population parameters are listed in the same Table for each model, respectively. In Figure 1, we show the joint distributions of redshift and physical parameters of the six empirical statistics-based mock galaxy population. Although the sampe set of empirical are employed, different SED models lead to slightly different distribution of parameters, especially for redshift and galaxy age. This is likely due to different mapping relations from free parameters to derived parameters. For example, different forms of SFH may lead to different relations between the age of galaxy and its recent SFR.

2.2 Hydrodynamical simulation-based photometric mock catalog

The second method to generate mock photometric catalog is based on an SED library which is built by the post-processing of galaxies from a hydrodynamical simulation. This catalog will be used in §5 to test the performance of redshift and stellar population parameter estimation in the case where the SED modeling is imperfect, since the SED modeling method employed in the Bayesian SED analysis will be very different from the one used to built it.

We start from the rest-frame spectra of galaxies which are produced using the light-cone from the cosmological hydrodynamical simulation Horizon-AGN (Dubois et al., 2014). The computation of these spectra, which accounts for the complex star formation history and metal enrichment of Horizon-AGN galaxies, and consistently includes dust attenuation, is described in details by Laigle et al. (2019) and Davidzon et al. (2019). The dust attenuation of galaxies in Horizon-AGN simulation is modelled for each stellar particle, assumed to be a SSP, by using the gas metal mass distribution as an approximation of the dust mass distribution, assuming a constant dust-to-metal mass ratio (Laigle et al., 2019). Besides, to obtain the amount of extinction at a given wavelength, the Weingartner & Draine (2001) model of Milky Way dust grain with RV=3.1R_{\rm V}=3.1 and the prominent 2175Å2175{\rm\AA}-graphite bump is employed for post-processing the simulated galaxies. As mentioned in Laigle et al. (2019), the overall attenuation curve becomes less steep and the bump tends to be reduced when summing up the contribution of all the SSPs to obtain the resulting galaxy spectrum. They also noticed that the averaged attenuation curve in Horizon-AGN simulation can not be well reproduced by either the model of Calzetti et al. (2000) or Arnouts et al. (2013). The more flexible form of DAL as given by Equation 9 is more likely to reproduce the attenuation curves of galaxies in the Horizon-AGN simulation. In order to isolate the possible differences in the convolution with filter response function, observational error modeling, and consideration of IGM absorption, we choose to convert their rest-frame spectra of mock galaxies to corresponding mock photometries with BayeSED, instead of using their virtual photometries directly111We have compared the two set of photometries careful and found that they are actually very similar, only with some differences at the very faint end.. The consideration for the effects of IGM absorption is the same as in §2.1.1.Therefore, the difference between the empirical statistics-based (§2.1) and hydrodynamical simulation-based photometric mock catalog are only driven by the different SFH, CEH, DAL of mock galaxies and their different distribution of redshift and physical parameters.

2.3 Observational error modeling

The modeling of realistic errors on the flux is crucial for a meaningful performance test of redshift and stellar population parameter estimation. Here, we introduce the method we have employed to compute flux errors of mock galaxies and perturb their fluxes accordingly. The flux error for a wavelength band ii with Nσ\sigma AB magnitude limit mlim,i=2.5log(Flim,i)+23.9m_{{\rm lim},i}=-2.5*{\rm log}(F_{\rm lim,i})+23.9 is given by:

σF,i=(Flim,i/N)2+σF,i,sys2,\sigma_{F,i}=\sqrt{(F_{\rm lim,i}/N)^{2}+\sigma^{2}_{F,i,{\rm sys}}}, (25)

where the flux limit Flim,iF_{\rm lim,i} and the systematic flux error σF,i,sys\sigma_{F,i,{\rm sys}} are in unit of μJy\mu{\rm Jy}. From the relation that magnitude error σm1.08574/SNR\sigma_{m}\approx 1.08574/SNR and signal-to-noise ratio SNR=F/σFSNR=F/\sigma_{F}, we can obtain:

σF,i,sys=σm,sysFi/1.08574.\sigma_{F,i,{\rm sys}}=\sigma_{m,{\rm sys}}*F_{i}/1.08574. (26)

As in Cao et al. (2018), we assume a systematic magnitude error σm,sys=0.02\sigma_{m,{\rm sys}}=0.02. The final mock flux is obtained by the original flux perturbed by a Gaussian noise ϵ𝒩(0,σF,i2)\epsilon\sim\mathcal{N}\left(0,\sigma_{F,i}^{2}\right). In practice, the magnitude limit may have a dispersion σm,lim,i\sigma_{m,{\rm lim},i} for galaxies with different sizes. So, the actually used magnitude limit is drawn from the Gaussian distribution 𝒩(mlim,i,σm,lim,i2)\mathcal{N}\left(m_{{\rm lim},i},\sigma_{m,{\rm lim},i}^{2}\right). In this work, we set σm,lim,i=0.1\sigma_{m,{\rm lim},i}=0.1. We have generated three sets of mock catalog for CSST-like, Euclid-like, and COSMOS-like surveys, respectively. A summary of the adopted depths in all bands of the three surveys is shown in Table 2. The response functions and modeled relation between magnitude and magnitude error are shown in the panels of Figure 2 for the 7 CSST bands, 3 Euclid bands and 26 COSMOS bands, respectively. To separate the effects of observational errors on the accuracy of parameter estimation, we also generated another two sets of mock catalog without adding observational errors. In this case (the no noise case in Figure 2), the magnitude errors are all fixed to be 0.010.01, but the photometries have not been perturbed accordingly.

Figure 2: (a) Response functions for CSST and Euclid bands. (b) The modeled relation between magnitude and magnitude error for CSST bands. Sources with SNR<1SNR<1 (i.e. σm,i>1.08574\sigma_{\rm m,i}>1.08574) are considered as non-detections. The non-detections in a wavelength band ii with Nσ\sigma flux limit Flim,iF_{\rm lim,i} and magnitude limit mlim,im_{{\rm lim},i} are represented as sources with Fi=Flim,iF_{i}=F_{\rm lim,i} and σF,i=Flim,i/N\sigma_{F,i}=-F_{\rm lim,i}/N (the flux case), or mi=mlim,im_{i}=m_{{\rm lim},i} and σm,i=1.08574/N\sigma_{\rm m,i}=-1.08574/N (the magnitude case). These conventions make sure the consistent conversion between flux data and magnitude data in the input file of BayeSED. (c), (d): Same as in (a), (b), but for COSMOS bands. For clarity, the twelve intermediate bands (IBs) and two narrow bands (NBs) are not shown.
Survey band depth
NUVNUV 24.2±0.124.2\pm 0.1
uu 24.2±0.124.2\pm 0.1
CSST-like gg 25.1±0.125.1\pm 0.1
5σ5\sigma depthbbfootnotemark: rr 24.8±0.124.8\pm 0.1
ii 24.6±0.124.6\pm 0.1
zz 24.1±0.124.1\pm 0.1
yy 23.2±0.123.2\pm 0.1
Euclid-like JJ 24.0±0.124.0\pm 0.1
5σ5\sigma depthaafootnotemark: HH 24.0±0.124.0\pm 0.1
YY 24.0±0.124.0\pm 0.1
uu 26.0±0.126.0\pm 0.1
BB 26.4±0.126.4\pm 0.1
VV 25.6±0.125.6\pm 0.1
rr 25.9±0.125.9\pm 0.1
i+i^{+} 25.6±0.125.6\pm 0.1
z++z^{++} 25.3±0.125.3\pm 0.1
YY 24.7±0.124.7\pm 0.1
COSMOS-like JJ 24.3±0.124.3\pm 0.1
5σ5\sigma depth aafootnotemark: HH 24.0±0.124.0\pm 0.1
KsK_{\rm s} 24.1±0.124.1\pm 0.1
IBIB 2425±0.124-25\pm 0.1
NB711NB711 24.5±0.124.5\pm 0.1
NB816NB816 24.6±0.124.6\pm 0.1
3.6μm3.6\mu{\rm m} 24.9±0.124.9\pm 0.1
4.5μm4.5\mu{\rm m} 24.9±0.124.9\pm 0.1
Table 2: A summary of the adopted depths in all bands for CSST-like, Euclid-like, and COSMOS-like mock observations.

2.4 Sample selection

To test the performance of BayeSED, we selected two sets of samples of galaxies with Ks<24.7K_{s}<24.7 (Laigle et al., 2019) and i+<25i+<25 (Cao et al., 2018)222In Cao et al. (2018) and Zhou et al. (2022a, b), only the high-quality sources with the SNR10SNR\geq 10 in g or i band were selected to test the performance of photometric redshift estimation. This selection is not used in this work to obtain a more full picture for the performance of the CSST wide-field multiband imaging survey. from the empirical statistics-based mock catalog (§2.1) and the hydrodynamical simulation-based mock catalog (§2.2), respectively. The first set of samples are obtained directly with BayeSED combined with SED models with different complexity by using the method present in §2.1.2. The second sample is selected from the Horizon-AGN hydrodynamical simulation-based photometric catalogs for COSMOS-like configuration333https://www.horizon-simulation.org/PHOTOCAT/HorizonAGN_LAIGLE-DAVIDZON+2019_COSMOS_v1.6.fits which contains 789,354789,354 galaxies. We find that a sample with 10,00010,000 galaxies is large enough to obtain stable results for the performance tests as presented in §4 and §5. The redshift and magnitude distributions of the two samples are presented in Figure 3. When employing different SED models with different complexity and the same set of empirical statistics, the empirical statistics-based samples show some differences, especially the redshift distribution. This is likely due to the different mapping relations from physical parameters to photometries and from free parameters to derived parameters for different SED models. Generally, the hydrodynamical simulation-based sample is consistent with the empirical statistics-based samples. We attribute the difference between the two set of samples to the different modeling of the SFH, CEH and DAL of galaxies and their different distribution of redshift and physical parameters.

Refer to caption
Refer to caption
Refer to caption
Figure 3: The comparison of redshift and magnitude distributions of the empirical statistics-based and the Horizon-AGN hydrodynamical simulation-based mock galaxy samples.

3 Bayesian photometric SED analysis with BayeSED

The general method for the application of Bayesian inference to photometric SED analysis of galaxies is the same as in Han & Han (2014, 2019). In this section, we introduce some special aspects of Bayesian parameter estimation (§3.1) and model selection (§3.2) which are relevant to the current work.

3.1 Bayesian parameter estimation

For the Bayesian analysis of the mock data generated in the last section, we employ the same SED modeling procedure and setting of priors for free parameters as in §2.1.1, while the commonly used Gaussian form of likelihood function is employed. The performance of this Bayesian analysis, including its speed and quality, is crucial for the analysis of large sample of galaxies in the big data era. We need some metrics to quantify the performance of parameter estimation which is the main subject of this work.

While the speed of parameter estimation can be easily quantified by the running time, some metrics for the quality of parameter estimation are required. Similar to Acquaviva et al. (2015), we use three metrics to quantify the quality of parameter estimation. Bias, which characterizes the median separation between the predicted and the true values, is defined as:

BIA=Median(Δx),BIA={\rm Median}(\Delta x), (27)

while the precision, which describes the scatter between predicted and the true values, is defined as:

σNMAD=1.4826Median(|Δx|),\sigma_{\rm NMAD}=1.4826*{\rm Median}(\lvert\Delta x\rvert), (28)

where Δx=(xphotxtrue)|/(1+xtrue)\Delta x=(x_{\mathrm{phot}}-x_{\mathrm{true}})|/(1+x_{\mathrm{true}}) for redshift (Ilbert et al., 2009; Dahlen et al., 2013; Salvato et al., 2018), and Δx=(xphotxtrue)/(xtruemaxxtruemin)\Delta x=(x_{\mathrm{phot}}-x_{\mathrm{true}})/(x_{\mathrm{true}}^{\mathrm{max}}-x_{\mathrm{true}}^{\mathrm{min}}) for other parameters. The median-base definition makes them to be less sensitive to outliers (sources with unexpectedly large errors). The fraction of outliers is defined as:

OLF=1Nobj×N(|Δx|>0.15).OLF=\frac{1}{N_{\mathrm{obj}}}\times N(|\Delta x|>0.15). (29)

3.2 Bayesian model selection

An important advantage of nested sampling-based algorithm, such as MultiNest, over MCMC-based method is the ability to carry out a simultaneous parameter estimation and model selection. While the main subject of this work is parameter estimation, it is also interesting to explore the effects of sampling parameters (namingly, nlivenlive and efrefr) of MultiNest on the computation of Bayesian evidence, the quantity which is crucial for Bayesian model selection.

In Han & Han (2019), we presented a mathematical framework to discriminate the different assumptions about SSP, DAL and SFH in the SED modeling of galaxies based on the Bayesian evidence for a sample of galaxies. In this work, since the SSP model employed in the generation of mock data is the same as that employed in their Bayesian analysis, we do not need to consider the different choices of SSP. So, the problem is significantly simplified. In this work, we focus on the computation of the Bayesian evidence for the SED modeling of a sample of galaxies with SSP, SFH, and DAL all being assumed to be universal (i.e. 𝑴(ssp0,sfh0,dal0){{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})}-like model (See Section 5.1 of Han & Han, 2019).) The sample Bayesian evidence in this case (as Equation 33 of Han & Han (2019)) is:

p(𝒅1,𝒅2,,𝒅N|𝑴(ssp0,sfh0,dal0),𝑰)=\displaystyle p(\bm{d}_{1},\bm{d}_{2},\dots,\bm{d}_{N}|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})=
g=1Np(𝒅g|𝜽g,𝑴(ssp0,sfh0,dal0),𝑰)\displaystyle\prod\limits_{g=1}^{N}\int p(\bm{d}_{g}|{\bm{\theta}_{g}},{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})
p(𝜽g|𝑴(ssp0,sfh0,dal0),𝑰)d𝜽g\displaystyle p({\bm{\theta}_{g}}|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I}){\mathrm{d}\bm{\theta}_{g}}
=g=1Np(𝒅g|𝑴(ssp0,sfh0,dal0),𝑰).\displaystyle=\prod\limits_{g=1}^{N}p(\bm{d}_{g}|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I}). (30)

Although the detailed SFH and DAL of different galaxies can vary significantly, the sample Bayesian evidence computed in this manner remains valuable for identifying the most efficient combination of SFH and DAL for analyzing a vast sample of galaxies, such as the one provided by the CSST wide-field imaging survey.

In practice, the natural logarithmic of Bayesian evidence is commonly used for Bayesian model selection. Therefore, Equation 30 can be rewritten as:

ln(BE)ln(p(𝒅1,𝒅2,,𝒅N|𝑴(ssp0,sfh0,dal0),𝑰))\displaystyle\ln(BE)\equiv\ln(p(\bm{d}_{1},\bm{d}_{2},\dots,\bm{d}_{N}|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I}))
=g=1Nln(p(𝒅g|𝑴(ssp0,sfh0,dal0),𝑰)),\displaystyle=\sum\limits_{g=1}^{N}\ln(p(\bm{d}_{g}|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})), (31)

where ln(p(𝒅g|𝑴(ssp0,sfh0,dal0),𝑰))\ln(p(\bm{d}_{g}|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})), the Bayesian evidence for an individual galaxy, can be directly obtained in BayeSED with MultiNest. However, the individual Bayesian evidences estimated with MultiNest contain errors. A more strict Bayesian model selection should consider the effects of error propagation. In our case, the error of the sample Bayesian evidence ln(BE)\ln(BE) is simply the sum of errors for individual galaxies which is provided by MultiNest as well.

The minimum χ2\chi^{2} method is also widely used for model selection. For the case with Gaussian observational errors, there is only a constant difference between the minimum χ2\chi^{2} and the natural logarithmic of maximum likelihood. The sample maximum likelihood (as Equation 32 of Han & Han (2019)) is:

max(𝜽^1,𝜽^2,,𝜽^N)\displaystyle\mathcal{L}_{\rm max}(\hat{\bm{\theta}}_{1},\hat{\bm{\theta}}_{2},\dots,\hat{\bm{\theta}}_{N})
max𝜽1,𝜽2,,𝜽N[p(𝒅1,𝒅2,,𝒅N|𝜽1,𝜽2,,𝜽N,\displaystyle\equiv{\mathop{\rm max}\limits_{\bm{\theta}_{1},\bm{\theta}_{2},\dots,\bm{\theta}_{N}}}[p(\bm{d}_{1},\bm{d}_{2},\dots,\bm{d}_{N}|{\bm{\theta}_{1},\bm{\theta}_{2},\dots,\bm{\theta}_{N}},
𝑴(ssp0,sfh0,dal0),𝑰)]\displaystyle{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})]
=g=1Nmax𝜽g[p(𝒅g|𝜽g,𝑴(ssp0,sfh0,dal0),𝑰)].\displaystyle=\prod\limits_{g=1}^{N}{\mathop{\rm max}\limits_{\bm{\theta}_{g}}}[p(\bm{d}_{g}|{\bm{\theta}_{g}},{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})]. (32)

Then, the natural logarithmic of sample maximum likelihood is:

ln(ML)ln(max(𝜽^1,𝜽^2,,𝜽^N))\displaystyle\ln(ML)\equiv\ln(\mathcal{L}_{\rm max}(\hat{\bm{\theta}}_{1},\hat{\bm{\theta}}_{2},\dots,\hat{\bm{\theta}}_{N}))
=g=1Nln(max𝜽g[p(𝒅g|𝜽g,𝑴(ssp0,sfh0,dal0),𝑰)]),\displaystyle=\sum\limits_{g=1}^{N}\ln({\mathop{\rm max}\limits_{\bm{\theta}_{g}}}[p(\bm{d}_{g}|{\bm{\theta}_{g}},{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})]), (33)

where ln(max𝜽g[p(𝒅g|𝜽g,𝑴(ssp0,sfh0,dal0),𝑰)])\ln({\mathop{\rm max}\limits_{\bm{\theta}_{g}}}[p(\bm{d}_{g}|{\bm{\theta}_{g}},{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})]), the natural logarithmic of maximum likelihood for an individual galaxy, can be directly obtained in BayeSED with MultiNest. Similar to the model selection with Bayesian evidence, only the difference of ln(ML)\ln(ML) between different models is useful for the model selection. Therefore, the model selection with ln(ML)\ln(ML) is equivalent to that with minimum χ2\chi^{2}. In §6, we will discuss the difference between the two model selection methods.

3.3 Runtime parameters of MultiNest algorithm

As the Bayesian inference engine of BayeSED, MultiNest has some runtime parameters. The values of these runtime parameters have very important effects on the performance of BayeSED for redshift and stellar population parameter estimation of galaxies. Here, we briefly introduce the meaning of these runtime parameters of MultiNest algorithm.

Nested sampling (NS) (Skilling, 2004, 2006), as a Monte Carlo (MC) method primarily designed for the efficient computation of the Bayesian evidence, allows posterior inference as a by-product at the same time. So, it provides a way to carry out simultaneous Bayesian parameter estimation and model selection. As an algorithm built on the NS framework, MultiNest (Feroz & Hobson, 2008; Feroz et al., 2009) is special for its efficiency in sampling from posteriors that may contain several modes and/or degeneracies. It has been improved further by the implementation of importance nested sampling (INS) (Cameron & Pettitt, 2014; Feroz et al., 2019) to increase the efficiency for evidence computation. In the latest version of BayeSED, the V3.12 version of MultiNest, which includes the implementation of INS, is employed.

Similar to most nested sampling algorithms, MultiNest explores the posterior distribution by maintaining a fixed number (See also Higson et al., 2019; Speagle, 2020, for new methods using variable number) of samples drawn from the prior distribution, called live points, and iteratively replaces the point with the lowest likelihood value (the dead point) with another point drawn from the prior but has a higher value of likelihood. While there are many runtime parameters of MultiNest which can be set in BayeSED, only two of them are of particular importance. They largely determined the accuracy and computational cost for the running of MultiNest algorithm and therefore BayeSED. The first one is the total number of live points (nlivenlive), which determines the effective sampling resolution. The second one is the target sampling efficiency (efrefr), which determines the ratio of points accepted to those sampled. Generally, the larger nlivenlive and lower efrefr lead to more accurate posteriors and evidence values but higher computational cost. The optimal value of nlivenlive and efrefr should be problem-dependent, although efrefr equals to 0.80.8 and 0.30.3 are recommended by the authors of MultiNest for parameter estimation and evidence evalutaion, respectively.

In this work, we will explore the effects of nlivenlive and efrefr on the estimation of photometric redshift and stellar population parameters. The results are presented in §4.1 and §4.2, respectively.

4 Results of performance tests using empirical statistics-based mock galaxy sample

In this section, we present the results of performance tests of photometric redshift and stellar population parameter estimation by using empirical statistics-based mock galaxy sample for CSST wide-field multiband imaging survey. Since the SED model employed in the Bayesian parameter estimation is exactly the same as that used in the generation of mock observations, the error of parameter estimation is mainly contributed by the random error in the data, parameter degeneracies, the stochastic nature of the employed MultiNest sampling algorithm and other potential errors in the BayeSED code. To separate out the effects of the random photometric error in the data, we will consider the two cases with and without adding random noise to the photometric data. Besides, to find out the optimal run parameters, we have considered six different choices of the target sampling efficiency (efrefr) and eight choices of the number of live points (nlivenlive) for the MultiNest sampling algorithm. Furthermore, we compare the performance of different SED models with increasing complexity in terms of running time and quality of parameter estimation.

Figure 4: Performance test with empirical statistics-based mock galaxy sample for the photometric redshift estimation of galaxies in the CSST wide-field multiband imaging survey. (a) The results for only the simplest SED model (SFH=τ\tau,-CEH,DAL=Cal+00) employed in this work. We have considered six different choices of efrefr (target sampling efficiency) as shown by different symbols. For each efrefr, we have considered eight cases with the number of live points (nlivenlive), which determines the effective sampling resolution, equals to 1010, 1515, 2020, 2525, 5050, 100100, 200200, and 400400, respectively. The relations between the computation time (in sec/obj/cpu, by employing one core of a 2.2 GHz cpu) and the performance metrics σNMAD\sigma_{\rm NMAD} (top panels), BIA (middle panels), OLF (bottom panels) are shown respectively. The results for the two cases with (left panels) and without (right panels) observational noise in the mock data are shown respectively. In general, larger value of nlivenlive and smaller value of efrefr lead to better quality of redshift estimation, but with the cost of longer running time. (b) The results for six different SED models with increasing complexity. Only the results with efr=0.1efr=0.1 are shown. In general, more complicated SED models require longer running time. They also lead to worse quality of photometric redshift estimation, which is likely due to more severe parameter degeneracies. Actually, for the last four more complicated SED models with the DAL of Salim et al. (2018), the number of free parameter is greater than the number of photometric data points (7 for CSST imaging survey), as shown in Table 1.

4.1 Photometric redshift estimation

The results of performance tests for photometric redshift estimation are shown in Figure 4. In Figure 4(a), we show the results for only the simplest SED model (SFH=τ\tau,-CEH,DAL=Cal+00) employed in this work. As shown in the top right panel of this figure, in the case without noise, there is a clear anti-correlation between the computation time (or the sampling resolution nlivenlive) and the error σNMAD\sigma_{\rm NMAD} (defined in Equation 28) of photometric redshift estimation. Meanwhile, a smaller efrefr makes the anti-correlation converge faster with the increasing of nlivenlive. There is a clear lower limit for the value of σNMAD\sigma_{\rm NMAD}, which is about 0.0060.006. As shown in the top left panel of Figure 4(a), in the case with noise, the error of photometric redshift estimation does not always decrease with the sampling resolution nlivenlive. When we set efr=0.1efr=0.1, the lowest error (0.056\lesssim 0.056) of redshift estimation is obtained when nlivenlive is about 2525.When nlive>25nlive>25, the error of redshift estimation start to increase with nlivenlive and finally converge to 0.058\sim 0.058. This is most likely due to the overfitting to the noise added to the mock data.

The middle two panels of Figure 4(a) show the relation between the computation time (or nlivenlive) and the bias (defined in Equation 27) of photometric redshift estimation. In the case with noise, the relation between the computation time (or nlivenlive) and bias has almost the opposite profile of that of the error σNMAD\sigma_{\rm NMAD}. However, the bias of photometric redshift estimation is generally very small, which is almost zero in the noise-free case.

The bottom two panels of Figure 4(a) show the relation between the computation time (or nlivenlive) and the fraction of outliers OLF (defined in Equation 29) of photometric redshift estimation. Similar to that for σNMAD\sigma_{\rm NMAD}, in the noise-free case, there is a clear anti-correlation between the computation time (or nlivenlive) and OLF. In this case, the lower limit for the value of OLF is about 0.0020.002. In the case with noise, the relation between the computation time (or nlivenlive) and OLF has the same profile as that of the error σNMAD\sigma_{\rm NMAD}. When we set efr=0.1efr=0.1, the lowest OLF (0.215\lesssim 0.215) of redshift estimation is also obtained when nlivenlive is about 2525. When nlive>25nlive>25, the OLF of redshift estimation start to increase with nlivenlive and finally converge to 0.225\sim 0.225.

In Figure 4(b), we show the results for all of the six SED models with increasing complexity. Here, only the results with efr=0.1efr=0.1 are shown. In the case without noise, as shown in the top right panel of this figure, the error σNMAD\sigma_{\rm NMAD} of photometric redshift estimation tends to converge to a larger value when more complicated SED model is employed. This is not strange, since more complicated SED models have more free parameters and thus suffer from more severe parameter degeneracies. Besides, more complicated SED models apparently require longer running time. The bias of redshift estimation is always very small no matter which SED is employed. In general, when more complicated SED model is employed, the OLF of redshift estimation also increases significantly, and decreases much slower with the increasing of nlivenlive.

In the case with noise, as shown in the left panels of Figure 4(b), the results are a little more complicated. For the first three simplest SED models, the error σNMAD\sigma_{\rm NMAD} of photometric redshift estimation apparently increases with the increasing of model complexity. However, when more complicated forms of SFH is considered, σNMAD\sigma_{\rm NMAD} start to decreases with the increasing of model complexity, although not very significantly. Meanwhile, the most complicated SED model (SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18) lead to the smallest absolute value of bias, although the bias is actually very small in all cases. The situation for OLF is somewhat similar to that of σNMAD\sigma_{\rm NMAD}. No matter which SED model is employed, when nlive25nlive\gtrsim 25, both σNMAD\sigma_{\rm NMAD} and OLF of redshift estimation start to increase, and then slowly decrease to a stable value.

4.2 Stellar population parameter estimation

In this subsection, we show the results of performance tests for the photometric stellar population parameter estimation. While the estimates of many stellar population parameter are available, we only show the results for stellar mass and SFR, which are two of the most important physical parameters for the study of the formation and evolution of galaxies.

Figure 5: As in Figure 4, but for the stellar mass estimation. The quality of stellar mass estimation, in terms of σNMAD\sigma_{\rm NMAD}, BIA and OLF, is worse than that of redshift estimation and more sensitive to the selection of SED models. In the case with noise, for the most complicated SED model (SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18) used in this work, the σNMAD\sigma_{\rm NMAD}, bias and OLF of stellar mass estimation increases significantly when nlive>100nlive>100. This should be a clear indication of overfitting to the noise in the data. In general, more complicated SED models lead to worse quality of stellar mass estimation.

4.2.1 Stellar mass

The results of performance tests for stellar mass estimation are shown in Figure 5. In Figure 5(a), we show the results for only the simplest SED model (SFH=τ\tau,-CEH,DAL=Cal+00) employed in this work. As shown in the top right panel of this figure, in the case without noise, there is also a clear anti-correlation between the computation time (or the sampling resolution nlivenlive) and the error σNMAD\sigma_{\rm NMAD} of photometric stellar mass estimation. The behavior of error σNMAD\sigma_{\rm NMAD} of stellar mass with respect to the change of efr is similar to that of photometric redshift. There is also a clear lower limit for the value of σNMAD\sigma_{\rm NMAD}, which is about 0.040.04. As shown in the top left panel of Figure 5(a), in the case with noise, the error of stellar mass estimation does not always decrease with the sampling resolution nlivenlive as well. When we set efr=0.1efr=0.1, the lowest error (0.1130\sim 0.1130) of stellar mass estimation is also obtained when nlivenlive is about 2525. When nlive>25nlive>25, the error of stellar mass estimation only slightly increase with nlivenlive. The error of stellar mass is about two times larger than that of photometric redshift estimation. The bias of stellar mass estimation is also larger, but still very small when comparing with σNMAD\sigma_{\rm NMAD}. In the noise-free case, there is also a clear anti-correlation between the computation time (or nlivenlive) and the OLF of stellar mass estimation, where the lower limit for the value of OLF is about 0.030.03. In the case with noise, when we set efr=0.1efr=0.1, the lowest OLF (0.285\lesssim 0.285) of stellar mass estimation is also obtained when nlivenlive is about 2525. When nlive>25nlive>25, the OLF of stellar mass estimation only slightly increase with nlivenlive as well. The OLF of stellar mass estimation is slightly larger than that of photometric redshift estimation.

In Figure 5(b), we show the results for all of the six SED models with increasing complexity, where only the results with efr=0.1efr=0.1 are shown. In the two cases with or without noise, as shown in the top right panel of this figure, the error σNMAD\sigma_{\rm NMAD} of photometric stellar mass estimation tends to converge to a larger value when more complicated SED model is employed. The same is true for the OLF of photometric stellar mass estimation. The behavior of bias is somewhat different, but it is generally very small when comparing with σNMAD\sigma_{\rm NMAD}. Besides, in the case with noise, for the most complicated SED model (SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18) used in this work, the σNMAD\sigma_{\rm NMAD}, bias and OLF of stellar mass estimation increases significantly when nlive>100nlive>100. This should be a very clear indication of overfitting to the noise in the data. In general, more complicated SED models lead to worse quality of stellar mass estimation.

Figure 6: As in Figure 4, but for the star-formation rate estimation. The quality of star-formation rate estimation, in terms of σNMAD\sigma_{\rm NMAD}, BIA and OLF, is slightly better than that of stellar mass estimation, but even more sensitive to the selection of SED models. In the case with noise, for the four most complicated SED models used in this work, the σNMAD\sigma_{\rm NMAD}, bias and OLF of SFR estimation significantly increases with nlivenlive. This is another even more clear indication of overfitting to the noise in the data. In general, more complicated SED models lead to worse quality of SFR estimation.

4.2.2 Star-formation rate

The results of performance tests for SFR estimation are shown in Figure 6. In Figure 6(a), we show the results for only the simplest SED model (SFH=τ\tau,-CEH,DAL=Cal+00) employed in this work. Similar to the results for photometric redshift and stellar mass estimation, in the case without noise, there is also a clear anti-correlation between the computation time (or the sampling resolution nlivenlive) and the error σNMAD\sigma_{\rm NMAD} of photometric stellar mass estimation. The behavior of error σNMAD\sigma_{\rm NMAD} of SFR with respect to the change of efr is similar to that of photometric redshift and stellar mass. There is also a clear lower limit for the value of σNMAD\sigma_{\rm NMAD}, which is about 0.020.02. As shown in the top left panel of Figure 5(a), in the case with noise, the error of SFR estimation increases apparently when the sampling resolution nlive25nlive\gtrsim 25. Generally, the error of SFR estimation is slightly smaller than that of stellar mass estimation. The bias of SFR estimation is also slightly smaller, and ignorable with respect to σNMAD\sigma_{\rm NMAD}. In the noise-free case, the relation between the computation time (or nlivenlive) and the OLF of SFR estimation is somewhat different from that of photometric redshift and stellar mass. Even with nlive=500nlive=500, the OLF of SFR estimation still does not seem to converge. The lower limit for the value of OLF seems near 0.10.1. In the case with noise, the OLF of SFR estimation converge much faster to about 0.2550.255 when we set efr=0.1efr=0.1. This is slightly smaller than that of stellar mass estimation.

In Figure 6(b), we show the results for all of the six SED models with increasing complexity, where only the results with efr=0.1efr=0.1 are shown. In the two cases with or without noise, as shown in the top right panel of this figure, the error σNMAD\sigma_{\rm NMAD} of SFR estimation tends to converge to a larger value when more complicated SED model is employed, and is more sensitive to the selection of SED model than that of stellar mass. The same is true for the OLF of SFR estimation. The behavior of bias is somewhat different, but it is generally very small when comparing with σNMAD\sigma_{\rm NMAD}. Besides, in the case with noise, for the four most complicated SED models used in this work, the σNMAD\sigma_{\rm NMAD}, bias and OLF of SFR estimation significantly increases with nlivenlive. This is another even more clear indication of overfitting to the noise in the data. In general, more complicated SED models lead to worse quality of SFR estimation.

4.3 Computation of Bayesian evidence

In this subsection, we present the results of performance test for the computation of Bayesian evidence, a quantity which is crucial for Bayesian model selection.

In Figure 7(a), we show the results for only the simplest SED model (SFH=τ\tau,-CEH,DAL=Cal+00) employed in this work. As shown in the top and middle panels of this figure, the Bayesian evidence computed with importance sampling is more stable than that without importance sampling, especially in the case with noise. So, hereafter and especially in §5 and 6, all mentioned Bayesian evidences are computed with importance sampling. The value of Bayesian evidence increases with the number of live points (nlivenlive) which determines the effective sampling resolution. In all cases, it eventually converges to a stable value when nlivenlive is very large, while a smaller sampling efficiency (efrefr) leads to faster convergence rate. A good balance between the speed and quality of Bayesian evidence estimation can be achieved when the MultiNest runtime parameters efrefr equals to 0.10.1 and nlivenlive equals to 5050.

On the other hand, as shown in the bottom panels of Figure 7(a), the error of Bayesian evidence decreases with nlivenlive, while a larger sampling efficiency (efrefr) also leads to faster convergence rate. However, unlike the value of Bayesian evidence, the error of Bayesian evidence converges slower with the increasing of nlivenlive in all cases. As a result, if we set efr=0.1efr=0.1 and nlive=50nlive=50, the error of Bayesian evidence would be overestimated. A much larger value of nlivenlive seems required to obtain a more reliable estimation for the error of Bayesian evidence with MultiNest, which would be very computationally expansive and not suitable for the analysis of massive photometric data. However, in practice, this may not be a serious issue, since an overestimated error of Bayesian evidence only leads to a more conservative conclusion about model comparison. We just need to keep this in mind.

In Figure 7(b), we show the results for all SED models with increasing complexity, where only the results with efr=0.1efr=0.1 are shown. Although the data used for the computation of Bayesian evidence is different for different SED models, the value of Bayesian evidence clearly decreases with the increasing of the complexity of SED model. This is reasonable. Since the same SED model is employed for the generation of mock data and their Bayesian analysis, the mock data can always be interpreted well. However, a more complicated SED model is penalized for being distributed over a larger space, of which only a smaller fraction is useful for the given mock data.

Figure 7: Performance test of Bayesian evidence estimation with empirical statistics-based mock galaxy sample for CSST imaging survey. The relations between the computation time (in sec/obj by employing a single 2.2 GHz cpu core) and the value of the natural logarithm of Bayesian evidence (as computed with Equation 31) and its error (bottom panels) for the whole galaxy sample are shown. The results for two versions of Bayesian evidence with (top panels) or without (middle panels) importance sampling have been shown. (a) The results for only the simplest SED model (SFH=τ\tau,-CEH,DAL=Cal+00) employed in this work. We have considered six different choices of efrefr (target sampling efficiency) as shown by different symbols. The results for seven cases with nlivenlive equals to 1515, 2020, 2525, 5050, 100100, 200200, and 400400 are shown respectively. The left panels show the results with noisy data while the right panels show the results with noise-free data. (b) The results for six different SED models with increasing complexity. Only the results with efr=0.1efr=0.1 are shown. The value of Bayesian evidence clearly decreases with the increasing of the complexity of the SED model.

5 Results of performance tests using hydrodynamical simulation-based mock galaxy sample

In this section, we present the results of performance tests of photometric redshift and stellar population parameter estimation by using hydrodynamical simulation-based mock galaxy sample for CSST-like imaging survey. Only the results obtained with the simplest SED model are shown. In §6, we will discuss the effect of more flexible SFH and DAL for CSST-like, CSST+Euclid-like and COSMOS-like surveys, respectively.

As mentioned in §2.2, the generation of this mock galaxy sample accounts for the complex SFH and metal enrichment of Horizon-AGN galaxies, and consistently includes dust attenuation. However, for the Bayesian analysis of this more theoretical mock galaxy sample, we firstly employ the widely used assumptions about SFH (exponentially declining), metal enrichment history (constant but free), and dust attenuation (uniform foreground dust screen with Calzetti et al. (2000) DAL). The results in this section will help us to quantify the systematic errors resulting from these simplified assumptions. Besides, as mentioned in §2.4, galaxies in the mock sample used here are selected with Ks<24.7K_{s}<24.7 and i+<25i_{+}<25. In the literature, it is quite common to exclude some pathological cases with a kind of χ2\chi^{2} selection (Davidzon et al., 2017; Caputi et al., 2015; Laigle et al., 2019) before presenting the results of performance test. However, no such cut was made here because the pathological cases are precisely what we want to investigate.

Figure 8: The results of photometric redshift estimation with (righ panels) and without (left panels) noise, and for the analysis of the empirical statistics-based (top panels) and hydrodynamical simulation-based (bottom panels) mock galaxy sample, respectively. The error from imperfect SED modeling will only present for the analysis of the hydrodynamical simulation-based mock galaxy sample. The photometric redshifts (zphotz_{\rm phot}) are estimated by employing the τ\tau model of SFH without consideration of metallicity evolution and the Calzetti et al. (2000) model of DAL. The red solid line indicate the identity while the red dotted lines indicate the outlier limits, i.e. |zphotztrue|/(1+ztrue)>0.15\left|z_{\mathrm{phot}}-z_{\mathrm{true}}\right|/(1+z_{\mathrm{true}})>0.15. Here, we show the results obtained with the MultiNest runtime parameters efrefr equals to 0.10.1 and nlivenlive equals to 5050. In general, the observational noise is the more important source of error for the photometric redshift estimation of galaxies, and the contribution from imperfect SED modeling is also very important.
Figure 9: As Figure 8, but for the stellar mass. In general, the observational noise is the more important source of error for the photometric stellar mass estimation of galaxies, and the contribution from imperfect SED modeling is almost comparable.
Figure 10: As Figure 8, but for the SFR. In general, the imperfect SED modeling is more important source of error for the photometric SFR estimation of galaxies, and the contribution from imperfect SED modeling is also very important. Especially, in the case without noise, it is clear that the simple τ\tau model of SFH and Calzetti et al. (2000) form of DAL lead to severely biased estimation of SFR for the Horizon-AGN simulation based mock sample of galaxies.

5.1 Photometric redshift estimation

In Figure 8, we investigate the performance of BayeSED combined with the simplest SED model to estimate the photometric redshifts of hydrodynamical simulation-based mock galaxy sample for CSST-like imaging survey. Aa a reference, the panel a of this figure show the ideal case without observational noise and SED modeling errors. So, in this case, the effects of parameter degeneracies, the stochastic nature of MultiNest sampling algorithm and other potential errors in the BayeSED code are the sources of error. As shown clearly, the total error from all of these sources is very small. By comparing panels a and b of this figure, with only the observational noise added, the σNMAD\sigma_{\rm NMAD} of photometric redshift estimation increases by eight times and the OLF increases by more than forty times. The bias also increases, but is ignorable with respect to σNMAD\sigma_{\rm NMAD}. By comparing panels a and c of this figure, with only the error from the imperfect SED modeling added, the σNMAD\sigma_{\rm NMAD} of photometric redshift estimation increases by more than three times and the OLF decreases slightly. Besides, there are some additional systematic patterns in the relation between true and estimated values of photometric redshift. The bias also increases and is comparable to σ/rmNMAD\sigma_{/rmNMAD}. In general, the observational noise is more important source of error for the photometric redshift estimation of galaxies, although the other one is also very important.

Finally, as shown in panel d of Figure 8, when all sources of error are included, the σNMAD\sigma_{\rm NMAD} of photometric redshift estimation increases to 0.0970.097, the OLF increases to 0.2640.264, and the bias becomes 0.0030.003. The systematic patterns shown in panel c seems being hidden due to the added noise. The algorithm seems to be struggling to estimate the photometric redshifts correctly by only using the seven-band photometries from CSST-like imaging survey, especially for galaxies with redshift larger than one.

5.2 Stellar population parameter estimation

In Figure 9, we investigate the performance of BayeSED combined with the simplest SED model to estimate the stellar population parameters of hydrodynamical simulation-based mock galaxies for CSST-like imaging survey. Aa a reference, the panel a of this figure show the ideal case without observational noise and SED modeling errors. In this ideal case, the σNMAD\sigma_{\rm NMAD}, bias and OLF of stellar mass estimation are 0.0470.047, 0.0050.005 and 0.0520.052, respectively. As shown in panel b, with only the observational noise added, the results become 0.1150.115, 0.020.02 and 0.2830.283, respectively. As shown in panel c, with only the error from the imperfect SED modeling added, the results become 0.1030.103, 0.0030.003 and 0.1410.141, respectively. By comparing panels a and c, the performance of stellar mass estimation is severely affected by the simplified assumptions in the SED modeling. However, by comparing panels b and c, the observational noise is more important source of error for the photometric stellar mass estimation of galaxies, although the other one is also very important. Finally, as shown in panel d of this figure, when all sources of error are included, the σNMAD\sigma_{\rm NMAD} of photometric stellar mass estimation increases to 0.1350.135, the bias increases to 0.0340.034, and the OLF increases to 0.3410.341. The algorithm seems to be even more struggling to estimate the photometric stellar mass correctly by only using the seven-band photometries from CSST-like imaging survey.

Similarly, Figure 10 shows the performance of BayeSED combined with the simplest SED model to estimate the SFR of hydrodynamical simulation-based mock galaxy sample for CSST-like imaging survey. Comparing with the results for stellar mass estimation in Figure 9, the photometric SFR estimation is even more severely affected by the simplified assumptions in the SED modeling. Actually, by comparing panels b and c of this figure, it is clear that the error from the imperfect SED modeling is more important source of error for the photometric SFR estimation of galaxies, although the other one is also very important. Finally, it becomes even more struggling to estimate the photometric SFR correctly by only using the seven-band photometries from CSST-like imaging survey.

6 Discussion

By comparing the results of performance tests for simultaneous photometric redshift and stellar parameters estimation using empirical statistics-based mock galaxy sample (§4) and hydrodynamical simulation-based mock galaxy sample(§5), especially those presented in Figures 8, 9 and 10, it is clear that the simple typical assumptions about the SFH and DAL of galaxies have severe impact on the performance of photometric parameter estimation of galaxies for CSST-like imaging survey. It is not very surprising, since the SFHs and MEHs of galaxies in the cosmic hydrodynamical simulation, such as Horizon-AGN (Volonteri et al., 2016; Beckmann et al., 2017a; Kaviraj et al., 2017; Beckmann et al., 2017b), are much more complex and diverse (See also Iyer et al., 2020) than the simple assumptions that have been employed in the previous Bayesian analysis of photometric mock data.

In this section, we will discuss the effects of more flexible forms of SFH and DAL on the performance of simultaneous photometric redshift and stellar population parameter estimation of galaxies. As in Han & Han (2012, 2014, 2019) (See also Salmon et al., 2016; Dries et al., 2016, 2018; Lawler & Acquaviva, 2021), we mainly employ the Bayesian model comparison method to compare six different combinations of these model ingredients with increasing complexity (See Table 1 for details). In addition to the CSST-like survey (§6.1) where only the photometries from seven broad-bands are available, we also discuss the results obtained by using mock data for CSST+Euclid-like (§6.2) and COSMOS-like surveys (§6.3) with increasing discriminative power, respectively.

6.1 Effects of more flexible SFH and DAL for CSST-like survey

survey noise SFH DAL ln(BE) ln(ML) parameter NMAD BIA OLF
zphotz_{\rm phot} 0.024 0.011 0.003
CSST - τ\tau,-CEH Cal+00 -88042±\pm5936 127197 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.103 0.003 0.141
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.140 -0.117 0.436
zphotz_{\rm phot} 0.023 0.010 0.002
CSST - τ\tau,+CEH Cal+00 -74128±\pm5973 144380 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.092 -0.010 0.141
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.148 -0.141 0.491
zphotz_{\rm phot} 0.016 0.014 0.004
CSST - τ\tau,+CEH Sal+18 58878±\pm5823 282267 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.086 0.041 0.170
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.153 -0.019 0.390
zphotz_{\rm phot} 0.015 0.014 0.004
CSST - β\beta-τ\tau,+CEH Sal+18 56343±\pm5780 283410 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.079 0.041 0.150
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.133 0 0.370
zphotz_{\rm phot} 0.017 0.011 0.006
CSST - β\beta-τ\tau-r,+CEH Sal+18 34213±\pm5867 275861 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.097 0.052 0.227
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.257 0.045 0.579
zphotz_{\rm phot} 0.016 0.010 0.005
CSST - α\alpha-β\beta-τ\tau-r,+CEH Sal+18 33044±\pm5800 275886 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.098 0.051 0.239
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.214 0.065 0.544
zphotz_{\rm phot} 0.097 0.003 0.264
CSST + τ\tau,-CEH Cal+00 -18871±\pm2649 49807 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.135 -0.034 0.341
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.157 0.011 0.371
zphotz_{\rm phot} 0.096 0.005 0.264
CSST + τ\tau,+CEH Cal+00 -18661±\pm2649 49469 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.135 -0.034 0.344
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.155 0 0.366
zphotz_{\rm phot} 0.098 0.012 0.263
CSST + τ\tau,+CEH Sal+18 -26098±\pm2956 54493 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.158 -0.023 0.387
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.201 0 0.468
zphotz_{\rm phot} 0.099 0.011 0.263
CSST + β\beta-τ\tau,+CEH Sal+18 -28141±\pm2904 54484 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.161 -0.029 0.396
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.194 0.028 0.472
zphotz_{\rm phot} 0.100 0.013 0.268
CSST + β\beta-τ\tau-r,+CEH Sal+18 -35048±\pm3020 54444 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.171 -0.037 0.415
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.232 0.041 0.544
zphotz_{\rm phot} 0.104 0.007 0.290
CSST + α\alpha-β\beta-τ\tau-r,+CEH Sal+18 -33983±\pm2869 54440 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.177 -0.052 0.430
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.224 0.080 0.570
Table 3: A summary of Bayesian evidences (BE), maximum likelihoods (ML) and the metrics of the quality of parameter estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for CSST-like survey by employing six SED models with increasing complexity in the form of SFH and DAL, as well as for the cases with and without noise.

In Table 3, we present a summary of the Bayesian evidences, maximum likelihoods and metrics of the quality of parameter estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for CSST-like survey by employing six different combinations of SFH and DAL with increasing complexity, as well as for the cases with and without noise. The same results are also shown more clearly in Figure 11.

Refer to caption
Figure 11: The Bayesian evidences (BE), maximum likelihood (ML) and metrics of photometric redshift (red star), stellar mass (blue square) and SFR (green circle) estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for CSST-like survey by employing six SED models (0:“SFH=τ\tau,-CEH,DAL=Cal+18”, 1:“SFH=τ\tau,+CEH,DAL=Cal+18”, 2:“SFH=τ\tau,+CEH,DAL=Sal+18”, 3:“SFH=β\beta-τ\tau,+CEH,DAL=Sal+18”, 4:“SFH=β\beta-τ\tau-r,+CEH,DAL=Sal+18”, 5:“SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18”) with increasing complexity in the forms of SFH and DAL , as well as for the cases with (bottom panels) and without noise (top panels). In the case without noise, the simplest model “SFH=τ\tau,-CEH,DAL=Cal+18” has the lowest Bayesian evidence of ln(BE)=88042±5936{\rm ln}(BE)=-88042\pm 5936. Meanwhile, the SED models “SFH=τ\tau,+CEH,DAL=Sal+18” and “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” which are neither the simplest nor the most complex models have the largest Bayesian evidences of ln(BE)=58878±5823{\rm ln}(BE)=58878\pm 5823 and ln(BE)=56343±5780{\rm ln}(BE)=56343\pm 5780 which are comparable within error bar. Interestingly, the same two models also give the highest quality parameter estimates. It is worth to mention that the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to similar results. Contrary to the case without noise, in the case with noise, the two simplest SED models “SFH=τ\tau,-CEH,DAL=Cal+00” and “SFH=τ\tau,+CEH,DAL=Cal+00” have the largest Bayesian evidences of ln(BE)=18871±2649{\rm ln}(BE)=-18871\pm 2649 and ln(BE)=18661±2649{\rm ln}(BE)=-18661\pm 2649 which are comparable within error bar. It is very interesting to notice that the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to exactly the opposite results, where the more complex (or flexible) models are always more favored. Similar to the case without noise, the two simplest SED models also give the highest quality parameter estimates. For photometric redshift, stellar mass and SFR, their accurate estimation becomes increasingly difficult, while the latter two are also more sensitive to the selection of SED models. Generally, the quality of parameter estimates is closely related to the level of Bayesian evidence, which is especially clear in the more realistic case with noise. Actually, the quality of parameter estimation, especially that of stellar mass and SFR estimation, significantly decrease with the increasing of SED model complexity, which is similar to the case with perfect SED modeling as shown in Figures 4, 5 and 6, and should be caused by more severe parameter degeneracies suffered by the more flexible SED model. It is clear that, in the more realistic case with noise, the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) is not consistent with the measurements of the quality of parameter estimation. Since the direct measurements of the metrics such as NMAD, BIA and OLF are usually unavailable, the Bayesian model comparison with Bayesian evidence can be used to find the best SED model which is not only the most efficient but also give the best parameter estimation.

6.1.1 Model comparison

In the case without noise, as shown in the top left panel of Figure 11, the simplest model “SFH=τ\tau,-CEH,DAL=Cal+18” has the lowest Bayesian evidence of ln(BE)=88042±5936{\rm ln}(BE)=-88042\pm 5936. With the additional consideration of metallicity evolution, the Bayesian evidence of the model “SFH=τ\tau,+CEH,DAL=Cal+18” increases to ln(BE)=74128±5973{\rm ln}(BE)=-74128\pm 5973. Then, with the adoption of the DAL of Salim et al. (2018), the Bayesian evidence of the model “SFH=τ\tau,+CEH,DAL=Sal+18” increases significantly to ln(BE)=58878±5823{\rm ln}(BE)=58878\pm 5823. Apparently, the DAL of Salim et al. (2018) is a much better choice than that of Calzetti et al. (2000) for the hydrodynamical simulation-based mock galaxy sampe in CSST-like survey. Furthermore, by employing a more complicated β\beta-τ\tau form of SFH, the Bayesian evidence of the model “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” seems decreases a little to ln(BE)=56343±5780{\rm ln}(BE)=56343\pm 5780. Actually, the latter two SED models (“SFH=τ\tau,+CEH,DAL=Sal+18” and “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18”) have the largest Bayesian evidences which are comparable within error bar. They are neither the simplest nor the most complex models. With a quenching (or rejuvenation) component added to the SFH, the Bayesian evidence of the model “SFH=β\beta-τ\tau-r,+CEH,DAL=Sal+18” obviously decreases to ln(BE)=34213±5867{\rm ln}(BE)=34213\pm 5867. It seems that, while the rejuvenation or rapid quenching events may happen in some galaxies, this additional component of SFH is not very effective for most of the galaxies in the sample. Finally, by employing a even more flexible double power-law form of SFH, the Bayesian evidence of the model “SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18” seems decreases a little to ln(BE)=33044±5800{\rm ln}(BE)=33044\pm 5800. However, the latter two SED models are actually comparable within error bar. On the other hand, it is worth to mention that the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to similar results.

In the case with noise, as shown in the bottom left panel of Figure 11, the two simplest SED models “SFH=τ\tau,-CEH,DAL=Cal+00” and “SFH=τ\tau,+CEH,DAL=Cal+00” have the largest Bayesian evidences of ln(BE)=18871±2649{\rm ln}(BE)=-18871\pm 2649 and ln(BE)=18661±2649{\rm ln}(BE)=-18661\pm 2649, respectively. Although the latter which has additionally considered the metallicity evolution seems better, their Bayesian evidences are actually comparable within error bar. Then, with the adoption of the DAL of Salim et al. (2018), the Bayesian evidence of the model “SFH=τ\tau,+CEH,DAL=Sal+18” decreases significantly to ln(BE)=26098±2956{\rm ln}(BE)=-26098\pm 2956, exactly the opposite of the situation without noise. It is likely that the more complicated form of DAL does not give a better fit to the noisier data. By employing a more complicated β\beta-τ\tau form of SFH, the Bayesian evidence of the model “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” seems decreases further to ln(BE)=28141±2904{\rm ln}(BE)=-28141\pm 2904, although comparable with the former within error bar. With a quenching (or rejuvenation) component added to the SFH, the Bayesian evidence of the model “SFH=β\beta-τ\tau-r,+CEH,DAL=Sal+18” obviously decreases to ln(BE)=35048±3020{\rm ln}(BE)=-35048\pm 3020, which is similar to the case without noise. Finally, by employing a even more flexible double power-law form of SFH, the Bayesian evidence of the model “SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18” seems increases a little to ln(BE)=33983±2869{\rm ln}(BE)=-33983\pm 2869, although comparable with the former within error bar. On the other hand, it is very interesting to notice that the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to exactly the opposite results, where the more complex (or flexible) models are always more favored.

6.1.2 Parameter estimation

In the right three panels of Figure 11, we show the three metrics of the quality of photometric redshift, stellar mass and SFR estimation for different SED models, respectively. In general, in the case without noise, the SED models “SFH=τ\tau,+CEH,DAL=Sal+18” and “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” which are just the two with largest Bayesian evidence give the highest quality parameter estimates. In the case with noise, the two simplest SED models “SFH=τ\tau,-CEH,DAL=Cal+00” and “SFH=τ\tau,+CEH,DAL=Cal+00” which are also the two with the largest Bayesian evidences give the highest quality parameter estimates. In the following, we discuss these results in more detail.

Figure 12: The results of photometric redshift estimation from the Bayesian analysis of the hydrodynamical simulation-based mock data for CSST-like imaging survey by employing the two SED models with the largest Bayesian evidence. (a) By comparing with the results in panel c of Figure 8, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the σNMAD\sigma_{\rm NMAD} of photometric redshift estimation is obviously reduced, although the bias and OLF are slightly increased. Meanwhile, the systematic patterns in the former results are also largely reduced. (c) By additionally employing a more complicated β\beta-τ\tau form of SFH, the σNMAD\sigma_{\rm NMAD} of photometric redshift estimation is only slightly reduced while the bias and OLF are exactly the same. (b, d) In the case with noise, the best two models are quite similar in the quality of photometric redshift estimation. Besides, there are two clear branches of outliers caused by the mis-identification of Lyman and Balmer break features.

The detailed results of photometric redshift estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 12. In the case without noise, by comparing the results in panel c of Figure 8 with that in panel a of Figure 12, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the σNMAD\sigma_{\rm NMAD} of photometric redshift estimation is obviously reduced while the bias and OLF are only slightly increased. Meanwhile, the systematic patterns in the former results are also largely reduced. However, as shown in panel c of Figure 12, by additionally employing a more complicated β\beta-τ\tau form of SFH, the σNMAD\sigma_{\rm NMAD} of photometric redshift estimation is only slightly reduced while the bias and OLF are exactly the same. Besides, as shown in Table 3 and Figure 11, the other two even more complicated forms of SFH lead to similar quality of photometric redshift estimation. In the case with noise, the best two models are quite similar in the quality of photometric redshift estimation. Besides, with the increasing of the complexity of SED models, the quality of photometric redshift estimation tend to decrease, although not very significantly.

Figure 13: As in Figure 12, but for the stellar mass. (a) By comparing with the results in panel c of Figure 9, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the σNMAD\sigma_{\rm NMAD} of photometric stellar mass estimation is reduced, although the bias and OLF are somewhat increased. (c) By additionally employing a more complicated β\beta-τ\tau form of SFH, the quality of photometric stellar mass estimation increase further. (b, d) In the case with noise, the best two models are exactly the same in the quality of photometric stellar mass estimation.

The detailed results of photometric stellar mass estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 13. In the case without noise, by comparing the results in panel c of Figure 9 with that in panel a of Figure 13, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the σNMAD\sigma_{\rm NMAD} of photometric stellar mass estimation is reduced, although the bias and OLF are somewhat increased. By additionally employing a more complicated β\beta-τ\tau form of SFH, as shown in panel c of Figure 13, the quality of photometric stellar mass estimation increase further. However, as shown in Table 3 and Figure 11, with the increasing of the complexity of SED models, the quality of photometric stellar mass stimation decreases obviously. In the case with noise, the best two models are exactly the same in the quality of photometric stellar mass estimation. Meanwhile, with the increasing of the complexity of SED models, the quality of photometric stellar mass stimation decreases even more obviously.

Figure 14: As in Figure 12, but for the SFR. (a) By comparing with the results in panel c of Figure 10 with that in panel a of Figure 14, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the systematic bias and OLF of photometric SFR are largely reduced, although the σNMAD\sigma_{\rm NMAD} is slightly increased. (c) By additionally employing a more complicated β\beta-τ\tau form of SFH the quality of photometric SFR estimation increase to the best. (b, d) In the case with noise, the best two models are also very similar in the quality of photometric stellar mass estimation.

The detailed results of photometric SFR estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 14. In the case without noise, by comparing with the results in panel c of Figure 10 with that in panel a of Figure 14, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the systematic bias and OLF of photometric redshift estimation are largely reduced while the σNMAD\sigma_{\rm NMAD} is slightly increased. By additionally employing a more complicated β\beta-τ\tau form of SFH, as shown in panel c of Figure 14, the quality of photometric SFR estimation increase to the best. However, as shown in Table 3 and Figure 11, with a additional quenching (or rejuvenation) component, the β\beta-τ\tau-r form of SFH lead to a much worse quality of photometric SFR estimation. Finally, the most complicated α\alpha-β\beta-τ\tau-r form of SFH lead to a slightly better SFR estimation. In the case with noise, the best two models are also very similar in the quality of photometric stellar mass estimation. Meanwhile, with the increasing of the complexity of SED models, the quality of photometric SFR stimation increases significantly.

For photometric redshift, stellar mass and SFR, their accurate estimation becomes increasingly difficult. Besides, the latter two are also more sensitive to the selection of SED models. Generally, the quality of parameter estimates is closely related to the level of Bayesian evidence, which is especially clear in the more realistic case with noise. Meanwhile, the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}), where the more complex (or flexible) models are always more favored, is not consistent with the measurements of the quality of parameter estimation. In practice, the direct measurements of the quality of parameter estimation as indicated by NMAD, BIA and OLF are usually unavailable. So, the Bayesian model comparison with Bayesian evidence can be used to find the best SED model which is not only the most efficient but also give the best parameter estimation.

Refer to caption
Figure 15: An example of 1D and 2D posterior probability distribution functions of free parameters obtained for the Bayesian analysis of a mock galaxy with CSST-like (grey), CSST+Euclid-like (red), and COSMOS-like (blue) photometric data, respectively. The contours show the 1σ1\sigma, 2σ2\sigma, and 3σ3\sigma confidence regions, while the red dash lines show the ground truth values of each parameter. It is clear that the parameters are more tightly constrained and some degeneracies between them have been broken when using datasets with increasing discriminative powers.

6.2 Effects of more flexible SFH and DAL for CSST+Euclid-like survey

The results of both model comparison and parameter estimation are strongly dependent on the used datasets which may have very different discriminative powers. In Figure 15, we show an example of 1D and 2D posterior probability distribution functions (PDFs) of free parameters obtained from the Bayesian analysis of the photometric data of a mock galaxy in the CSST-like, CSST-like+Euclid-like, and COSMOS-like surveys, respectively. It is clear that different datasets lead to very different PDFs, due to their very different discriminative powers. In this section and §6.3, we discuss the effects of more flexible SFH and DAL for CSST+Euclid-like and COSMOS-like surveys, respectively. The addition of Euclid data extends the wavelength coverage of the data to the longer NIR band than with CSST-only data, which should be useful for enhancing the discriminative power of model comparison and the quality of the parameter estimation.

survey noise SFH DAL ln(BE) ln(ML) parameter NMAD BIA OLF
zphotz_{\rm phot} 0.020 0.002 0.001
CSST+Euclid - τ\tau,-CEH Cal+00 -243527±\pm6307 -3717 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.055 -0.023 0.045
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.097 -0.088 0.299
zphotz_{\rm phot} 0.021 0.001 0.001
CSST+Euclid - τ\tau,+CEH Cal+00 -216613±\pm6390 30195 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.057 -0.014 0.037
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.094 -0.108 0.342
zphotz_{\rm phot} 0.013 0.010 0.005
CSST+Euclid - τ\tau,+CEH Sal+18 72092±\pm6475 340038 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.029 -0.015 0.009
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.081 -0.046 0.218
zphotz_{\rm phot} 0.012 0.010 0.004
CSST+Euclid - β\beta-τ\tau,+CEH Sal+18 70340±\pm6409 339916 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.027 -0.010 0.007
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.065 -0.021 0.195
zphotz_{\rm phot} 0.017 0.012 0.010
CSST+Euclid - β\beta-τ\tau-r,+CEH Sal+18 25728±\pm6566 315401 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.032 -0.005 0.017
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.087 0.006 0.302
zphotz_{\rm phot} 0.016 0.012 0.016
CSST+Euclid - α\alpha-β\beta-τ\tau-r,+CEH Sal+18 19560±\pm6454 305020 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.035 -0.007 0.030
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.087 0.011 0.286
zphotz_{\rm phot} 0.065 -0.003 0.119
CSST+Euclid + τ\tau,-CEH Cal+00 -31783±\pm3314 58861 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.058 -0.030 0.083
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.164 -0.020 0.379
zphotz_{\rm phot} 0.064 -0.001 0.118
CSST+Euclid + τ\tau,+CEH Cal+00 -31347±\pm3302 58766 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.058 -0.027 0.080
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.171 -0.027 0.392
zphotz_{\rm phot} 0.067 0.016 0.129
CSST+Euclid + τ\tau,+CEH Sal+18 -38081±\pm3529 64421 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.061 -0.016 0.103
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.186 0.011 0.449
zphotz_{\rm phot} 0.068 0.015 0.130
CSST+Euclid + β\beta-τ\tau,+CEH Sal+18 -41164±\pm3437 64349 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.063 -0.017 0.112
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.177 0.042 0.476
zphotz_{\rm phot} 0.069 0.017 0.141
CSST+Euclid + β\beta-τ\tau-r,+CEH Sal+18 -49796±\pm3561 63873 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.065 -0.023 0.127
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.213 0.059 0.571
zphotz_{\rm phot} 0.068 0.016 0.136
CSST+Euclid + α\alpha-β\beta-τ\tau-r,+CEH Sal+18 -49608±\pm3426 64065 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.073 -0.025 0.169
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.181 0.084 0.560
Table 4: As in Table 3, but for CSST+Euclid-like survey.

In Table 4, we present a summary of the Bayesian evidences, maximum likelihoods and metrics of the quality of parameter estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for CSST+Euclid-like survey by employing six different combinations of SFH and DAL with increasing complexity, as well as for the cases with and without noise. The same results are also shown more clearly in Figure 16.

Refer to caption
Figure 16: The Bayesian evidences (BE), maximum likelihood (ML) and metrics of photometric redshift (red star), stellar mass (blue square) and SFR (green circle) estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for CSST+Euclid-like survey by employing six SED models (0:“SFH=τ\tau,-CEH,DAL=Cal+18”, 1:“SFH=τ\tau,+CEH,DAL=Cal+18”, 2:“SFH=τ\tau,+CEH,DAL=Sal+18”, 3:“SFH=β\beta-τ\tau,+CEH,DAL=Sal+18”, 4:“SFH=β\beta-τ\tau-r,+CEH,DAL=Sal+18”, 5:“SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18”) with increasing complexity in the forms of SFH and DAL , as well as for the cases with (bottom panels) and without noise (top panels). In the case without noise, the simplest model “SFH=τ\tau,-CEH,DAL=Cal+18” has the lowest Bayesian evidence of ln(BE)=243527±6307{\rm ln}(BE)=-243527\pm 6307. Meanwhile, the SED models “SFH=τ\tau,+CEH,DAL=Sal+18” and “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” which are neither the simplest nor the most complex models have the largest Bayesian evidences of ln(BE)=72092±6475{\rm ln}(BE)=72092\pm 6475 and ln(BE)=70340±6409{\rm ln}(BE)=70340\pm 6409 which are comparable within error bar. As in the case for CSST-like survey, the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to similar results. Contrary to the case without noise, in the case with noise, the two simplest SED models “SFH=τ\tau,-CEH,DAL=Cal+00” and “SFH=τ\tau,+CEH,DAL=Cal+00” have the largest Bayesian evidences of ln(BE)=31783±3314{\rm ln}(BE)=-31783\pm 3314 and ln(BE)=31347±3302{\rm ln}(BE)=-31347\pm 3302 which are comparable within error bar. However, the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to exactly the opposite results, where the more complex (or flexible) models are always more favored. Generally, the quality of parameter estimates is closely related to the level of Bayesian evidence, which is especially clear in the more realistic case with noise. The model selection with Bayesian evidence is still more consistent with the measurements of the quality of parameter estimation than that with maximum likelihood (or equivalently minimum χ2\chi^{2}). For photometric redshift, stellar mass and SFR, their accurate estimation becomes increasingly difficult, while the latter two are also more sensitive to the selection of SED models. In the case without noise, the SED models “SFH=τ\tau,+CEH,DAL=Sal+18” and “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” which are just the two with the largest Bayesian evidence give the highest quality parameter estimates. In the case with noise, the two simplest SED models “SFH=τ\tau,-CEH,DAL=Cal+00” and “SFH=τ\tau,+CEH,DAL=Cal+00” which are also the two with the largest Bayesian evidences give the highest quality parameter estimates. All of these results are very similar to that for CSST-like survey. However, the relative error of Bayesian evidences have been reduced, especially in the case without noise. Besides, the quality of parameter estimation, especially that of stellar mass estimation, has been significantly improved. Furthermore, the quality of parameter estimation, especially that of stellar mass, increases more slowly with the increasing of SED model complexity.

6.2.1 Model comparison

In the case without noise, as shown in the top left panel of Figure 16, the simplest model “SFH=τ\tau,-CEH,DAL=Cal+18” has the lowest Bayesian evidence of ln(BE)=243527±6307{\rm ln}(BE)=-243527\pm 6307. With the additional consideration of metallicity evolution, the Bayesian evidence of the model “SFH=τ\tau,+CEH,DAL=Cal+18” increases to ln(BE)=216613±6390{\rm ln}(BE)=-216613\pm 6390. Then, with the adoption of the DAL of Salim et al. (2018), the Bayesian evidence of the model “SFH=τ\tau,+CEH,DAL=Sal+18” increases significantly to ln(BE)=72092±6475{\rm ln}(BE)=72092\pm 6475. Apparently, the DAL of Salim et al. (2018) is also a much better choice than that of Calzetti et al. (2000) for the hydrodynamical simulation-based mock galaxy sampe in CSST+Euclid-like survey. Furthermore, by employing a more complicated β\beta-τ\tau form of SFH, the Bayesian evidence of the model “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” seems decreases a little to ln(BE)=70340±6409{\rm ln}(BE)=70340\pm 6409. As in the case for CSST-like survey, the latter two SED models (“SFH=τ\tau,+CEH,DAL=Sal+18” and “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18”) have the largest Bayesian evidences which are comparable within error bar. With a quenching (or rejuvenation) component added to the SFH, the Bayesian evidence of the model “SFH=β\beta-τ\tau-r,+CEH,DAL=Sal+18” decreases significantly to ln(BE)=25728±6566{\rm ln}(BE)=25728\pm 6566. Finally, by employing a even more flexible double power-law form of SFH, the Bayesian evidence of the model “SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18” seems decreases a little to ln(BE)=19560±6454{\rm ln}(BE)=19560\pm 6454 which is comparable with the former within error bar. On the other hand, the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to similar results.

In the case with noise, the two simplest SED models “SFH=τ\tau,-CEH,DAL=Cal+00” and “SFH=τ\tau,+CEH,DAL=Cal+00” have the largest Bayesian evidences of ln(BE)=31783±3314{\rm ln}(BE)=-31783\pm 3314 and ln(BE)=31347±3302{\rm ln}(BE)=-31347\pm 3302, respectively. Then, with the adoption of the DAL of Salim et al. (2018), the Bayesian evidence of the model “SFH=τ\tau,+CEH,DAL=Sal+18” decreases significantly to ln(BE)=38081±3529{\rm ln}(BE)=-38081\pm 3529. By employing a more complicated β\beta-τ\tau form of SFH, the Bayesian evidence of the model “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” seems decreases further to ln(BE)=41164±3437{\rm ln}(BE)=-41164\pm 3437, although comparable with the former within error bar. With a quenching (or rejuvenation) component added to the SFH, the Bayesian evidence of the model “SFH=β\beta-τ\tau-r,+CEH,DAL=Sal+18” obviously decreases to ln(BE)=49796±3561{\rm ln}(BE)=-49796\pm 3561, which is similar to the case without noise. Finally, by employing a even more flexible double power-law form of SFH, the Bayesian evidence of the model “SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18” seems increases a little to ln(BE)=49608±3426{\rm ln}(BE)=-49608\pm 3426, although comparable with the former within error bar. However, the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to exactly the opposite results, where the more complex (or flexible) models are always more favored.

In general, all of these results are very similar to that for CSST-like survey. The model selection with Bayesian evidence is still more consistent with the measurements of the quality of parameter estimation. However, the relative error of Bayesian evidences have been reduced, especially in the case without noise.

6.2.2 Parameter estimation

In the right three panels of Figure 16, we show the three metrics of the quality of photometric redshift, stellar mass and SFR estimation for different SED models, respectively. In general, in the case without noise, the SED models “SFH=τ\tau,+CEH,DAL=Sal+18” and “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” which are just the two with largest Bayesian evidence give the highest quality parameter estimates. In the case with noise, the two simplest SED models “SFH=τ\tau,-CEH,DAL=Cal+00” and “SFH=τ\tau,+CEH,DAL=Cal+00” which are also the two with the largest Bayesian evidences give the highest quality parameter estimates. In the following, we discuss these results in more detail.

Figure 17: The results of photometric redshift estimation from the Bayesian analysis of the hydrodynamical simulation-based mock data for CSST+Euclid-like survey by employing the two SED models with the largest Bayesian evidence. By comparing with the results for CSST-like survey in Figure 12, it is clear that the quality of photometric redshift estimation has been obviously increased in both the cases with and without noise. Especially, in the more realistic case with noise, the outliers caused by the mis-identification of Lyman and Balmer break features have been largely reduced. This suggest that, the inclusion of J, H and Y bands from Euclid is helpful for improving the photometric redshift estimation. However, the more complicated β\beta-τ\tau form of SFH is not very helpful for improving the quality of photometric redshift estimation.

The detailed results of photometric redshift estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 17. By comparing with the results for CSST-like survey in Figure 12, it is clear that the quality of photometric redshift estimation has been obviously increased in both the cases with and without noise. Especially, in the more realistic case with noise, the outliers caused by the mis-identification of Lyman and Balmer break features have been largely reduced. This suggest that, the inclusion of J, H and Y bands from Euclid is helpful for improving the photometric redshift estimation. However, the more complicated β\beta-τ\tau form of SFH is not very helpful for improving the quality of photometric redshift estimation.

Figure 18: As in Figure 17, but for the stellar mass. By comparing with the results for CSST-like survey in Figure 13, it is clear that the quality of photometric stellar mass estimation has been significantly improved in both the cases with and without noise. Apparently, the inclusion of J, H and Y bands from Euclid is crucial for a more accurate estimation of stellar mass. Besides, with the more complicated β\beta-τ\tau form of SFH, the quality of photometric stellar mass estimation is only slightly improved.

The detailed results of photometric stellar mass estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 18. By comparing with the results for CSST-like survey in Figure 13, it is clear that the quality of photometric stellar mass estimation has been significantly improved in both the cases with and without noise. Apparently, the inclusion of J, H and Y bands from Euclid is crucial for a more accurate estimation of stellar mass. With the more complicated β\beta-τ\tau form of SFH, the quality of photometric stellar mass estimation is slightly improved. However, as shown in Table 4 and Figure 16, the even more complicated forms of SFH is still not helpful for improving the quality of photometric stellar mass estimation.

Figure 19: As in Figure 17, but for the SFR. (a, c) The cases without noise. By comparing with the results for CSST-like survey in the left panes of Figure 14, it is clear that the σNMAD\sigma_{\rm NMAD} and OLF of photometric SFR estimation are largely reduced, although the bias is slightly increased. Besides, with the more complicated β\beta-τ\tau form of SFH, the quality of photometric SFR estimation is improved. (b, d) The cases with noise. By comparing with the results for CSST-like survey in the right panes of Figure 14, all of the σNMAD\sigma_{\rm NMAD}, bias and OLF of photometric SFR estimation slightly increase. This suggest that, the inclusion of J, H and Y bands from Euclid is not very helpful for improving the photometric SFR estimation. Besides, with the additional consideration of metallicity evolution, the quality of photometric SFR estimation becomes slightly worse.

The detailed results of photometric SFR estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 19. In the case without noise, by comparing with the results for CSST-like survey in Figure 14, it is clear that the σNMAD\sigma_{\rm NMAD} and OLF of photometric SFR estimation are largely reduced, although the bias is slightly increased. However, in the case with noise, all of the σNMAD\sigma_{\rm NMAD}, bias and OLF of photometric SFR estimation slightly increase. So, the inclusion of J, H and Y bands from Euclid is not very helpful for improving the photometric SFR estimation.

6.3 Effects of more flexible SFH and DAL for COSMOS-like survey

The COSMOS-like survey covers many more bands than CSST-like data. Although there is no NUV data, it extends to the longer wavelengths and includes some intermediate bands (IBs) (Laigle et al., 2016). Since the COSMOS-like mock data has much stronger discriminative power than CSST-like and CSST+Euclid-like mock data, the Bayesian evidence of different SED models should show much larger difference, and the photometric redshift and stellar population parameter estimation should be better.

survey noise SFH DAL ln(BE) ln(ML) parameter NMAD BIA OLF
zphotz_{\rm phot} 0.007 -0.002 0
COSMOS - τ\tau,-CEH Cal+00 -1710604±\pm6811 -1431790 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.030 -0.026 0.013
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.110 -0.062 0.290
zphotz_{\rm phot} 0.006 -0.002 0
COSMOS - τ\tau,+CEH Cal+00 -1499618±\pm6997 -1207887 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.032 -0.017 0.010
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.101 -0.082 0.299
zphotz_{\rm phot} 0.002 0 0
COSMOS - τ\tau,+CEH Sal+18 354581±\pm7310 681554 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.023 -0.022 0.012
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.069 -0.068 0.124
zphotz_{\rm phot} 0.002 0 0
COSMOS - β\beta-τ\tau,+CEH Sal+18 387009±\pm7273 717135 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.020 -0.011 0.006
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.047 -0.041 0.057
zphotz_{\rm phot} 0.003 0.001 0
COSMOS - β\beta-τ\tau-r,+CEH Sal+18 255069±\pm7552 613256 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.028 -0.003 0.017
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.070 -0.001 0.124
zphotz_{\rm phot} 0.003 0.001 0
COSMOS - α\alpha-β\beta-τ\tau-r,+CEH Sal+18 275129±\pm7218 612401 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.031 -0.005 0.012
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.067 -0.006 0.121
zphotz_{\rm phot} 0.017 -0.003 0.004
COSMOS + τ\tau,-CEH Cal+00 99881±\pm4881 259510 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.044 -0.020 0.006
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.127 -0.032 0.282
zphotz_{\rm phot} 0.017 -0.004 0.003
COSMOS + τ\tau,+CEH Cal+00 107009±\pm4896 268492 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.044 -0.012 0.005
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.126 -0.044 0.288
zphotz_{\rm phot} 0.015 0.002 0.003
COSMOS + τ\tau,+CEH Sal+18 131578±\pm5115 313110 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.042 -0.017 0.036
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.107 -0.021 0.211
zphotz_{\rm phot} 0.015 0.002 0.003
COSMOS + β\beta-τ\tau,+CEH Sal+18 130766±\pm5064 315205 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.037 -0.012 0.029
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.090 0.002 0.176
zphotz_{\rm phot} 0.016 0.006 0.004
COSMOS + β\beta-τ\tau-r,+CEH Sal+18 109572±\pm5116 305267 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.044 -0.021 0.052
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.102 0.063 0.321
zphotz_{\rm phot} 0.015 0.005 0.003
COSMOS + α\alpha-β\beta-τ\tau-r,+CEH Sal+18 110909±\pm5006 306387 log(M)phot{\rm log}(M_{*})_{\rm phot} 0.045 -0.020 0.066
log(SFR/[M/yr])phot{\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot} 0.097 0.055 0.304
Table 5: As in Table 3, but for COSMOS-like survey.

In Table 5, we present a summary of the Bayesian evidences, maximum likelihoods and metrics of the quality of parameter estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for COSMOS-like survey by employing six different combinations of SFH and DAL with increasing complexity, as well as for the cases with and without noise. The same results are also shown more clearly in Figure 20.

Refer to caption
Figure 20: The Bayesian evidences (BE), maximum likelihood (ML) and metrics of photometric redshift (red star), stellar mass (blue square) and SFR (green circle) estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for COSMOS-like survey by employing six SED models (0:“SFH=τ\tau,-CEH,DAL=Cal+18”, 1:“SFH=τ\tau,+CEH,DAL=Cal+18”, 2:“SFH=τ\tau,+CEH,DAL=Sal+18”, 3:“SFH=β\beta-τ\tau,+CEH,DAL=Sal+18”, 4:“SFH=β\beta-τ\tau-r,+CEH,DAL=Sal+18”, 5:“SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18”) with increasing complexity in the forms of SFH and DAL , as well as for the cases with (bottom panels) and without noise (top panels). In the case without noise, the simplest model “SFH=τ\tau,-CEH,DAL=Cal+18” has the lowest Bayesian evidence of ln(BE)=1710604±6811{\rm ln}(BE)=-1710604\pm 6811. Meanwhile, the SED model “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” which is neither the simplest nor the most complex models has the largest Bayesian evidence of ln(BE)=387009±7273{\rm ln}(BE)=387009\pm 7273, and give the highest quality parameter estimates. As in the cases for CSST-like and CSST+Euclid-like surveys, the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to similar results. In the case with noise, the simplest model “SFH=τ\tau,-CEH,DAL=Cal+18” still has the lowest Bayesian evidence of ln(BE)=99881±4881{\rm ln}(BE)=99881\pm 4881. Meanwhile, the SED models “SFH=τ\tau,+CEH,DAL=Sal+18” and “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” have the largest Bayesian evidences of ln(BE)=131578±5115{\rm ln}(BE)=131578\pm 5115 and ln(BE)=130766±5064{\rm ln}(BE)=130766\pm 5064 which are comparable within error bar. Unlike the cases for CSST-like and CSST+Euclid-like surveys, the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to similar results. The two models with the largest Bayesian evidences (or maximum likelihoods) also give the highest quality parameter estimates. In general, in the cases with and without noise, the same and more clear results of model comparison are obtained. Meanwhile, in the more realistic case with noise, the more complicated SED models are more favored than in the cases for CSST-like and CSST+Euclid-like survey. All of these are the natural results of the much stronger discriminative power of the COSMOS-like survey than the CSST-like and CSST+Euclid-like surveys.

6.3.1 Model comparison

In the case without noise, as shown in the top left panel of Figure 20, the simplest model “SFH=τ\tau,-CEH,DAL=Cal+18” has the lowest Bayesian evidence of ln(BE)=1710604±6811{\rm ln}(BE)=-1710604\pm 6811. With the additional consideration of metallicity evolution, the Bayesian evidence of the model “SFH=τ\tau,+CEH,DAL=Cal+18” increases to ln(BE)=1499618±6997{\rm ln}(BE)=-1499618\pm 6997. Then, with the adoption of the DAL of Salim et al. (2018), the Bayesian evidence of the model “SFH=τ\tau,+CEH,DAL=Sal+18” increases significantly to ln(BE)=354581±7310{\rm ln}(BE)=354581\pm 7310. Apparently, the DAL of Salim et al. (2018) is also a much better choice than that of Calzetti et al. (2000) for the hydrodynamical simulation-based mock galaxy sampe in COSMOS-like survey. Furthermore, by employing a more complicated β\beta-τ\tau form of SFH, the Bayesian evidence of the model “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” increases further to ln(BE)=387009±7273{\rm ln}(BE)=387009\pm 7273. However, with a quenching (or rejuvenation) component added to the SFH, the Bayesian evidence of the model “SFH=β\beta-τ\tau-r,+CEH,DAL=Sal+18” significantly decreases to ln(BE)=255069±7552{\rm ln}(BE)=255069\pm 7552, which is similar to the case without noise. Finally, by employing a even more flexible double power-law form of SFH, the Bayesian evidence of the model “SFH=α\alpha-β\beta-τ\tau-r,+CEH,DAL=Sal+18” increases to ln(BE)=275129±7218{\rm ln}(BE)=275129\pm 7218. As in the cases for CSST-like and CSST+Euclid-like surveys, the model selection with maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to similar results.

In the case with noise, as shown in the bottom left panel of Figure 20, almost the same conclusions about SED model comparison are obtained, although the detailed values of Bayesian evidence are apparently different. Unlike the cases for CSST-like and CSST+Euclid-like surveys, the model selection with Bayesian evidence and maximum likelihood (or equivalently minimum χ2\chi^{2}) lead to similar results. In general, for COSMOS-like survey, we can obtain more clear results of SED model comparison. Meanwhile, in the more realistic case with noise, the more complicated SED models are more favored than in the cases for CSST-like and CSST+Euclid-like survey. This is reasonable, since COSMOS-like survey has much stronger discriminative power than CSST-like and CSST+Euclid-like surveys.

6.3.2 Parameter estimation

In the right three panels of Figure 20, we show the three metrics of the quality of photometric redshift, stellar mass and SFR estimation for different SED models, respectively. In the cases with and without noise, the same SED model “SFH=β\beta-τ\tau,+CEH,DAL=Sal+18” which is just the one with largest Bayesian evidence give the highest quality parameter estimates. In the following, we discuss these results in more detail.

Figure 21: The results of photometric redshift estimation from the Bayesian analysis of the hydrodynamical simulation-based mock data for COSMOS-like survey by employing the two SED models with the largest Bayesian evidence. By comparing with the results for CSST-like survey in Figure 12 and that for for CSST+Euclid-like survey in Figure 17, it is clear that the quality of photometric redshift estimation has been significantly increased in both the cases with and without noise. Meanwhile, in both cases, the best two SED models given identical quality of photometric redshift estimation. In the case without noise, the bias and OLF of photometric redshift estimation are almost zero while the σNMAD\sigma_{\rm NMAD} is only 0.002. Since the errors from parameter degeneracy and SED model error should be largely reduced, this suggest that the contribution of errors from the stochastic nature of MultiNest sampling algorithm and other potential errors in the BayeSED code should be less than 0.002. In the more realistic case with noise, the outliers caused by the mis-identification of Lyman and Balmer break features have been largely resolved.

The detailed results of photometric redshift estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 21. By comparing with the results for CSST-like survey in Figure 12 and that for for CSST+Euclid-like survey in Figure 17, it is clear that the quality of photometric redshift estimation has been significantly increased in both the cases with and without noise. Meanwhile, in both cases, the best two SED models given identical quality of photometric redshift estimation. In the case without noise, the bias and OLF of photometric redshift estimation are almost zero while the σNMAD\sigma_{\rm NMAD} is only 0.002. In this case, the errors from parameter degeneracy and SED model error should be largely reduced. This suggest that the contribution of errors from the stochastic nature of MultiNest sampling algorithm and other potential errors in the BayeSED code should be less than 0.002. In the more realistic case with noise, the outliers caused by the mis-identification of Lyman and Balmer break features have been largely resolved. Finally, as in the cases for CSST-like and CSST+Euclid-like surveys, the more complicated SED models are not very helpful for improving the quality of photometric redshift estimation.

Figure 22: As in Figure 21, but for the stellar mass. By comparing with the results for CSST+Euclid-like survey in Figure 18, it is clear that the quality of photometric stellar mass estimation has been improved further in both the cases with and without noise. Besides, the more complicated β\beta-τ\tau form of SFH makes the quality of photometric stellar mass estimation a little better.

The detailed results of photometric stellar mass estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 22. By comparing with the results for CSST+Euclid-like survey in Figure 18, it is clear that the quality of photometric stellar mass estimation has been improved further in both the cases with and without noise. Besides, the more complicated β\beta-τ\tau form of SFH makes the quality of photometric stellar mass estimation a little better. However, as shown in Table 5 and Figure 16, the even more complicated forms of SFH make the quality of photometric stellar mass estimation worse.

Figure 23: As in Figure 21, but for the SFR. By comparing with the results for CSST+Euclid-like survey in Figure 18, it is clear that the quality of photometric SFR estimation has been significantly improved in both the cases with and without noise. Besides, the more complicated β\beta-τ\tau form of SFH makes the quality of photometric SFR estimation much better.

The detailed results of photometric SFR estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 23. By comparing with the results for CSST+Euclid-like survey in Figure 18, it is clear that the quality of photometric SFR estimation has been significantly improved in both the cases with and without noise. Besides, the more complicated β\beta-τ\tau form of SFH makes the quality of photometric SFR estimation much better. However, as shown in 20, with a quenching (or rejuvenation) component added to the SFH, the quality of photometric SFR estimation becomes obviously worse. Finally, by employing a even more flexible double power-law form of SFH, the quality of photometric SFR estimation becomes a little better.

7 Summary and conclusion

In this work, based on the Bayesian SED synthesis and analysis techniques employed in the BayeSED-V3 code, we present a comprehensive and systematic test of its performance for simultaneous photometric redshift and stellar population parameter estimation of galaxies when combined with six SED models with increasing complexity in the form of SFH and DAL. The main purpose is to make a systematic analysis of various factors affecting the simultaneous photometric redshift and stellar population parameter estimation of galaxies in the context of Bayesian SED fitting, so as to provide clues for further improvement.

To separate the different factors which could contribute to the errors of photometric redshift and stellar population parameter estimation of galaxies, the empirical statistics-based and hydrodynamical simulation-based approaches have been employed to generate mock photometric sample of galaxies with or without noise for CSST-like, COSMOS-like and CSST+Euclid-like surveys, respectively. We compare the difference in performance of photometric parameter estimation with different run parameters of Bayesian analysis algorithm, different assumptions about the SFH and DAL of galaxies, and different observational datasets. Our main findings are as follows.

For the performance tests using empirical statistics-based mock galaxy sample with idealized SED modeling:

  1. 1.

    The performance of photometric redshift and stellar population parameter estimation, in terms of speed and quality, is sensitive to the runtime parameters (the target sampling efficiency efrefr and the number of live points nlivenlive) of MultiNest algorithm.

  2. 2.

    A good balance among the speed, quality of parameter estimation, and accuracy of model comparison can be achieved when adopting the MultiNest runtime parameters efrefr equals to 0.10.1 and nlivenlive equals to 5050.

  3. 3.

    By employing the optimized runtime parameters of MultiNest and simplest SED modeling, a speed of 2s/obj/cpu\sim 2{\rm s/obj/cpu} (10s/obj/cpu\sim 10{\rm s/obj/cpu}) can be achieved for a detailed Bayesian analysis of photometries from CSST-like survey, which is sufficient for the analysis of massive photometric data. Meanwhile, a quality of photometric redshift estimation with σNMAD=0.056\sigma_{\rm NMAD}=0.056, BIA=0.0025BIA=-0.0025, OLF=0.215OLF=0.215, a quality of photometric stellar mass estimation with σNMAD=0.113\sigma_{\rm NMAD}=0.113, BIA=0.025BIA=-0.025, OLF=0.285OLF=0.285, and a quality of photometric SFR estimation with σNMAD=0.08\sigma_{\rm NMAD}=0.08, BIA=0.01BIA=-0.01, OLF=0.255OLF=0.255 can be achieved.

  4. 4.

    With the optimized runtime parameters of MultiNest, the value of Bayesian evidence which is crucial for Bayesian model comparison can also be well estimated, although the error of Bayesian evidence tends to be overestimated, which may lead to a more conservative conclusion about model comparison.

  5. 5.

    The random observational errors in photometries are more important sources of errors than the parameter degeneracies and Bayesian analysis method and tool.

  6. 6.

    More complicated SED models apparently require longer running time. They also tend to overfit noisy photometries and lead to worse quality of photometric redshift, stellar mass and SFR estimation, which is likely due to more free parameters and more severe parameter degeneracies.

  7. 7.

    The value of Bayesian evidence clearly decreases with the increasing of the complexity of the SED model in both of the cases with and without noise.

For the performance tests using hydrodynamical simulation-based mock galaxy sample without idealized SED modeling:

  1. 1.

    The commonly used simple assumptions about the SFH and DAL of galaxies have severe impact on the quality of photometric parameter estimation of galaxies, especially for CSST-like survey with only photometries from seven broad-bands.

  2. 2.

    The performance of both Bayesian parameter estimation and model comparison highly depends on the discriminative power of the observational photometries. With more informative photometries, more clear results about SED model comparison and higher quality of photometric parameter estimation can be obtained.

  3. 3.

    While the SED model comparison with Bayesian evidence may favor SED models with very different complexities when using photometries from different surveys, the maximum likelihood (or equivalently minimum χ2\chi^{2}) tend to favor more complex models. For photometries with strong enough discriminative power, the two methods lead to more consistent results. However, for photometries without strong enough discriminative power, the two methods may lead to contradictory results. In both cases, the results of model selection with Bayesian evidence are more consistent with the measurements of the quality of parameter estimation.

  4. 4.

    In both of the cases with and without noise, the additional consideration of metallicity evolution helps to improve the quality of photometric redshift and stellar parameter estimation of galaxies, and increases the Bayesian evidence of corresponding SED model.

  5. 5.

    In the case without noise, the DAL of Salim et al. (2018) is a much better choice than that of Calzetti et al. (2000) for the hydrodynamical simulation-based mock galaxy sampe in CSST-like, CSST+Euclid-like and COSMOS-like surveys. However, in the more realistic case with noise, it is only more favored in the COSMOS-like survey with Bayesian evidence-based model selection. With maximum likelihood (or equivalently minimum χ2\chi^{2})-based model selection, it could be more favored, but lead to worse parameter estimation.

  6. 6.

    In the case without noise, the more flexible forms of SFH lead to better quality of parameter estimation and increase the Bayesian evidence of corresponding SED model. However, in the more realistic case with noise, they are only more favored in the COSMOS-like survey.

  7. 7.

    With a quenching (or rejuvenation) component added to the SFH, the quality of parameter estimation and the Bayesian evidence of corresponding SED model decrease in all cases. Although the rejuvenation or rapid quenching events may happen in some galaxies, this additional component of SFH is not very effective for most of the galaxies in the hydrodynamical simulation-based mock galaxy sample.

  8. 8.

    The quality of parameter estimation is closely related to the level of Bayesian evidence such that the SED model with largest Bayesian evidence tends to give the best quality of parameter estimation, which is more clear for photometries with larger discriminative power. By using photometries without strong enough discriminative power, the quality of parameter estimation, especially that of stellar mass and SFR estimation, tend to decrease with the increasing of SED model complexity

  9. 9.

    Since the direct measurements of the quality of parameter estimation as indicated by NMAD, BIA and OLF are usually unavailable, the Bayesian model comparison with Bayesian evidence can be used to find the best SED model which is not only the most efficient but also give the best parameter estimation.

  10. 10.

    For photometric redshift, stellar mass and SFR, their accurate estimation becomes increasingly difficult, while the latter two are also more sensitive to the selection of SED models.

  11. 11.

    For the photometric redshift estimation of galaxies in CSST-like survey, the observational noise is the more important source of error than the imperfect SED modeling. However, for the photometric stellar mass and SFR estimation of galaxies, the opposite is true.

  12. 12.

    The combination of photometries from CSST-like and Euclid-like surveys is helpful for improving the quality of photometric redshift estimation and crucial for the more accurate stellar mass estimation, but not very useful for SFR estimation.

  13. 13.

    With photometries in 26 bands from COSMOS-like surveys, by employ the same SED model, BayeSED-V3 can achieve similar quality of photometric redshift, stellar mass and SFR estimation to previous works. Besides, with photometries in 26 bands from COSMOS-like surveys, more complicated SED models tend to be more favored, which is very different from the two cases with only photometries from CSST-like (7 bands) or CSST+Euclid-like (10 bands) surveys.

We conclude that the latest version of BayeSED is capable of achieving a good balance among speed, the quality of simultaneous photometric redshift and stellar population parameter estimation of galaxies and the reliable SED model comparison. This makes it suitable for the analysis of existing and forthcoming massive photometric data of galaxies in CSST wide-field multiband imaging survey and others.

Generally, the current main bottleneck that limits the performance of the Bayesian approach for the simultaneous photometric redshift and stellar population parameter estimation of galaxies is the reliability of the SED synthesis (or modeling) procedure. Assuming a more flexible model is not a complete solution. We need a SED model that is not only more flexible but also more precisely accurate. It can be achieved by gradually adding more informative priors and physical constraints to the SED synthesis (modeling) procedure of galaxies, which is the subject of future works. The Bayesian model selection method with Bayesian evidence, a quantified Occam’s razor, is very helpful to identify the best SED model which is not only the most efficient but also give the best parameter estimation.

The results about simultaneous photometric redshift and stellar population estimation presented in this work are not yet optimal, especially those about CSST. The contributions of nebular lines and continuum emission to the SED, which may help break some parameter degeneracies, are still mising in this work. It is also worth to mention that the results of Bayesian SED model comparison and the metrics (OLF, BIA or NMAD) of parameter estimation highly depend on the selected samples. In this paper, we have chosen a relative broader sample to test the overall performance in the CSST wide-field imaging survey. For a differently selected sample which is designed for answering a more specific scientific question, the results could be different.

Finally, in addition to multi-bands photometries, we may need more informations from other forms of data, such as slitless spectroscopy and morphology parameters from the imaging to break the severe parameter degeneracies. More advanced methods may be able to take advantage of all information to give better redshift and/or stellar population parameter estimation of galaxies. These will be the subjects of future works as well.

The authors gratefully acknowledge the “PHOENIX Supercomputing Platform” jointly operated by the Binary Population Synthesis Group and the Stellar Astrophysics Group at Yunnan Observatories, Chinese Academy of Sciences. We warmly thank the Horizon-AGN team, especially C. Laigle, for making their photometric catalogs and spectra data publicly available. We thank Fengshan Liu, Ye Cao, and Xinwen Shu for helpful discussion about SNR and magnitude limit. We acknowledge support from the National Key R&D Program of China (Nos. 2021YFA1600401 and 2021YFA1600400), the National Science Foundation of China (grant nos. 11773063, 12173037, 12233008, 12233005, 12073078, 12288102) the China Manned Space Project (grant nos. CMS-CSST-2021-A02, CMS-CSST-2021-A04, CMS-CSST-2021-A06, and CMS-CSST-2021-A07), the “Light of West China” Program of Chinese Academy of Sciences, the Yunnan Ten Thousand Talents Plan Young & Elite Talents Project, the International Centre of Supernovae, Yunnan Key Laboratory (No. 202302AN360001). LF also gratefully acknowledges the support of the CAS Project for Young Scientists in Basic Research (No. YSBR-092), the Fundamental Research Funds for the Central Universities (WK3440000006), and the Cyrus Chun Ying Tang Foundations. This work made use of the following softwares: ASTROPY(Astropy Collaboration et al., 2013, 2018), MATPLOTLIB(Hunter, 2007), NUMPY(van der Walt et al., 2011), and H5PY(Collette et al., 2023).

References

  • Abdurro’uf et al. (2021) Abdurro’uf, Lin, Y.-T., Wu, P.-F., & Akiyama, M. 2021, ApJS, 254, 15, doi: 10.3847/1538-4365/abebe2
  • Acquaviva et al. (2011) Acquaviva, V., Gawiser, E., & Guaita, L. 2011, ApJ, 737, 47, doi: 10.1088/0004-637X/737/2/47
  • Acquaviva et al. (2015) Acquaviva, V., Raichoor, A., & Gawiser, E. 2015, ApJ, 804, 8, doi: 10.1088/0004-637X/804/1/8
  • Alsing et al. (2023) Alsing, J., Peiris, H., Mortlock, D., Leja, J., & Leistedt, B. 2023, ApJS, 264, 29, doi: 10.3847/1538-4365/ac9583
  • Alsing et al. (2020) Alsing, J., Peiris, H., Leja, J., et al. 2020, ApJS, 249, 5, doi: 10.3847/1538-4365/ab917f
  • Antonucci (1993) Antonucci, R. 1993, ARA&A, 31, 473, doi: 10.1146/annurev.aa.31.090193.002353
  • Antonucci (2012) —. 2012, Astronomical and Astrophysical Transactions, 27, 557. https://arxiv.org/abs/1210.2716
  • Arnouts et al. (2013) Arnouts, S., Le Floc’h, E., Chevallard, J., et al. 2013, A&A, 558, A67, doi: 10.1051/0004-6361/201321768
  • Ashton et al. (2022) Ashton, G., Bernstein, N., Buchner, J., et al. 2022, Nature Reviews Methods Primers, 2, 39, doi: 10.1038/s43586-022-00121-x
  • Astropy Collaboration et al. (2013) Astropy Collaboration, Robitaille, T. P., Tollerud, E. J., et al. 2013, A&A, 558, A33, doi: 10.1051/0004-6361/201322068
  • Astropy Collaboration et al. (2018) Astropy Collaboration, Price-Whelan, A. M., Sipőcz, B. M., et al. 2018, AJ, 156, 123, doi: 10.3847/1538-3881/aabc4f
  • Aufort et al. (2020) Aufort, G., Ciesla, L., Pudlo, P., & Buat, V. 2020, A&A, 635, A136, doi: 10.1051/0004-6361/201936788
  • Bastian et al. (2010) Bastian, N., Covey, K. R., & Meyer, M. R. 2010, ARA&A, 48, 339, doi: 10.1146/annurev-astro-082708-101642
  • Beckmann et al. (2017a) Beckmann, R. S., Devriendt, J., Slyz, A., et al. 2017a, MNRAS, 472, 949, doi: 10.1093/mnras/stx1831
  • Beckmann et al. (2017b) —. 2017b, ArXiv e-prints. https://arxiv.org/abs/1701.07838
  • Behroozi et al. (2013) Behroozi, P. S., Wechsler, R. H., & Conroy, C. 2013, ApJ, 770, 57, doi: 10.1088/0004-637X/770/1/57
  • Beichman et al. (2012) Beichman, C. A., Rieke, M., Eisenstein, D., et al. 2012, in Space Telescopes and Instrumentation 2012: Optical, Infrared, and Millimeter Wave, ed. M. C. Clampin, G. G. Fazio, H. A. MacEwen, & J. M. O. Jr., Vol. 8442, International Society for Optics and Photonics (SPIE), 84422N, doi: 10.1117/12.925447
  • Bertelli et al. (2008) Bertelli, G., Girardi, L., Marigo, P., & Nasi, E. 2008, A&A, 484, 815, doi: 10.1051/0004-6361:20079165
  • Boquien et al. (2019) Boquien, M., Burgarella, D., Roehlly, Y., et al. 2019, A&A, 622, A103, doi: 10.1051/0004-6361/201834156
  • Bowman et al. (2020) Bowman, W. P., Zeimann, G. R., Nagaraj, G., et al. 2020, ApJ, 899, 7, doi: 10.3847/1538-4357/ab9f3c
  • Breivik et al. (2022) Breivik, K., Connolly, A. J., Ford, K. E. S., et al. 2022, arXiv e-prints, arXiv:2208.02781. https://arxiv.org/abs/2208.02781
  • Brott et al. (2011) Brott, I., de Mink, S. E., Cantiello, M., et al. 2011, A&A, 530, A115, doi: 10.1051/0004-6361/201016113
  • Brown et al. (2019a) Brown, A., Nayyeri, H., Cooray, A., et al. 2019a, ApJ, 871, 87, doi: 10.3847/1538-4357/aaf73b
  • Brown et al. (2019b) Brown, M. J. I., Duncan, K. J., Landt, H., et al. 2019b, MNRAS, 489, 3351, doi: 10.1093/mnras/stz2324
  • Bruzual & Charlot (2003) Bruzual, G., & Charlot, S. 2003, MNRAS, 344, 1000, doi: 10.1046/j.1365-8711.2003.06897.x
  • Buchner (2021) Buchner, J. 2021, arXiv e-prints, arXiv:2101.09675. https://arxiv.org/abs/2101.09675
  • Calzetti et al. (2000) Calzetti, D., Armus, L., Bohlin, R. C., et al. 2000, ApJ, 533, 682, doi: 10.1086/308692
  • Cameron & Pettitt (2014) Cameron, E., & Pettitt, A. 2014, Statistical Science, 29, 397 , doi: 10.1214/13-STS465
  • Cao et al. (2018) Cao, Y., Gong, Y., Meng, X.-M., et al. 2018, MNRAS, 480, 2178, doi: 10.1093/mnras/sty1980
  • Cappellari et al. (2012) Cappellari, M., McDermid, R. M., Alatalo, K., et al. 2012, Nature, 484, 485, doi: 10.1038/nature10972
  • Caputi et al. (2015) Caputi, K. I., Ilbert, O., Laigle, C., et al. 2015, ApJ, 810, 73, doi: 10.1088/0004-637X/810/1/73
  • Carnall et al. (2019) Carnall, A. C., Leja, J., Johnson, B. D., et al. 2019, ApJ, 873, 44, doi: 10.3847/1538-4357/ab04a2
  • Carnall et al. (2018) Carnall, A. C., McLure, R. J., Dunlop, J. S., & Davé, R. 2018, MNRAS, 480, 4379, doi: 10.1093/mnras/sty2169
  • Chabrier (2003) Chabrier, G. 2003, PASP, 115, 763, doi: 10.1086/376392
  • Charlot & Longhetti (2001) Charlot, S., & Longhetti, M. 2001, MNRAS, 323, 887, doi: 10.1046/j.1365-8711.2001.04260.x
  • Chevallard & Charlot (2016) Chevallard, J., & Charlot, S. 2016, MNRAS, 462, 1415, doi: 10.1093/mnras/stw1756
  • Choi et al. (2019) Choi, J., Conroy, C., & Johnson, B. D. 2019, ApJ, 872, 136, doi: 10.3847/1538-4357/aaff67
  • Ciesla et al. (2017) Ciesla, L., Elbaz, D., & Fensch, J. 2017, A&A, 608, A41, doi: 10.1051/0004-6361/201731036
  • Ciesla et al. (2016) Ciesla, L., Boselli, A., Elbaz, D., et al. 2016, A&A, 585, A43, doi: 10.1051/0004-6361/201527107
  • Coelho (2009) Coelho, P. 2009, in American Institute of Physics Conference Series, Vol. 1111, American Institute of Physics Conference Series, ed. G. R. M. L. L. A. A. N. M. . E. B. G. Giobbi, A. Tornambe, 67–74, doi: 10.1063/1.3141624
  • Coelho et al. (2020) Coelho, P. R. T., Bruzual, G., & Charlot, S. 2020, MNRAS, 491, 2025, doi: 10.1093/mnras/stz3023
  • Collette et al. (2023) Collette, A., Kluyver, T., Caswell, T. A., et al. 2023, h5py/h5py: 3.8.0-aarch64-wheels, 3.8.0-aarch64-wheels, Zenodo, doi: 10.5281/zenodo.7568214
  • Conroy (2013) Conroy, C. 2013, ARA&A, 51, 393, doi: 10.1146/annurev-astro-082812-141017
  • Conroy & Gunn (2010) Conroy, C., & Gunn, J. E. 2010, ApJ, 712, 833, doi: 10.1088/0004-637X/712/2/833
  • Conroy et al. (2009) Conroy, C., Gunn, J. E., & White, M. 2009, ApJ, 699, 486, doi: 10.1088/0004-637X/699/1/486
  • Conroy et al. (2010) Conroy, C., White, M., & Gunn, J. E. 2010, ApJ, 708, 58, doi: 10.1088/0004-637X/708/1/58
  • Côté et al. (2016) Côté, B., Ritter, C., O’Shea, B. W., et al. 2016, ApJ, 824, 82, doi: 10.3847/0004-637X/824/2/82
  • da Cunha et al. (2008) da Cunha, E., Charlot, S., & Elbaz, D. 2008, MNRAS, 388, 1595, doi: 10.1111/j.1365-2966.2008.13535.x
  • Dahlen et al. (2013) Dahlen, T., Mobasher, B., Faber, S. M., et al. 2013, ApJ, 775, 93, doi: 10.1088/0004-637X/775/2/93
  • Davidzon et al. (2017) Davidzon, I., Ilbert, O., Laigle, C., et al. 2017, A&A, 605, A70, doi: 10.1051/0004-6361/201730419
  • Davidzon et al. (2019) Davidzon, I., Laigle, C., Capak, P. L., et al. 2019, MNRAS, 489, 4817, doi: 10.1093/mnras/stz2486
  • Debsarma et al. (2016) Debsarma, S., Chattopadhyay, T., Das, S., & Pfenniger, D. 2016, MNRAS, doi: 10.1093/mnras/stw1909
  • Diemer et al. (2017) Diemer, B., Sparre, M., Abramson, L. E., & Torrey, P. 2017, ApJ, 839, 26, doi: 10.3847/1538-4357/aa68e5
  • Doore et al. (2023) Doore, K., Monson, E. B., Eufrasio, R. T., et al. 2023, ApJS, 266, 39, doi: 10.3847/1538-4365/accc29
  • Dore et al. (2019) Dore, O., Hirata, C., Wang, Y., et al. 2019, BAAS, 51, 341. https://arxiv.org/abs/1904.01174
  • Draine (2010) Draine, B. 2010, Physics of the Interstellar and Intergalactic Medium, Princeton Series in Astrophysics (Princeton University Press). https://books.google.co.kr/books?id=FycJvKHyiwsC
  • Draine (2003) Draine, B. T. 2003, ARA&A, 41, 241, doi: 10.1146/annurev.astro.41.011802.094840
  • Dries et al. (2016) Dries, M., Trager, S. C., & Koopmans, L. V. E. 2016, MNRAS, 463, 886, doi: 10.1093/mnras/stw2049
  • Dries et al. (2018) Dries, M., Trager, S. C., Koopmans, L. V. E., Popping, G., & Somerville, R. S. 2018, MNRAS, 474, 3500, doi: 10.1093/mnras/stx2979
  • Driver et al. (2013) Driver, S. P., Robotham, A. S. G., Bland-Hawthorn, J., et al. 2013, MNRAS, 430, 2622, doi: 10.1093/mnras/sts717
  • Dubois et al. (2014) Dubois, Y., Pichon, C., Welker, C., et al. 2014, MNRAS, 444, 1453, doi: 10.1093/mnras/stu1227
  • Eldridge & Stanway (2009) Eldridge, J. J., & Stanway, E. R. 2009, MNRAS, 400, 1019, doi: 10.1111/j.1365-2966.2009.15514.x
  • Feroz & Hobson (2008) Feroz, F., & Hobson, M. P. 2008, MNRAS, 384, 449, doi: 10.1111/j.1365-2966.2007.12353.x
  • Feroz et al. (2009) Feroz, F., Hobson, M. P., & Bridges, M. 2009, MNRAS, 398, 1601, doi: 10.1111/j.1365-2966.2009.14548.x
  • Feroz et al. (2019) Feroz, F., Hobson, M. P., Cameron, E., & Pettitt, A. N. 2019, The Open Journal of Astrophysics, 2, 10, doi: 10.21105/astro.1306.2144
  • Ferreras et al. (2013) Ferreras, I., La Barbera, F., de la Rosa, I. G., et al. 2013, MNRAS, 429, L15, doi: 10.1093/mnrasl/sls014
  • Fitzpatrick (1986) Fitzpatrick, E. L. 1986, AJ, 92, 1068, doi: 10.1086/114237
  • Galliano et al. (2018) Galliano, F., Galametz, M., & Jones, A. P. 2018, ARA&A, 56, 673, doi: 10.1146/annurev-astro-081817-051900
  • Gardner et al. (2006) Gardner, J. P., Mather, J. C., Clampin, M., et al. 2006, Space Sci. Rev., 123, 485, doi: 10.1007/s11214-006-8315-7
  • Gennaro et al. (2018) Gennaro, M., Tchernyshyov, K., Brown, T. M., et al. 2018, ApJ, 855, 20, doi: 10.3847/1538-4357/aaa973
  • Gilda et al. (2021) Gilda, S., Lower, S., & Narayanan, D. 2021, ApJ, 916, 43, doi: 10.3847/1538-4357/ac0058
  • Gong et al. (2019) Gong, Y., Liu, X., Cao, Y., et al. 2019, ApJ, 883, 203, doi: 10.3847/1538-4357/ab391e
  • Green et al. (2012) Green, J., Schechter, P., Baltay, C., et al. 2012, arXiv e-prints, arXiv:1208.4012. https://arxiv.org/abs/1208.4012
  • Hahn & Melchior (2022) Hahn, C., & Melchior, P. 2022, ApJ, 938, 11, doi: 10.3847/1538-4357/ac7b84
  • Hahn et al. (2022) Hahn, C., Kwon, K. J., Tojeiro, R., et al. 2022, arXiv e-prints, arXiv:2202.01809. https://arxiv.org/abs/2202.01809
  • Han & Han (2012) Han, Y., & Han, Z. 2012, ApJ, 749, 123, doi: 10.1088/0004-637X/749/2/123
  • Han & Han (2014) —. 2014, ApJS, 215, 2, doi: 10.1088/0067-0049/215/1/2
  • Han & Han (2019) —. 2019, ApJS, 240, 3, doi: 10.3847/1538-4365/aaeffa
  • Han et al. (2019) Han, Y., Han, Z., & Fan, L. 2019, in The Art of Measuring Galaxy Physical Properties, 5, doi: 10.5281/zenodo.3553688
  • Han et al. (2020) Han, Y., Han, Z., & Fan, L. 2020, in IAU Symposium, Vol. 341, IAU Symposium, ed. M. Boquien, E. Lusso, C. Gruppioni, & P. Tissera, 143–146, doi: 10.1017/S1743921319002746
  • Han et al. (2007) Han, Z., Podsiadlowski, P., & Lynas-Gray, A. E. 2007, MNRAS, 380, 1098, doi: 10.1111/j.1365-2966.2007.12151.x
  • Hernández-Pérez & Bruzual (2013) Hernández-Pérez, F., & Bruzual, G. 2013, MNRAS, 431, 2612, doi: 10.1093/mnras/stt368
  • Hickox & Alexander (2018) Hickox, R. C., & Alexander, D. M. 2018, ARA&A, 56, 625, doi: 10.1146/annurev-astro-081817-051803
  • Higson et al. (2019) Higson, E., Handley, W., Hobson, M., & Lasenby, A. 2019, Statistics and Computing, 29, 891, doi: 10.1007/s11222-018-9844-0
  • Hogg & Foreman-Mackey (2018) Hogg, D. W., & Foreman-Mackey, D. 2018, ApJS, 236, 11, doi: 10.3847/1538-4365/aab76e
  • Hopkins et al. (2014) Hopkins, P. F., Kereš, D., Oñorbe, J., et al. 2014, MNRAS, 445, 581, doi: 10.1093/mnras/stu1738
  • Hoversten & Glazebrook (2008) Hoversten, E. A., & Glazebrook, K. 2008, ApJ, 675, 163, doi: 10.1086/524095
  • Hunter (2007) Hunter, J. D. 2007, Computing in Science & Engineering, 9, 90, doi: 10.1109/MCSE.2007.55
  • Ilbert et al. (2009) Ilbert, O., Capak, P., Salvato, M., et al. 2009, ApJ, 690, 1236, doi: 10.1088/0004-637X/690/2/1236
  • Ivezić et al. (2019) Ivezić, Ž., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111, doi: 10.3847/1538-4357/ab042c
  • Iyer & Gawiser (2017) Iyer, K., & Gawiser, E. 2017, ApJ, 838, 127, doi: 10.3847/1538-4357/aa63f0
  • Iyer et al. (2019) Iyer, K. G., Gawiser, E., Faber, S. M., et al. 2019, ApJ, 879, 116, doi: 10.3847/1538-4357/ab2052
  • Iyer et al. (2020) Iyer, K. G., Tacchella, S., Genel, S., et al. 2020, MNRAS, 498, 430, doi: 10.1093/mnras/staa2150
  • Joachimi (2016) Joachimi, B. 2016, in Astronomical Society of the Pacific Conference Series, Vol. 507, Multi-Object Spectroscopy in the Next Decade: Big Questions, Large Surveys, and Wide Fields, ed. I. Skillen, M. Balcells, & S. Trager, 401
  • Kaviraj et al. (2017) Kaviraj, S., Laigle, C., Kimm, T., et al. 2017, MNRAS, 467, 4739, doi: 10.1093/mnras/stx126
  • Kewley et al. (2019) Kewley, L. J., Nicholls, D. C., & Sutherland, R. S. 2019, ARA&A, 57, 511, doi: 10.1146/annurev-astro-081817-051832
  • Knowles et al. (2021) Knowles, A. T., Sansom, A. E., Allende Prieto, C., & Vazdekis, A. 2021, MNRAS, 504, 2286, doi: 10.1093/mnras/stab1001
  • Knowles et al. (2019) Knowles, A. T., Sansom, A. E., Coelho, P. R. T., et al. 2019, MNRAS, 486, 1814, doi: 10.1093/mnras/stz754
  • Kriek & Conroy (2013) Kriek, M., & Conroy, C. 2013, ApJ, 775, L16, doi: 10.1088/2041-8205/775/1/L16
  • Laigle et al. (2016) Laigle, C., McCracken, H. J., Ilbert, O., et al. 2016, ApJS, 224, 24, doi: 10.3847/0067-0049/224/2/24
  • Laigle et al. (2019) Laigle, C., Davidzon, I., Ilbert, O., et al. 2019, MNRAS, 486, 5104, doi: 10.1093/mnras/stz1054
  • Laureijs et al. (2011) Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints. https://arxiv.org/abs/1110.3193
  • Lawler & Acquaviva (2021) Lawler, A. J., & Acquaviva, V. 2021, MNRAS, 502, 3993, doi: 10.1093/mnras/stab138
  • Lee et al. (2010) Lee, S.-K., Ferguson, H. C., Somerville, R. S., Wiklind, T., & Giavalisco, M. 2010, ApJ, 725, 1644, doi: 10.1088/0004-637X/725/2/1644
  • Lee et al. (2009) Lee, S.-K., Idzi, R., Ferguson, H. C., et al. 2009, ApJS, 184, 100, doi: 10.1088/0067-0049/184/1/100
  • Leja et al. (2019) Leja, J., Carnall, A. C., Johnson, B. D., Conroy, C., & Speagle, J. S. 2019, ApJ, 876, 3, doi: 10.3847/1538-4357/ab133c
  • Leja et al. (2017) Leja, J., Johnson, B. D., Conroy, C., van Dokkum, P. G., & Byler, N. 2017, ApJ, 837, 170, doi: 10.3847/1538-4357/aa5ffe
  • Leja et al. (2020) Leja, J., Speagle, J. S., Johnson, B. D., et al. 2020, ApJ, 893, 111, doi: 10.3847/1538-4357/ab7e27
  • Lower et al. (2020) Lower, S., Narayanan, D., Leja, J., et al. 2020, ApJ, 904, 33, doi: 10.3847/1538-4357/abbfa7
  • Lower et al. (2022) —. 2022, ApJ, 931, 14, doi: 10.3847/1538-4357/ac6959
  • Lyu & Rieke (2022) Lyu, J., & Rieke, G. 2022, Universe, 8, 304, doi: 10.3390/universe8060304
  • Ma et al. (2016) Ma, X., Hopkins, P. F., Faucher-Giguère, C.-A., et al. 2016, MNRAS, 456, 2140, doi: 10.1093/mnras/stv2659
  • Madau (1995) Madau, P. 1995, ApJ, 441, 18, doi: 10.1086/175332
  • Maiolino & Mannucci (2019) Maiolino, R., & Mannucci, F. 2019, A&A Rev., 27, 3, doi: 10.1007/s00159-018-0112-2
  • Maraston (2005) Maraston, C. 2005, MNRAS, 362, 799, doi: 10.1111/j.1365-2966.2005.09270.x
  • Maraston et al. (2006) Maraston, C., Daddi, E., Renzini, A., et al. 2006, ApJ, 652, 85, doi: 10.1086/508143
  • Marigo et al. (2008) Marigo, P., Girardi, L., Bressan, A., et al. 2008, A&A, 482, 883, doi: 10.1051/0004-6361:20078467
  • Morishita (2022) Morishita, T. 2022, gsf: Grism SED Fitting package, Astrophysics Source Code Library, record ascl:2211.012. http://ascl.net/2211.012
  • Narayanan et al. (2018) Narayanan, D., Conroy, C., Davé, R., Johnson, B. D., & Popping, G. 2018, ApJ, 869, 70, doi: 10.3847/1538-4357/aaed25
  • National Academies of Sciences, Engineering, and Medicine (2021) National Academies of Sciences, Engineering, and Medicine. 2021, Pathways to Discovery in Astronomy and Astrophysics for the 2020s (The National Academies Press), doi: 10.17226/26141
  • Netzer (2015) Netzer, H. 2015, ARA&A, 53, 365, doi: 10.1146/annurev-astro-082214-122302
  • Noll et al. (2009) Noll, S., Burgarella, D., Giovannoli, E., et al. 2009, A&A, 507, 1793, doi: 10.1051/0004-6361/200912497
  • Oke (1974) Oke, J. B. 1974, ApJS, 27, 21, doi: 10.1086/190287
  • Pacifici et al. (2012) Pacifici, C., Charlot, S., Blaizot, J., & Brinchmann, J. 2012, MNRAS, 421, 2002, doi: 10.1111/j.1365-2966.2012.20431.x
  • Padoan et al. (1997) Padoan, P., Nordlund, A., & Jones, B. J. T. 1997, MNRAS, 288, 145
  • Pforr et al. (2012) Pforr, J., Maraston, C., & Tonini, C. 2012, MNRAS, 422, 3285, doi: 10.1111/j.1365-2966.2012.20848.x
  • Pforr et al. (2013) —. 2013, MNRAS, 435, 1389, doi: 10.1093/mnras/stt1382
  • Qiu & Kang (2022) Qiu, Y., & Kang, X. 2022, ApJ, 930, 66, doi: 10.3847/1538-4357/ac63a1
  • Reddy et al. (2012) Reddy, N. A., Pettini, M., Steidel, C. C., et al. 2012, ApJ, 754, 25, doi: 10.1088/0004-637X/754/1/25
  • Reddy et al. (2015) Reddy, N. A., Kriek, M., Shapley, A. E., et al. 2015, ApJ, 806, 259, doi: 10.1088/0004-637X/806/2/259
  • Rieke et al. (2005) Rieke, M. J., Kelly, D., & Horner, S. 2005, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 5904, Cryogenic Optical Systems and Instruments XI, ed. J. B. Heaney & L. G. Burriesci, 1–8, doi: 10.1117/12.615554
  • Robertson (2022) Robertson, B. E. 2022, ARA&A, 60, 121, doi: 10.1146/annurev-astro-120221-044656
  • Robotham et al. (2020) Robotham, A. S. G., Bellstedt, S., Lagos, C. d. P., et al. 2020, MNRAS, 495, 905, doi: 10.1093/mnras/staa1116
  • Salim & Boquien (2019) Salim, S., & Boquien, M. 2019, ApJ, 872, 23, doi: 10.3847/1538-4357/aaf88a
  • Salim et al. (2018) Salim, S., Boquien, M., & Lee, J. C. 2018, ApJ, 859, 11, doi: 10.3847/1538-4357/aabf3c
  • Salim & Narayanan (2020) Salim, S., & Narayanan, D. 2020, ARA&A, 58, 529, doi: 10.1146/annurev-astro-032620-021933
  • Salmon et al. (2016) Salmon, B., Papovich, C., Long, J., et al. 2016, ApJ, 827, 20, doi: 10.3847/0004-637X/827/1/20
  • Salvato et al. (2018) Salvato, M., Ilbert, O., & Hoyle, B. 2018, Nature Astronomy, doi: 10.1038/s41550-018-0478-0
  • Schmidt (1959) Schmidt, M. 1959, ApJ, 129, 243, doi: 10.1086/146614
  • Seon & Draine (2016) Seon, K.-I., & Draine, B. T. 2016, ApJ, 833, 201, doi: 10.3847/1538-4357/833/2/201
  • Sharma (2017) Sharma, S. 2017, ARA&A, 55, 213, doi: 10.1146/annurev-astro-082214-122339
  • Shivaei et al. (2020) Shivaei, I., Reddy, N., Rieke, G., et al. 2020, ApJ, 899, 117, doi: 10.3847/1538-4357/aba35e
  • Skilling (2004) Skilling, J. 2004, in American Institute of Physics Conference Series, Vol. 735, American Institute of Physics Conference Series, ed. . U. V. T. R. Fischer, R. Preuss, 395–405, doi: 10.1063/1.1835238
  • Skilling (2006) Skilling, J. 2006, Bayesian Analysis, 1, 833
  • Speagle (2020) Speagle, J. S. 2020, MNRAS, 493, 3132, doi: 10.1093/mnras/staa278
  • Spergel et al. (2003) Spergel, D. N., Verde, L., Peiris, H. V., et al. 2003, ApJS, 148, 175, doi: 10.1086/377226
  • Suess et al. (2022) Suess, K. A., Leja, J., Johnson, B. D., et al. 2022, ApJ, 935, 146, doi: 10.3847/1538-4357/ac82b0
  • Tacconi et al. (2020) Tacconi, L. J., Genzel, R., & Sternberg, A. 2020, ARA&A, 58, 157, doi: 10.1146/annurev-astro-082812-141034
  • Tanaka (2015) Tanaka, M. 2015, ApJ, 801, 20, doi: 10.1088/0004-637X/801/1/20
  • Thomas & Maraston (2003) Thomas, D., & Maraston, C. 2003, A&A, 401, 429, doi: 10.1051/0004-6361:20030153
  • Thorne et al. (2021) Thorne, J. E., Robotham, A. S. G., Davies, L. J. M., et al. 2021, MNRAS, 505, 540, doi: 10.1093/mnras/stab1294
  • Tinsley (1978) Tinsley, B. M. 1978, ApJ, 222, 14, doi: 10.1086/156116
  • Tinsley (1980) —. 1980, Fund. Cosmic Phys., 5, 287, doi: 10.48550/arXiv.2203.02041
  • Tinsley & Gunn (1976) Tinsley, B. M., & Gunn, J. E. 1976, ApJ, 203, 52, doi: 10.1086/154046
  • Valentini et al. (2019) Valentini, M., Borgani, S., Bressan, A., et al. 2019, MNRAS, 485, 1384, doi: 10.1093/mnras/stz492
  • van der Walt et al. (2011) van der Walt, S., Colbert, S. C., & Varoquaux, G. 2011, Computing in Science & Engineering, 13, 22, doi: 10.1109/MCSE.2011.37
  • van Dokkum (2008) van Dokkum, P. G. 2008, ApJ, 674, 29, doi: 10.1086/525014
  • Volonteri et al. (2016) Volonteri, M., Dubois, Y., Pichon, C., & Devriendt, J. 2016, MNRAS, 460, 2979, doi: 10.1093/mnras/stw1123
  • Walcher et al. (2011) Walcher, J., Groves, B., Budavári, T., & Dale, D. 2011, Ap&SS, 331, 1, doi: 10.1007/s10509-010-0458-z
  • Wang & Lilly (2020) Wang, E., & Lilly, S. J. 2020, ApJ, 892, 87, doi: 10.3847/1538-4357/ab7b7d
  • Weingartner & Draine (2001) Weingartner, J. C., & Draine, B. T. 2001, ApJ, 548, 296, doi: 10.1086/318651
  • Witt & Gordon (2000) Witt, A. N., & Gordon, K. D. 2000, ApJ, 528, 799, doi: 10.1086/308197
  • Yallup et al. (2022) Yallup, D., Janßen, T., Schumann, S., & Handley, W. 2022, European Physical Journal C, 82, 678, doi: 10.1140/epjc/s10052-022-10632-2
  • Yan et al. (2019) Yan, R., Chen, Y., Lazarz, D., et al. 2019, ApJ, 883, 175, doi: 10.3847/1538-4357/ab3ebc
  • Zhan (2011) Zhan, H. 2011, Scientia Sinica Physica, Mechanica & Astronomica, 41, 1441, doi: 10.1360/132011-961
  • Zhan (2018) Zhan, H. 2018, in 42nd COSPAR Scientific Assembly, Vol. 42, E1.16–4–18
  • Zhan (2021) —. 2021, Chinese Science Bulletin, 66, 1290
  • Zhang et al. (2005) Zhang, F., Han, Z., Li, L., & Hurley, J. R. 2005, MNRAS, 357, 1088, doi: 10.1111/j.1365-2966.2005.08739.x
  • Zhou et al. (2022a) Zhou, X., Gong, Y., Meng, X.-M., et al. 2022a, arXiv e-prints, arXiv:2206.13696. https://arxiv.org/abs/2206.13696
  • Zhou et al. (2022b) —. 2022b, MNRAS, 512, 4593, doi: 10.1093/mnras/stac786