BayeSED-GALAXIES I. Performance test for simultaneous photometric redshift and stellar population parameter estimation of galaxies in the CSST wide-field multiband imaging survey

Yunkun Han Yunnan Observatories, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China Center for Astronomical Mega-Science, Chinese Academy of Sciences, 20A Datun Road, Chaoyang District, Beijing, 100012, P. R. China Key Laboratory for the Structure and Evolution of Celestial Objects, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China International Centre of Supernovae, Yunnan Key Laboratory, Kunming 650216, P. R. China Lulu Fan Deep Space Exploration Laboratory / Department of Astronomy, University of Science and Technology of China, Hefei 230026, China School of Astronomy and Space Science, University of Science and Technology of China, Hefei 230026, China Xian Zhong Zheng Purple Mountain Observatory, Chinese Academy of Sciences, 10 Yuanhua Road, Nanjing 210023, China Jin-Ming Bai Yunnan Observatories, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China Center for Astronomical Mega-Science, Chinese Academy of Sciences, 20A Datun Road, Chaoyang District, Beijing, 100012, P. R. China Key Laboratory for the Structure and Evolution of Celestial Objects, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China Zhanwen Han Yunnan Observatories, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China Center for Astronomical Mega-Science, Chinese Academy of Sciences, 20A Datun Road, Chaoyang District, Beijing, 100012, P. R. China Key Laboratory for the Structure and Evolution of Celestial Objects, Chinese Academy of Sciences, 396 Yangfangwang, Guandu District, Kunming, 650216, P. R. China International Centre of Supernovae, Yunnan Key Laboratory, Kunming 650216, P. R. China

Abstract

The forthcoming CSST wide-field multiband imaging survey will produce seven-band photometric spectral energy distributions (SEDs) for billions of galaxies. The effective extraction of astronomical information from these massive datasets of SEDs relies on the techniques of both SED synthesis (or modeling) and analysis (or fitting). We evaluate the performance of the latest version of BayeSED code combined with SED models with increasing complexity for simultaneously determining the photometric redshifts and stellar population parameters of galaxies in this survey. By using an empirical statistics-based mock galaxy sample without SED modeling errors, we show finding that the random observational errors in photometries are more important sources of errors than the parameter degeneracies and Bayesian analysis method and tool. By using a Horizon-AGN hydrodynamical simulation-based mock galaxy sample with SED modeling errors about the star formation histories (SFHs) and dust attenuation laws (DALs), the simple typical assumptions lead to significantly worse parameter estimation with CSST photometries only.The SED models with more flexible (or complicated) forms of SFH/DAL do not necessarily lead to better estimation of redshift and stellar population parameters. We discuss the selection of the best SED model by means of Bayesian model comparison in different surveys. Our results reveal that the Bayesian model comparison with Bayesian evidence may favor SED models with different complexities when using photometries from different surveys. Meanwhile, the SED model with the largest Bayesian evidence tends to give the best performance of parameter estimation, which is more clear for photometries with larger discriminative power.

galaxies: fundamental parameters — galaxies: stellar content — galaxies: statistics — methods: data analysis — methods: statistical

^†^†journal: ApJS

1 Introduction

Understanding the complex ecosystem of stars, interstellar gas and dust, and supermassive black holes in galaxies is one of the most important challenges in modern astrophysics (National Academies of Sciences, Engineering, and Medicine, 2021). The new generation of space and ground telescopes and the corresponding large surveys will provide vast amounts of multi-band data for understanding the cosmic ecosystems and all the complex physical processes involved. For example, the James Webb Space Telescope (JWST, Rieke et al., 2005; Gardner et al., 2006; Beichman et al., 2012) is able to detect the earliest stages of galaxies from infrared at unprecedented depths, and is expected to provide decisive observations of the first generation of stars and galaxies (Beichman et al., 2012; Robertson, 2022). Meanwhile, forthcoming deep and wide field surveys with the Chinese Space Station Telescope (CSST, Zhan, 2011, 2018, 2021), the Euclid Space Telescope (Laureijs et al., 2011; Joachimi, 2016), the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST, Ivezić et al., 2019; Breivik et al., 2022), and the Nancy Grace Roman Space Telescope (NGRST, Green et al., 2012; Dore et al., 2019) will provide multi-band photometric and spectroscopic information for billions of galaxies. Especially, the CSST wide-field multiband imaging survey is set to image approximately 17,500 square degrees of the sky using NUV, u, g, r, i, z, and y bands in about 10 years of orbital time, which aims to achieve a $5\sigma$ limiting magnitude of 26 (AB mag) or higher for point sources in the g and r bands. How to effectively and reliably measure the redshift and the properties of various physical components of galaxies from the obtained huge amount of photometric spectral energy distributions (SEDs) data has become an urgent task to be done. A new generation of SED synthesis and analysis methods and tools are strongly demanded to effectively extract physical information from those massive datasets of observational SEDs.

The SED synthesis and analysis of galaxies are two aspects that are both opposite and unified in nature. The reliability and efficiency of the SED synthesis and analysis methods and tools will directly determine the reliability and efficiency of physical information extraction from the massive multi-wavelength datasets. In terms of SED synthesis of galaxies, the evolutionary synthesis technique of stellar population has become the core method from the pioneering works of Tinsley & Gunn (1976); Tinsley (1978). Nowadays, the stellar population synthesis models of BC03 (Bruzual & Charlot, 2003), M05 (Maraston, 2005), FSPS (Conroy et al., 2009), BPASS (Eldridge & Stanway, 2009), among others are widely used in the study of the formation and evolution of galaxies. However, in the SED synthesis models of galaxies, many important uncertainties remain in almost all the model ingredients (Conroy et al., 2009, 2010; Conroy & Gunn, 2010; Conroy, 2013), such as the initial stellar mass function (IMF) (Padoan et al., 1997; Hoversten & Glazebrook, 2008; van Dokkum, 2008; Bastian et al., 2010; Cappellari et al., 2012; Ferreras et al., 2013; Gennaro et al., 2018), the physics of stellar evolution (Thomas & Maraston, 2003; Zhang et al., 2005; Maraston et al., 2006; Han et al., 2007; Marigo et al., 2008; Bertelli et al., 2008; Brott et al., 2011; Hernández-Pérez & Bruzual, 2013), stellar spectral libraries (Coelho, 2009; Choi et al., 2019; Knowles et al., 2019; Coelho et al., 2020; Knowles et al., 2021; Yan et al., 2019), the complex star formation and metallicity enrichment histories (SFHs and MEHs) (Debsarma et al., 2016; Iyer et al., 2019; Carnall et al., 2019; Leja et al., 2019; Aufort et al., 2020; Wang & Lilly, 2020; Iyer et al., 2020; Côté et al., 2016; Maiolino & Mannucci, 2019; Valentini et al., 2019), the reprocessing by the interstellar gas and dust (Draine, 2003, 2010; Galliano et al., 2018; Kewley et al., 2019; Salim & Narayanan, 2020; Tacconi et al., 2020), and the possible contribution from active galactic nuclei (AGNs) at the center of galaxies (Antonucci, 1993, 2012; Netzer, 2015; Hickox & Alexander, 2018; Brown et al., 2019a, b; Lyu & Rieke, 2022). Different choices of these model ingredients will lead to very different estimation of the redshifts and physical parameters of galaxies, as well as different and even conflicting conclusions about the formation and evolution of galaxies. Therefore, the proper selection of these model ingredients is an essential step in any SED analysis work of galaxies (Han & Han, 2019; Han et al., 2020).

In terms of SED analysis of galaxies, the Bayesian method has been widely adopted in the last decade. For example, the widely used and actively developing SED fitting codes, such as MAGPHYS (da Cunha et al., 2008), CIGALE (Noll et al., 2009; Boquien et al., 2019), GalMC (Acquaviva et al., 2011), BayeSED (Han & Han, 2012, 2014), BEAGLE (Chevallard & Charlot, 2016), Prospector (Leja et al., 2017), BAGPIPES (Carnall et al., 2018), and ProSpect (Robotham et al., 2020) are all based on the Bayesian methods. Besides, a long list of new SED fitting codes, such as MCSED (Bowman et al., 2020), piXedfit (Abdurro’uf et al., 2021), gsf (Morishita, 2022), and Lightning (Doore et al., 2023) among others, are build along this way. The application of Bayesian methods implies that the SED analysis of galaxies is considered as a more general Bayesian inference problem instead of the previous Chi-square minimization-based optimization problem known as SED fitting. For the parameter estimation of a give SED model, the Bayesian approach provides the complete posterior probability distribution of parameters as the solution to the SED analysis problem, which is computationally more demanding but allows a more formal and simultaneous estimation of parameter values and their uncertainties. More importantly, for the selection of model ingredients, the Bayesian approach also provides the very useful Bayesian evidence which can be considered as a quantified Occam’s razor for effective model selection.

A noteworthy difference among the Bayesian SED analysis tools is that the earlier tools (e.g. MAGPHYS and CIGALE) are based on irregular or regular grid search, while the newer generation of tools (e.g. GalMC and BayeSED) are based on more advanced random sampling techniques such as Markov Chain Monte Carlo (MCMC, Sharma, 2017; Hogg & Foreman-Mackey, 2018) and Nested Sampling (NS, Skilling, 2006; Buchner, 2021; Ashton et al., 2022). The advantage of the grid-based Bayesian approach is that an SED library with regular or irregular model grids can be built in advance for only once. Besides, the prior probabilities can be set more freely during this procedure. Then it can be used in the analysis of a large sample of galaxies of any size without the generation of new SEDs. However, the size of SED library needs to be very large to allow a reasonable parameter estimation for all galaxies in the sample, especially in the case with regular grids where the number of required grids will increase dramatically with the number of free parameters. In contrast, a sampling-based Bayesian approach allows for a more detailed and efficient sampling of the parameter space for each galaxy and allows for a finer reconstruction of the posterior, leading to more reliable parameter estimates. However, the theoretical SED synthesis needs to be done in real-time and repeated many times, which could be very computationally expensive for the analysis of very large samples of galaxies. Fortunately, a much more efficient synthesis SED models can be achieved with the help of machine learning techniques. For example, in Han & Han (2014) we have employed the artificial neural network (ANN) and K-nearest neighbor (KNN) searching techniques to speed up the sampling-based Bayesian approach. The combination of sampling-based Bayesian inference and machine learning techniques enables the detailed Bayesian SED analysis of very large samples of galaxies (Han et al., 2019). Although the training phase of machine-learning-based SED synthesis method could be very time-consuming, especially for very complex SED models with many free parameters and the accurate synthesis of high-resolution SED, it is very promising with more advanced training techniques (Alsing et al., 2020; Gilda et al., 2021; Qiu & Kang, 2022; Hahn & Melchior, 2022) and worthwhile to carry out further exploration in this direction.

For the study of galaxy formation and evolution, the ideal SED synthesis and analysis tool should be able to simultaneously account for the contributions of stars, interstellar gas and dust, and AGN components, and to provide accurate and efficient estimates of the redshift and the physical properties of all components. However, in practice, it is very difficult, if not impossible, to fully satisfy all of these requirements. Therefore, a good SED synthesis and analysis tool should attempt to achieve a reasonable balance among these requirements as much as possible. This is what we are trying to achieve during the development of the BayeSED code (Han & Han, 2012, 2014, 2019; Han et al., 2019, 2020). In this work, we will rigorously test the performance of the latest version of BayeSED code combined with SED models with increasing complexity for simultaneous photometric redshift and stellar population parameter estimation of galaxies, so as to be ready for the analysis of the forthcoming massive datasets from the CSST wide-field multiband imaging survey and others.

We begin in §2 by introducing the methods we have employed for the generation of empirical statistics-based (§2.1) and hydrodynamical simulation-based (§2.2) mock catalog of galaxies, observational error modeling (§2.3) and the selection of samples (§2.4) that will be used for the performance test. In §3, we briefly describe the Bayesian approach of photometric SED analysis methods, including parameter estimation (§3.1) and model selection (§3.2). Especially, in §3.3, we will introduce some runtime parameters of MultiNest algorithm which is the core engine of BayeSED. They need to be properly tuned to improve the performance of BayeSED. We present the results of performance test for the case without SED modeling errors using an empirical statistics-based mock galaxy sample in §4. In §5, by employing the simplest SED model, we present the results of performance test for the case with SED modeling errors about the SFH and DAL of galaxies using a Horizon-AGN hydrodynamical simulation-based mock galaxy sample for CSST-like imaging survey. In §6, we discuss the effectiveness of more flexible (or complex) forms of SFH and DAL of galaxies for improving the performance of simultaneous redshift and stellar parameter estimation in CSST-like (§6.1), CSST+Euclid-like (§6.2), and COSMOS-like (§6.3) surveys with increasing discriminative power, respectively. Especially, we also discuss the relation between the metrics of the quality of parameter estimation and Bayesian evidence, as well as how they depend on the different surveys. Finally, a summary of our results and conclusions is presented in §7.

Throughout this work, we assume a flat $\Lambda$ -cold dark matter cosmology with $H_{0}=70$ km s^-1 Mpc^-1, $\Omega_{m}=0.3$ , $\Omega_{\Lambda}=0.7$ (Spergel et al., 2003, WMAP-5), and the Chabrier (2003) IMF. All the presented magnitudes are in the AB system (Oke, 1974).

2 Bayesian photometric SED synthesis with BayeSED

The SED synthesis (or modeling) module is an essential part in any Bayesian SED fitting code. In BayeSED-V3, we have added more functions for SED synthesis, especially for the simulation of mock observation of galaxies in a Bayesian way. This is not just crucial for the current work, but also lays the foundation for future applications of machine learning and simulation-based Bayesian inference methods in Bayesian SED fitting (Hahn & Melchior, 2022; Hahn et al., 2022). For this work, we use the empirical statistics-based (§2.1) and hydrodynamical simulation-based (§2.2) methods to generate mock photometric catalog, add noise with a simple magnitude limit-based approach (2.3), and select a sample (2.4) similar to previous works for the performance test in the next two sections. In the following, we introduce them in more detail.

2.1 Empirical statistics-based photometric mock catalog

The first method to generate mock photometric catalog is built by randomly draw samples from the parameter space of a particular SED model while under the constraints of some empirical statistical properties of galaxies. The sampling is performed with the same nested sampler MultiNest as in the Bayesian SED analysis mode of BayeSED. A sample from this catalog will be used in §4 to test the performance of redshift and stellar population parameter estimation in the case where the SED modeling is perfect, since exactly the same SED modeling method will be used in the Bayesian SED analysis of it.

2.1.1 SED modeling

As in Han & Han (2019), the SED of a galaxy is modeled as the luminosity of starlight from stellar populations of varying ages and metallicities, transmitted through the Interstellar Medium (ISM) and the Intergalactic Medium (IGM) to the observer. Specifically, the luminosity emitted at wavelength $\lambda$ by a galaxy with $age=t$ can be given as:

	$\displaystyle L_{\lambda}(t)$	$\displaystyle=\int_{0}^{t}{\rm d}\hbox{$t^{\prime}$}\,\psi(t-\hbox{$t^{\prime}$})\,S_{\lambda}[\hbox{$t^{\prime}$},{Z}(t-\hbox{$t^{\prime}$})]\,T^{\textsc{ism}}_{\lambda}(t,t^{\prime})$		(1)
		$\displaystyle=T^{\textsc{ism}}_{\lambda}\,\int_{0}^{t}{\rm d}\hbox{$t^{\prime}$}\,\psi(t-\hbox{$t^{\prime}$})\,S_{\lambda}[\hbox{$t^{\prime}$},{Z}(t-\hbox{$t^{\prime}$})]\,,$		(2)

where $\psi(t-\hbox{$t^{\prime}$})$ is the star formation history (SFH) describing the SFR as a function of the time $t-\hbox{$t^{\prime}$}$ , and $S_{\lambda}[\hbox{$t^{\prime}$},{Z}(t-t^{\prime})]$ is the luminosity emitted per unit wavelength per unit mass by a simple stellar population (SSP) of age $t^{\prime}$ and metallicity ${Z}(t-t^{\prime})$ .

$T^{\textsc{ism}}_{\lambda}(t,t^{\prime})$ is the transmission function of the ISM (Charlot & Longhetti, 2001), which is contributed by two components:

T^{\textsc{ism}}_{\lambda}(t,t^{\prime})=T^{+}_{\lambda}(t,t^{\prime})T^{0}_{\lambda}(t,t^{\prime}),

(3)

where $T^{+}_{\lambda}(t,t^{\prime})$ and $T^{0}_{\lambda}(t,t^{\prime})$ are the transmission functions of the ionized gas and the neutral ISM, respectively. The transmission through ionized gas can be modeled with photoionization code such as CLOUDY. However, we set $T^{+}_{\lambda}(t,t^{\prime})=1$ in this work to be consistent with the hydrodynamical-simulation based catalog (§2.2). A detailed modeling of $T^{+}_{\lambda}(t,t^{\prime})$ with CLOUDY to account for the combined effects of starlight absorption, nebular line emission, ionized continuum emission, and possible emission from warm dust within HII regions will be presented in a companion paper. Meanwhile, the transmission functions of the neutral ISM is considered with a simple time-independent dust attenuation law (DAL) and uniformly applied to the whole galaxy.

${Z}(t-t^{\prime})$ is the stellar metallicity as a function of the time $t-t^{\prime}$ , which describe the chemical evolution history of the galaxy. In previous works, we assume a time-independent metallicity, i.e. ${Z}(t-t^{\prime})=Z_{0}$ , as in many SED fitting codes of galaxy. To properly consider the evolution of stellar metallicity, we additionally employ a linear SFH-to-metallicity mapping model(Driver et al., 2013; Robotham et al., 2020; Thorne et al., 2021; Alsing et al., 2023):

Z(t-t^{\prime})=\left(Z(t_{\mathrm{age}})-Z_{\min}\right)\frac{1}{M}\int_{0}^{t}\psi(t-\hbox{$t^{\prime}$}){\rm d}\hbox{$t^{\prime}$}+Z_{\min}

(4)

Generally, the main ingredients for our SED modeling of galaxies are the SSP model, SFH, CEH and DAL. In this work,as the construction of the Horizon-AGN hydrodynamical-simulation based catalog (§2.2), we use the SSP model assuming a Chabrier (2003) stellar IMF from the widely used stellar population synthesis model of Bruzual & Charlot (2003). The SFH of galaxies is typically parameterized as the exponentially declining form: ${\rm SFR}(t)\propto e^{-t/\tau}\,$ (hereafter $\tau$ model). The $\tau$ model only describes the SFH of galaxies in a closed box without inflow of pristine gas and outflow of processed gas, where the gases are converted to stars at a rate proportional to the remaining gas and with a fixed efficiency (Schmidt, 1959; Tinsley, 1980). It is widely discussed in the literature that this simple assumption may lead to systematically biased estimation of stellar population parameters, especially for galaxies at $z\gtrsim 2$ (Lee et al., 2009, 2010; Reddy et al., 2012; Ciesla et al., 2017; Carnall et al., 2018). Therefore, some more flexible and physically inspired form of models have been suggested to improve the measurement of SFHs of galaxies and the estimation of their stellar population parameters and photometric redshift (Pacifici et al., 2012; Ciesla et al., 2017; Iyer & Gawiser, 2017; Carnall et al., 2019; Leja et al., 2019; Iyer et al., 2019; Lower et al., 2020; Suess et al., 2022).

In the present work, we employ three extension of the $\tau$ model with different complexity. The first one is described as:

\psi(t)\propto t^{\beta}\times\exp({-t/\tau}),

(5)

which is just an extended form of the delayed- $\tau$ model (Lee et al., 2010). Apparently, the typical $\tau$ model and delayed- $\tau$ model are just two special cases of this model (hereafter $\beta$ - $\tau$ model) with $\beta=0$ and $\beta=1$ , respectively. The second one is the $\beta$ - $\tau$ model combined with a quenching (or rejuvenation) component which is described as (Ciesla et al., 2016):

\psi(t)\propto\begin{cases}t^{\beta}\times\exp(-t/\tau),&\text{ when }t<=t_{\text{trunc }}\\ r_{\mathrm{SFR}}\times\psi\left(t=t_{\text{trunc}}\right),&\text{ when }t>t_{\text{trunc }}\end{cases}

(6)

where $t_{\text{trunc}}$ is the time when the star formation is quenched ( $r_{\mathrm{SFR}}<1$ ) or rejuvenated ( $r_{\mathrm{SFR}}>1$ ), and $r_{\mathrm{SFR}}$ is the ratio between $\psi(t>t_{\text{trunc }})$ and $\psi(t=t_{\text{trunc }})$ :

r_{\mathrm{SFR}}=\frac{\psi\left(t>t_{\text{trunc }}\right)}{\psi\left(t_{\text{trunc }}\right)}.

(7)

This model (hereafter $\beta$ - $\tau$ -r model) is a further extension of the $\beta$ - $\tau$ model with the latter being the special case with $r_{\mathrm{SFR}}=1$ . The third one is the double power-law model (Diemer et al., 2017; Carnall et al., 2018; Alsing et al., 2023) combined with a quenching (or rejuvenation) component which is described as:

\psi(t)\propto\begin{cases}\frac{1.0}{{(\frac{t}{t^{*}})^{\alpha}+(\frac{t}{t^{*}})^{-\beta}}},&\text{ when }t<=t_{\text{trunc }}\\ r_{\mathrm{SFR}}\times\psi\left(t=t_{\text{trunc}}\right),&\text{ when }t>t_{\text{trunc }}\end{cases}

(8)

where $\alpha$ and $\beta$ are the falling and rising slopes, respectively, and $t^{*}$ is related to the time at which star formation peaks, which is defined as $t^{*}\equiv\tau t_{\mathrm{age}}$ for the age of the galaxy $t_{\mathrm{age}}$ . A major advantage of the double-power-law model is the decoupling of the rising and falling parts of the SFH. Therefore, this model (hereafter $\alpha$ - $\beta$ - $\tau$ -r model) is even more flexible than the $\beta$ - $\tau$ -r model.

The dust attenuation law (DAL) is another very important ingredient for the SED modeling of galaxies (Walcher et al., 2011; Conroy, 2013). When deriving the photometric redshift and physical properties of galaxies from the analysis of their photometric or spectroscopic observations, a universal DAL as a simple uniform screen is commonly assumed. However, different choices of the universal law may lead to very different estimation of photometric redshift and physical parameters of galaxies (Pforr et al., 2012, 2013; Salim & Narayanan, 2020). Especially, many studies show that the dust attenuation curve of different galaxies are very different (Kriek & Conroy, 2013; Reddy et al., 2015; Salmon et al., 2016; Salim & Boquien, 2019; Shivaei et al., 2020), and therefore there is no universal DAL as expected on theoretical grounds (Witt & Gordon, 2000; Seon & Draine, 2016; Narayanan et al., 2018; Lower et al., 2022). By a detailed study of the dust attenuation curves of about 230,000 individual galaxies in the local universe using photometric data covering from UV to IR bands, Salim et al. (2018) presented new forms of attenuation laws that are suitable for normal star-forming galaxies, high-z analogs, and quiescent galaxies (See also Noll et al., 2009). In this work, we additionally employ this new form of DAL which is parameterized as following:

\frac{A_{\rm\lambda,{\rm mod}}}{A_{\rm V}}=\frac{k_{\lambda}}{R_{\rm V}}=\frac{k_{\lambda,{\rm Cal}}}{R_{V,{\rm Cal}}}\left(\frac{\lambda}{5500\text{\AA}}\right)^{\delta}+\frac{D_{\lambda}}{R_{V,{\rm mod}}},

(9)

where ${k_{\lambda,{\rm Cal}}}/{R_{V,{\rm Cal}}}$ is the Calzetti et al. (2000) DAL with ${R_{V,{\rm Cal}}}=4.05$ . The power law term with an exponent $\delta$ is introduced to deviate from the slope of the Calzetti et al. (2000) DAL. $R_{V,{\rm mod}}$ is the $\delta$ -dependent ratio of total to selective extinction for the modified law. The term $D_{\lambda}$ is introduced to add a UV bump. The relationship between $R_{V,{\rm mod}}$ and $\delta$ is given by:

R_{V,{\rm mod}}=\frac{R_{V,{\rm Cal}}}{(R_{V,{\rm Cal}}+1)(0.44/0.55)^{\delta}-R_{V,{\rm Cal}}}.

(10)

The UV bump following a Drude profile (Fitzpatrick, 1986) is represented as:

D_{\lambda}(E_{\rm b})=\frac{E_{\rm b}\lambda^{2}\gamma^{2}}{[\lambda^{2}-(\lambda_{0})^{2}]^{2}+\lambda^{2}\gamma^{2}},

(11)

with the amplitude $E_{\rm b}$ , fixed central wavelength $\lambda_{0}=0.2175\mu{\rm m}$ and width $\gamma=0.35\mu{\rm m}$ .

In total, we have considered six different combinations of SFH, CEH, and DAL with increasing complexity. A summary of these models, their parameters and priors are shown in Table 1.

Table 1: Summary of SED models, parameters and priors.

Model	Parameter	Range	Prior
	$z$	$[0,4]$	U
	${\rm log}(M_{*}/{\rm M_{\odot}})$	$[8,12]$	U
	${\rm log}(age/\rm{yr})$ ^c^cfootnotemark:	$[8,10.14]$	U
	$A_{\rm v}$	$[0,4]$	U
SFH= $\tau$ ,CEH=no	${\rm log}(Z_{0}/Z_{\odot})$	$[-2.30,0.70]$	U
DAL=Cal+00^a^afootnotemark:	${\rm log}(\tau/\rm{yr})$	$[6,10]$	U
SFH= $\tau$ ,CEH=yes	${\rm log}(Z(t_{\mathrm{age}})/Z_{\odot})$	$[-2.30,0.70]$	U
DAL=Cal+00^a^afootnotemark:	${\rm log}(\tau/\rm{yr})$	$[6,10]$	U
SFH= $\tau$ ,CEH=yes	${\rm log}(Z(t_{\mathrm{age}})/Z_{\odot})$	$[-2.30,0.70]$	U
DAL=Sal+18^b^bfootnotemark:	${\rm log}(\tau/\rm{yr})$	$[6,10]$	U
	$E_{\rm b}$	$[-2,6]$	U
	$\delta$	$[-1.2,0.4]$	U
SFH= $\beta$ - $\tau$	${\rm log}(Z(t_{\mathrm{age}})/Z_{\odot})$	$[-2.30,0.70]$	U
CEH=yes	${\rm log}(\tau/\rm{yr})$	$[6,10]$	U
DAL=Sal+18^b^bfootnotemark:	$\beta$	$[0,1]$	U
	$E_{\rm b}$	$[-2,6]$	U
	$\delta$	$[-1.2,0.4]$	U
SFH= $\beta$ - $\tau$ -r	${\rm log}(Z(t_{\mathrm{age}})/Z_{\odot})$	$[-2.30,0.70]$	U
CEH=yes	${\rm log}(\tau/\rm{yr})$	$[6,10]$	U
DAL=Sal+18^b^bfootnotemark:	rSFR	$[1e-6,1e6]$	LU^d^dfootnotemark:
	$t_{\rm trunc}/t_{\mathrm{age}}$	$[0,1]$	U
	$\beta$	$[0,1]$	U
	$E_{\rm b}$	$[-2,6]$	U
	$\delta$	$[-1.2,0.4]$	U
SFH= $\alpha$ - $\beta$ - $\tau$ -r	${\rm log}(Z(t_{\mathrm{age}})/Z_{\odot})$	$[-2.30,0.70]$	U
CEH=yes	$\tau=t^{*}/t_{\mathrm{age}}$	$[0.007,1]$	U
DAL=Sal+18^b^bfootnotemark:	$\alpha$	$[0.01,1000]$	LU
	$\beta$	$[0.01,1000]$	LU
	rSFR	$[1e-6,1e6]$	LU^d^dfootnotemark:
	$t_{\rm trunc}/t_{\mathrm{age}}$	$[0,1]$	U
	$E_{\rm b}$	$[-2,6]$	U
	$\delta$	$[-1.2,0.4]$	U

Finally, we also include the effect of IGM absorption with the description of Madau (1995). Other other more recent consideration of IGM absorption are also available in BayeSED. However, the exploration of the effects of different choices of IGM absorption models on the redshift and stellar population parameter estimation is beyond the scope of this work, which will not change the conclusions given here.

2.1.2 Galaxy population modeling

To model the galaxy population, we need to set the joint probability distribution that characterizes the statistical properties of the galaxy population. The statistical properties of the galaxy population are the results of the complex physical procedures happened during the formation and evolution of galaxies. In this work, we employ some widely discussed empirical statistical properties of galaxies to model the galaxy population phenomenologically. It should be mentioned that there are large uncertainties in these statistical properties, and we do not attempt to use the most up-to-date results for all of them in this work. The other choices of statistical properties of galaxies will not change the conclusions of this work.

Refer to caption — Figure 1: The joint distributions of redshift and physical parameters of the empirical statistics-based mock galaxy population produced with BayeSED combined with SED models of different complexity. With the same set of empirical statistics, different SED models lead to slightly different redshift and age distribution, which is likely due to different mapping relations from free parameters to derived parameters. Meanwhile, only the two SED models with a quenching component produce a clear region of quiescent galaxies below the star-forming main sequence.

Similar to Tanaka (2015)(See also Alsing et al., 2023), we assume that the joint probability distribution of the stellar population parameters and redshift of galaxy population can be factorized as

		$\displaystyle P(\bm{G},z)=P(M_{*},z)$		(12)
		$\displaystyle\quad\times P(SFR\mid M_{*},z)$
		$\displaystyle\quad\times P(A_{\rm V}\mid SFR,z)$
		$\displaystyle\quad\times P(age\mid M_{*},z)$
		$\displaystyle\quad\times P(Z_{\rm MW}\mid M_{*},z).$

The joint distribution of stellar mass and redshift is defined as

P(M_{*},z)\propto\Phi(M_{*},z)dV(z),

(13)

where $\Phi(M_{*},z)$ is the unnormalized stellar mass function and $dV(z)$ is the differential comoving volume element. We employ the recent measurement of stellar mass function and its redshift evolution from Leja et al. (2020), while a WMAP-5 (Spergel et al., 2003) cosmology for the comoving volume element.

Following Tanaka (2015), we assume that $P(SFR\mid M_{*},z)$ can be expressed as the sum of two Gaussians to represent two distinct sequences formed by star forming and quiescent galaxies:

\begin{gathered}P\left(\mathrm{SFR}\mid M_{*},z\right)\propto\\ \frac{(1-f_{\mathrm{Q}})}{\sigma_{\mathrm{SF}}}\exp\left[-\frac{1}{2}\left(\frac{\log\mathrm{SFR}-\log\mathrm{SFR}_{\mathrm{SF}}\left(M_{*},z\right)}{\sigma_{\mathrm{SF}}}\right)^{2}\right]\\ \quad+\frac{f_{\mathrm{Q}}}{\sigma_{\mathrm{Q}}}\exp\left[-\frac{1}{2}\left(\frac{\log\mathrm{SFR}-\log\mathrm{SFR}_{\mathrm{SF}}\left(M_{*},z\right)+2}{\sigma_{\mathrm{Q}}}\right)^{2}\right],\end{gathered}

(14)

where $\operatorname{SFR}_{\mathrm{SF}}\left(M_{*},z\right)$ is the mean SFR of star forming galaxies given by

\operatorname{SFR}_{\mathrm{SF}}\left(M_{*},z\right)=\operatorname{SFR}^{*}(z)\times\frac{M_{*}}{10^{11}M_{\odot}}M_{\odot}\mathrm{yr}^{-1},

(15)

with

\operatorname{SFR}^{*}(z)=\begin{cases}10\times(1+z)^{2.1}&(z<2)\\ 19\times(1+z)^{1.5}&(z\geqslant 2)\end{cases},

(16)

and the fraction of quiescent galaxies is given as a function of stellar mass and redshift (Behroozi et al., 2013):

f_{\mathrm{Q}}\left(M_{*},z\right)=\left[\left(\frac{M_{*}}{10^{10.2+0.5z}M_{\odot}}\right)^{-1.3}+1\right]^{-1}.

(17)

As in Tanaka (2015), the dust attenuation is considered to positively correlate with SFR:

P\left(\tau_{V}\mid\mathrm{SFR},z\right)\propto\frac{1}{{\sigma_{\tau_{V}}}}\exp\left[-\frac{1}{2}\left(\frac{\tau_{V}-\left\langle\tau_{V}\right\rangle}{\sigma_{\tau_{V}}}\right)^{2}\right],

(18)

where $\sigma_{\tau_{V}}=0.5$ , and

\left\langle\tau_{V}\right\rangle=\begin{cases}0.2&\left(\mathrm{SFR}_{0}<1\right)\\ 0.2+0.5\log\mathrm{SFR}_{0}&\left(\mathrm{SFR}_{0}>1\right)\end{cases},

(19)

\mathrm{SFR}_{0}=100\frac{\mathrm{SFR}}{\mathrm{SFR}^{*}(z)},

(20)

Then, we use the relation between $\tau_{V}$ and $A_{\rm V}$ to obtain $P(A_{\rm V}\mid SFR,z)$ .

The probability of the age of a galaxy is described conditionally on the stellar mass and redshift:

P\left(\operatorname{age}\mid M_{*},z\right)\propto\exp\left[-\frac{\langle\mathrm{SFR}\rangle}{\operatorname{SFR}^{*}(z)}\right],

(21)

where

\langle\mathrm{SFR}\rangle=M_{*}/\text{ age. }

(22)

This lead to a low probability for a massive galaxy with young age, while a high probability for a low-mass galaxy with the same age.

Finally, the probability of mass weighted stellar metallicity is modeled as:

\begin{array}[]{l}P\left(Z_{\rm MW}\mid M_{*},z\right)\propto\\ \frac{1}{{\sigma_{\log(Z_{\rm MW})}}}\exp\left[-\frac{1}{2}\left(\frac{\log(Z_{\rm MW})-\left\langle\log(Z_{\rm MW})\right\rangle}{\sigma_{\log(Z_{\rm MW})}}\right)^{2}\right],\end{array}

(23)

where ${\sigma_{\log(Z_{\rm MW})}}=0.1$ , and

\begin{array}[]{l}\left\langle\log(Z_{\rm MW})\right\rangle=\\ 0.40\left[\log M_{*}-\right.10]+0.67\exp(-0.50z)-1.04\end{array}

(24)

is the redshift dependent stellar mass and metallicity relation (Ma et al., 2016) which is predicted by using the high-resolution cosmological zoom-in simulations from the Feedback in Realistic Environment (FIRE) project (Hopkins et al., 2014).

To generate empirical statistics-based mock catalog of galaxies, we employ the MultiNest algorithm to draw samples from the joint probability distribution of the stellar population parameters and redshift of galaxy population by setting $P(\bm{G},z)$ in Equation 12 to be the likelihood function. Besides, to simulate a magnitude-limited sample, we can additionally set the likelihood function to be $0$ when the magnitude in a given band is larger than a given value. Since the sampling points with likelihood to be $0$ will be ignored by MultiNest, the obtained posterior sample can be used to buid a magnitude-limited sample of mock galaxies with some physical constraints from the empirical statistical properties of galaxies. More details about the selection of magnitude-limited sample is presented in §2.4. The mock catalog can be build with the posterior sample of redshift and all physical parameters given by MultiNest. However, this is a weighted sample (Yallup et al., 2022), which can not be directly used as a mock sample of galaxies. To build a more realistic mock sample of galaxies, we use bootstrap resampling method to obtain an unweighted sample.

In total, we have build six mock catalog of galaxies by employing SED models with different combinations of SFH, CEH, and DAL and increasing complexity as shown in Table 1, respectively. The employed priors of redshift and stellar population parameters are listed in the same Table for each model, respectively. In Figure 1, we show the joint distributions of redshift and physical parameters of the six empirical statistics-based mock galaxy population. Although the sampe set of empirical are employed, different SED models lead to slightly different distribution of parameters, especially for redshift and galaxy age. This is likely due to different mapping relations from free parameters to derived parameters. For example, different forms of SFH may lead to different relations between the age of galaxy and its recent SFR.

2.2 Hydrodynamical simulation-based photometric mock catalog

The second method to generate mock photometric catalog is based on an SED library which is built by the post-processing of galaxies from a hydrodynamical simulation. This catalog will be used in §5 to test the performance of redshift and stellar population parameter estimation in the case where the SED modeling is imperfect, since the SED modeling method employed in the Bayesian SED analysis will be very different from the one used to built it.

We start from the rest-frame spectra of galaxies which are produced using the light-cone from the cosmological hydrodynamical simulation Horizon-AGN (Dubois et al., 2014). The computation of these spectra, which accounts for the complex star formation history and metal enrichment of Horizon-AGN galaxies, and consistently includes dust attenuation, is described in details by Laigle et al. (2019) and Davidzon et al. (2019). The dust attenuation of galaxies in Horizon-AGN simulation is modelled for each stellar particle, assumed to be a SSP, by using the gas metal mass distribution as an approximation of the dust mass distribution, assuming a constant dust-to-metal mass ratio (Laigle et al., 2019). Besides, to obtain the amount of extinction at a given wavelength, the Weingartner & Draine (2001) model of Milky Way dust grain with $R_{\rm V}=3.1$ and the prominent $2175{\rm\AA}$ -graphite bump is employed for post-processing the simulated galaxies. As mentioned in Laigle et al. (2019), the overall attenuation curve becomes less steep and the bump tends to be reduced when summing up the contribution of all the SSPs to obtain the resulting galaxy spectrum. They also noticed that the averaged attenuation curve in Horizon-AGN simulation can not be well reproduced by either the model of Calzetti et al. (2000) or Arnouts et al. (2013). The more flexible form of DAL as given by Equation 9 is more likely to reproduce the attenuation curves of galaxies in the Horizon-AGN simulation. In order to isolate the possible differences in the convolution with filter response function, observational error modeling, and consideration of IGM absorption, we choose to convert their rest-frame spectra of mock galaxies to corresponding mock photometries with BayeSED, instead of using their virtual photometries directly¹¹1We have compared the two set of photometries careful and found that they are actually very similar, only with some differences at the very faint end.. The consideration for the effects of IGM absorption is the same as in §2.1.1.Therefore, the difference between the empirical statistics-based (§2.1) and hydrodynamical simulation-based photometric mock catalog are only driven by the different SFH, CEH, DAL of mock galaxies and their different distribution of redshift and physical parameters.

2.3 Observational error modeling

The modeling of realistic errors on the flux is crucial for a meaningful performance test of redshift and stellar population parameter estimation. Here, we introduce the method we have employed to compute flux errors of mock galaxies and perturb their fluxes accordingly. The flux error for a wavelength band $i$ with N $\sigma$ AB magnitude limit $m_{{\rm lim},i}=-2.5*{\rm log}(F_{\rm lim,i})+23.9$ is given by:

\sigma_{F,i}=\sqrt{(F_{\rm lim,i}/N)^{2}+\sigma^{2}_{F,i,{\rm sys}}},

(25)

where the flux limit $F_{\rm lim,i}$ and the systematic flux error $\sigma_{F,i,{\rm sys}}$ are in unit of $\mu{\rm Jy}$ . From the relation that magnitude error $\sigma_{m}\approx 1.08574/SNR$ and signal-to-noise ratio $SNR=F/\sigma_{F}$ , we can obtain:

\sigma_{F,i,{\rm sys}}=\sigma_{m,{\rm sys}}*F_{i}/1.08574.

(26)

As in Cao et al. (2018), we assume a systematic magnitude error $\sigma_{m,{\rm sys}}=0.02$ . The final mock flux is obtained by the original flux perturbed by a Gaussian noise $\epsilon\sim\mathcal{N}\left(0,\sigma_{F,i}^{2}\right)$ . In practice, the magnitude limit may have a dispersion $\sigma_{m,{\rm lim},i}$ for galaxies with different sizes. So, the actually used magnitude limit is drawn from the Gaussian distribution $\mathcal{N}\left(m_{{\rm lim},i},\sigma_{m,{\rm lim},i}^{2}\right)$ . In this work, we set $\sigma_{m,{\rm lim},i}=0.1$ . We have generated three sets of mock catalog for CSST-like, Euclid-like, and COSMOS-like surveys, respectively. A summary of the adopted depths in all bands of the three surveys is shown in Table 2. The response functions and modeled relation between magnitude and magnitude error are shown in the panels of Figure 2 for the 7 CSST bands, 3 Euclid bands and 26 COSMOS bands, respectively. To separate the effects of observational errors on the accuracy of parameter estimation, we also generated another two sets of mock catalog without adding observational errors. In this case (the no noise case in Figure 2), the magnitude errors are all fixed to be $0.01$ , but the photometries have not been perturbed accordingly.

Figure 2: (a) Response functions for CSST and Euclid bands. (b) The modeled relation between magnitude and magnitude error for CSST bands. Sources with

SNR<1

(i.e.

\sigma_{\rm m,i}>1.08574

) are considered as non-detections. The non-detections in a wavelength band

i

with N

\sigma

flux limit

F_{\rm lim,i}

and magnitude limit

m_{{\rm lim},i}

are represented as sources with

F_{i}=F_{\rm lim,i}

and

\sigma_{F,i}=-F_{\rm lim,i}/N

(the flux case), or

m_{i}=m_{{\rm lim},i}

and

\sigma_{\rm m,i}=-1.08574/N

(the magnitude case). These conventions make sure the consistent conversion between flux data and magnitude data in the input file of BayeSED. (c), (d): Same as in (a), (b), but for COSMOS bands. For clarity, the twelve intermediate bands (IBs) and two narrow bands (NBs) are not shown.

Survey	band	depth
	$NUV$	$24.2\pm 0.1$
	$u$	$24.2\pm 0.1$
CSST-like	$g$	$25.1\pm 0.1$
$5\sigma$ depth^b^bfootnotemark:	$r$	$24.8\pm 0.1$
	$i$	$24.6\pm 0.1$
	$z$	$24.1\pm 0.1$
	$y$	$23.2\pm 0.1$
Euclid-like	$J$	$24.0\pm 0.1$
$5\sigma$ depth^a^afootnotemark:	$H$	$24.0\pm 0.1$
	$Y$	$24.0\pm 0.1$
	$u$	$26.0\pm 0.1$
	$B$	$26.4\pm 0.1$
	$V$	$25.6\pm 0.1$
	$r$	$25.9\pm 0.1$
	$i^{+}$	$25.6\pm 0.1$
	$z^{++}$	$25.3\pm 0.1$
	$Y$	$24.7\pm 0.1$
COSMOS-like	$J$	$24.3\pm 0.1$
$5\sigma$ depth ^a^afootnotemark:	$H$	$24.0\pm 0.1$
	$K_{\rm s}$	$24.1\pm 0.1$
	$IB$	$24-25\pm 0.1$
	$NB711$	$24.5\pm 0.1$
	$NB816$	$24.6\pm 0.1$
	$3.6\mu{\rm m}$	$24.9\pm 0.1$
	$4.5\mu{\rm m}$	$24.9\pm 0.1$

Table 2: A summary of the adopted depths in all bands for CSST-like, Euclid-like, and COSMOS-like mock observations.

2.4 Sample selection

To test the performance of BayeSED, we selected two sets of samples of galaxies with $K_{s}<24.7$ (Laigle et al., 2019) and $i+<25$ (Cao et al., 2018)²²2In Cao et al. (2018) and Zhou et al. (2022a, b), only the high-quality sources with the $SNR\geq 10$ in g or i band were selected to test the performance of photometric redshift estimation. This selection is not used in this work to obtain a more full picture for the performance of the CSST wide-field multiband imaging survey. from the empirical statistics-based mock catalog (§2.1) and the hydrodynamical simulation-based mock catalog (§2.2), respectively. The first set of samples are obtained directly with BayeSED combined with SED models with different complexity by using the method present in §2.1.2. The second sample is selected from the Horizon-AGN hydrodynamical simulation-based photometric catalogs for COSMOS-like configuration³³3https://www.horizon-simulation.org/PHOTOCAT/HorizonAGN_LAIGLE-DAVIDZON+2019_COSMOS_v1.6.fits which contains $789,354$ galaxies. We find that a sample with $10,000$ galaxies is large enough to obtain stable results for the performance tests as presented in §4 and §5. The redshift and magnitude distributions of the two samples are presented in Figure 3. When employing different SED models with different complexity and the same set of empirical statistics, the empirical statistics-based samples show some differences, especially the redshift distribution. This is likely due to the different mapping relations from physical parameters to photometries and from free parameters to derived parameters for different SED models. Generally, the hydrodynamical simulation-based sample is consistent with the empirical statistics-based samples. We attribute the difference between the two set of samples to the different modeling of the SFH, CEH and DAL of galaxies and their different distribution of redshift and physical parameters.

3 Bayesian photometric SED analysis with BayeSED

The general method for the application of Bayesian inference to photometric SED analysis of galaxies is the same as in Han & Han (2014, 2019). In this section, we introduce some special aspects of Bayesian parameter estimation (§3.1) and model selection (§3.2) which are relevant to the current work.

3.1 Bayesian parameter estimation

For the Bayesian analysis of the mock data generated in the last section, we employ the same SED modeling procedure and setting of priors for free parameters as in §2.1.1, while the commonly used Gaussian form of likelihood function is employed. The performance of this Bayesian analysis, including its speed and quality, is crucial for the analysis of large sample of galaxies in the big data era. We need some metrics to quantify the performance of parameter estimation which is the main subject of this work.

While the speed of parameter estimation can be easily quantified by the running time, some metrics for the quality of parameter estimation are required. Similar to Acquaviva et al. (2015), we use three metrics to quantify the quality of parameter estimation. Bias, which characterizes the median separation between the predicted and the true values, is defined as:

BIA={\rm Median}(\Delta x),

(27)

while the precision, which describes the scatter between predicted and the true values, is defined as:

\sigma_{\rm NMAD}=1.4826*{\rm Median}(\lvert\Delta x\rvert),

(28)

where $\Delta x=(x_{\mathrm{phot}}-x_{\mathrm{true}})|/(1+x_{\mathrm{true}})$ for redshift (Ilbert et al., 2009; Dahlen et al., 2013; Salvato et al., 2018), and $\Delta x=(x_{\mathrm{phot}}-x_{\mathrm{true}})/(x_{\mathrm{true}}^{\mathrm{max}}-x_{\mathrm{true}}^{\mathrm{min}})$ for other parameters. The median-base definition makes them to be less sensitive to outliers (sources with unexpectedly large errors). The fraction of outliers is defined as:

OLF=\frac{1}{N_{\mathrm{obj}}}\times N(|\Delta x|>0.15).

(29)

3.2 Bayesian model selection

An important advantage of nested sampling-based algorithm, such as MultiNest, over MCMC-based method is the ability to carry out a simultaneous parameter estimation and model selection. While the main subject of this work is parameter estimation, it is also interesting to explore the effects of sampling parameters (namingly, $nlive$ and $efr$ ) of MultiNest on the computation of Bayesian evidence, the quantity which is crucial for Bayesian model selection.

In Han & Han (2019), we presented a mathematical framework to discriminate the different assumptions about SSP, DAL and SFH in the SED modeling of galaxies based on the Bayesian evidence for a sample of galaxies. In this work, since the SSP model employed in the generation of mock data is the same as that employed in their Bayesian analysis, we do not need to consider the different choices of SSP. So, the problem is significantly simplified. In this work, we focus on the computation of the Bayesian evidence for the SED modeling of a sample of galaxies with SSP, SFH, and DAL all being assumed to be universal (i.e. ${{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})}$ -like model (See Section 5.1 of Han & Han, 2019).) The sample Bayesian evidence in this case (as Equation 33 of Han & Han (2019)) is:

	$\displaystyle p(\bm{d}_{1},\bm{d}_{2},\dots,\bm{d}_{N}\|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})=$
	$\displaystyle\prod\limits_{g=1}^{N}\int p(\bm{d}_{g}\|{\bm{\theta}_{g}},{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})$
	$\displaystyle p({\bm{\theta}_{g}}\|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I}){\mathrm{d}\bm{\theta}_{g}}$
	$\displaystyle=\prod\limits_{g=1}^{N}p(\bm{d}_{g}\|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I}).$		(30)

Although the detailed SFH and DAL of different galaxies can vary significantly, the sample Bayesian evidence computed in this manner remains valuable for identifying the most efficient combination of SFH and DAL for analyzing a vast sample of galaxies, such as the one provided by the CSST wide-field imaging survey.

In practice, the natural logarithmic of Bayesian evidence is commonly used for Bayesian model selection. Therefore, Equation 30 can be rewritten as:

	$\displaystyle\ln(BE)\equiv\ln(p(\bm{d}_{1},\bm{d}_{2},\dots,\bm{d}_{N}\|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I}))$
	$\displaystyle=\sum\limits_{g=1}^{N}\ln(p(\bm{d}_{g}\|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})),$		(31)

where $\ln(p(\bm{d}_{g}|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I}))$ , the Bayesian evidence for an individual galaxy, can be directly obtained in BayeSED with MultiNest. However, the individual Bayesian evidences estimated with MultiNest contain errors. A more strict Bayesian model selection should consider the effects of error propagation. In our case, the error of the sample Bayesian evidence $\ln(BE)$ is simply the sum of errors for individual galaxies which is provided by MultiNest as well.

The minimum $\chi^{2}$ method is also widely used for model selection. For the case with Gaussian observational errors, there is only a constant difference between the minimum $\chi^{2}$ and the natural logarithmic of maximum likelihood. The sample maximum likelihood (as Equation 32 of Han & Han (2019)) is:

	$\displaystyle\mathcal{L}_{\rm max}(\hat{\bm{\theta}}_{1},\hat{\bm{\theta}}_{2},\dots,\hat{\bm{\theta}}_{N})$
	$\displaystyle\equiv{\mathop{\rm max}\limits_{\bm{\theta}_{1},\bm{\theta}_{2},\dots,\bm{\theta}_{N}}}[p(\bm{d}_{1},\bm{d}_{2},\dots,\bm{d}_{N}\|{\bm{\theta}_{1},\bm{\theta}_{2},\dots,\bm{\theta}_{N}},$
	$\displaystyle{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})]$
	$\displaystyle=\prod\limits_{g=1}^{N}{\mathop{\rm max}\limits_{\bm{\theta}_{g}}}[p(\bm{d}_{g}\|{\bm{\theta}_{g}},{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})].$		(32)

Then, the natural logarithmic of sample maximum likelihood is:

	$\displaystyle\ln(ML)\equiv\ln(\mathcal{L}_{\rm max}(\hat{\bm{\theta}}_{1},\hat{\bm{\theta}}_{2},\dots,\hat{\bm{\theta}}_{N}))$
	$\displaystyle=\sum\limits_{g=1}^{N}\ln({\mathop{\rm max}\limits_{\bm{\theta}_{g}}}[p(\bm{d}_{g}\|{\bm{\theta}_{g}},{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})]),$		(33)

where $\ln({\mathop{\rm max}\limits_{\bm{\theta}_{g}}}[p(\bm{d}_{g}|{\bm{\theta}_{g}},{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})])$ , the natural logarithmic of maximum likelihood for an individual galaxy, can be directly obtained in BayeSED with MultiNest. Similar to the model selection with Bayesian evidence, only the difference of $\ln(ML)$ between different models is useful for the model selection. Therefore, the model selection with $\ln(ML)$ is equivalent to that with minimum $\chi^{2}$ . In §6, we will discuss the difference between the two model selection methods.

3.3 Runtime parameters of MultiNest algorithm

As the Bayesian inference engine of BayeSED, MultiNest has some runtime parameters. The values of these runtime parameters have very important effects on the performance of BayeSED for redshift and stellar population parameter estimation of galaxies. Here, we briefly introduce the meaning of these runtime parameters of MultiNest algorithm.

Nested sampling (NS) (Skilling, 2004, 2006), as a Monte Carlo (MC) method primarily designed for the efficient computation of the Bayesian evidence, allows posterior inference as a by-product at the same time. So, it provides a way to carry out simultaneous Bayesian parameter estimation and model selection. As an algorithm built on the NS framework, MultiNest (Feroz & Hobson, 2008; Feroz et al., 2009) is special for its efficiency in sampling from posteriors that may contain several modes and/or degeneracies. It has been improved further by the implementation of importance nested sampling (INS) (Cameron & Pettitt, 2014; Feroz et al., 2019) to increase the efficiency for evidence computation. In the latest version of BayeSED, the V3.12 version of MultiNest, which includes the implementation of INS, is employed.

Similar to most nested sampling algorithms, MultiNest explores the posterior distribution by maintaining a fixed number (See also Higson et al., 2019; Speagle, 2020, for new methods using variable number) of samples drawn from the prior distribution, called live points, and iteratively replaces the point with the lowest likelihood value (the dead point) with another point drawn from the prior but has a higher value of likelihood. While there are many runtime parameters of MultiNest which can be set in BayeSED, only two of them are of particular importance. They largely determined the accuracy and computational cost for the running of MultiNest algorithm and therefore BayeSED. The first one is the total number of live points ( $nlive$ ), which determines the effective sampling resolution. The second one is the target sampling efficiency ( $efr$ ), which determines the ratio of points accepted to those sampled. Generally, the larger $nlive$ and lower $efr$ lead to more accurate posteriors and evidence values but higher computational cost. The optimal value of $nlive$ and $efr$ should be problem-dependent, although $efr$ equals to $0.8$ and $0.3$ are recommended by the authors of MultiNest for parameter estimation and evidence evalutaion, respectively.

In this work, we will explore the effects of $nlive$ and $efr$ on the estimation of photometric redshift and stellar population parameters. The results are presented in §4.1 and §4.2, respectively.

4 Results of performance tests using empirical statistics-based mock galaxy sample

In this section, we present the results of performance tests of photometric redshift and stellar population parameter estimation by using empirical statistics-based mock galaxy sample for CSST wide-field multiband imaging survey. Since the SED model employed in the Bayesian parameter estimation is exactly the same as that used in the generation of mock observations, the error of parameter estimation is mainly contributed by the random error in the data, parameter degeneracies, the stochastic nature of the employed MultiNest sampling algorithm and other potential errors in the BayeSED code. To separate out the effects of the random photometric error in the data, we will consider the two cases with and without adding random noise to the photometric data. Besides, to find out the optimal run parameters, we have considered six different choices of the target sampling efficiency ( $efr$ ) and eight choices of the number of live points ( $nlive$ ) for the MultiNest sampling algorithm. Furthermore, we compare the performance of different SED models with increasing complexity in terms of running time and quality of parameter estimation.

Figure 4: Performance test with empirical statistics-based mock galaxy sample for the photometric redshift estimation of galaxies in the CSST wide-field multiband imaging survey. (a) The results for only the simplest SED model (SFH=

\tau

,-CEH,DAL=Cal+00) employed in this work. We have considered six different choices of

efr

(target sampling efficiency) as shown by different symbols. For each

efr

, we have considered eight cases with the number of live points (

nlive

), which determines the effective sampling resolution, equals to

10

15

20

25

50

100

200

, and

400

, respectively. The relations between the computation time (in sec/obj/cpu, by employing one core of a 2.2 GHz cpu) and the performance metrics

\sigma_{\rm NMAD}

(top panels), BIA (middle panels), OLF (bottom panels) are shown respectively. The results for the two cases with (left panels) and without (right panels) observational noise in the mock data are shown respectively. In general, larger value of

nlive

and smaller value of

efr

lead to better quality of redshift estimation, but with the cost of longer running time. (b) The results for six different SED models with increasing complexity. Only the results with

efr=0.1

are shown. In general, more complicated SED models require longer running time. They also lead to worse quality of photometric redshift estimation, which is likely due to more severe parameter degeneracies. Actually, for the last four more complicated SED models with the DAL of Salim et al. (2018), the number of free parameter is greater than the number of photometric data points (7 for CSST imaging survey), as shown in Table 1.

4.1 Photometric redshift estimation

The results of performance tests for photometric redshift estimation are shown in Figure 4. In Figure 4(a), we show the results for only the simplest SED model (SFH= $\tau$ ,-CEH,DAL=Cal+00) employed in this work. As shown in the top right panel of this figure, in the case without noise, there is a clear anti-correlation between the computation time (or the sampling resolution $nlive$ ) and the error $\sigma_{\rm NMAD}$ (defined in Equation 28) of photometric redshift estimation. Meanwhile, a smaller $efr$ makes the anti-correlation converge faster with the increasing of $nlive$ . There is a clear lower limit for the value of $\sigma_{\rm NMAD}$ , which is about $0.006$ . As shown in the top left panel of Figure 4(a), in the case with noise, the error of photometric redshift estimation does not always decrease with the sampling resolution $nlive$ . When we set $efr=0.1$ , the lowest error ( $\lesssim 0.056$ ) of redshift estimation is obtained when $nlive$ is about $25$ .When $nlive>25$ , the error of redshift estimation start to increase with $nlive$ and finally converge to $\sim 0.058$ . This is most likely due to the overfitting to the noise added to the mock data.

The middle two panels of Figure 4(a) show the relation between the computation time (or $nlive$ ) and the bias (defined in Equation 27) of photometric redshift estimation. In the case with noise, the relation between the computation time (or $nlive$ ) and bias has almost the opposite profile of that of the error $\sigma_{\rm NMAD}$ . However, the bias of photometric redshift estimation is generally very small, which is almost zero in the noise-free case.

The bottom two panels of Figure 4(a) show the relation between the computation time (or $nlive$ ) and the fraction of outliers OLF (defined in Equation 29) of photometric redshift estimation. Similar to that for $\sigma_{\rm NMAD}$ , in the noise-free case, there is a clear anti-correlation between the computation time (or $nlive$ ) and OLF. In this case, the lower limit for the value of OLF is about $0.002$ . In the case with noise, the relation between the computation time (or $nlive$ ) and OLF has the same profile as that of the error $\sigma_{\rm NMAD}$ . When we set $efr=0.1$ , the lowest OLF ( $\lesssim 0.215$ ) of redshift estimation is also obtained when $nlive$ is about $25$ . When $nlive>25$ , the OLF of redshift estimation start to increase with $nlive$ and finally converge to $\sim 0.225$ .

In Figure 4(b), we show the results for all of the six SED models with increasing complexity. Here, only the results with $efr=0.1$ are shown. In the case without noise, as shown in the top right panel of this figure, the error $\sigma_{\rm NMAD}$ of photometric redshift estimation tends to converge to a larger value when more complicated SED model is employed. This is not strange, since more complicated SED models have more free parameters and thus suffer from more severe parameter degeneracies. Besides, more complicated SED models apparently require longer running time. The bias of redshift estimation is always very small no matter which SED is employed. In general, when more complicated SED model is employed, the OLF of redshift estimation also increases significantly, and decreases much slower with the increasing of $nlive$ .

In the case with noise, as shown in the left panels of Figure 4(b), the results are a little more complicated. For the first three simplest SED models, the error $\sigma_{\rm NMAD}$ of photometric redshift estimation apparently increases with the increasing of model complexity. However, when more complicated forms of SFH is considered, $\sigma_{\rm NMAD}$ start to decreases with the increasing of model complexity, although not very significantly. Meanwhile, the most complicated SED model (SFH= $\alpha$ - $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18) lead to the smallest absolute value of bias, although the bias is actually very small in all cases. The situation for OLF is somewhat similar to that of $\sigma_{\rm NMAD}$ . No matter which SED model is employed, when $nlive\gtrsim 25$ , both $\sigma_{\rm NMAD}$ and OLF of redshift estimation start to increase, and then slowly decrease to a stable value.

4.2 Stellar population parameter estimation

In this subsection, we show the results of performance tests for the photometric stellar population parameter estimation. While the estimates of many stellar population parameter are available, we only show the results for stellar mass and SFR, which are two of the most important physical parameters for the study of the formation and evolution of galaxies.

Figure 5: As in Figure 4, but for the stellar mass estimation. The quality of stellar mass estimation, in terms of

\sigma_{\rm NMAD}

, BIA and OLF, is worse than that of redshift estimation and more sensitive to the selection of SED models. In the case with noise, for the most complicated SED model (SFH=

\alpha

\beta

\tau

-r,+CEH,DAL=Sal+18) used in this work, the

\sigma_{\rm NMAD}

, bias and OLF of stellar mass estimation increases significantly when

nlive>100

. This should be a clear indication of overfitting to the noise in the data. In general, more complicated SED models lead to worse quality of stellar mass estimation.

4.2.1 Stellar mass

The results of performance tests for stellar mass estimation are shown in Figure 5. In Figure 5(a), we show the results for only the simplest SED model (SFH= $\tau$ ,-CEH,DAL=Cal+00) employed in this work. As shown in the top right panel of this figure, in the case without noise, there is also a clear anti-correlation between the computation time (or the sampling resolution $nlive$ ) and the error $\sigma_{\rm NMAD}$ of photometric stellar mass estimation. The behavior of error $\sigma_{\rm NMAD}$ of stellar mass with respect to the change of efr is similar to that of photometric redshift. There is also a clear lower limit for the value of $\sigma_{\rm NMAD}$ , which is about $0.04$ . As shown in the top left panel of Figure 5(a), in the case with noise, the error of stellar mass estimation does not always decrease with the sampling resolution $nlive$ as well. When we set $efr=0.1$ , the lowest error ( $\sim 0.1130$ ) of stellar mass estimation is also obtained when $nlive$ is about $25$ . When $nlive>25$ , the error of stellar mass estimation only slightly increase with $nlive$ . The error of stellar mass is about two times larger than that of photometric redshift estimation. The bias of stellar mass estimation is also larger, but still very small when comparing with $\sigma_{\rm NMAD}$ . In the noise-free case, there is also a clear anti-correlation between the computation time (or $nlive$ ) and the OLF of stellar mass estimation, where the lower limit for the value of OLF is about $0.03$ . In the case with noise, when we set $efr=0.1$ , the lowest OLF ( $\lesssim 0.285$ ) of stellar mass estimation is also obtained when $nlive$ is about $25$ . When $nlive>25$ , the OLF of stellar mass estimation only slightly increase with $nlive$ as well. The OLF of stellar mass estimation is slightly larger than that of photometric redshift estimation.

In Figure 5(b), we show the results for all of the six SED models with increasing complexity, where only the results with $efr=0.1$ are shown. In the two cases with or without noise, as shown in the top right panel of this figure, the error $\sigma_{\rm NMAD}$ of photometric stellar mass estimation tends to converge to a larger value when more complicated SED model is employed. The same is true for the OLF of photometric stellar mass estimation. The behavior of bias is somewhat different, but it is generally very small when comparing with $\sigma_{\rm NMAD}$ . Besides, in the case with noise, for the most complicated SED model (SFH= $\alpha$ - $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18) used in this work, the $\sigma_{\rm NMAD}$ , bias and OLF of stellar mass estimation increases significantly when $nlive>100$ . This should be a very clear indication of overfitting to the noise in the data. In general, more complicated SED models lead to worse quality of stellar mass estimation.

Figure 6: As in Figure 4, but for the star-formation rate estimation. The quality of star-formation rate estimation, in terms of

\sigma_{\rm NMAD}

, BIA and OLF, is slightly better than that of stellar mass estimation, but even more sensitive to the selection of SED models. In the case with noise, for the four most complicated SED models used in this work, the

\sigma_{\rm NMAD}

, bias and OLF of SFR estimation significantly increases with

nlive

. This is another even more clear indication of overfitting to the noise in the data. In general, more complicated SED models lead to worse quality of SFR estimation.

4.2.2 Star-formation rate

The results of performance tests for SFR estimation are shown in Figure 6. In Figure 6(a), we show the results for only the simplest SED model (SFH= $\tau$ ,-CEH,DAL=Cal+00) employed in this work. Similar to the results for photometric redshift and stellar mass estimation, in the case without noise, there is also a clear anti-correlation between the computation time (or the sampling resolution $nlive$ ) and the error $\sigma_{\rm NMAD}$ of photometric stellar mass estimation. The behavior of error $\sigma_{\rm NMAD}$ of SFR with respect to the change of efr is similar to that of photometric redshift and stellar mass. There is also a clear lower limit for the value of $\sigma_{\rm NMAD}$ , which is about $0.02$ . As shown in the top left panel of Figure 5(a), in the case with noise, the error of SFR estimation increases apparently when the sampling resolution $nlive\gtrsim 25$ . Generally, the error of SFR estimation is slightly smaller than that of stellar mass estimation. The bias of SFR estimation is also slightly smaller, and ignorable with respect to $\sigma_{\rm NMAD}$ . In the noise-free case, the relation between the computation time (or $nlive$ ) and the OLF of SFR estimation is somewhat different from that of photometric redshift and stellar mass. Even with $nlive=500$ , the OLF of SFR estimation still does not seem to converge. The lower limit for the value of OLF seems near $0.1$ . In the case with noise, the OLF of SFR estimation converge much faster to about $0.255$ when we set $efr=0.1$ . This is slightly smaller than that of stellar mass estimation.

In Figure 6(b), we show the results for all of the six SED models with increasing complexity, where only the results with $efr=0.1$ are shown. In the two cases with or without noise, as shown in the top right panel of this figure, the error $\sigma_{\rm NMAD}$ of SFR estimation tends to converge to a larger value when more complicated SED model is employed, and is more sensitive to the selection of SED model than that of stellar mass. The same is true for the OLF of SFR estimation. The behavior of bias is somewhat different, but it is generally very small when comparing with $\sigma_{\rm NMAD}$ . Besides, in the case with noise, for the four most complicated SED models used in this work, the $\sigma_{\rm NMAD}$ , bias and OLF of SFR estimation significantly increases with $nlive$ . This is another even more clear indication of overfitting to the noise in the data. In general, more complicated SED models lead to worse quality of SFR estimation.

4.3 Computation of Bayesian evidence

In this subsection, we present the results of performance test for the computation of Bayesian evidence, a quantity which is crucial for Bayesian model selection.

In Figure 7(a), we show the results for only the simplest SED model (SFH= $\tau$ ,-CEH,DAL=Cal+00) employed in this work. As shown in the top and middle panels of this figure, the Bayesian evidence computed with importance sampling is more stable than that without importance sampling, especially in the case with noise. So, hereafter and especially in §5 and 6, all mentioned Bayesian evidences are computed with importance sampling. The value of Bayesian evidence increases with the number of live points ( $nlive$ ) which determines the effective sampling resolution. In all cases, it eventually converges to a stable value when $nlive$ is very large, while a smaller sampling efficiency ( $efr$ ) leads to faster convergence rate. A good balance between the speed and quality of Bayesian evidence estimation can be achieved when the MultiNest runtime parameters $efr$ equals to $0.1$ and $nlive$ equals to $50$ .

On the other hand, as shown in the bottom panels of Figure 7(a), the error of Bayesian evidence decreases with $nlive$ , while a larger sampling efficiency ( $efr$ ) also leads to faster convergence rate. However, unlike the value of Bayesian evidence, the error of Bayesian evidence converges slower with the increasing of $nlive$ in all cases. As a result, if we set $efr=0.1$ and $nlive=50$ , the error of Bayesian evidence would be overestimated. A much larger value of $nlive$ seems required to obtain a more reliable estimation for the error of Bayesian evidence with MultiNest, which would be very computationally expansive and not suitable for the analysis of massive photometric data. However, in practice, this may not be a serious issue, since an overestimated error of Bayesian evidence only leads to a more conservative conclusion about model comparison. We just need to keep this in mind.

In Figure 7(b), we show the results for all SED models with increasing complexity, where only the results with $efr=0.1$ are shown. Although the data used for the computation of Bayesian evidence is different for different SED models, the value of Bayesian evidence clearly decreases with the increasing of the complexity of SED model. This is reasonable. Since the same SED model is employed for the generation of mock data and their Bayesian analysis, the mock data can always be interpreted well. However, a more complicated SED model is penalized for being distributed over a larger space, of which only a smaller fraction is useful for the given mock data.

Figure 7: Performance test of Bayesian evidence estimation with empirical statistics-based mock galaxy sample for CSST imaging survey. The relations between the computation time (in sec/obj by employing a single 2.2 GHz cpu core) and the value of the natural logarithm of Bayesian evidence (as computed with Equation 31) and its error (bottom panels) for the whole galaxy sample are shown. The results for two versions of Bayesian evidence with (top panels) or without (middle panels) importance sampling have been shown. (a) The results for only the simplest SED model (SFH=

\tau

,-CEH,DAL=Cal+00) employed in this work. We have considered six different choices of

efr

(target sampling efficiency) as shown by different symbols. The results for seven cases with

nlive

equals to

15

20

25

50

100

200

, and

400

are shown respectively. The left panels show the results with noisy data while the right panels show the results with noise-free data. (b) The results for six different SED models with increasing complexity. Only the results with

efr=0.1

are shown. The value of Bayesian evidence clearly decreases with the increasing of the complexity of the SED model.

5 Results of performance tests using hydrodynamical simulation-based mock galaxy sample

In this section, we present the results of performance tests of photometric redshift and stellar population parameter estimation by using hydrodynamical simulation-based mock galaxy sample for CSST-like imaging survey. Only the results obtained with the simplest SED model are shown. In §6, we will discuss the effect of more flexible SFH and DAL for CSST-like, CSST+Euclid-like and COSMOS-like surveys, respectively.

As mentioned in §2.2, the generation of this mock galaxy sample accounts for the complex SFH and metal enrichment of Horizon-AGN galaxies, and consistently includes dust attenuation. However, for the Bayesian analysis of this more theoretical mock galaxy sample, we firstly employ the widely used assumptions about SFH (exponentially declining), metal enrichment history (constant but free), and dust attenuation (uniform foreground dust screen with Calzetti et al. (2000) DAL). The results in this section will help us to quantify the systematic errors resulting from these simplified assumptions. Besides, as mentioned in §2.4, galaxies in the mock sample used here are selected with $K_{s}<24.7$ and $i_{+}<25$ . In the literature, it is quite common to exclude some pathological cases with a kind of $\chi^{2}$ selection (Davidzon et al., 2017; Caputi et al., 2015; Laigle et al., 2019) before presenting the results of performance test. However, no such cut was made here because the pathological cases are precisely what we want to investigate.

Figure 8: The results of photometric redshift estimation with (righ panels) and without (left panels) noise, and for the analysis of the empirical statistics-based (top panels) and hydrodynamical simulation-based (bottom panels) mock galaxy sample, respectively. The error from imperfect SED modeling will only present for the analysis of the hydrodynamical simulation-based mock galaxy sample. The photometric redshifts (

z_{\rm phot}

) are estimated by employing the

\tau

model of SFH without consideration of metallicity evolution and the Calzetti et al. (2000) model of DAL. The red solid line indicate the identity while the red dotted lines indicate the outlier limits, i.e.

\left|z_{\mathrm{phot}}-z_{\mathrm{true}}\right|/(1+z_{\mathrm{true}})>0.15

. Here, we show the results obtained with the MultiNest runtime parameters

efr

equals to

0.1

and

nlive

equals to

50

. In general, the observational noise is the more important source of error for the photometric redshift estimation of galaxies, and the contribution from imperfect SED modeling is also very important.

Figure 9: As Figure 8, but for the stellar mass. In general, the observational noise is the more important source of error for the photometric stellar mass estimation of galaxies, and the contribution from imperfect SED modeling is almost comparable.

Figure 10: As Figure 8, but for the SFR. In general, the imperfect SED modeling is more important source of error for the photometric SFR estimation of galaxies, and the contribution from imperfect SED modeling is also very important. Especially, in the case without noise, it is clear that the simple

\tau

model of SFH and Calzetti et al. (2000) form of DAL lead to severely biased estimation of SFR for the Horizon-AGN simulation based mock sample of galaxies.

5.1 Photometric redshift estimation

In Figure 8, we investigate the performance of BayeSED combined with the simplest SED model to estimate the photometric redshifts of hydrodynamical simulation-based mock galaxy sample for CSST-like imaging survey. Aa a reference, the panel a of this figure show the ideal case without observational noise and SED modeling errors. So, in this case, the effects of parameter degeneracies, the stochastic nature of MultiNest sampling algorithm and other potential errors in the BayeSED code are the sources of error. As shown clearly, the total error from all of these sources is very small. By comparing panels a and b of this figure, with only the observational noise added, the $\sigma_{\rm NMAD}$ of photometric redshift estimation increases by eight times and the OLF increases by more than forty times. The bias also increases, but is ignorable with respect to $\sigma_{\rm NMAD}$ . By comparing panels a and c of this figure, with only the error from the imperfect SED modeling added, the $\sigma_{\rm NMAD}$ of photometric redshift estimation increases by more than three times and the OLF decreases slightly. Besides, there are some additional systematic patterns in the relation between true and estimated values of photometric redshift. The bias also increases and is comparable to $\sigma_{/rmNMAD}$ . In general, the observational noise is more important source of error for the photometric redshift estimation of galaxies, although the other one is also very important.

Finally, as shown in panel d of Figure 8, when all sources of error are included, the $\sigma_{\rm NMAD}$ of photometric redshift estimation increases to $0.097$ , the OLF increases to $0.264$ , and the bias becomes $0.003$ . The systematic patterns shown in panel c seems being hidden due to the added noise. The algorithm seems to be struggling to estimate the photometric redshifts correctly by only using the seven-band photometries from CSST-like imaging survey, especially for galaxies with redshift larger than one.

5.2 Stellar population parameter estimation

In Figure 9, we investigate the performance of BayeSED combined with the simplest SED model to estimate the stellar population parameters of hydrodynamical simulation-based mock galaxies for CSST-like imaging survey. Aa a reference, the panel a of this figure show the ideal case without observational noise and SED modeling errors. In this ideal case, the $\sigma_{\rm NMAD}$ , bias and OLF of stellar mass estimation are $0.047$ , $0.005$ and $0.052$ , respectively. As shown in panel b, with only the observational noise added, the results become $0.115$ , $0.02$ and $0.283$ , respectively. As shown in panel c, with only the error from the imperfect SED modeling added, the results become $0.103$ , $0.003$ and $0.141$ , respectively. By comparing panels a and c, the performance of stellar mass estimation is severely affected by the simplified assumptions in the SED modeling. However, by comparing panels b and c, the observational noise is more important source of error for the photometric stellar mass estimation of galaxies, although the other one is also very important. Finally, as shown in panel d of this figure, when all sources of error are included, the $\sigma_{\rm NMAD}$ of photometric stellar mass estimation increases to $0.135$ , the bias increases to $0.034$ , and the OLF increases to $0.341$ . The algorithm seems to be even more struggling to estimate the photometric stellar mass correctly by only using the seven-band photometries from CSST-like imaging survey.

Similarly, Figure 10 shows the performance of BayeSED combined with the simplest SED model to estimate the SFR of hydrodynamical simulation-based mock galaxy sample for CSST-like imaging survey. Comparing with the results for stellar mass estimation in Figure 9, the photometric SFR estimation is even more severely affected by the simplified assumptions in the SED modeling. Actually, by comparing panels b and c of this figure, it is clear that the error from the imperfect SED modeling is more important source of error for the photometric SFR estimation of galaxies, although the other one is also very important. Finally, it becomes even more struggling to estimate the photometric SFR correctly by only using the seven-band photometries from CSST-like imaging survey.

6 Discussion

By comparing the results of performance tests for simultaneous photometric redshift and stellar parameters estimation using empirical statistics-based mock galaxy sample (§4) and hydrodynamical simulation-based mock galaxy sample(§5), especially those presented in Figures 8, 9 and 10, it is clear that the simple typical assumptions about the SFH and DAL of galaxies have severe impact on the performance of photometric parameter estimation of galaxies for CSST-like imaging survey. It is not very surprising, since the SFHs and MEHs of galaxies in the cosmic hydrodynamical simulation, such as Horizon-AGN (Volonteri et al., 2016; Beckmann et al., 2017a; Kaviraj et al., 2017; Beckmann et al., 2017b), are much more complex and diverse (See also Iyer et al., 2020) than the simple assumptions that have been employed in the previous Bayesian analysis of photometric mock data.

In this section, we will discuss the effects of more flexible forms of SFH and DAL on the performance of simultaneous photometric redshift and stellar population parameter estimation of galaxies. As in Han & Han (2012, 2014, 2019) (See also Salmon et al., 2016; Dries et al., 2016, 2018; Lawler & Acquaviva, 2021), we mainly employ the Bayesian model comparison method to compare six different combinations of these model ingredients with increasing complexity (See Table 1 for details). In addition to the CSST-like survey (§6.1) where only the photometries from seven broad-bands are available, we also discuss the results obtained by using mock data for CSST+Euclid-like (§6.2) and COSMOS-like surveys (§6.3) with increasing discriminative power, respectively.

6.1 Effects of more flexible SFH and DAL for CSST-like survey

survey	noise	SFH	DAL	ln(BE)	ln(ML)	parameter	NMAD	BIA	OLF
						$z_{\rm phot}$	0.024	0.011	0.003
CSST	-	$\tau$ ,-CEH	Cal+00	-88042 $\pm$ 5936	127197	${\rm log}(M_{*})_{\rm phot}$	0.103	0.003	0.141
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.140	-0.117	0.436
						$z_{\rm phot}$	0.023	0.010	0.002
CSST	-	$\tau$ ,+CEH	Cal+00	-74128 $\pm$ 5973	144380	${\rm log}(M_{*})_{\rm phot}$	0.092	-0.010	0.141
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.148	-0.141	0.491
						$z_{\rm phot}$	0.016	0.014	0.004
CSST	-	$\tau$ ,+CEH	Sal+18	58878 $\pm$ 5823	282267	${\rm log}(M_{*})_{\rm phot}$	0.086	0.041	0.170
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.153	-0.019	0.390
						$z_{\rm phot}$	0.015	0.014	0.004
CSST	-	$\beta$ - $\tau$ ,+CEH	Sal+18	56343 $\pm$ 5780	283410	${\rm log}(M_{*})_{\rm phot}$	0.079	0.041	0.150
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.133	0	0.370
						$z_{\rm phot}$	0.017	0.011	0.006
CSST	-	$\beta$ - $\tau$ -r,+CEH	Sal+18	34213 $\pm$ 5867	275861	${\rm log}(M_{*})_{\rm phot}$	0.097	0.052	0.227
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.257	0.045	0.579
						$z_{\rm phot}$	0.016	0.010	0.005
CSST	-	$\alpha$ - $\beta$ - $\tau$ -r,+CEH	Sal+18	33044 $\pm$ 5800	275886	${\rm log}(M_{*})_{\rm phot}$	0.098	0.051	0.239
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.214	0.065	0.544
						$z_{\rm phot}$	0.097	0.003	0.264
CSST	+	$\tau$ ,-CEH	Cal+00	-18871 $\pm$ 2649	49807	${\rm log}(M_{*})_{\rm phot}$	0.135	-0.034	0.341
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.157	0.011	0.371
						$z_{\rm phot}$	0.096	0.005	0.264
CSST	+	$\tau$ ,+CEH	Cal+00	-18661 $\pm$ 2649	49469	${\rm log}(M_{*})_{\rm phot}$	0.135	-0.034	0.344
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.155	0	0.366
						$z_{\rm phot}$	0.098	0.012	0.263
CSST	+	$\tau$ ,+CEH	Sal+18	-26098 $\pm$ 2956	54493	${\rm log}(M_{*})_{\rm phot}$	0.158	-0.023	0.387
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.201	0	0.468
						$z_{\rm phot}$	0.099	0.011	0.263
CSST	+	$\beta$ - $\tau$ ,+CEH	Sal+18	-28141 $\pm$ 2904	54484	${\rm log}(M_{*})_{\rm phot}$	0.161	-0.029	0.396
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.194	0.028	0.472
						$z_{\rm phot}$	0.100	0.013	0.268
CSST	+	$\beta$ - $\tau$ -r,+CEH	Sal+18	-35048 $\pm$ 3020	54444	${\rm log}(M_{*})_{\rm phot}$	0.171	-0.037	0.415
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.232	0.041	0.544
						$z_{\rm phot}$	0.104	0.007	0.290
CSST	+	$\alpha$ - $\beta$ - $\tau$ -r,+CEH	Sal+18	-33983 $\pm$ 2869	54440	${\rm log}(M_{*})_{\rm phot}$	0.177	-0.052	0.430
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.224	0.080	0.570

Table 3: A summary of Bayesian evidences (BE), maximum likelihoods (ML) and the metrics of the quality of parameter estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for CSST-like survey by employing six SED models with increasing complexity in the form of SFH and DAL, as well as for the cases with and without noise.

In Table 3, we present a summary of the Bayesian evidences, maximum likelihoods and metrics of the quality of parameter estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for CSST-like survey by employing six different combinations of SFH and DAL with increasing complexity, as well as for the cases with and without noise. The same results are also shown more clearly in Figure 11.

6.1.1 Model comparison

In the case without noise, as shown in the top left panel of Figure 11, the simplest model “SFH= $\tau$ ,-CEH,DAL=Cal+18” has the lowest Bayesian evidence of ${\rm ln}(BE)=-88042\pm 5936$ . With the additional consideration of metallicity evolution, the Bayesian evidence of the model “SFH= $\tau$ ,+CEH,DAL=Cal+18” increases to ${\rm ln}(BE)=-74128\pm 5973$ . Then, with the adoption of the DAL of Salim et al. (2018), the Bayesian evidence of the model “SFH= $\tau$ ,+CEH,DAL=Sal+18” increases significantly to ${\rm ln}(BE)=58878\pm 5823$ . Apparently, the DAL of Salim et al. (2018) is a much better choice than that of Calzetti et al. (2000) for the hydrodynamical simulation-based mock galaxy sampe in CSST-like survey. Furthermore, by employing a more complicated $\beta$ - $\tau$ form of SFH, the Bayesian evidence of the model “SFH= $\beta$ - $\tau$ ,+CEH,DAL=Sal+18” seems decreases a little to ${\rm ln}(BE)=56343\pm 5780$ . Actually, the latter two SED models (“SFH= $\tau$ ,+CEH,DAL=Sal+18” and “SFH= $\beta$ - $\tau$ ,+CEH,DAL=Sal+18”) have the largest Bayesian evidences which are comparable within error bar. They are neither the simplest nor the most complex models. With a quenching (or rejuvenation) component added to the SFH, the Bayesian evidence of the model “SFH= $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18” obviously decreases to ${\rm ln}(BE)=34213\pm 5867$ . It seems that, while the rejuvenation or rapid quenching events may happen in some galaxies, this additional component of SFH is not very effective for most of the galaxies in the sample. Finally, by employing a even more flexible double power-law form of SFH, the Bayesian evidence of the model “SFH= $\alpha$ - $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18” seems decreases a little to ${\rm ln}(BE)=33044\pm 5800$ . However, the latter two SED models are actually comparable within error bar. On the other hand, it is worth to mention that the model selection with maximum likelihood (or equivalently minimum $\chi^{2}$ ) lead to similar results.

In the case with noise, as shown in the bottom left panel of Figure 11, the two simplest SED models “SFH= $\tau$ ,-CEH,DAL=Cal+00” and “SFH= $\tau$ ,+CEH,DAL=Cal+00” have the largest Bayesian evidences of ${\rm ln}(BE)=-18871\pm 2649$ and ${\rm ln}(BE)=-18661\pm 2649$ , respectively. Although the latter which has additionally considered the metallicity evolution seems better, their Bayesian evidences are actually comparable within error bar. Then, with the adoption of the DAL of Salim et al. (2018), the Bayesian evidence of the model “SFH= $\tau$ ,+CEH,DAL=Sal+18” decreases significantly to ${\rm ln}(BE)=-26098\pm 2956$ , exactly the opposite of the situation without noise. It is likely that the more complicated form of DAL does not give a better fit to the noisier data. By employing a more complicated $\beta$ - $\tau$ form of SFH, the Bayesian evidence of the model “SFH= $\beta$ - $\tau$ ,+CEH,DAL=Sal+18” seems decreases further to ${\rm ln}(BE)=-28141\pm 2904$ , although comparable with the former within error bar. With a quenching (or rejuvenation) component added to the SFH, the Bayesian evidence of the model “SFH= $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18” obviously decreases to ${\rm ln}(BE)=-35048\pm 3020$ , which is similar to the case without noise. Finally, by employing a even more flexible double power-law form of SFH, the Bayesian evidence of the model “SFH= $\alpha$ - $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18” seems increases a little to ${\rm ln}(BE)=-33983\pm 2869$ , although comparable with the former within error bar. On the other hand, it is very interesting to notice that the model selection with maximum likelihood (or equivalently minimum $\chi^{2}$ ) lead to exactly the opposite results, where the more complex (or flexible) models are always more favored.

6.1.2 Parameter estimation

In the right three panels of Figure 11, we show the three metrics of the quality of photometric redshift, stellar mass and SFR estimation for different SED models, respectively. In general, in the case without noise, the SED models “SFH= $\tau$ ,+CEH,DAL=Sal+18” and “SFH= $\beta$ - $\tau$ ,+CEH,DAL=Sal+18” which are just the two with largest Bayesian evidence give the highest quality parameter estimates. In the case with noise, the two simplest SED models “SFH= $\tau$ ,-CEH,DAL=Cal+00” and “SFH= $\tau$ ,+CEH,DAL=Cal+00” which are also the two with the largest Bayesian evidences give the highest quality parameter estimates. In the following, we discuss these results in more detail.

Figure 12: The results of photometric redshift estimation from the Bayesian analysis of the hydrodynamical simulation-based mock data for CSST-like imaging survey by employing the two SED models with the largest Bayesian evidence. (a) By comparing with the results in panel c of Figure 8, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the

\sigma_{\rm NMAD}

of photometric redshift estimation is obviously reduced, although the bias and OLF are slightly increased. Meanwhile, the systematic patterns in the former results are also largely reduced. (c) By additionally employing a more complicated

\beta

\tau

form of SFH, the

\sigma_{\rm NMAD}

of photometric redshift estimation is only slightly reduced while the bias and OLF are exactly the same. (b, d) In the case with noise, the best two models are quite similar in the quality of photometric redshift estimation. Besides, there are two clear branches of outliers caused by the mis-identification of Lyman and Balmer break features.

The detailed results of photometric redshift estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 12. In the case without noise, by comparing the results in panel c of Figure 8 with that in panel a of Figure 12, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the $\sigma_{\rm NMAD}$ of photometric redshift estimation is obviously reduced while the bias and OLF are only slightly increased. Meanwhile, the systematic patterns in the former results are also largely reduced. However, as shown in panel c of Figure 12, by additionally employing a more complicated $\beta$ - $\tau$ form of SFH, the $\sigma_{\rm NMAD}$ of photometric redshift estimation is only slightly reduced while the bias and OLF are exactly the same. Besides, as shown in Table 3 and Figure 11, the other two even more complicated forms of SFH lead to similar quality of photometric redshift estimation. In the case with noise, the best two models are quite similar in the quality of photometric redshift estimation. Besides, with the increasing of the complexity of SED models, the quality of photometric redshift estimation tend to decrease, although not very significantly.

Figure 13: As in Figure 12, but for the stellar mass. (a) By comparing with the results in panel c of Figure 9, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the

\sigma_{\rm NMAD}

of photometric stellar mass estimation is reduced, although the bias and OLF are somewhat increased. (c) By additionally employing a more complicated

\beta

\tau

form of SFH, the quality of photometric stellar mass estimation increase further. (b, d) In the case with noise, the best two models are exactly the same in the quality of photometric stellar mass estimation.

The detailed results of photometric stellar mass estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 13. In the case without noise, by comparing the results in panel c of Figure 9 with that in panel a of Figure 13, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the $\sigma_{\rm NMAD}$ of photometric stellar mass estimation is reduced, although the bias and OLF are somewhat increased. By additionally employing a more complicated $\beta$ - $\tau$ form of SFH, as shown in panel c of Figure 13, the quality of photometric stellar mass estimation increase further. However, as shown in Table 3 and Figure 11, with the increasing of the complexity of SED models, the quality of photometric stellar mass stimation decreases obviously. In the case with noise, the best two models are exactly the same in the quality of photometric stellar mass estimation. Meanwhile, with the increasing of the complexity of SED models, the quality of photometric stellar mass stimation decreases even more obviously.

Figure 14: As in Figure 12, but for the SFR. (a) By comparing with the results in panel c of Figure 10 with that in panel a of Figure 14, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the systematic bias and OLF of photometric SFR are largely reduced, although the

\sigma_{\rm NMAD}

is slightly increased. (c) By additionally employing a more complicated

\beta

\tau

form of SFH the quality of photometric SFR estimation increase to the best. (b, d) In the case with noise, the best two models are also very similar in the quality of photometric stellar mass estimation.

The detailed results of photometric SFR estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 14. In the case without noise, by comparing with the results in panel c of Figure 10 with that in panel a of Figure 14, it is clear that, with the additional consideration of metallicity evolution and the adoption of the DAL of Salim et al. (2018), the systematic bias and OLF of photometric redshift estimation are largely reduced while the $\sigma_{\rm NMAD}$ is slightly increased. By additionally employing a more complicated $\beta$ - $\tau$ form of SFH, as shown in panel c of Figure 14, the quality of photometric SFR estimation increase to the best. However, as shown in Table 3 and Figure 11, with a additional quenching (or rejuvenation) component, the $\beta$ - $\tau$ -r form of SFH lead to a much worse quality of photometric SFR estimation. Finally, the most complicated $\alpha$ - $\beta$ - $\tau$ -r form of SFH lead to a slightly better SFR estimation. In the case with noise, the best two models are also very similar in the quality of photometric stellar mass estimation. Meanwhile, with the increasing of the complexity of SED models, the quality of photometric SFR stimation increases significantly.

For photometric redshift, stellar mass and SFR, their accurate estimation becomes increasingly difficult. Besides, the latter two are also more sensitive to the selection of SED models. Generally, the quality of parameter estimates is closely related to the level of Bayesian evidence, which is especially clear in the more realistic case with noise. Meanwhile, the model selection with maximum likelihood (or equivalently minimum $\chi^{2}$ ), where the more complex (or flexible) models are always more favored, is not consistent with the measurements of the quality of parameter estimation. In practice, the direct measurements of the quality of parameter estimation as indicated by NMAD, BIA and OLF are usually unavailable. So, the Bayesian model comparison with Bayesian evidence can be used to find the best SED model which is not only the most efficient but also give the best parameter estimation.

6.2 Effects of more flexible SFH and DAL for CSST+Euclid-like survey

The results of both model comparison and parameter estimation are strongly dependent on the used datasets which may have very different discriminative powers. In Figure 15, we show an example of 1D and 2D posterior probability distribution functions (PDFs) of free parameters obtained from the Bayesian analysis of the photometric data of a mock galaxy in the CSST-like, CSST-like+Euclid-like, and COSMOS-like surveys, respectively. It is clear that different datasets lead to very different PDFs, due to their very different discriminative powers. In this section and §6.3, we discuss the effects of more flexible SFH and DAL for CSST+Euclid-like and COSMOS-like surveys, respectively. The addition of Euclid data extends the wavelength coverage of the data to the longer NIR band than with CSST-only data, which should be useful for enhancing the discriminative power of model comparison and the quality of the parameter estimation.

survey	noise	SFH	DAL	ln(BE)	ln(ML)	parameter	NMAD	BIA	OLF
						$z_{\rm phot}$	0.020	0.002	0.001
CSST+Euclid	-	$\tau$ ,-CEH	Cal+00	-243527 $\pm$ 6307	-3717	${\rm log}(M_{*})_{\rm phot}$	0.055	-0.023	0.045
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.097	-0.088	0.299
						$z_{\rm phot}$	0.021	0.001	0.001
CSST+Euclid	-	$\tau$ ,+CEH	Cal+00	-216613 $\pm$ 6390	30195	${\rm log}(M_{*})_{\rm phot}$	0.057	-0.014	0.037
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.094	-0.108	0.342
						$z_{\rm phot}$	0.013	0.010	0.005
CSST+Euclid	-	$\tau$ ,+CEH	Sal+18	72092 $\pm$ 6475	340038	${\rm log}(M_{*})_{\rm phot}$	0.029	-0.015	0.009
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.081	-0.046	0.218
						$z_{\rm phot}$	0.012	0.010	0.004
CSST+Euclid	-	$\beta$ - $\tau$ ,+CEH	Sal+18	70340 $\pm$ 6409	339916	${\rm log}(M_{*})_{\rm phot}$	0.027	-0.010	0.007
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.065	-0.021	0.195
						$z_{\rm phot}$	0.017	0.012	0.010
CSST+Euclid	-	$\beta$ - $\tau$ -r,+CEH	Sal+18	25728 $\pm$ 6566	315401	${\rm log}(M_{*})_{\rm phot}$	0.032	-0.005	0.017
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.087	0.006	0.302
						$z_{\rm phot}$	0.016	0.012	0.016
CSST+Euclid	-	$\alpha$ - $\beta$ - $\tau$ -r,+CEH	Sal+18	19560 $\pm$ 6454	305020	${\rm log}(M_{*})_{\rm phot}$	0.035	-0.007	0.030
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.087	0.011	0.286
						$z_{\rm phot}$	0.065	-0.003	0.119
CSST+Euclid	+	$\tau$ ,-CEH	Cal+00	-31783 $\pm$ 3314	58861	${\rm log}(M_{*})_{\rm phot}$	0.058	-0.030	0.083
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.164	-0.020	0.379
						$z_{\rm phot}$	0.064	-0.001	0.118
CSST+Euclid	+	$\tau$ ,+CEH	Cal+00	-31347 $\pm$ 3302	58766	${\rm log}(M_{*})_{\rm phot}$	0.058	-0.027	0.080
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.171	-0.027	0.392
						$z_{\rm phot}$	0.067	0.016	0.129
CSST+Euclid	+	$\tau$ ,+CEH	Sal+18	-38081 $\pm$ 3529	64421	${\rm log}(M_{*})_{\rm phot}$	0.061	-0.016	0.103
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.186	0.011	0.449
						$z_{\rm phot}$	0.068	0.015	0.130
CSST+Euclid	+	$\beta$ - $\tau$ ,+CEH	Sal+18	-41164 $\pm$ 3437	64349	${\rm log}(M_{*})_{\rm phot}$	0.063	-0.017	0.112
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.177	0.042	0.476
						$z_{\rm phot}$	0.069	0.017	0.141
CSST+Euclid	+	$\beta$ - $\tau$ -r,+CEH	Sal+18	-49796 $\pm$ 3561	63873	${\rm log}(M_{*})_{\rm phot}$	0.065	-0.023	0.127
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.213	0.059	0.571
						$z_{\rm phot}$	0.068	0.016	0.136
CSST+Euclid	+	$\alpha$ - $\beta$ - $\tau$ -r,+CEH	Sal+18	-49608 $\pm$ 3426	64065	${\rm log}(M_{*})_{\rm phot}$	0.073	-0.025	0.169
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.181	0.084	0.560

Table 4: As in Table 3, but for CSST+Euclid-like survey.

In Table 4, we present a summary of the Bayesian evidences, maximum likelihoods and metrics of the quality of parameter estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for CSST+Euclid-like survey by employing six different combinations of SFH and DAL with increasing complexity, as well as for the cases with and without noise. The same results are also shown more clearly in Figure 16.

6.2.1 Model comparison

In the case without noise, as shown in the top left panel of Figure 16, the simplest model “SFH= $\tau$ ,-CEH,DAL=Cal+18” has the lowest Bayesian evidence of ${\rm ln}(BE)=-243527\pm 6307$ . With the additional consideration of metallicity evolution, the Bayesian evidence of the model “SFH= $\tau$ ,+CEH,DAL=Cal+18” increases to ${\rm ln}(BE)=-216613\pm 6390$ . Then, with the adoption of the DAL of Salim et al. (2018), the Bayesian evidence of the model “SFH= $\tau$ ,+CEH,DAL=Sal+18” increases significantly to ${\rm ln}(BE)=72092\pm 6475$ . Apparently, the DAL of Salim et al. (2018) is also a much better choice than that of Calzetti et al. (2000) for the hydrodynamical simulation-based mock galaxy sampe in CSST+Euclid-like survey. Furthermore, by employing a more complicated $\beta$ - $\tau$ form of SFH, the Bayesian evidence of the model “SFH= $\beta$ - $\tau$ ,+CEH,DAL=Sal+18” seems decreases a little to ${\rm ln}(BE)=70340\pm 6409$ . As in the case for CSST-like survey, the latter two SED models (“SFH= $\tau$ ,+CEH,DAL=Sal+18” and “SFH= $\beta$ - $\tau$ ,+CEH,DAL=Sal+18”) have the largest Bayesian evidences which are comparable within error bar. With a quenching (or rejuvenation) component added to the SFH, the Bayesian evidence of the model “SFH= $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18” decreases significantly to ${\rm ln}(BE)=25728\pm 6566$ . Finally, by employing a even more flexible double power-law form of SFH, the Bayesian evidence of the model “SFH= $\alpha$ - $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18” seems decreases a little to ${\rm ln}(BE)=19560\pm 6454$ which is comparable with the former within error bar. On the other hand, the model selection with maximum likelihood (or equivalently minimum $\chi^{2}$ ) lead to similar results.

In the case with noise, the two simplest SED models “SFH= $\tau$ ,-CEH,DAL=Cal+00” and “SFH= $\tau$ ,+CEH,DAL=Cal+00” have the largest Bayesian evidences of ${\rm ln}(BE)=-31783\pm 3314$ and ${\rm ln}(BE)=-31347\pm 3302$ , respectively. Then, with the adoption of the DAL of Salim et al. (2018), the Bayesian evidence of the model “SFH= $\tau$ ,+CEH,DAL=Sal+18” decreases significantly to ${\rm ln}(BE)=-38081\pm 3529$ . By employing a more complicated $\beta$ - $\tau$ form of SFH, the Bayesian evidence of the model “SFH= $\beta$ - $\tau$ ,+CEH,DAL=Sal+18” seems decreases further to ${\rm ln}(BE)=-41164\pm 3437$ , although comparable with the former within error bar. With a quenching (or rejuvenation) component added to the SFH, the Bayesian evidence of the model “SFH= $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18” obviously decreases to ${\rm ln}(BE)=-49796\pm 3561$ , which is similar to the case without noise. Finally, by employing a even more flexible double power-law form of SFH, the Bayesian evidence of the model “SFH= $\alpha$ - $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18” seems increases a little to ${\rm ln}(BE)=-49608\pm 3426$ , although comparable with the former within error bar. However, the model selection with maximum likelihood (or equivalently minimum $\chi^{2}$ ) lead to exactly the opposite results, where the more complex (or flexible) models are always more favored.

In general, all of these results are very similar to that for CSST-like survey. The model selection with Bayesian evidence is still more consistent with the measurements of the quality of parameter estimation. However, the relative error of Bayesian evidences have been reduced, especially in the case without noise.

6.2.2 Parameter estimation

In the right three panels of Figure 16, we show the three metrics of the quality of photometric redshift, stellar mass and SFR estimation for different SED models, respectively. In general, in the case without noise, the SED models “SFH= $\tau$ ,+CEH,DAL=Sal+18” and “SFH= $\beta$ - $\tau$ ,+CEH,DAL=Sal+18” which are just the two with largest Bayesian evidence give the highest quality parameter estimates. In the case with noise, the two simplest SED models “SFH= $\tau$ ,-CEH,DAL=Cal+00” and “SFH= $\tau$ ,+CEH,DAL=Cal+00” which are also the two with the largest Bayesian evidences give the highest quality parameter estimates. In the following, we discuss these results in more detail.

Figure 17: The results of photometric redshift estimation from the Bayesian analysis of the hydrodynamical simulation-based mock data for CSST+Euclid-like survey by employing the two SED models with the largest Bayesian evidence. By comparing with the results for CSST-like survey in Figure 12, it is clear that the quality of photometric redshift estimation has been obviously increased in both the cases with and without noise. Especially, in the more realistic case with noise, the outliers caused by the mis-identification of Lyman and Balmer break features have been largely reduced. This suggest that, the inclusion of J, H and Y bands from Euclid is helpful for improving the photometric redshift estimation. However, the more complicated

\beta

\tau

form of SFH is not very helpful for improving the quality of photometric redshift estimation.

The detailed results of photometric redshift estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 17. By comparing with the results for CSST-like survey in Figure 12, it is clear that the quality of photometric redshift estimation has been obviously increased in both the cases with and without noise. Especially, in the more realistic case with noise, the outliers caused by the mis-identification of Lyman and Balmer break features have been largely reduced. This suggest that, the inclusion of J, H and Y bands from Euclid is helpful for improving the photometric redshift estimation. However, the more complicated $\beta$ - $\tau$ form of SFH is not very helpful for improving the quality of photometric redshift estimation.

Figure 18: As in Figure 17, but for the stellar mass. By comparing with the results for CSST-like survey in Figure 13, it is clear that the quality of photometric stellar mass estimation has been significantly improved in both the cases with and without noise. Apparently, the inclusion of J, H and Y bands from Euclid is crucial for a more accurate estimation of stellar mass. Besides, with the more complicated

\beta

\tau

form of SFH, the quality of photometric stellar mass estimation is only slightly improved.

The detailed results of photometric stellar mass estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 18. By comparing with the results for CSST-like survey in Figure 13, it is clear that the quality of photometric stellar mass estimation has been significantly improved in both the cases with and without noise. Apparently, the inclusion of J, H and Y bands from Euclid is crucial for a more accurate estimation of stellar mass. With the more complicated $\beta$ - $\tau$ form of SFH, the quality of photometric stellar mass estimation is slightly improved. However, as shown in Table 4 and Figure 16, the even more complicated forms of SFH is still not helpful for improving the quality of photometric stellar mass estimation.

Figure 19: As in Figure 17, but for the SFR. (a, c) The cases without noise. By comparing with the results for CSST-like survey in the left panes of Figure 14, it is clear that the

\sigma_{\rm NMAD}

and OLF of photometric SFR estimation are largely reduced, although the bias is slightly increased. Besides, with the more complicated

\beta

\tau

form of SFH, the quality of photometric SFR estimation is improved. (b, d) The cases with noise. By comparing with the results for CSST-like survey in the right panes of Figure 14, all of the

\sigma_{\rm NMAD}

, bias and OLF of photometric SFR estimation slightly increase. This suggest that, the inclusion of J, H and Y bands from Euclid is not very helpful for improving the photometric SFR estimation. Besides, with the additional consideration of metallicity evolution, the quality of photometric SFR estimation becomes slightly worse.

The detailed results of photometric SFR estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 19. In the case without noise, by comparing with the results for CSST-like survey in Figure 14, it is clear that the $\sigma_{\rm NMAD}$ and OLF of photometric SFR estimation are largely reduced, although the bias is slightly increased. However, in the case with noise, all of the $\sigma_{\rm NMAD}$ , bias and OLF of photometric SFR estimation slightly increase. So, the inclusion of J, H and Y bands from Euclid is not very helpful for improving the photometric SFR estimation.

6.3 Effects of more flexible SFH and DAL for COSMOS-like survey

The COSMOS-like survey covers many more bands than CSST-like data. Although there is no NUV data, it extends to the longer wavelengths and includes some intermediate bands (IBs) (Laigle et al., 2016). Since the COSMOS-like mock data has much stronger discriminative power than CSST-like and CSST+Euclid-like mock data, the Bayesian evidence of different SED models should show much larger difference, and the photometric redshift and stellar population parameter estimation should be better.

survey	noise	SFH	DAL	ln(BE)	ln(ML)	parameter	NMAD	BIA	OLF
						$z_{\rm phot}$	0.007	-0.002	0
COSMOS	-	$\tau$ ,-CEH	Cal+00	-1710604 $\pm$ 6811	-1431790	${\rm log}(M_{*})_{\rm phot}$	0.030	-0.026	0.013
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.110	-0.062	0.290
						$z_{\rm phot}$	0.006	-0.002	0
COSMOS	-	$\tau$ ,+CEH	Cal+00	-1499618 $\pm$ 6997	-1207887	${\rm log}(M_{*})_{\rm phot}$	0.032	-0.017	0.010
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.101	-0.082	0.299
						$z_{\rm phot}$	0.002	0	0
COSMOS	-	$\tau$ ,+CEH	Sal+18	354581 $\pm$ 7310	681554	${\rm log}(M_{*})_{\rm phot}$	0.023	-0.022	0.012
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.069	-0.068	0.124
						$z_{\rm phot}$	0.002	0	0
COSMOS	-	$\beta$ - $\tau$ ,+CEH	Sal+18	387009 $\pm$ 7273	717135	${\rm log}(M_{*})_{\rm phot}$	0.020	-0.011	0.006
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.047	-0.041	0.057
						$z_{\rm phot}$	0.003	0.001	0
COSMOS	-	$\beta$ - $\tau$ -r,+CEH	Sal+18	255069 $\pm$ 7552	613256	${\rm log}(M_{*})_{\rm phot}$	0.028	-0.003	0.017
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.070	-0.001	0.124
						$z_{\rm phot}$	0.003	0.001	0
COSMOS	-	$\alpha$ - $\beta$ - $\tau$ -r,+CEH	Sal+18	275129 $\pm$ 7218	612401	${\rm log}(M_{*})_{\rm phot}$	0.031	-0.005	0.012
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.067	-0.006	0.121
						$z_{\rm phot}$	0.017	-0.003	0.004
COSMOS	+	$\tau$ ,-CEH	Cal+00	99881 $\pm$ 4881	259510	${\rm log}(M_{*})_{\rm phot}$	0.044	-0.020	0.006
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.127	-0.032	0.282
						$z_{\rm phot}$	0.017	-0.004	0.003
COSMOS	+	$\tau$ ,+CEH	Cal+00	107009 $\pm$ 4896	268492	${\rm log}(M_{*})_{\rm phot}$	0.044	-0.012	0.005
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.126	-0.044	0.288
						$z_{\rm phot}$	0.015	0.002	0.003
COSMOS	+	$\tau$ ,+CEH	Sal+18	131578 $\pm$ 5115	313110	${\rm log}(M_{*})_{\rm phot}$	0.042	-0.017	0.036
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.107	-0.021	0.211
						$z_{\rm phot}$	0.015	0.002	0.003
COSMOS	+	$\beta$ - $\tau$ ,+CEH	Sal+18	130766 $\pm$ 5064	315205	${\rm log}(M_{*})_{\rm phot}$	0.037	-0.012	0.029
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.090	0.002	0.176
						$z_{\rm phot}$	0.016	0.006	0.004
COSMOS	+	$\beta$ - $\tau$ -r,+CEH	Sal+18	109572 $\pm$ 5116	305267	${\rm log}(M_{*})_{\rm phot}$	0.044	-0.021	0.052
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.102	0.063	0.321
						$z_{\rm phot}$	0.015	0.005	0.003
COSMOS	+	$\alpha$ - $\beta$ - $\tau$ -r,+CEH	Sal+18	110909 $\pm$ 5006	306387	${\rm log}(M_{*})_{\rm phot}$	0.045	-0.020	0.066
						${\rm log}(SFR/[\rm M_{\odot}/yr])_{\rm phot}$	0.097	0.055	0.304

Table 5: As in Table 3, but for COSMOS-like survey.

In Table 5, we present a summary of the Bayesian evidences, maximum likelihoods and metrics of the quality of parameter estimation from the Bayesian analysis of the hydrodynamical simulation-based mock galaxy sampe for COSMOS-like survey by employing six different combinations of SFH and DAL with increasing complexity, as well as for the cases with and without noise. The same results are also shown more clearly in Figure 20.

6.3.1 Model comparison

In the case without noise, as shown in the top left panel of Figure 20, the simplest model “SFH= $\tau$ ,-CEH,DAL=Cal+18” has the lowest Bayesian evidence of ${\rm ln}(BE)=-1710604\pm 6811$ . With the additional consideration of metallicity evolution, the Bayesian evidence of the model “SFH= $\tau$ ,+CEH,DAL=Cal+18” increases to ${\rm ln}(BE)=-1499618\pm 6997$ . Then, with the adoption of the DAL of Salim et al. (2018), the Bayesian evidence of the model “SFH= $\tau$ ,+CEH,DAL=Sal+18” increases significantly to ${\rm ln}(BE)=354581\pm 7310$ . Apparently, the DAL of Salim et al. (2018) is also a much better choice than that of Calzetti et al. (2000) for the hydrodynamical simulation-based mock galaxy sampe in COSMOS-like survey. Furthermore, by employing a more complicated $\beta$ - $\tau$ form of SFH, the Bayesian evidence of the model “SFH= $\beta$ - $\tau$ ,+CEH,DAL=Sal+18” increases further to ${\rm ln}(BE)=387009\pm 7273$ . However, with a quenching (or rejuvenation) component added to the SFH, the Bayesian evidence of the model “SFH= $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18” significantly decreases to ${\rm ln}(BE)=255069\pm 7552$ , which is similar to the case without noise. Finally, by employing a even more flexible double power-law form of SFH, the Bayesian evidence of the model “SFH= $\alpha$ - $\beta$ - $\tau$ -r,+CEH,DAL=Sal+18” increases to ${\rm ln}(BE)=275129\pm 7218$ . As in the cases for CSST-like and CSST+Euclid-like surveys, the model selection with maximum likelihood (or equivalently minimum $\chi^{2}$ ) lead to similar results.

In the case with noise, as shown in the bottom left panel of Figure 20, almost the same conclusions about SED model comparison are obtained, although the detailed values of Bayesian evidence are apparently different. Unlike the cases for CSST-like and CSST+Euclid-like surveys, the model selection with Bayesian evidence and maximum likelihood (or equivalently minimum $\chi^{2}$ ) lead to similar results. In general, for COSMOS-like survey, we can obtain more clear results of SED model comparison. Meanwhile, in the more realistic case with noise, the more complicated SED models are more favored than in the cases for CSST-like and CSST+Euclid-like survey. This is reasonable, since COSMOS-like survey has much stronger discriminative power than CSST-like and CSST+Euclid-like surveys.

6.3.2 Parameter estimation

In the right three panels of Figure 20, we show the three metrics of the quality of photometric redshift, stellar mass and SFR estimation for different SED models, respectively. In the cases with and without noise, the same SED model “SFH= $\beta$ - $\tau$ ,+CEH,DAL=Sal+18” which is just the one with largest Bayesian evidence give the highest quality parameter estimates. In the following, we discuss these results in more detail.

Figure 21: The results of photometric redshift estimation from the Bayesian analysis of the hydrodynamical simulation-based mock data for COSMOS-like survey by employing the two SED models with the largest Bayesian evidence. By comparing with the results for CSST-like survey in Figure 12 and that for for CSST+Euclid-like survey in Figure 17, it is clear that the quality of photometric redshift estimation has been significantly increased in both the cases with and without noise. Meanwhile, in both cases, the best two SED models given identical quality of photometric redshift estimation. In the case without noise, the bias and OLF of photometric redshift estimation are almost zero while the

\sigma_{\rm NMAD}

is only 0.002. Since the errors from parameter degeneracy and SED model error should be largely reduced, this suggest that the contribution of errors from the stochastic nature of MultiNest sampling algorithm and other potential errors in the BayeSED code should be less than 0.002. In the more realistic case with noise, the outliers caused by the mis-identification of Lyman and Balmer break features have been largely resolved.

The detailed results of photometric redshift estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 21. By comparing with the results for CSST-like survey in Figure 12 and that for for CSST+Euclid-like survey in Figure 17, it is clear that the quality of photometric redshift estimation has been significantly increased in both the cases with and without noise. Meanwhile, in both cases, the best two SED models given identical quality of photometric redshift estimation. In the case without noise, the bias and OLF of photometric redshift estimation are almost zero while the $\sigma_{\rm NMAD}$ is only 0.002. In this case, the errors from parameter degeneracy and SED model error should be largely reduced. This suggest that the contribution of errors from the stochastic nature of MultiNest sampling algorithm and other potential errors in the BayeSED code should be less than 0.002. In the more realistic case with noise, the outliers caused by the mis-identification of Lyman and Balmer break features have been largely resolved. Finally, as in the cases for CSST-like and CSST+Euclid-like surveys, the more complicated SED models are not very helpful for improving the quality of photometric redshift estimation.

Figure 22: As in Figure 21, but for the stellar mass. By comparing with the results for CSST+Euclid-like survey in Figure 18, it is clear that the quality of photometric stellar mass estimation has been improved further in both the cases with and without noise. Besides, the more complicated

\beta

\tau

form of SFH makes the quality of photometric stellar mass estimation a little better.

The detailed results of photometric stellar mass estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 22. By comparing with the results for CSST+Euclid-like survey in Figure 18, it is clear that the quality of photometric stellar mass estimation has been improved further in both the cases with and without noise. Besides, the more complicated $\beta$ - $\tau$ form of SFH makes the quality of photometric stellar mass estimation a little better. However, as shown in Table 5 and Figure 16, the even more complicated forms of SFH make the quality of photometric stellar mass estimation worse.

Figure 23: As in Figure 21, but for the SFR. By comparing with the results for CSST+Euclid-like survey in Figure 18, it is clear that the quality of photometric SFR estimation has been significantly improved in both the cases with and without noise. Besides, the more complicated

\beta

\tau

form of SFH makes the quality of photometric SFR estimation much better.

The detailed results of photometric SFR estimation obtained by employing the two models with the largest Bayesian evidences are shown in Figure 23. By comparing with the results for CSST+Euclid-like survey in Figure 18, it is clear that the quality of photometric SFR estimation has been significantly improved in both the cases with and without noise. Besides, the more complicated $\beta$ - $\tau$ form of SFH makes the quality of photometric SFR estimation much better. However, as shown in 20, with a quenching (or rejuvenation) component added to the SFH, the quality of photometric SFR estimation becomes obviously worse. Finally, by employing a even more flexible double power-law form of SFH, the quality of photometric SFR estimation becomes a little better.

7 Summary and conclusion

In this work, based on the Bayesian SED synthesis and analysis techniques employed in the BayeSED-V3 code, we present a comprehensive and systematic test of its performance for simultaneous photometric redshift and stellar population parameter estimation of galaxies when combined with six SED models with increasing complexity in the form of SFH and DAL. The main purpose is to make a systematic analysis of various factors affecting the simultaneous photometric redshift and stellar population parameter estimation of galaxies in the context of Bayesian SED fitting, so as to provide clues for further improvement.

To separate the different factors which could contribute to the errors of photometric redshift and stellar population parameter estimation of galaxies, the empirical statistics-based and hydrodynamical simulation-based approaches have been employed to generate mock photometric sample of galaxies with or without noise for CSST-like, COSMOS-like and CSST+Euclid-like surveys, respectively. We compare the difference in performance of photometric parameter estimation with different run parameters of Bayesian analysis algorithm, different assumptions about the SFH and DAL of galaxies, and different observational datasets. Our main findings are as follows.

For the performance tests using empirical statistics-based mock galaxy sample with idealized SED modeling:

1.

The performance of photometric redshift and stellar population parameter estimation, in terms of speed and quality, is sensitive to the runtime parameters (the target sampling efficiency $efr$ and the number of live points $nlive$ ) of MultiNest algorithm.
2.

A good balance among the speed, quality of parameter estimation, and accuracy of model comparison can be achieved when adopting the MultiNest runtime parameters $efr$ equals to $0.1$ and $nlive$ equals to $50$ .
3.

By employing the optimized runtime parameters of MultiNest and simplest SED modeling, a speed of $\sim 2{\rm s/obj/cpu}$ ( $\sim 10{\rm s/obj/cpu}$ ) can be achieved for a detailed Bayesian analysis of photometries from CSST-like survey, which is sufficient for the analysis of massive photometric data. Meanwhile, a quality of photometric redshift estimation with $\sigma_{\rm NMAD}=0.056$ , $BIA=-0.0025$ , $OLF=0.215$ , a quality of photometric stellar mass estimation with $\sigma_{\rm NMAD}=0.113$ , $BIA=-0.025$ , $OLF=0.285$ , and a quality of photometric SFR estimation with $\sigma_{\rm NMAD}=0.08$ , $BIA=-0.01$ , $OLF=0.255$ can be achieved.
4.

With the optimized runtime parameters of MultiNest, the value of Bayesian evidence which is crucial for Bayesian model comparison can also be well estimated, although the error of Bayesian evidence tends to be overestimated, which may lead to a more conservative conclusion about model comparison.
5.

The random observational errors in photometries are more important sources of errors than the parameter degeneracies and Bayesian analysis method and tool.
6.

More complicated SED models apparently require longer running time. They also tend to overfit noisy photometries and lead to worse quality of photometric redshift, stellar mass and SFR estimation, which is likely due to more free parameters and more severe parameter degeneracies.
7.

The value of Bayesian evidence clearly decreases with the increasing of the complexity of the SED model in both of the cases with and without noise.

For the performance tests using hydrodynamical simulation-based mock galaxy sample without idealized SED modeling:

1.

The commonly used simple assumptions about the SFH and DAL of galaxies have severe impact on the quality of photometric parameter estimation of galaxies, especially for CSST-like survey with only photometries from seven broad-bands.
2.

The performance of both Bayesian parameter estimation and model comparison highly depends on the discriminative power of the observational photometries. With more informative photometries, more clear results about SED model comparison and higher quality of photometric parameter estimation can be obtained.
3.

While the SED model comparison with Bayesian evidence may favor SED models with very different complexities when using photometries from different surveys, the maximum likelihood (or equivalently minimum $\chi^{2}$ ) tend to favor more complex models. For photometries with strong enough discriminative power, the two methods lead to more consistent results. However, for photometries without strong enough discriminative power, the two methods may lead to contradictory results. In both cases, the results of model selection with Bayesian evidence are more consistent with the measurements of the quality of parameter estimation.
4.

In both of the cases with and without noise, the additional consideration of metallicity evolution helps to improve the quality of photometric redshift and stellar parameter estimation of galaxies, and increases the Bayesian evidence of corresponding SED model.
5.

In the case without noise, the DAL of Salim et al. (2018) is a much better choice than that of Calzetti et al. (2000) for the hydrodynamical simulation-based mock galaxy sampe in CSST-like, CSST+Euclid-like and COSMOS-like surveys. However, in the more realistic case with noise, it is only more favored in the COSMOS-like survey with Bayesian evidence-based model selection. With maximum likelihood (or equivalently minimum $\chi^{2}$ )-based model selection, it could be more favored, but lead to worse parameter estimation.
6.

In the case without noise, the more flexible forms of SFH lead to better quality of parameter estimation and increase the Bayesian evidence of corresponding SED model. However, in the more realistic case with noise, they are only more favored in the COSMOS-like survey.
7.

With a quenching (or rejuvenation) component added to the SFH, the quality of parameter estimation and the Bayesian evidence of corresponding SED model decrease in all cases. Although the rejuvenation or rapid quenching events may happen in some galaxies, this additional component of SFH is not very effective for most of the galaxies in the hydrodynamical simulation-based mock galaxy sample.
8.

The quality of parameter estimation is closely related to the level of Bayesian evidence such that the SED model with largest Bayesian evidence tends to give the best quality of parameter estimation, which is more clear for photometries with larger discriminative power. By using photometries without strong enough discriminative power, the quality of parameter estimation, especially that of stellar mass and SFR estimation, tend to decrease with the increasing of SED model complexity
9.

Since the direct measurements of the quality of parameter estimation as indicated by NMAD, BIA and OLF are usually unavailable, the Bayesian model comparison with Bayesian evidence can be used to find the best SED model which is not only the most efficient but also give the best parameter estimation.
10.

For photometric redshift, stellar mass and SFR, their accurate estimation becomes increasingly difficult, while the latter two are also more sensitive to the selection of SED models.
11.

For the photometric redshift estimation of galaxies in CSST-like survey, the observational noise is the more important source of error than the imperfect SED modeling. However, for the photometric stellar mass and SFR estimation of galaxies, the opposite is true.
12.

The combination of photometries from CSST-like and Euclid-like surveys is helpful for improving the quality of photometric redshift estimation and crucial for the more accurate stellar mass estimation, but not very useful for SFR estimation.
13.

With photometries in 26 bands from COSMOS-like surveys, by employ the same SED model, BayeSED-V3 can achieve similar quality of photometric redshift, stellar mass and SFR estimation to previous works. Besides, with photometries in 26 bands from COSMOS-like surveys, more complicated SED models tend to be more favored, which is very different from the two cases with only photometries from CSST-like (7 bands) or CSST+Euclid-like (10 bands) surveys.

We conclude that the latest version of BayeSED is capable of achieving a good balance among speed, the quality of simultaneous photometric redshift and stellar population parameter estimation of galaxies and the reliable SED model comparison. This makes it suitable for the analysis of existing and forthcoming massive photometric data of galaxies in CSST wide-field multiband imaging survey and others.

Generally, the current main bottleneck that limits the performance of the Bayesian approach for the simultaneous photometric redshift and stellar population parameter estimation of galaxies is the reliability of the SED synthesis (or modeling) procedure. Assuming a more flexible model is not a complete solution. We need a SED model that is not only more flexible but also more precisely accurate. It can be achieved by gradually adding more informative priors and physical constraints to the SED synthesis (modeling) procedure of galaxies, which is the subject of future works. The Bayesian model selection method with Bayesian evidence, a quantified Occam’s razor, is very helpful to identify the best SED model which is not only the most efficient but also give the best parameter estimation.

The results about simultaneous photometric redshift and stellar population estimation presented in this work are not yet optimal, especially those about CSST. The contributions of nebular lines and continuum emission to the SED, which may help break some parameter degeneracies, are still mising in this work. It is also worth to mention that the results of Bayesian SED model comparison and the metrics (OLF, BIA or NMAD) of parameter estimation highly depend on the selected samples. In this paper, we have chosen a relative broader sample to test the overall performance in the CSST wide-field imaging survey. For a differently selected sample which is designed for answering a more specific scientific question, the results could be different.

Finally, in addition to multi-bands photometries, we may need more informations from other forms of data, such as slitless spectroscopy and morphology parameters from the imaging to break the severe parameter degeneracies. More advanced methods may be able to take advantage of all information to give better redshift and/or stellar population parameter estimation of galaxies. These will be the subjects of future works as well.

The authors gratefully acknowledge the “PHOENIX Supercomputing Platform” jointly operated by the Binary Population Synthesis Group and the Stellar Astrophysics Group at Yunnan Observatories, Chinese Academy of Sciences. We warmly thank the Horizon-AGN team, especially C. Laigle, for making their photometric catalogs and spectra data publicly available. We thank Fengshan Liu, Ye Cao, and Xinwen Shu for helpful discussion about SNR and magnitude limit. We acknowledge support from the National Key R&D Program of China (Nos. 2021YFA1600401 and 2021YFA1600400), the National Science Foundation of China (grant nos. 11773063, 12173037, 12233008, 12233005, 12073078, 12288102) the China Manned Space Project (grant nos. CMS-CSST-2021-A02, CMS-CSST-2021-A04, CMS-CSST-2021-A06, and CMS-CSST-2021-A07), the “Light of West China” Program of Chinese Academy of Sciences, the Yunnan Ten Thousand Talents Plan Young & Elite Talents Project, the International Centre of Supernovae, Yunnan Key Laboratory (No. 202302AN360001). LF also gratefully acknowledges the support of the CAS Project for Young Scientists in Basic Research (No. YSBR-092), the Fundamental Research Funds for the Central Universities (WK3440000006), and the Cyrus Chun Ying Tang Foundations. This work made use of the following softwares: ASTROPY(Astropy Collaboration et al., 2013, 2018), MATPLOTLIB(Hunter, 2007), NUMPY(van der Walt et al., 2011), and H5PY(Collette et al., 2023).

References

Abdurro’uf et al. (2021) Abdurro’uf, Lin, Y.-T., Wu, P.-F., & Akiyama, M. 2021, ApJS, 254, 15, doi: 10.3847/1538-4365/abebe2
Acquaviva et al. (2011) Acquaviva, V., Gawiser, E., & Guaita, L. 2011, ApJ, 737, 47, doi: 10.1088/0004-637X/737/2/47
Acquaviva et al. (2015) Acquaviva, V., Raichoor, A., & Gawiser, E. 2015, ApJ, 804, 8, doi: 10.1088/0004-637X/804/1/8
Alsing et al. (2023) Alsing, J., Peiris, H., Mortlock, D., Leja, J., & Leistedt, B. 2023, ApJS, 264, 29, doi: 10.3847/1538-4365/ac9583
Alsing et al. (2020) Alsing, J., Peiris, H., Leja, J., et al. 2020, ApJS, 249, 5, doi: 10.3847/1538-4365/ab917f
Antonucci (1993) Antonucci, R. 1993, ARA&A, 31, 473, doi: 10.1146/annurev.aa.31.090193.002353
Antonucci (2012) —. 2012, Astronomical and Astrophysical Transactions, 27, 557. https://arxiv.org/abs/1210.2716
Arnouts et al. (2013) Arnouts, S., Le Floc’h, E., Chevallard, J., et al. 2013, A&A, 558, A67, doi: 10.1051/0004-6361/201321768
Ashton et al. (2022) Ashton, G., Bernstein, N., Buchner, J., et al. 2022, Nature Reviews Methods Primers, 2, 39, doi: 10.1038/s43586-022-00121-x
Astropy Collaboration et al. (2013) Astropy Collaboration, Robitaille, T. P., Tollerud, E. J., et al. 2013, A&A, 558, A33, doi: 10.1051/0004-6361/201322068
Astropy Collaboration et al. (2018) Astropy Collaboration, Price-Whelan, A. M., Sipőcz, B. M., et al. 2018, AJ, 156, 123, doi: 10.3847/1538-3881/aabc4f
Aufort et al. (2020) Aufort, G., Ciesla, L., Pudlo, P., & Buat, V. 2020, A&A, 635, A136, doi: 10.1051/0004-6361/201936788
Bastian et al. (2010) Bastian, N., Covey, K. R., & Meyer, M. R. 2010, ARA&A, 48, 339, doi: 10.1146/annurev-astro-082708-101642
Beckmann et al. (2017a) Beckmann, R. S., Devriendt, J., Slyz, A., et al. 2017a, MNRAS, 472, 949, doi: 10.1093/mnras/stx1831
Beckmann et al. (2017b) —. 2017b, ArXiv e-prints. https://arxiv.org/abs/1701.07838
Behroozi et al. (2013) Behroozi, P. S., Wechsler, R. H., & Conroy, C. 2013, ApJ, 770, 57, doi: 10.1088/0004-637X/770/1/57
Beichman et al. (2012) Beichman, C. A., Rieke, M., Eisenstein, D., et al. 2012, in Space Telescopes and Instrumentation 2012: Optical, Infrared, and Millimeter Wave, ed. M. C. Clampin, G. G. Fazio, H. A. MacEwen, & J. M. O. Jr., Vol. 8442, International Society for Optics and Photonics (SPIE), 84422N, doi: 10.1117/12.925447
Bertelli et al. (2008) Bertelli, G., Girardi, L., Marigo, P., & Nasi, E. 2008, A&A, 484, 815, doi: 10.1051/0004-6361:20079165
Boquien et al. (2019) Boquien, M., Burgarella, D., Roehlly, Y., et al. 2019, A&A, 622, A103, doi: 10.1051/0004-6361/201834156
Bowman et al. (2020) Bowman, W. P., Zeimann, G. R., Nagaraj, G., et al. 2020, ApJ, 899, 7, doi: 10.3847/1538-4357/ab9f3c
Breivik et al. (2022) Breivik, K., Connolly, A. J., Ford, K. E. S., et al. 2022, arXiv e-prints, arXiv:2208.02781. https://arxiv.org/abs/2208.02781
Brott et al. (2011) Brott, I., de Mink, S. E., Cantiello, M., et al. 2011, A&A, 530, A115, doi: 10.1051/0004-6361/201016113
Brown et al. (2019a) Brown, A., Nayyeri, H., Cooray, A., et al. 2019a, ApJ, 871, 87, doi: 10.3847/1538-4357/aaf73b
Brown et al. (2019b) Brown, M. J. I., Duncan, K. J., Landt, H., et al. 2019b, MNRAS, 489, 3351, doi: 10.1093/mnras/stz2324
Bruzual & Charlot (2003) Bruzual, G., & Charlot, S. 2003, MNRAS, 344, 1000, doi: 10.1046/j.1365-8711.2003.06897.x
Buchner (2021) Buchner, J. 2021, arXiv e-prints, arXiv:2101.09675. https://arxiv.org/abs/2101.09675
Calzetti et al. (2000) Calzetti, D., Armus, L., Bohlin, R. C., et al. 2000, ApJ, 533, 682, doi: 10.1086/308692
Cameron & Pettitt (2014) Cameron, E., & Pettitt, A. 2014, Statistical Science, 29, 397 , doi: 10.1214/13-STS465
Cao et al. (2018) Cao, Y., Gong, Y., Meng, X.-M., et al. 2018, MNRAS, 480, 2178, doi: 10.1093/mnras/sty1980
Cappellari et al. (2012) Cappellari, M., McDermid, R. M., Alatalo, K., et al. 2012, Nature, 484, 485, doi: 10.1038/nature10972
Caputi et al. (2015) Caputi, K. I., Ilbert, O., Laigle, C., et al. 2015, ApJ, 810, 73, doi: 10.1088/0004-637X/810/1/73
Carnall et al. (2019) Carnall, A. C., Leja, J., Johnson, B. D., et al. 2019, ApJ, 873, 44, doi: 10.3847/1538-4357/ab04a2
Carnall et al. (2018) Carnall, A. C., McLure, R. J., Dunlop, J. S., & Davé, R. 2018, MNRAS, 480, 4379, doi: 10.1093/mnras/sty2169
Chabrier (2003) Chabrier, G. 2003, PASP, 115, 763, doi: 10.1086/376392
Charlot & Longhetti (2001) Charlot, S., & Longhetti, M. 2001, MNRAS, 323, 887, doi: 10.1046/j.1365-8711.2001.04260.x
Chevallard & Charlot (2016) Chevallard, J., & Charlot, S. 2016, MNRAS, 462, 1415, doi: 10.1093/mnras/stw1756
Choi et al. (2019) Choi, J., Conroy, C., & Johnson, B. D. 2019, ApJ, 872, 136, doi: 10.3847/1538-4357/aaff67
Ciesla et al. (2017) Ciesla, L., Elbaz, D., & Fensch, J. 2017, A&A, 608, A41, doi: 10.1051/0004-6361/201731036
Ciesla et al. (2016) Ciesla, L., Boselli, A., Elbaz, D., et al. 2016, A&A, 585, A43, doi: 10.1051/0004-6361/201527107
Coelho (2009) Coelho, P. 2009, in American Institute of Physics Conference Series, Vol. 1111, American Institute of Physics Conference Series, ed. G. R. M. L. L. A. A. N. M. . E. B. G. Giobbi, A. Tornambe, 67–74, doi: 10.1063/1.3141624
Coelho et al. (2020) Coelho, P. R. T., Bruzual, G., & Charlot, S. 2020, MNRAS, 491, 2025, doi: 10.1093/mnras/stz3023
Collette et al. (2023) Collette, A., Kluyver, T., Caswell, T. A., et al. 2023, h5py/h5py: 3.8.0-aarch64-wheels, 3.8.0-aarch64-wheels, Zenodo, doi: 10.5281/zenodo.7568214
Conroy (2013) Conroy, C. 2013, ARA&A, 51, 393, doi: 10.1146/annurev-astro-082812-141017
Conroy & Gunn (2010) Conroy, C., & Gunn, J. E. 2010, ApJ, 712, 833, doi: 10.1088/0004-637X/712/2/833
Conroy et al. (2009) Conroy, C., Gunn, J. E., & White, M. 2009, ApJ, 699, 486, doi: 10.1088/0004-637X/699/1/486
Conroy et al. (2010) Conroy, C., White, M., & Gunn, J. E. 2010, ApJ, 708, 58, doi: 10.1088/0004-637X/708/1/58
Côté et al. (2016) Côté, B., Ritter, C., O’Shea, B. W., et al. 2016, ApJ, 824, 82, doi: 10.3847/0004-637X/824/2/82
da Cunha et al. (2008) da Cunha, E., Charlot, S., & Elbaz, D. 2008, MNRAS, 388, 1595, doi: 10.1111/j.1365-2966.2008.13535.x
Dahlen et al. (2013) Dahlen, T., Mobasher, B., Faber, S. M., et al. 2013, ApJ, 775, 93, doi: 10.1088/0004-637X/775/2/93
Davidzon et al. (2017) Davidzon, I., Ilbert, O., Laigle, C., et al. 2017, A&A, 605, A70, doi: 10.1051/0004-6361/201730419
Davidzon et al. (2019) Davidzon, I., Laigle, C., Capak, P. L., et al. 2019, MNRAS, 489, 4817, doi: 10.1093/mnras/stz2486
Debsarma et al. (2016) Debsarma, S., Chattopadhyay, T., Das, S., & Pfenniger, D. 2016, MNRAS, doi: 10.1093/mnras/stw1909
Diemer et al. (2017) Diemer, B., Sparre, M., Abramson, L. E., & Torrey, P. 2017, ApJ, 839, 26, doi: 10.3847/1538-4357/aa68e5
Doore et al. (2023) Doore, K., Monson, E. B., Eufrasio, R. T., et al. 2023, ApJS, 266, 39, doi: 10.3847/1538-4365/accc29
Dore et al. (2019) Dore, O., Hirata, C., Wang, Y., et al. 2019, BAAS, 51, 341. https://arxiv.org/abs/1904.01174
Draine (2010) Draine, B. 2010, Physics of the Interstellar and Intergalactic Medium, Princeton Series in Astrophysics (Princeton University Press). https://books.google.co.kr/books?id=FycJvKHyiwsC
Draine (2003) Draine, B. T. 2003, ARA&A, 41, 241, doi: 10.1146/annurev.astro.41.011802.094840
Dries et al. (2016) Dries, M., Trager, S. C., & Koopmans, L. V. E. 2016, MNRAS, 463, 886, doi: 10.1093/mnras/stw2049
Dries et al. (2018) Dries, M., Trager, S. C., Koopmans, L. V. E., Popping, G., & Somerville, R. S. 2018, MNRAS, 474, 3500, doi: 10.1093/mnras/stx2979
Driver et al. (2013) Driver, S. P., Robotham, A. S. G., Bland-Hawthorn, J., et al. 2013, MNRAS, 430, 2622, doi: 10.1093/mnras/sts717
Dubois et al. (2014) Dubois, Y., Pichon, C., Welker, C., et al. 2014, MNRAS, 444, 1453, doi: 10.1093/mnras/stu1227
Eldridge & Stanway (2009) Eldridge, J. J., & Stanway, E. R. 2009, MNRAS, 400, 1019, doi: 10.1111/j.1365-2966.2009.15514.x
Feroz & Hobson (2008) Feroz, F., & Hobson, M. P. 2008, MNRAS, 384, 449, doi: 10.1111/j.1365-2966.2007.12353.x
Feroz et al. (2009) Feroz, F., Hobson, M. P., & Bridges, M. 2009, MNRAS, 398, 1601, doi: 10.1111/j.1365-2966.2009.14548.x
Feroz et al. (2019) Feroz, F., Hobson, M. P., Cameron, E., & Pettitt, A. N. 2019, The Open Journal of Astrophysics, 2, 10, doi: 10.21105/astro.1306.2144
Ferreras et al. (2013) Ferreras, I., La Barbera, F., de la Rosa, I. G., et al. 2013, MNRAS, 429, L15, doi: 10.1093/mnrasl/sls014
Fitzpatrick (1986) Fitzpatrick, E. L. 1986, AJ, 92, 1068, doi: 10.1086/114237
Galliano et al. (2018) Galliano, F., Galametz, M., & Jones, A. P. 2018, ARA&A, 56, 673, doi: 10.1146/annurev-astro-081817-051900
Gardner et al. (2006) Gardner, J. P., Mather, J. C., Clampin, M., et al. 2006, Space Sci. Rev., 123, 485, doi: 10.1007/s11214-006-8315-7
Gennaro et al. (2018) Gennaro, M., Tchernyshyov, K., Brown, T. M., et al. 2018, ApJ, 855, 20, doi: 10.3847/1538-4357/aaa973
Gilda et al. (2021) Gilda, S., Lower, S., & Narayanan, D. 2021, ApJ, 916, 43, doi: 10.3847/1538-4357/ac0058
Gong et al. (2019) Gong, Y., Liu, X., Cao, Y., et al. 2019, ApJ, 883, 203, doi: 10.3847/1538-4357/ab391e
Green et al. (2012) Green, J., Schechter, P., Baltay, C., et al. 2012, arXiv e-prints, arXiv:1208.4012. https://arxiv.org/abs/1208.4012
Hahn & Melchior (2022) Hahn, C., & Melchior, P. 2022, ApJ, 938, 11, doi: 10.3847/1538-4357/ac7b84
Hahn et al. (2022) Hahn, C., Kwon, K. J., Tojeiro, R., et al. 2022, arXiv e-prints, arXiv:2202.01809. https://arxiv.org/abs/2202.01809
Han & Han (2012) Han, Y., & Han, Z. 2012, ApJ, 749, 123, doi: 10.1088/0004-637X/749/2/123
Han & Han (2014) —. 2014, ApJS, 215, 2, doi: 10.1088/0067-0049/215/1/2
Han & Han (2019) —. 2019, ApJS, 240, 3, doi: 10.3847/1538-4365/aaeffa
Han et al. (2019) Han, Y., Han, Z., & Fan, L. 2019, in The Art of Measuring Galaxy Physical Properties, 5, doi: 10.5281/zenodo.3553688
Han et al. (2020) Han, Y., Han, Z., & Fan, L. 2020, in IAU Symposium, Vol. 341, IAU Symposium, ed. M. Boquien, E. Lusso, C. Gruppioni, & P. Tissera, 143–146, doi: 10.1017/S1743921319002746
Han et al. (2007) Han, Z., Podsiadlowski, P., & Lynas-Gray, A. E. 2007, MNRAS, 380, 1098, doi: 10.1111/j.1365-2966.2007.12151.x
Hernández-Pérez & Bruzual (2013) Hernández-Pérez, F., & Bruzual, G. 2013, MNRAS, 431, 2612, doi: 10.1093/mnras/stt368
Hickox & Alexander (2018) Hickox, R. C., & Alexander, D. M. 2018, ARA&A, 56, 625, doi: 10.1146/annurev-astro-081817-051803
Higson et al. (2019) Higson, E., Handley, W., Hobson, M., & Lasenby, A. 2019, Statistics and Computing, 29, 891, doi: 10.1007/s11222-018-9844-0
Hogg & Foreman-Mackey (2018) Hogg, D. W., & Foreman-Mackey, D. 2018, ApJS, 236, 11, doi: 10.3847/1538-4365/aab76e
Hopkins et al. (2014) Hopkins, P. F., Kereš, D., Oñorbe, J., et al. 2014, MNRAS, 445, 581, doi: 10.1093/mnras/stu1738
Hoversten & Glazebrook (2008) Hoversten, E. A., & Glazebrook, K. 2008, ApJ, 675, 163, doi: 10.1086/524095
Hunter (2007) Hunter, J. D. 2007, Computing in Science & Engineering, 9, 90, doi: 10.1109/MCSE.2007.55
Ilbert et al. (2009) Ilbert, O., Capak, P., Salvato, M., et al. 2009, ApJ, 690, 1236, doi: 10.1088/0004-637X/690/2/1236
Ivezić et al. (2019) Ivezić, Ž., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111, doi: 10.3847/1538-4357/ab042c
Iyer & Gawiser (2017) Iyer, K., & Gawiser, E. 2017, ApJ, 838, 127, doi: 10.3847/1538-4357/aa63f0
Iyer et al. (2019) Iyer, K. G., Gawiser, E., Faber, S. M., et al. 2019, ApJ, 879, 116, doi: 10.3847/1538-4357/ab2052
Iyer et al. (2020) Iyer, K. G., Tacchella, S., Genel, S., et al. 2020, MNRAS, 498, 430, doi: 10.1093/mnras/staa2150
Joachimi (2016) Joachimi, B. 2016, in Astronomical Society of the Pacific Conference Series, Vol. 507, Multi-Object Spectroscopy in the Next Decade: Big Questions, Large Surveys, and Wide Fields, ed. I. Skillen, M. Balcells, & S. Trager, 401
Kaviraj et al. (2017) Kaviraj, S., Laigle, C., Kimm, T., et al. 2017, MNRAS, 467, 4739, doi: 10.1093/mnras/stx126
Kewley et al. (2019) Kewley, L. J., Nicholls, D. C., & Sutherland, R. S. 2019, ARA&A, 57, 511, doi: 10.1146/annurev-astro-081817-051832
Knowles et al. (2021) Knowles, A. T., Sansom, A. E., Allende Prieto, C., & Vazdekis, A. 2021, MNRAS, 504, 2286, doi: 10.1093/mnras/stab1001
Knowles et al. (2019) Knowles, A. T., Sansom, A. E., Coelho, P. R. T., et al. 2019, MNRAS, 486, 1814, doi: 10.1093/mnras/stz754
Kriek & Conroy (2013) Kriek, M., & Conroy, C. 2013, ApJ, 775, L16, doi: 10.1088/2041-8205/775/1/L16
Laigle et al. (2016) Laigle, C., McCracken, H. J., Ilbert, O., et al. 2016, ApJS, 224, 24, doi: 10.3847/0067-0049/224/2/24
Laigle et al. (2019) Laigle, C., Davidzon, I., Ilbert, O., et al. 2019, MNRAS, 486, 5104, doi: 10.1093/mnras/stz1054
Laureijs et al. (2011) Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints. https://arxiv.org/abs/1110.3193
Lawler & Acquaviva (2021) Lawler, A. J., & Acquaviva, V. 2021, MNRAS, 502, 3993, doi: 10.1093/mnras/stab138
Lee et al. (2010) Lee, S.-K., Ferguson, H. C., Somerville, R. S., Wiklind, T., & Giavalisco, M. 2010, ApJ, 725, 1644, doi: 10.1088/0004-637X/725/2/1644
Lee et al. (2009) Lee, S.-K., Idzi, R., Ferguson, H. C., et al. 2009, ApJS, 184, 100, doi: 10.1088/0067-0049/184/1/100
Leja et al. (2019) Leja, J., Carnall, A. C., Johnson, B. D., Conroy, C., & Speagle, J. S. 2019, ApJ, 876, 3, doi: 10.3847/1538-4357/ab133c
Leja et al. (2017) Leja, J., Johnson, B. D., Conroy, C., van Dokkum, P. G., & Byler, N. 2017, ApJ, 837, 170, doi: 10.3847/1538-4357/aa5ffe
Leja et al. (2020) Leja, J., Speagle, J. S., Johnson, B. D., et al. 2020, ApJ, 893, 111, doi: 10.3847/1538-4357/ab7e27
Lower et al. (2020) Lower, S., Narayanan, D., Leja, J., et al. 2020, ApJ, 904, 33, doi: 10.3847/1538-4357/abbfa7
Lower et al. (2022) —. 2022, ApJ, 931, 14, doi: 10.3847/1538-4357/ac6959
Lyu & Rieke (2022) Lyu, J., & Rieke, G. 2022, Universe, 8, 304, doi: 10.3390/universe8060304
Ma et al. (2016) Ma, X., Hopkins, P. F., Faucher-Giguère, C.-A., et al. 2016, MNRAS, 456, 2140, doi: 10.1093/mnras/stv2659
Madau (1995) Madau, P. 1995, ApJ, 441, 18, doi: 10.1086/175332
Maiolino & Mannucci (2019) Maiolino, R., & Mannucci, F. 2019, A&A Rev., 27, 3, doi: 10.1007/s00159-018-0112-2
Maraston (2005) Maraston, C. 2005, MNRAS, 362, 799, doi: 10.1111/j.1365-2966.2005.09270.x
Maraston et al. (2006) Maraston, C., Daddi, E., Renzini, A., et al. 2006, ApJ, 652, 85, doi: 10.1086/508143
Marigo et al. (2008) Marigo, P., Girardi, L., Bressan, A., et al. 2008, A&A, 482, 883, doi: 10.1051/0004-6361:20078467
Morishita (2022) Morishita, T. 2022, gsf: Grism SED Fitting package, Astrophysics Source Code Library, record ascl:2211.012. http://ascl.net/2211.012
Narayanan et al. (2018) Narayanan, D., Conroy, C., Davé, R., Johnson, B. D., & Popping, G. 2018, ApJ, 869, 70, doi: 10.3847/1538-4357/aaed25
National Academies of Sciences, Engineering, and Medicine (2021) National Academies of Sciences, Engineering, and Medicine. 2021, Pathways to Discovery in Astronomy and Astrophysics for the 2020s (The National Academies Press), doi: 10.17226/26141
Netzer (2015) Netzer, H. 2015, ARA&A, 53, 365, doi: 10.1146/annurev-astro-082214-122302
Noll et al. (2009) Noll, S., Burgarella, D., Giovannoli, E., et al. 2009, A&A, 507, 1793, doi: 10.1051/0004-6361/200912497
Oke (1974) Oke, J. B. 1974, ApJS, 27, 21, doi: 10.1086/190287
Pacifici et al. (2012) Pacifici, C., Charlot, S., Blaizot, J., & Brinchmann, J. 2012, MNRAS, 421, 2002, doi: 10.1111/j.1365-2966.2012.20431.x
Padoan et al. (1997) Padoan, P., Nordlund, A., & Jones, B. J. T. 1997, MNRAS, 288, 145
Pforr et al. (2012) Pforr, J., Maraston, C., & Tonini, C. 2012, MNRAS, 422, 3285, doi: 10.1111/j.1365-2966.2012.20848.x
Pforr et al. (2013) —. 2013, MNRAS, 435, 1389, doi: 10.1093/mnras/stt1382
Qiu & Kang (2022) Qiu, Y., & Kang, X. 2022, ApJ, 930, 66, doi: 10.3847/1538-4357/ac63a1
Reddy et al. (2012) Reddy, N. A., Pettini, M., Steidel, C. C., et al. 2012, ApJ, 754, 25, doi: 10.1088/0004-637X/754/1/25
Reddy et al. (2015) Reddy, N. A., Kriek, M., Shapley, A. E., et al. 2015, ApJ, 806, 259, doi: 10.1088/0004-637X/806/2/259
Rieke et al. (2005) Rieke, M. J., Kelly, D., & Horner, S. 2005, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 5904, Cryogenic Optical Systems and Instruments XI, ed. J. B. Heaney & L. G. Burriesci, 1–8, doi: 10.1117/12.615554
Robertson (2022) Robertson, B. E. 2022, ARA&A, 60, 121, doi: 10.1146/annurev-astro-120221-044656
Robotham et al. (2020) Robotham, A. S. G., Bellstedt, S., Lagos, C. d. P., et al. 2020, MNRAS, 495, 905, doi: 10.1093/mnras/staa1116
Salim & Boquien (2019) Salim, S., & Boquien, M. 2019, ApJ, 872, 23, doi: 10.3847/1538-4357/aaf88a
Salim et al. (2018) Salim, S., Boquien, M., & Lee, J. C. 2018, ApJ, 859, 11, doi: 10.3847/1538-4357/aabf3c
Salim & Narayanan (2020) Salim, S., & Narayanan, D. 2020, ARA&A, 58, 529, doi: 10.1146/annurev-astro-032620-021933
Salmon et al. (2016) Salmon, B., Papovich, C., Long, J., et al. 2016, ApJ, 827, 20, doi: 10.3847/0004-637X/827/1/20
Salvato et al. (2018) Salvato, M., Ilbert, O., & Hoyle, B. 2018, Nature Astronomy, doi: 10.1038/s41550-018-0478-0
Schmidt (1959) Schmidt, M. 1959, ApJ, 129, 243, doi: 10.1086/146614
Seon & Draine (2016) Seon, K.-I., & Draine, B. T. 2016, ApJ, 833, 201, doi: 10.3847/1538-4357/833/2/201
Sharma (2017) Sharma, S. 2017, ARA&A, 55, 213, doi: 10.1146/annurev-astro-082214-122339
Shivaei et al. (2020) Shivaei, I., Reddy, N., Rieke, G., et al. 2020, ApJ, 899, 117, doi: 10.3847/1538-4357/aba35e
Skilling (2004) Skilling, J. 2004, in American Institute of Physics Conference Series, Vol. 735, American Institute of Physics Conference Series, ed. . U. V. T. R. Fischer, R. Preuss, 395–405, doi: 10.1063/1.1835238
Skilling (2006) Skilling, J. 2006, Bayesian Analysis, 1, 833
Speagle (2020) Speagle, J. S. 2020, MNRAS, 493, 3132, doi: 10.1093/mnras/staa278
Spergel et al. (2003) Spergel, D. N., Verde, L., Peiris, H. V., et al. 2003, ApJS, 148, 175, doi: 10.1086/377226
Suess et al. (2022) Suess, K. A., Leja, J., Johnson, B. D., et al. 2022, ApJ, 935, 146, doi: 10.3847/1538-4357/ac82b0
Tacconi et al. (2020) Tacconi, L. J., Genzel, R., & Sternberg, A. 2020, ARA&A, 58, 157, doi: 10.1146/annurev-astro-082812-141034
Tanaka (2015) Tanaka, M. 2015, ApJ, 801, 20, doi: 10.1088/0004-637X/801/1/20
Thomas & Maraston (2003) Thomas, D., & Maraston, C. 2003, A&A, 401, 429, doi: 10.1051/0004-6361:20030153
Thorne et al. (2021) Thorne, J. E., Robotham, A. S. G., Davies, L. J. M., et al. 2021, MNRAS, 505, 540, doi: 10.1093/mnras/stab1294
Tinsley (1978) Tinsley, B. M. 1978, ApJ, 222, 14, doi: 10.1086/156116
Tinsley (1980) —. 1980, Fund. Cosmic Phys., 5, 287, doi: 10.48550/arXiv.2203.02041
Tinsley & Gunn (1976) Tinsley, B. M., & Gunn, J. E. 1976, ApJ, 203, 52, doi: 10.1086/154046
Valentini et al. (2019) Valentini, M., Borgani, S., Bressan, A., et al. 2019, MNRAS, 485, 1384, doi: 10.1093/mnras/stz492
van der Walt et al. (2011) van der Walt, S., Colbert, S. C., & Varoquaux, G. 2011, Computing in Science & Engineering, 13, 22, doi: 10.1109/MCSE.2011.37
van Dokkum (2008) van Dokkum, P. G. 2008, ApJ, 674, 29, doi: 10.1086/525014
Volonteri et al. (2016) Volonteri, M., Dubois, Y., Pichon, C., & Devriendt, J. 2016, MNRAS, 460, 2979, doi: 10.1093/mnras/stw1123
Walcher et al. (2011) Walcher, J., Groves, B., Budavári, T., & Dale, D. 2011, Ap&SS, 331, 1, doi: 10.1007/s10509-010-0458-z
Wang & Lilly (2020) Wang, E., & Lilly, S. J. 2020, ApJ, 892, 87, doi: 10.3847/1538-4357/ab7b7d
Weingartner & Draine (2001) Weingartner, J. C., & Draine, B. T. 2001, ApJ, 548, 296, doi: 10.1086/318651
Witt & Gordon (2000) Witt, A. N., & Gordon, K. D. 2000, ApJ, 528, 799, doi: 10.1086/308197
Yallup et al. (2022) Yallup, D., Janßen, T., Schumann, S., & Handley, W. 2022, European Physical Journal C, 82, 678, doi: 10.1140/epjc/s10052-022-10632-2
Yan et al. (2019) Yan, R., Chen, Y., Lazarz, D., et al. 2019, ApJ, 883, 175, doi: 10.3847/1538-4357/ab3ebc
Zhan (2011) Zhan, H. 2011, Scientia Sinica Physica, Mechanica & Astronomica, 41, 1441, doi: 10.1360/132011-961
Zhan (2018) Zhan, H. 2018, in 42nd COSPAR Scientific Assembly, Vol. 42, E1.16–4–18
Zhan (2021) —. 2021, Chinese Science Bulletin, 66, 1290
Zhang et al. (2005) Zhang, F., Han, Z., Li, L., & Hurley, J. R. 2005, MNRAS, 357, 1088, doi: 10.1111/j.1365-2966.2005.08739.x
Zhou et al. (2022a) Zhou, X., Gong, Y., Meng, X.-M., et al. 2022a, arXiv e-prints, arXiv:2206.13696. https://arxiv.org/abs/2206.13696
Zhou et al. (2022b) —. 2022b, MNRAS, 512, 4593, doi: 10.1093/mnras/stac786

	$\displaystyle p(\bm{d}_{1},\bm{d}_{2},\dots,\bm{d}_{N}\|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})=$
	$\displaystyle\prod\limits_{g=1}^{N}\int p(\bm{d}_{g}\|{\bm{\theta}_{g}},{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I})$
	$\displaystyle p({\bm{\theta}_{g}}\|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I}){\mathrm{d}\bm{\theta}_{g}}$
	$\displaystyle=\prod\limits_{g=1}^{N}p(\bm{d}_{g}\|{{\bm{M}}({ssp}_{0},{sfh}_{0},{dal}_{0})},\bm{I}).$		(30)