This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Constrained NILC method for CMB B mode observations

Zirui Zhang    Yang Liu111Corresponding author    Si-Yu Li    Haifeng Li    Hong Li1
Abstract

The Internal Linear Combination (ILC) method is commonly employed to extract the cosmic microwave background (CMB) signal from multi-frequency observation maps. However, the performance of the ILC method tends to degrade when the signal-to-noise ratio (SNR) is relatively low, particularly when measuring the primordial BB-modes to detect the primordial gravitational waves. To address this issue, an enhanced version of the ILC method, known as constrained ILC, is proposed. This method is designed to be more suitable for situations with low signal-to-noise ratio (SNR) by incorporating additional prior foreground information. In our study, we have modified the constraint Needlet ILC method and successfully improved its performance at low SNR. We illustrate our methods using mock data generated from the combination of WMAP, Planck and a ground-based experiment in the northern hemisphere, and the chosen noise level for the ground-based experiment are very conservative which can be easily achieved in the very near future. The results show that the level of foreground residual can be well controlled. In comparison to the standard NILC method, which introduces a bias to the tensor-to-scalar ratio (rr) of approximately 0.050.05, the constrained NILC method exhibits a significantly reduced bias of only around 5×1035\times 10^{-3} towards rr which is much smaller than the statistical error.

1 Introduction

The cosmic microwave background (CMB) radiation, originating approximately 380,000 years after the Big Bang, represents the oldest light in the universe and carries a wealth of cosmological information. Over the past decades, significant advancements in CMB observations have made it the most accurate cosmic probe, playing a pivotal role in establishing the standard cosmological model known as Λ\LambdaCDM, and ushering in a new era of precise cosmology.

To address certain challenges encountered within the Λ\LambdaCDM model, such as the horizon, flatness, and magnetic monopole problems, the inflation theory was proposed[1, 2]. In addition to resolving these issues, inflation theory also predicted the existence of primordial gravitational waves (PGWs). The detection of PGWs stands as a crucial frontier in cosmology, with the precise measurement of the CMB’s primordial BB-mode polarization signal serving as the most promising approach. The strength of this signal is directly proportional to the amplitude of PGWs, which can be quantified by the tensor-to-scalar ratio rr. Various experimental groups, such as BICEP/Keck series [3, 4, 5], Simons Observatory[6], CMB-S4[7], LiteBIRD [8] etc., are actively involved in the pursuit of detecting and characterizing PGWs.

Currently, the strongest constraint on rr was given from the joint analysis of BICEP2/Keck Array, Planck and WMAP as r0.05<0.036r_{0.05}<0.036 at 95%95\% confidence level[9]. However, detecting and characterizing the primordial BB-mode signal is a formidable challenge due to its inherent weakness compared to other sky signals. One of the foremost obstacles in primordial BB-mode measurements is the presence of foreground contaminants. Astrophysical sources, such as Galactic dust emissions and synchrotron radiation, emit polarized signals that can significantly obscure the desired primordial BB-mode polarization signal. Component separation between the cosmological signal and foreground emissions is a complex task that necessitates the implementation of sophisticated foreground cleaning techniques.

In CMB analysis, there are various methods employed for component separation, broadly categorized into three groups: blind methods, parametric methods, and template removal methods. Blind methods, such as ILC (Internal Linear Combination)[10, 11, 12, 13], FastICA (Fast Independent Component Analysis)[14], and CCA (Correlated Component Analysis)[15], make no assumptions about the foreground radiation. Parametric methods, like Commander[16] and ForeGroundBuster[17], rely on assumptions about the components’ models, such as their respective spectral laws, and fit the model parameters using the observation data. Template removal methods, such as SEVEM (Spectral Estimation Via Expectation Maximisation)[18, 19, 20], constructing the foreground templates from the pairs of maps differences from the low and high frequency channels and obtaining the cleaned CMB maps by finding the coefficients to minimize the variance of map, SMICA (Spectral Matching Independent Component Analysis)[21, 22], modelling the foregrounds as templates with spectral indices, power spectra and correlation between components. In most cases, these methods focus on quasi-fullsky. When it comes to small partial sky which is the common case for ground based experiment, extracting the right cleaned BB-mode signal becomes more and more challenging because the resulting BB map (or power spectrum) can be strongly contaminated by EE to BB leakage [23, 24].

In our previous work[25], we have shown that a Needlet space ILC working on the EBEB leakage corrected BB map has a good performance. For a future ground-based experiment with a deep survey like CMB-S4[7], the foreground residual can be well controlled which biases the tensor to scalar ratio (rr) at the order of 10310^{-3}. However, the inherent limitation of the ILC algorithm itself is that it does not differentiate between instrument noise and foreground signals. As a result, in situations with low signal-to-noise ratio (SNR), the ILC methods tend to produce high foreground residual. So for BB-mode analysis, the foreground residual finally bias the constraint of rr. To improve the performance of ILC, i.e., reduce the foreground residual, additional information need to be involved to help ILC method more effective in distinguishing the foreground. One efficient approach is to incorporate an emission model of the foreground and enforce orthogonality of the ILC weights with respect to the foreground intensity. This can help in accurately distinguishing between the two components and improving the overall performance of the ILC method. That is the so called constrained ILC method, which was first proposed by the reference [26], and this method has also been widely employed to study CMB BB-mode physics[27, 28].

In this paper, we present an enhanced version of the NILC methodology for BB-mode analysis. We utilize end-to-end simulations to validate the effectiveness of the improved method, demonstrating our approach with mock data and noise levels derived from WMAP[29], Planck[30], and a future ground-based experiment in the northern hemisphere. These noise levels simulate the low SNR challenges encountered in actual observations in the near future. Our results show that the level of foreground residual can be well controlled with constrained NILC. Comparing the performance of our constrained NILC method with the standard NILC method, we find a remarkable reduction in bias on rr. Specifically, the bias is reduced from 0.060.06 to 0.0070.007, which is much smaller than the statistical error.

The paper is organized as follows. Section 2 describes origin of the bias in ILC and introduces our proposed constrained NILC method. Section 3 outlines the methodology employed for generating the mock data with given instrumental configuration. Section 4 details how we implement our component separation method on the mock data to get the clean BB map and the subsequent constraint on rr. Section 5 presents the results on map level, angular power spectral level as well as the rr constraint; section 6 gives the conclusion.

2 Motivation and Methodology

2.1 The ILC estimator and its bias

The objective of the ILC method is to accurately isolate the specific emission of interest, such as CMB, from a set of multi-frequency maps. This technique assumes that the emission of interest remains consistent across different frequencies, allowing for the construction of a single cleaned map that predominantly represents the desired signal.

Suppose that all the frequency maps are calibrated to CMB units and the same resolution, the observation can be modeled as:

𝐲(p)=𝟏s(p)+𝐟(p)+𝐧(p).\displaystyle\mathbf{y}(p)=\mathbf{1}s(p)+\mathbf{f}(p)+\mathbf{n}(p). (2.1)

Here, 𝐲(p)\mathbf{y}(p), 𝟏\mathbf{1} and 𝐧(p)\mathbf{n}(p) are nνn_{\nu} dimensional vector, where nνn_{\nu} denotes the number of frequency channels, pp can be any domain like pixel, harmonic, needlet, etc. The terms 𝐟\mathbf{f} and 𝐧\mathbf{n} describe the contribution from galactic diffuse foreground contamination and instrumental noise respectively. To minimize the impact of the “junk” term 𝐟\mathbf{f}+𝐧\mathbf{n}, the ILC provides an estimator s^ILC\hat{s}_{\rm ILC} of ss by forming the linear combination s^ILC(p)=𝐰T𝐲(p)\hat{s}_{\rm ILC}(p)=\mathbf{w}^{T}\mathbf{y}(p), which has minimum variance and also preserves the CMB signal with 𝐰T𝟏=1\mathbf{w}^{T}\mathbf{1}=1. Such the weights are given by [10]

𝐰=𝐂^1𝟏𝟏T𝐂^1𝟏,\displaystyle\mathbf{w}=\frac{\hat{\mathbf{C}}^{-1}\mathbf{1}}{\mathbf{1}^{T}\hat{\mathbf{C}}^{-1}\mathbf{1}}, (2.2)

where 𝐂^\hat{\mathbf{C}} is the empirical covariance matrix of the observations.

Suppose the number of frequency channel is larger than the combined number of foreground components and the CMB signal, and in an ideal scenario with no noise, the ILC estimator is unbiased. However, in the real world, bias always exists and the bias of ILC may come from different situations as follows:

  • The non-stationary of foreground and noise within the selected pp domain introduces a challenge as it leads to varying weights, which can introduce bias in the ILC method. To address this issue, various approaches have been explored to ensure the stationary of the field 𝐟\mathbf{f} and noise 𝐧\mathbf{n}. One approach is to carefully choose appropriate pp domains by 1) decomposing the sky maps into several regions[10], 2) harmonic domain[12], 3) needlet domain[31]. Additionally, it is beneficial to scan the sky as uniformly as possible during observation.

  • The residual of “junk” term can be defined as r=𝟏T𝐂^1(𝐟+𝐧)/(𝟏T𝐂^1𝟏)r=\mathbf{1}^{T}\hat{\mathbf{C}}^{-1}(\mathbf{f}+\mathbf{n})/(\mathbf{1}^{T}\hat{\mathbf{C}}^{-1}\mathbf{1}). When 𝐟\mathbf{f} and 𝐧\mathbf{n} are random field with the zero means, s^ILC\hat{s}_{\rm ILC} is unbiased, but when there exits non-zero higher order terms of auto-term and cross-term of 𝐫\mathbf{r}, for example r2+2sr\left<r^{2}\right>+2\left<sr\right> will introduce bias[13]:

    a) The auto-term bias can the written as following with the assumption that there is no correlation between foreground and noise:

    r2=𝟏T𝐂^1(𝐟𝐟T+𝐧𝐧T)𝐂^1𝟏(𝟏T𝐂^1𝟏)2,\displaystyle\left<r^{2}\right>=\frac{\mathbf{1}^{T}\hat{\mathbf{C}}^{-1}(\left<\mathbf{f}\mathbf{f}^{T}\right>+\left<\mathbf{n}\mathbf{n}^{T}\right>)\hat{\mathbf{C}}^{-1}\mathbf{1}}{(\mathbf{1}^{T}\hat{\mathbf{C}}^{-1}\mathbf{1})^{2}}, (2.3)

    both the bias from foreground and noise increase with the noise level. However, we can easily remove noise bias by employing noise-only simulations with the knowledge of statistical property of noise which is described in Section 4.2. Unfortunately, the same approach cannot be applied to the foreground. For low SNR BB-mode physics, such bias is typically non-negligible. It is possible to obtain an unbiased ILC estimator by replacing the 𝐂\mathbf{C} with 𝐂𝐍\mathbf{C}-\mathbf{N}, where 𝐍\mathbf{N} is the noise covariance, but this will leads to a substantial increase in the variance of CMB power[32].

    b) The cross-term bias mainly arise from the difference between the true underlying covariance matrix 𝐂\mathbf{C} and the estimated empirical covariance matrix 𝐂^\hat{\mathbf{C}}. The bias values approximated amounts to 2(1nν)σs2/Np2(1-n_{\nu})\sigma^{2}_{s}/N_{p}, which means that the ILC estimator underestimates the CMB power by roughly 2(1nν)C/Np2(1-n_{\nu})C_{\ell}/N_{p}, more details refer to [13].

In this paper, we focus on reducing the ILC bias with the case of low SNR. To minimize this bias, more additional information are needed to distinguish the Galactic foreground signal from the instrumental noise. In other words, extra constraints, which depend on the additional information, can be imposed on the weights.

2.2 Constrained Needlet ILC

Suppose the emission law of all the Galactic foreground and the CMB signal are known, thus the observation can be modeled as

𝐲(p)=𝐀𝐬(p)+𝐧(p).\displaystyle\mathbf{y}(p)=\mathbf{As}(p)+\mathbf{n}(p). (2.4)

Here we also assume that all maps have been re-smoothed to the same resolution. The mixing matrix 𝐀\mathbf{A} are known from the prior. 𝐬(p)\mathbf{s}(p) is a column vector with NcompN_{\rm comp} elements denote the templates for all the components (CMB + foreground). And thus the term 𝐧(p)\mathbf{n}(p) includes the instrumental noise only. The goal of constrained ILC is to reduce the impact of residual foreground signal while preserving the CMB signal. Thus the constraint becomes 𝐀T𝐰=𝐞\mathbf{A}^{T}\mathbf{w}=\mathbf{e} with eCMB=1e_{\rm CMB}=1, enonCMB=0e_{\rm non-CMB}=0. The weights for minimizing the variance of the resulting map with the constraints can be solved using Lagrange multipliers[11]. The solution is

𝐰=𝐂^1𝐀(𝐀T𝐂^1𝐀)1𝐞.\displaystyle\mathbf{w}=\hat{\mathbf{C}}^{-1}\mathbf{A}(\mathbf{A}^{T}\hat{\mathbf{C}}^{-1}\mathbf{A})^{-1}\mathbf{e}. (2.5)

The foreground bias in this issue will arise bias of 𝐀\mathbf{A} in two main aspects: misunderstanding of the number of foreground components and the bias associated with the individual elements of 𝐀\mathbf{A}.

From our previous work, we have shown that the NILC on BB maps is very promising on rr constraint, so here we choose to estimate the weights in needlet domain. The following will describe the constrained NILC in detail, the multi-frequency data is in the form of needlet coefficients which given by

𝜷jk=m𝐲mh(j)Ym(ξ^k(j))\displaystyle\bm{\beta}_{jk}=\sum_{\ell m}\mathbf{y}_{\ell m}h_{\ell}^{(j)}Y_{\ell m}\left(\hat{\xi}_{k}^{(j)}\right) (2.6)

where 𝐲m\mathbf{y}_{\ell m} is the spherical harmonic coefficients of the observational 𝐲(p)\mathbf{y}(p) which can be arbitrary scalar or pseudo-scalar map. ξ^k(j)\hat{\xi}^{(j)}_{k} is a set of grid points on sphere. In practice, they are exactly the center of the pixel kk in Healpix pixelization scheme with different NSIDE(j)\texttt{NSIDE}(j). h(j)h_{\ell}^{(j)} is the spectral windows satisfying j(h(j))2=1\sum_{j}(h_{\ell}^{(j)})^{2}=1. The NILC estimation on needlet space is

β^jkNILC=𝐰jkT𝜷jk.\displaystyle\hat{\beta}^{\rm NILC}_{jk}=\mathbf{w}^{T}_{jk}\bm{\beta}_{jk}. (2.7)

The final estimation s^NILC\hat{s}_{\rm NILC} in real space is

s^NILC(p)=jmβ^j,mNILCh(j)Ym(p)\displaystyle\hat{s}_{\rm NILC}(p)=\sum_{j}\sum_{\ell m}\hat{\beta}_{j,\ell m}^{\rm NILC}h_{\ell}^{(j)}Y_{\ell m}(p) (2.8)

The weights in (2.7) are calculated from the equation (2.5) with an empirical covariance [𝐂^]jk[\hat{\mathbf{C}}]_{jk} which are estimated by the formula

𝐂^jk=1N𝒟kωj(k,k)𝜷jk𝜷jkT.\displaystyle\hat{\mathbf{C}}_{jk}=\frac{1}{N_{\mathcal{D}}}\sum_{k^{\prime}}\omega_{j}(k,k^{\prime})\bm{\beta}_{jk^{\prime}}\bm{\beta}_{jk^{\prime}}^{T}. (2.9)

It means that the covariance matrix obtained by computing the mean of the needlet coefficients. In practice, the weights in (2.9) is a 2D Gaussian function. Due to the locality of the NILC method, it is convenient to extend equation (2.5) by using different mixing matrices 𝐀\mathbf{A} to compute weights at different positions. The estimation of 𝐀\mathbf{A} matrices from the multi-frequency maps will be shown in section 4.

3 Map Simulations

3.1 Sky model

We adopted a sky model that is as close as possible to the real observation. It contains CMB signal, diffuse foreground emission composed of synchrotron, thermal dust, free-free and spinning dust. The polarized foreground maps are simulated by using the Planck Sky Model[33] (PSM). Here are the details of the sky model:

  1. 1.

    CMB: the input CMB maps are Gaussian realizations from a particular power spectrum obtained from the Boltzmann code CLASS222http://lesgourg.github.io/class_public/class.html[34], using Planck 2018 best-fit cosmological parameters [35] with the tensor scalar ratio r=0r=0. The CMB dipole was not considered in the simulations.

    The effect of CMB distorted by weak gravitational lensing was also considered in our simulation. Weak lensing distorts a part of EE-mode signal to BB-mode and it is not a linear effect and therefore it is non-Gaussian. The precise way to simulate such lensing distortion effects is to rearrange the pixel on the map level. For this part, the package LensPix333https://cosmologist.info/lenspix/ is used to simulate it. the algorithm is presented in the reference[36], which distorts the primordial signal given a realization of lensing potential map from CϕϕC_{\ell}^{\phi\phi}.

  2. 2.

    Synchrotron: Galactic synchrotron radiation is produced by charged relativistic cosmic rays that are accelerated by the Galactic magnetic field. At frequencies below 80 GHz, synchrotron radiation is the primary polarized foreground. In our simulations, we did not account for the effects of Faraday rotation, and instead modeled the synchrotron emission across frequencies using template maps based on specific emission laws. To simplify matters, we modeled the frequency dependence of synchrotron radiation in the CMB frequency range as a power-law function (in Rayleigh-Jeans units):

    [Tsync(ν)Qsync(ν)Usync(ν)][TtemplateQtemplateUtemplate]×νβs.\displaystyle\begin{bmatrix}T_{\rm sync}(\nu)\\ Q_{\rm sync}(\nu)\\ U_{\rm sync}(\nu)\end{bmatrix}\propto\begin{bmatrix}T_{template}\\ Q_{template}\\ U_{template}\end{bmatrix}\times\nu^{\beta_{s}}. (3.1)

    The spectral index βs\beta_{s} typically has a value of 3-3. When generating temperature maps, we utilized the 408 MHz radio continuum full-sky map of Haslam et al. [37, 38] as our template. For polarization maps, we used the SMICA component separation of Planck PR3 (Planck Public Data Release 3) [39] polarization maps as templates, with the same synchrotron spectral index.

  3. 3.

    Thermal dust: At higher frequencies (above 80 GHz), the thermal emission from heated dust grains becomes the dominant foreground signal. In the PSM, we modeled the emission of thermal dust as modified black-bodies [40],

    IννβdBν(Td),\displaystyle I_{\nu}\propto\nu^{\beta_{\rm d}}B_{\nu}(T_{\rm d}), (3.2)

    where TdT_{\rm d} is the temperature of the dust grains, AdA_{\rm d} is the amplitude and βd\beta_{\rm d} is the spectral index. Bν(Td)B_{\nu}(T_{\rm d}) is the Planck black-body function at temperature TdT_{\rm d} at frequency ν\nu. For polarization, the Stokes QQ and UU parameters (in MJy/sr{\rm MJy}/{\rm sr} unit) are correlate with the intensity,

    Qν(𝐧^)\displaystyle Q_{\nu}(\hat{\mathbf{n}}) =f(𝐧^)Iνcos(2γd(𝐧^)),\displaystyle=f(\hat{\mathbf{n}})I_{\nu}\cos(2\gamma_{d}(\hat{\mathbf{n}})), Uν(𝐧^)\displaystyle U_{\nu}(\hat{\mathbf{n}}) =f(𝐧^)Iνsin(2γd(𝐧^)),\displaystyle=f(\hat{\mathbf{n}})I_{\nu}\sin(2\gamma_{d}(\hat{\mathbf{n}})), (3.3)

    where f(𝐧^)f(\hat{\mathbf{n}}) and γd(𝐧^)\gamma_{\rm d}(\hat{\mathbf{n}}) denote polarization fraction and polarization angle at different position 𝐧^\hat{\mathbf{n}}. We use the dust template in Planck PR3 at 353353 GHz in intensity and polarization as our template, which includes the template maps of the dust grains TdT_{\rm d}, the amplitude for all T,Q,UT,Q,U at 353 GHz, and the spectral indices.

  4. 4.

    free-free: Free-free emission is produced by electron-ion scattering in interstellar plasma and is generally fainter than synchrotron or thermal dust emission. It is modeled as a power-law with a spectral index βf\beta_{\rm f} of approximately 2.14-2.14 [41] and is intrinsically unpolarized. Free-free emission only contributes to temperature measurements, and we used the free-free emission map from the Commander method [16] in Planck 2015 data PR2 [41] as our template.

  5. 5.

    Spinning dust: Small rotating dust grains can produce microwave emission if they have a non-zero electric dipole moment, which can also be modeled as a power-law within the CMB observation frequency bands. We used the template spinning dust map from Commander in Planck PR2 as our input template. Spinning dust emission is weakly polarized, and in our simulation, the polarization fraction of spinning dust was set to 0.005.

3.2 Instrumental configuration

In this work, we consider a ground-based observational experiment that can only cover a partial sky area. The scanning strategy is optimized for detecting primordial gravitational waves on Northern hemisphere, in other words, the scanning strategy was designed to perform a deep scan of a small sky region. The hit-map corresponding to the scanning strategy is shown in Figure 1. The deep scanning sky patch is centered at RA 1010h0404m, Dec. 61.4961.49^{\circ}. The effective area of the sky-patch is about 7.5%7.5\% of the full-sky.

Refer to caption
(a)
Refer to caption
(b)
Figure 1: Orthogonal projection of the hit-map and sky patches in Galactic coordinate. (a) shows the hit-map of the ground-based observation used in this work. (b) shows the sky patch we used for further analysis. The background of it is the polarization intensity P=Q2+U2P=\sqrt{Q^{2}+U^{2}} map observed by Planck at 353353 GHz [42] with 11^{\circ} resolution.

Considering the atmospheric window, we have set up two observation frequency bands of 95 and 150 GHz for the ground-based experiments. In order to achieve better foreground removal results for ILC, we also simulated some space-based observed sky maps and taking them as the inputs. The maps including the K and V bands of WMAP, as well as all bands of Planck High Frequency Instrument (HFI). The beam and noise information of them was obtained from the LAMBDA website444https://lambda.gsfc.nasa.gov.

Table 1: Instrumental configuration
Frequency bands Beam size Polarization map-depth* Type
(GHz) (arcmin) (μKarcmin\mu{\rm K}\cdot{\rm arcmin})
23 52.8 271271 WMAP
61 21.0 141141
95 19.0 2020 Ground-based
150 11.0 3131
100 9.94 105105 Planck HFI
143 7.04 6363
217 4.66 9595
353 4.41 394394
  • *

    The average value within the sky-patch

The frequency bands and the related beam resolution are listed in Table 1. All the maps are in the HealPix pixelization scheme at NSIDE=512\texttt{NSIDE}=512. The noise in the sky patch is not homogeneous. For simulated sky maps of ground-based observations, the noise variance on each pixel depends on the hit-map, which shown in Figure 1. The variance is given by

σp2=NET2HpΩp,\displaystyle\sigma^{2}_{p}=\frac{{\rm NET}^{2}}{H_{p}\Omega_{p}}, (3.4)

where HpH_{p} denotes the value of hit-map at pixel pp (in hour/deg2{\rm hour}/{\rm deg^{2}} unit) and Ωp\Omega_{p} is the solid angle of pixel pp. In the case of simulating space-based observed sky maps, the noise variance map can be obtained from the LAMBDA website.

4 Foreground removal implementation and r constraint

4.1 Foreground removal implementation

To perform a constrained NILC on the simulated observational sky maps, there is still a key issue to be solved, which is how to obtain the mixing matrix 𝐀\mathbf{A} in equation (2.5). Generally speaking, the term 𝐀\mathbf{A} encodes the emission law of the foreground. Based on the current understanding of the foreground model, only synchrotron radiation and thermal dust emission can contribute polarization signal. From the synchrotron and dust model equations (3.1) and (3.2), it appears that we need to know these three parameters βs,βd\beta_{\rm s},\beta_{\rm d} and TdT_{\rm d} in advance.

In our implementation, the parameter TdT_{\rm d} is fixed to be 19.619.6 K according to the results of Planck[43]. As for βs\beta_{\rm s} and βd\beta_{\rm d}, We will fit them from the observational intensity and polarization sky maps. The fitting is based on three assumptions: 1. The foreground emission is dominated by large-scale signals. 2. The spectral indices of any foreground components modeled by power-law change very little on small scales. 3. The frequency dependence of foreground emission is the same for both intensity and polarization signals. We fit the values of these two parameters at each pixel using the 8(frequencychannel)×3(T,Q,U)8\ (frequency\ channel)\times 3\ (T,Q,U) observed sky map. The model at each pixel pp is

Tν(p)\displaystyle T_{\nu}(p) =As(p)gRJ(ν)gRJ(νs)(ννs)βs+Af(p)gRJ(ν)gRJ(νs)(ννs)2.14+Asd(p)gJy(ν)gJy(νs)fsd(ν)fsd(νs)\displaystyle=A_{\rm s}(p)\frac{g_{\rm RJ}(\nu)}{g_{\rm RJ}(\nu_{\rm s})}\left(\frac{\nu}{\nu_{\rm s}}\right)^{\beta_{\rm s}}+A_{\rm f}(p)\frac{g_{\rm RJ}(\nu)}{g_{\rm RJ}(\nu_{\rm s})}\left(\frac{\nu}{\nu_{\rm s}}\right)^{-2.14}+A_{\rm sd}(p)\frac{g_{\rm Jy}(\nu)}{g_{\rm Jy}(\nu_{\rm s})}\frac{f_{\rm sd}(\nu)}{f_{\rm sd}(\nu_{\rm s})}
+Ad(p)gJy(ν)gJy(νd)Bν(Td)Bνd(Td)(ννd)βd(p)+TCMB(p),\displaystyle+A_{\rm d}(p)\frac{g_{\rm Jy}(\nu)}{g_{\rm Jy}(\nu_{\rm d})}\frac{B_{\nu}(T_{\rm d})}{B_{\nu_{\rm d}}(T_{\rm d})}\left(\frac{\nu}{\nu_{\rm d}}\right)^{\beta_{\rm d}(p)}+T_{\rm CMB}(p), (4.1)
Qν(p)\displaystyle Q_{\nu}(p) =Qs(p)gRJ(ν)gRJ(νs)(ννs)βs+Qd(p)gJy(ν)gJy(νd)B(ν,Td)B(νd,Td)(ννd)βd(p)+QCMB(p),\displaystyle=Q_{\rm s}(p)\frac{g_{\rm RJ}(\nu)}{g_{\rm RJ}(\nu_{\rm s})}\left(\frac{\nu}{\nu_{\rm s}}\right)^{\beta_{\rm s}}+Q_{\rm d}(p)\frac{g_{\rm Jy}(\nu)}{g_{\rm Jy}(\nu_{\rm d})}\frac{B(\nu,T_{\rm d})}{B(\nu_{\rm d},T_{\rm d})}\left(\frac{\nu}{\nu_{\rm d}}\right)^{\beta_{\rm d}(p)}+Q_{\rm CMB}(p), (4.2)
Uν(p)\displaystyle U_{\nu}(p) =Us(p)gRJ(ν)gRJ(νs)(ννs)βs+Ud(p)gJy(ν)gJy(νd)B(ν,Td)B(νd,Td)(ννd)βd(p)+UCMB(p).\displaystyle=U_{\rm s}(p)\frac{g_{\rm RJ}(\nu)}{g_{\rm RJ}(\nu_{\rm s})}\left(\frac{\nu}{\nu_{\rm s}}\right)^{\beta_{\rm s}}+U_{\rm d}(p)\frac{g_{\rm Jy}(\nu)}{g_{\rm Jy}(\nu_{\rm d})}\frac{B(\nu,T_{\rm d})}{B(\nu_{\rm d},T_{\rm d})}\left(\frac{\nu}{\nu_{\rm d}}\right)^{\beta_{\rm d}(p)}+U_{\rm CMB}(p). (4.3)

Here νs23\nu_{\rm s}\equiv 23 GHz and νd353\nu_{\rm d}\equiv 353 GHz. The factors gRJ(ν)g_{\rm RJ}(\nu) and gJy(ν)g_{\rm Jy}(\nu) denote the conversion factors from antenna temperature μKRJ\mu\rm{K}_{\rm RJ} or flux density Jy/sr to CMB unit μKCMB\mu{\rm K}_{\rm CMB} at frequency ν\nu. We considered the spinning dust emission are unpolarized in our fitting model. fsdf_{sd} is the external template calculated from SpDust2 [44]. Totally there are 1212 free parameters in the model include the amplitudes and the spectral indices of the foreground, and the CMB signal.

In order to achieve better fitting results, all the observational maps are re-smoothed to the same resolution 52.852.8 arcmin and down graded the NSIDE to 88. This step can average out the noise and CMB signal on each pixel, thereby improving the fitting results. After smoothing and downgrading, the spectral index for each pixel is the average of the true spectral index values within the pixel area. It can be easily figure out by perform Taylor expansion

sFG(p)=d𝒏A(𝒏)νβ(𝒏)A¯νβ¯+A¯(lnν)2(β(𝒏)β¯)2+,s_{\rm FG}(p)=\int{\rm d}\bm{n}A(\bm{n})\nu^{\beta(\bm{n})}\simeq\bar{A}\nu^{\bar{\beta}}+\bar{A}(\ln\nu)^{2}\int(\beta(\bm{n})-\bar{\beta})^{2}+\dots, (4.4)

with A¯d𝒏W(𝒏)A(𝒏)\bar{A}\equiv\int{\rm d}\bm{n}W(\bm{n})A(\bm{n}) and β¯d𝒏W(𝒏)β(𝒏)\bar{\beta}\equiv\int{\rm d}\bm{n}W(\bm{n})\beta(\bm{n}), where W(𝒏)W(\bm{n}) is the weight that encodes the information for smoothing and downgrading. Based on our aforementioned assumptions, we only consider the zeroth-order term.

The likelihood for each pixel, denoted as (p)\mathcal{L}(p), is given by

2log(p)=ν[T^ν(p)Tν(p)]2σT2(ν,p)+[Q^ν(p)Qν(p)]2σP2(ν,p)+[U^ν(p)Uν(p)]2σP2(ν,p)+const.\displaystyle-2\log\mathcal{L}(p)=\sum_{\nu}\frac{\left[\hat{T}_{\nu}(p)-T_{\nu}(p)\right]^{2}}{\sigma^{2}_{T}(\nu,p)}+\frac{\left[\hat{Q}_{\nu}(p)-Q_{\nu}(p)\right]^{2}}{\sigma^{2}_{P}(\nu,p)}+\frac{\left[\hat{U}_{\nu}(p)-U_{\nu}(p)\right]^{2}}{\sigma^{2}_{P}(\nu,p)}+const. (4.5)

Here σT2\sigma^{2}_{T} and σP2\sigma^{2}_{P} represent the variances of instrumental noise of the temperature map and polarization map, respectively. The variances are recalculated at NSIDE=8\texttt{NSIDE}=8. For each pixel, we sampled the likelihood (4.5) by using Markov Chain Monte Carlo (MCMC). For efficiency purpose, we use the python package NumPyro555https://github.com/pyro-ppl/numpyro[45, 46] to sample the likelihood. When running MCMC, the priors for all parameters are uniform distributed. For βs\beta_{\rm s} and βd\beta_{\rm d}, we use the ranges βs[4,2]\beta_{\rm s}\in[-4,-2] and βd[1,3]\beta_{\rm d}\in[1,3]. For every pixel pp, we take the best-fit value sampled from MCMC as the fitting results βs(p)\beta_{\rm s}(p) and βd(p)\beta_{\rm d}(p). Thus, βs\beta_{\rm s} and βd\beta_{\rm d} are the maps with the resolution NSIDE=8\texttt{NSIDE}=8 after the fitting. An fitting example is given in Figure 2 shows the fitting results of some of the parameters.

Refer to caption
(a)
Refer to caption
(b)
Figure 2: (a) The solid step-line histogram and the contour represent the posterior distribution for the spectral index of synchrotron and thermal dust, as shown in Equations (4.1) - (4.3). These distributions were obtained through the MCMC fitting algorithm for 8×38\times 3 map-level data points within one pixel. The histogram, filled with gray color, shows the actual spectral indices in the fitting area, while the red line represents the averaged spectral indices in the area. The fitting results are βs=3.010.09+0.10\beta_{\rm s}=-3.01^{+0.10}_{-0.09} and βd=1.5360.011+0.011\beta_{\rm d}=1.536^{+0.011}_{-0.011} within 95%95\% CL. The central value is represented by the blue line in the plot. (b) Shows the difference in βd\beta_{\rm d} between the blue and red lines as depicted in (a). A total of 81 pixels at Nside=8N_{\text{side}}=8 were fitted.

According to the current understanding of the polarization foregrounds[41, 47], the model for polarized signal includes CMB, synchrotron and thermal dust emission. This suggests that we can apply two other separate constraints on the NILC method. One for removing synchrotron emission signal νwνAν,s=0\sum_{\nu}w_{\nu}A_{\nu,{\rm s}}=0 where Aν,sA_{\nu,{\rm s}}, which is the element of mixing matrix 𝐀\mathbf{A} and given by Equation (3.1) and another for removing thermal dust emission signal νwνAν,d=0\sum_{\nu}w_{\nu}A_{\nu,{\rm d}}=0 where Aν,dA_{\nu,{\rm d}} is given by Equation (3.2). It should be noted that the conversion factors to CMB unit are needed when calculating Aν,sA_{\nu,{\rm s}} and Aν,dA_{\nu,{\rm d}}. Due to the scarcity of low-frequency observation data, fitting the model (4.14.3) independently on each pixel results in a substantial error in the estimation of parameter βs\beta_{\rm s} for all pixels. In this case, the 95% CL for βs\beta_{\rm s} is about 0.50.5. Therefore, in order to mitigate the error, we impose the constraint that parameter βs\beta_{\rm s} remains constant across all pixels during the fitting process. However, it is important to note that parameter βd\beta_{\rm d} is allowed to vary across different pixels, as it represents distinct parameters for each pixel. In this case, the signal-to-noise ratio for the parameter βs\beta_{\rm s} is improved, reaching a value of 0.10.1 at a 95% CL. The posterior distribution for βs\beta_{\rm s} and βd\beta_{\rm d} are shown in Figure 2.

To better recover the BB-mode CMB signal, the constrained weights 𝐰jk\mathbf{w}_{jk} in Equation (2.5) are calculated on BB maps defined as

B(p)=mamBYm(p).\displaystyle B(p)=\sum_{\ell m}a_{\ell m}^{B}Y_{\ell m}(p). (4.6)

The EBEB leakage[23] due to limited sky coverage is also corrected by a so-called template subtraction method[48] when obtain BB maps from observational QUQU maps. We have provided a detailed description of the steps for the EB leakage correction in the appendix. The correction method has been proven to be the optimal blind correction on the pixel domain in the literature[49]. We have outlined the specific correction steps and results in Appendix A. We show that the ILC methods performs slightly better on such leakage-free BB-family maps[25]. In addition, because of the localization of NILC, the mixing matrix 𝐀\mathbf{A} can vary with spatial position as 𝐀jk\mathbf{A}_{jk}. In order to match the NSIDE of βd(p)\beta_{\rm d}(p) maps with NSIDE(j)\texttt{NSIDE}(j) for every needlet band jj, we use map2alm and alm2map routines in healpy666https://github.com/healpy/healpy[50, 51] package to up-grade the resolution.

In order to more intuitively describe how the constrained NILC process is executed from beginning to end, Figure 3 shows the flow chart of the entire data processing process.

Refer to caption
Figure 3: The flow chart of the whole constrained NILC process.

4.2 From cleaned map to r constraint

After obtaining the clean BB map through the constrained NILC method, the post-mapmaking analysis procedures are quite straightforward. In general, the subsequent processing involves the following steps.

Angular power spectra estimation: The pseudo-power spectra are estimated by using the python package of NaMaster777https://github.com/LSSTDESC/NaMaster[52]. In order to avoid instability caused by signal jumps at the edges of the sky regions, a mask that was slightly smaller than the mask shown in Figure 1LABEL:sub@subfig:skypatch by about 22 degrees is used. The power spectra are binned every 3030 multipole moments from =35\ell=35 to 215215.

Noise debias: We use the 5050 noise only simulations to go through the same constrained NILC processing while keeping the weights unchanged to reconstruct 5050 reconstructed noise only maps, then estimate the mean value the band-powers NN_{\ell} over these noise simulations, finally subtract it from the band-power calculated from the resulting map of ILC. After subtraction, the band-power is considered to be noise-unbiased. Also, the 5050 noise only simulation band-powers are used to estimate the noise covariance matrix.

Constraint on rr: To show how good our method is, we also use the calculated unbiased clean spectra to constrain the tensor to scalar ratio rr. The full posterior for the individual band powers is non-Gaussian. However, for high enough multipoles (usually 30\ell\geq 30) the central limit theorem justifies a Gaussian approximation[53]. The lowest multipole \ell in our binning scheme is 3535 thus the Gaussian approximation is valid:

2ln=(C^bBBCbBB)TCov(C^bBB,C^bBB)1(C^bBBCbBB)+const.\displaystyle-2\ln\mathcal{L}=(\hat{C}_{\ell_{b}}^{BB}-C_{\ell_{b}}^{BB})^{T}{\rm Cov}(\hat{C}_{\ell_{b}}^{BB},\hat{C}_{\ell_{b}}^{BB})^{-1}(\hat{C}_{\ell_{b}}^{BB}-C_{\ell_{b}}^{BB})+const. (4.7)

where the observed power spectrum is denoted as C^bBB\hat{C}_{\ell_{b}}^{BB}. CbBBC_{\ell_{b}}^{BB} denotes the theoretical power spectrum which was calculated by CAMB code. The likelihood only concerns the BBBB spectra at 30\ell\geq 30, where the primordial gravitational wave and lensing effect matters, so we fixed all the parameters to the best fit value from Planck-2018[54], while only free rr with prior r[0,1]r\in[0,1]. The covariance term is estimated from the noise only simulations. We sample this likelihood for these two parameters using the CosmoMC[55]888https://cosmologist.info/cosmomc/ package. The posteriors are summarized by GetDist[56]999https://pypi.org/project/GetDist/ package.

5 Results and discussion

In this section, we present the numerical results obtained from the constrained NILC analysis. To demonstrate the stability of this method, we performed 500500 independent simulations. We construct the cleaned CMB maps estimated by constrained NILC foreground removal, as well as the residual foreground on the resulting map. We compute the angular power spectra of each of these maps and compare them to see its efficiency. As a comparison, we also perform the standard NILC method without any extra constraint on the same simulated data. As expected, the bias of standard NILC is larger than that of constrained NILC.

5.1 CMB signal reconstruction and foreground residual

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Refer to caption
(f)
Figure 4: The resulting and residual foreground BB-mode maps obtained by constrained NILC and standard NILC as well as the band-powers of them. All the maps are smoothed with a Gaussian beam of 1111 arcmin, and the edges of them are apodized by 22 degrees. (a) and (b) show the resulting and residual foreground maps of constrained NILC. The band-powers of these two maps are shown in (e). (c), (d) and (f) show the same results but for standard NILC. The dot-dashed line is the input lensed BB-mode power spectrum with r=0r=0 and the solid curve is an unlensed BB-mode power spectrum with r=0.01r=0.01 for comparison. All the band-powers in (e) and (f) are the averaged band-powers from 500500 independent simulations. The error-bars show the standard deviation of the simulations.

The foreground residual maps can be obtained by applying the same weights obtained from constrained NILC or standard NILC to the input foreground maps. The resulting and foreground residual BB-mode maps are shown in Figure 4. When comparing the results of the constrained NILC and standard NILC methods at the map level, it’s easy to find that qualitatively, the resulting map of the constrained NILC exhibits slightly larger fluctuations compared to the standard NILC. However, this trend is reversed when considering the foreground residuals. More quantifiable results can be observed at the power spectrum level. It’s clear that the foreground residual of the constrained NILC method is much smaller. From the power spectra analysis, the bias caused by foreground residual in the constrained NILC method is estimated to be around r102r\sim 10^{-2}. On the other hand, the standard NILC method exhibits a significantly larger foreground residual, estimated to be approximately at the order of r=101r=10^{-1}.

One notable observation from the residual foreground maps is that the residual foreground of standard NILC exhibits a higher degree of small-scale structure compared to the residual of constrained NILC. This indicates that the foreground and the instrumental noise are indistinguishable especially on small scale where the noise dominate. The root of the problem lies in the fact that standard NILC lacks any prior information regarding noise and foreground components , as mentioned in Section 2.1. Adding prior information helps to get results with lower foreground residuals. But if the foreground and noise are considered as a whole “junk” term, then the overall residual will become larger. This is why the fluctuations in the resulting map of constrained NILC are greater than that of standard NILC. In a word, adding the extra constraints on NILC can help reduce the bias, but slightly increases uncertainty.

5.2 The constraint on r

One of the most important scientific goal of observing CMB BB-mode polarization is to find out the polarized signal produced by the primordial gravitational waves. Figure 5 illustrates the distribution of the best-fit values (left) and the differences between the 95%95\% upper limits and the best-fit values (right) for both the standard NILC and constrained NILC methods, based on 500500 independent simulations. It is evident that the constrained NILC method has a significantly smaller bias, less than 5×1035\times 10^{-3}, compared to the standard NILC method, which exhibits a bias of approximately r0.05r\sim 0.05. Additionally, as anticipated, the figure also demonstrates that the difference, which represents the level of uncertainty, increases accordingly. Indeed, for the standard NILC method, the 2σ2\sigma uncertainty is approximately 0.0250.025, while for the constrained NILC method, it increases to around 0.0450.045. This discrepancy arises because the weights used in the constrained NILC method are optimized specifically for minimizing the foreground residual, rather than minimizing the overall impact of foregrounds and noise. As a result, there is a trade-off between reducing bias and increasing uncertainty in practice. In conclusion, the obtained fitting results align with the analysis of residual foregrounds conducted in the previous section.

Refer to caption
Refer to caption
Figure 5: The left shows the distribution of the best-fit values rbestfitr_{\rm bestfit} of tensor-scalar ratio rr. The right shows the distribution of the difference between one-sided upper bounds rupperr_{\rm upper} and the best-fit values rbestfitr_{\rm bestfit}. Both histogram are sampled from 500500 independent simulations.

6 Conclusions

The effectiveness of the ILC method for foreground removal at low SNR is limited by its inherent constraints. However, by incorporating additional prior information and transforming it into a semi-blind method, the residual foreground can be significantly reduced. In this work, we propose an improved needlet ILC foreground removal method, referred to as constrained NILC, which specifically designed for ground-based CMB observations searching for BB-mode. The complete workflow of the constrained NILC method is depicted in Figure 3.

To validate the effectiveness of this method, we choose a clean sky-patch located in the northern sky. The map-depth are calculated from a reasonable hit-map shown in Figure 1. We compare the reconstructed CMB signals and foreground residuals at the map level, the angular power spectra level, and the final rr constraints level using 500500 independent end-to-end simulations. The results are shown in Figure 4 and Figure 5. Moreover, it is evident that the bias resulting from the foreground residuals is substantially reduced with the incorporation of additional prior information. The bias of the tensor-to-scalar ratio decreases from 0.050.05 to 0.0070.007. However, this improvement comes at the expense of increased uncertainty, which rises from 0.0250.025 to 0.0450.045. In practice, it is necessary to strike a balance between these two aspects.

Furthermore, the output of the constrained NILC method, which is a sky map that retains a significant amount of information about the CMB, can be valuable for investigating non-Gaussian CMB physics such as weak gravitational lensing. As a result, constrained NILC has the potential to be a viable method for mitigating foreground contamination in future ground-based experiments focused on primordial gravitational waves.

The effectiveness of this method also relies on the accuracy of prior information regarding the foreground. To successfully distinguish the primordial BB-modes from galactic foregrounds, it is crucial to conduct comprehensive research on foreground phenomena, especially considering the current constraint on the tensor-to-scalar ratio (r<0.036r<0.036 at 95% confidence level). The complexity and intensity of foreground signals can introduce interference in foreground removal techniques. Further foreground studies should involve in-depth analysis of the physical properties, spatial distribution, and spectral characteristics of foreground components, along with modeling and simulation.

Acknowledgments

We thank Jacques Delabrouille, Mathieu Remazeilles, and those of AliCPT group for useful discussion. This study is supported in part by the National Key R&D Program of China No.2020YFC2201600 and by the NSFC No.11653002.

Appendix A The optimal EB-leakage correction

EB-leakage arises when performing spherical harmonic transformation on a partial sky, as the orthogonality of spherical harmonics is no longer satisfied. In ground-based CMB observations, such leakage is unavoidable due to limited sky coverage, leading to bias on rr. Correcting this EB-leakage is necessary.

Recent studies show that there’s a good way called template cleaning method for a blind estimation and correction of EBEB leakage [48] comes from the partial sky, and can directly give a pure BB map, here we propose to extend the ILC method for scalar TT map to BB map, as BB is a pseudo-scalar for a strictly speaking.

The detailed description of the blind EBEB correction method refers to paper [48], we briefly summarize the steps for pure BB map construction in the following:

  1. 1.

    Use the map2alm function of healpy to directly transform the incomplete observation (Q,U)(Q,U) maps to (amE,amB)(a_{\ell m}^{E},a_{\ell m}^{B}). Then perform the inverse transformation from (amE,0)(a_{\ell m}^{E},0) to (Q,U)(Q,U) map, denoted as (QE(1),UE(1))(Q^{(1)}_{E},U^{(1)}_{E}). Utilize amBa_{\ell m}^{B} solely to obtain the pseudo-scalar BB map, denoted as B(1)B^{(1)}. At this stage, the BB map B(1)B^{(1)} is contaminated by EBEB leakage.

  2. 2.

    Obtain the BB map from (QE(1),UE(1))(Q^{(1)}_{E},U^{(1)}_{E}) using the same mask, denoted as B(2)B^{(2)} in a similar manner.

  3. 3.

    Now, B(2)B^{(2)} serves as the leakage template for B(1)B^{(1)} within the available region. The final leakage-corrected BB map can be expressed as:

    Bclean(p)=B(1)(p)aB(2)(p),B_{\rm clean}(p)=B^{(1)}(p)-aB^{(2)}(p), (A.1)

    Here, the factor aa is obtained through a linear fitting of B(1)B^{(1)} and B(2)B^{(2)}, and pp denotes the pixel location.

Figure 6 displays the results before and after EBEB leakage correction using this method on a simple circular patch. The input sky-map has zero BB mode. It is evident that the EBEB leakage resulting from the partial sky is greater than r=0.01r=0.01 in the power spectrum, which heavily biases the rr measurement, particularly on the degree scale. After the correction, the residual leakage is much smaller than r=0.01r=0.01.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Figure 6: (a): The leaked BB mode signal at the map level. (b): The residual after EBEB leakage correction using the method. (c): The power spectra corresponding to (a) and (b), including the r=0.01r=0.01 BB-mode.

References