This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The low level of correlation observed in the CMB sky at large angular scales and the low quadrupole variance

Robert Z Knight,1 and Lloyd Knox,1
1Physics Department, University of California, Davis, CA 95616
E–mail: riknight@ucdavis.edu (ZK)
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract

The angular two-point correlation function of the temperature of the cosmic microwave background (CMB), as inferred from nearly all-sky maps, is very close to zero on large angular scales. A statistic invented to quantify this feature, S1/2S_{1/2}, has a value sufficiently low that only about 7 in 1000 simulations generated assuming the standard cosmological model have lower values; i.e., it has a pp–value of 0.007. As such, it is one of several unusual features of the CMB sky on large scales, including the low value of the observed CMB quadrupole, whose importance is unclear: are they multiple and independent clues about physics beyond the cosmological standard model, or an expected consequence of our ability to find signals in Gaussian noise? We find they are not independent: using only simulations with quadrupole values near the observed one, the S1/2S_{1/2} pp–value increases from 0.007 to 0.08. We also find strong evidence that corrections for a “look-elsewhere effect” are large. To do so, we use a one-dimensional generalization of the S1/2S_{1/2} statistic, and select along the one dimension for the statistic that is most extreme. Subjecting our simulations to this process increases the pp–value from 0.007 to 0.03; a result similar to that found in Planck XVI (2016). We argue that this optimization process along the one dimension provides an underestimate of the look-elsewhere effect correction for the historical human process of selecting the S1/2S_{1/2} statistic from a very high-dimensional space of alternative statistics after having examined the data.

keywords:
methods: statistical – cosmology: cosmic background radiation
pubyear: 2017pagerange: The low level of correlation observed in the CMB sky at large angular scales and the low quadrupole varianceA

1 Introduction

We view the continued success of a six-parameter model in describing the statistical properties of cosmic microwave background (CMB) temperature anisotropy maps with millions of resolution elements as one of the major takeaway messages from the Planck mission: a simple model works extremely well. This success includes agreement between model and data via multiple statistics, most prominently the two-point correlation function (Planck XV, 2014; Planck XI, 2016), but also via the higher-order correlations expected from gravitational lensing (Planck XVII, 2014; Planck XV, 2016) and relativistic effects of our motion with respect to cosmic rest (Planck XXVII, 2014).

The strongest challenge111While others have described challenges to the success of the six-parameter model arising from some internal tensions revealed by tests using the two-point correlation function, e.g. Addison et. al. (2016), they are not highly significant (Planck LI, 2016). to this takeaway message, based on CMB maps alone, may be the existence of relatively large-scale patterns that have been described as “anomalous.” Some of these large-scale CMB anomalies are: the existence of an unexpectedly large cold spot (Cruz, Mart nez–Gonz lez, & Vielva, 2006, 2007, 2010; Zhang & Huterer, 2010), the preference for odd parity modes (Kim & Naselsky, 2010a, b, 2011), a hemispherical power asymmetry (Eriksen et. al., 2004, 2007; Hoftuft et. al., 2009; Hansen et. al., 2009; Akrami et. al., 2014; Planck XXIII, 2014; Planck XVI, 2016), the alignment of the lowest multipoles with the geometry of the solar system and with each other (de Oliveira–Costa et. al., 2004; Hansen et. al., 2004; Schwarz et. al., 2004; Land & Magueijo, 2005a, b; Bennet et. al., 2011; Copi et. al., 2015b), the low variance in low-resolution maps (Planck XXIII, 2014; Planck XVI, 2016), and the lack of correlation on the largest angular scales (Bennett et. al., 2003; Spergel et. al., 2003; Hajian, 2007; Copi et. al., 2007, 2009; Bennet et. al., 2011; Gruppuso, 2014; Copi et. al., 2015a). For a recent review of large-scale CMB anomalies, see Schwarz et. al. (2015).

One of the apparent anomalies that has received a lot of attention is the low value of the real space two-point correlation function at large angular scales. This feature can be seen in the earliest observations of the correlation function of the CMB as observed by the Cosmic Background Explorer Differential Microwave Radiometer (COBE–DMR) (Hinshaw et. al., 1996), though it was not described in this way at the time. The lack of correlations at large angles received more attention with the Wilkinson Microwave Anisotropy Probe (WMAP) first year data (Spergel et. al., 2003), when the most widely used precise statistic created to capture this feature of the sky was defined: the S1/2S_{1/2} statistic. Spergel et. al. (2003) initiated the use of S1/2S_{1/2}, after having looked at the COBE–DMR and WMAP data, as a convenient way to show how unusual the lack of correlation of the CMB at large angles is under the assumption of Λ\LambdaCDM, making S1/2S_{1/2} an a posteriori statistic. The S1/2S_{1/2} statistic has since been used to confirm the presence of the anomaly in the WMAP 3 year (Copi et. al., 2007) and 5 year (Copi et. al., 2009) data, as well as in the first (Copi et. al., 2015a) and second (Planck XVI, 2016) Planck data releases.

We now define S1/2S_{1/2}, starting with a generalized version:

Sx=1x[C(θ)]2dcos(θ)S_{x}=\int_{-1}^{x}\left[C(\theta)\right]^{2}d\cos(\theta) (1)

where

C(θ)=T(𝐧^)T(𝐧^)C(\theta)=\langle T(\mathbf{\hat{n}})T(\mathbf{\hat{n}^{\prime}})\rangle (2)

is the two-point correlation function and θ\theta is the angle between the two directions 𝐧^\mathbf{\hat{n}} and 𝐧^\mathbf{\hat{n}^{\prime}}. With x=1/2=cos60x=1/2=\cos 60^{\circ}, this becomes the standard definition of S1/2S_{1/2}.

Use of this statistic enables one to quantify how unusual the observed feature is, given Λ\LambdaCDM (the null hypothesis). We use simulations to determine a distribution of S1/2S_{1/2} values given Λ\LambdaCDM, and find the fraction of simulations with lower S1/2S_{1/2} values than that inferred from the data. This fraction of simulations that exceeds the statistic in question is commonly called a pp–value. In a recent re-analysis of the observed lack of correlations on the CMB at large angles, Copi et. al. (2015a) calculated several S1/2S_{1/2} values of the CMB using various WMAP and Planck maps, masks, and methods, and using simulations based on the Λ\LambdaCDM theory, found pp–values for S1/2S_{1/2} of the observed cut-sky CMB. These vary from 0.001910.00191 to 0.003290.00329. Alternatively, rather than use pp–values, one could perform a Bayesian analysis on S1/2S_{1/2} in order to quantify how unexpected the observed value is. Efstathiou, Ma, & Hanson (2010) did such an analysis, in which they calculated an expected posterior distribution of theoretical S1/2S_{1/2} values, given the observed value. This resulted in a very wide distribution of theoretical S1/2S_{1/2} values, and the Λ\LambdaCDM S1/2S_{1/2} value was not strongly disfavoured, indicating that S1/2S_{1/2} is actually a poor discriminator of theoretical models. In this work, we do not use the Bayesian approach, but rather stick with the computation of pp–values.

There have been numerous theoretical attempts to define cosmologies for which a low value of S1/2S_{1/2} is less unlikely. Some of these attempts examine basic features of Λ\LambdaCDM, such as the Gaussianity and statistical homogeneity/isotropy of the primordial fluctuations. For example, Copi et. al. (2009) showed that the low-\ell modes of the CMB seem to work together to cause the lack of correlation on large angles, suggesting a correlation among the modes, indicating a violation of statistical isotropy. Non-trivial topology could be a possible way to explain the low correlation on large angles, because in topological models that employ the spherical multi-connected manifolds, the power at large scales is naturally suppressed (Stevens, Silk, & Scott, 1993; Niarchou & Jaffe, 2006). However, constraints on such manifolds arise from the null results of searches for matched circles on the CMB. In a more recent work, Aurich & Lustig (2014) explore CMB correlations in a cosmology incorporating the Hantzsche–Wendt manifold, which is interesting because this manifold could easily escape detection by matched circle pairs. They calculate the distribution of S1/2S_{1/2} for an ensemble of simulations in this topology, finding it reduced by about a factor of 2 compared to Λ\LambdaCDM.

Here we argue that this attention is misplaced. We find the evidence is quite weak that the low S1/2S_{1/2} value is due to anything other than either the low quadrupole value, or a human’s ability to identify unusual-looking features in a high-dimensional data set with many different ways it can be unusual, or both.

The role of the low-order multipoles and the quadrupole in particular in the lack of correlation at large angles has previously been investigated. Bernui et. al. (2006) examined the role of the low quadrupole in the correlation function by removing the quadrupole from both the observed WMAP data and the Λ\LambdaCDM model, and found a qualitatively better agreement between observation and theory than with the quadrupole included. Furthermore, both the quadrupole-removed WMAP and Λ\LambdaCDM correlation functions showed a lack of correlations at large angles, whereas the non-quadrupole-removed Λ\LambdaCDM correlation function did not, indicating that the low quadrupole is largely, though not entirely, responsible for the lack of correlation at large angles. Hajian (2007) pointed out that the contribution to C(θ)C(\theta) from multipoles =2,3\ell=2,3 in the observed CMB is nearly equal and opposite to the contribution from the rest of the multipoles put together. Additionally, Copi et. al. (2009) found that by tuning the powers C2,C3C_{2},C_{3} of the Λ\LambdaCDM prediction, they could obtain an S1/2S_{1/2} expectation value lower than the observed S1/2S_{1/2} value found from WMAP. However, no work of which we are aware has yet reconciled the low pp–value of the S1/2S_{1/2} statistic at 0.007\sim 0.007, with the larger pp–value of the low quadrupole at 0.04\sim 0.04.

Despite this previous attention to the relationship between the low quadrupole and low correlation at large angles, we find an important result has been overlooked: namely, the impact of the low quadrupole on the distribution of S1/2S_{1/2} values expected under Λ\LambdaCDM. We demonstrate that once the probability of S1/2S_{1/2} is conditioned on the observed value of the quadrupole, the value of S1/2S_{1/2} is no longer anomalous, as the pp–value rises to 0.08. This result has implications for solutions to the puzzle of low S1/2S_{1/2}: any change to the model that suppresses the ensemble average of the quadrupole variance is helpful. For example, several groups have explored the possibility of an era in the early universe before slow-roll inflation in which a “fast-roll” caused a large scale suppression of power, effectively creating a cutoff in the primordial power spectrum which would be observable as a lack of correlations on large scales today (Contaldi et. al., 2003; Lello et. al., 2014; Liu et. al., 2014; Gruppuso & Sagnotti, 2014).

Although the low value of S1/2S_{1/2} is potentially understandable as due to a departure from Λ\LambdaCDM that reduces the ensemble average of the quadrupole, the question still remains of whether S1/2S_{1/2} is compatible with Λ\LambdaCDM in the absence of any such modifications.

The standard criticism of the amount of attention given to the anomalies is the standard criticism of a posteriori statistics: their a posteriori nature makes their interpretation very challenging. Consider that for a map of the sky, even with a finite number of pixels or data points, there is an infinite variety of statistics that one can make up.222As a simple example, consider linear combinations of a finite number of data points. Given the freedom to choose arbitrary coefficients, there are an infinite number of possible combinations of these data. If one makes up 1,000 of them prior to examining some data set, then it is not evidence of a failure of a model if one of those statistics has a value that happens to be that extreme in only 1 out of 1,000 simulations. With a posteriori statistics, by definition, one creates the statistic after seeing the data. When such a statistic ends up having an extreme value, determining whether or not this indicates a failure of a model is difficult because it is not clear how many such statistics could have been made up, had the data looked different to begin with. Additionally, because the choice of statistic to use was informed by the data, this also makes simulation of the data creation and analysis process very difficult, and perhaps practically impossible. What statistic would a different, simulated data set, have led the analyst to choose? Now that an analyst is in the simulation loop, simulation is quite challenging, to say the least.

There are several features about the S1/2S_{1/2} statistic which can be considered to have been selected a posteriori. Broadly speaking, the whole notion that C(θ)C(\theta) is low at large angles is a posteriori. More specifically, the choices to square C(θ)C(\theta), to integrate this quantity over angles, and to select an upper integration limit of 1/21/2 were all a posteriori choices, thereby making the significance of the value of S1/2S_{1/2} difficult to interpret. In order to address the critique that the upper integration limit was selected a posteriori, the Planck team (Planck XVI, 2016) recently implemented a process in which this limit in the definition of S1/2S_{1/2} was allowed to vary over the whole range of xx values (1x1)(-1\leq x\leq 1). We repeat a similar process here, although with a different rationale. We know historically that there actually was no optimization over xx as a continuous variable – this point is indisputable as one can do the optimization and the result is not x=1/2x=1/2. However, the choice of a statistic tailored to capture the apparently unusual near-zero values of the correlation function on large angular scales was clearly an a posteriori selection. We view the SxS_{x} statistic as a one-dimensional proxy for the very high-dimensional space of all possible statistics from which S1/2S_{1/2} was selected. It is a useful proxy as the impact of a posteriori choice can then be calculated. When we consider the effects of the a posteriori choice of the S1/2S_{1/2} statistic, by considering other possible statistics that one could calculate, the significance of the S1/2S_{1/2} anomaly decreases. However, as we are using a one-dimensional proxy, we expect the impact of selection from the larger space is underestimated. 333For completeness, we mention that Efstathiou, Ma, & Hanson (2010), in addition to providing the Bayesian analysis we mentioned earlier, also suggested that a two-dimensional generalization of S1/2S_{1/2} could illustrate the impact of a posteriori choice on pp–values  but did not make any clear quantitative claims about this impact other than to suggest it could be “an order of magnitude or more.”

One research direction motivated by the S1/2S_{1/2} anomaly that we do see as valuable is a search for testable predictions of the hypothesis that Λ\LambdaCDM is correct and the low value of S1/2S_{1/2} is just a statistical fluke. Yoho et. al. (2014) use Λ\LambdaCDM realizations conditioned on the low value of S1/2S_{1/2} to develop a priori statistics for use with future data sets, such as CMB polarization data, designed to test the hypothesis that the low S1/2S_{1/2} measurement is just the consequence of an unlikely Λ\LambdaCDM realization. The conclusion of our work here is that the evidence is quite weak that the low S1/2S_{1/2} value indicates anything other than a somewhat unlikely Λ\LambdaCDM realization. We hesitate to call it a fluke however, as it is only a highly unusual realization if one does not correct for a posteriori effects. Names aside, we expect that fluke-hypothesis tests have the potential to confirm the “fluke” interpretation we favor here, or provide stronger evidence against Λ\LambdaCDM.

The rest of this paper is organized as follows. In section 2 we describe the observational and simulated data and data processing needed to create S1/2S_{1/2} pp–values. In section 3 we describe the effect of the low quadrupole on the observed S1/2S_{1/2} value and how we include the low quadrupole in our filtered-Λ\LambdaCDM null hypothesis, and the effect that that has on the S1/2S_{1/2} pp–value. In section 4 we describe a means to quantify the impact of the a posteriori choice x=1/2x=1/2 by exploring a more generic statistic SxS_{x}. We finish by combining these two methods and examining the resultant quadrupole-filtered look–elsewhere–corrected pp–values.

2 The S1/2S_{1/2} Statistic

In order to calculate how unlikely the S1/2S_{1/2} statistic for the observed CMB is, given a particular cosmological model, one has to calculate S1/2S_{1/2} for the observed universe as well as for an ensemble of simulated universes, created based on that model. When compared to a suitably created ensemble of simulations, the pp–value of the S1/2S_{1/2} statistic can be computed. In this analysis, we have treated the observed CMB map as similarly as possible to those created in simulations, for consistency. This includes the treatment of the pixelization, spatial frequency content, and masking.

2.1 CMB Observations

There are to date several (nearly) full-sky maps of the CMB that are suitable for use in calculating the correlation function C(θ)C(\theta), and the corresponding large angle statistic S1/2S_{1/2}. Copi et. al. (2015a) used two different Galactic+foreground masks444A mask is a set of flags indicating which pixels of a map are to be excluded from use in further analysis. in their study: the WMAP 9-year KQ75y9 mask, which leaves unmasked 6969 per cent of the sky (fsky=0.69f_{\rm sky}=0.69), and a Planck-derived U74 mask, which has fsky=0.74f_{\rm sky}=0.74. They used each of these masks with several WMAP and Planck CMB maps, and in every case, the S1/2S_{1/2} pp–values were considerably higher (meaning the observed S1/2S_{1/2} was less unusual) for the smaller U74 mask (larger fskyf_{\rm sky}) than those for the larger KQ75y9 mask (smaller fskyf_{\rm sky}). This is also consistent with the results of Gruppuso (2014), who showed that larger masks (smaller fskyf_{\rm sky}) make the significance of the S1/2S_{1/2} anomaly larger (smaller pp–value) than with smaller masks. With an array of Galactic masks that ranged in sizes from fsky=0.78f_{\rm sky}=0.78 to 0.280.28, they found a corresponding decrease in pp–value from 1.961.96 per cent to 0.050.05 per cent. (Note that within this range, the two mask used by Copi et. al. (2015a) were on the smaller end, or had larger fskyf_{\rm sky}.) Except where otherwise mentioned, in this analysis we use only one observed CMB map and mask combination: the Planck PR2 (2015) SMICA map with the (2015) Common mask.555 available at the Planck Legacy Archive http://pla.esac.esa.int/pla/#home

The Planck map and mask are stored in the healpix666 See http://healpix.sourceforge.net for information about healpix (G rski et. al., 2005) format at high angular resolution. For calculations of S1/2S_{1/2}, high angular resolution is not needed, so we degrade the resolution of the mask and map from Nside=2048N_{\rm side}=2048 to Nside=128N_{\rm side}=128 following the procedure described by Planck XVI (2016): the map is first transformed to harmonic space using healpix, then a reweighing by the ratio of pixel window functions is applied, and then the harmonic coefficients are re-transformed back into a map at the lower resolution. The same thing is done to the mask, with the one additional step of defining a threshold value of 0.9, over which the mask pixel values are set to 1, and below which they are set to 0. The resulting mask leaves a fraction of the sky fsky=0.676f_{\rm sky}=0.676 available for analysis. The low resolution of Nside=128N_{\rm side}=128 was chosen in order to reduce computational time while preserving the signal of interest, which is at large angular scales, and also to be consistent with the resolution used in previous analyses (e.g. Copi et. al. (2015a)).

The power spectrum CC_{\ell} of the degraded SMICA map was calculated using spice.777 polspice available at http://www2.iap.fr/users/hivon/software/PolSpice/ This code applies the mask to the map and subtracts the monopole and dipole from the resultant cut-sky map as part of the process to find the cut-sky CC_{\ell}. To derive the correlation function C(θ)C(\theta) shown in Fig. 1, we applied the transform

C(θ)=21002+14πCP(cosθ).C(\theta)\equiv\sum_{\ell=2}^{100}\frac{2\ell+1}{4\pi}C_{\ell}P_{\ell}(\cos\theta). (3)

We used the upper limit of 100 following Copi et. al. (2015a), who showed that the effect on S1/2S_{1/2} of using max=100\ell_{\rm max}=100 rather than a higher \ell value was less than 1%.

In order to calculate S1/2S_{1/2} for the SMICA map, we used the harmonic space method of Copi et. al. (2009). Combining equations 1 and 3, we evaluate

Sx=,CI,(x)CS_{x}=\sum_{\ell,\ell^{\prime}}C_{\ell}I_{\ell,\ell^{\prime}}(x)C_{\ell^{\prime}} (4)

where the quantity

I,(x)=(2+1)(2+1)(4π)21xP(μ)P(μ)𝑑μI_{\ell,\ell^{\prime}}(x)=\frac{(2\ell+1)(2\ell^{\prime}+1)}{(4\pi)^{2}}\int_{-1}^{x}P_{\ell}(\mu)P_{\ell^{\prime}}(\mu)d\mu (5)

is an easily calculable function of xx (Copi et. al., 2009), and we have used the generalization to SxS_{x}, with S1/2S_{1/2} being a special case with x=1/2x=1/2.

Using our degraded SMICA map, we calculated C(θ)C(\theta) and S1/2S_{1/2} resulting in a value of S1/2=2145μK4S_{1/2}=2145~{\rm\mu K}^{4}, which when compared to our ensemble of 10510^{5} simulations, has a pp–value of 0.720.72 per cent (see the solid curves in Fig. 1 and Fig. 2). This S1/2S_{1/2} value is quite close to the value S1/2=2153μK4S_{1/2}=2153~{\rm\mu K}^{4} found by Copi, O’Dwyer, & Starkman (2016), who used the same data set and methods, and is similar to those found by Copi et. al. (2015a), who used older Planck and WMAP data sets. The S1/2S_{1/2} pp–values reported by Copi et. al. (2015a) range from about 0.20.2 per cent to 0.30.3 per cent, depending on the map, mask, and method used. For a more detailed comparison, see the appendix.

2.2 Creating Simulations

Simulations of the CMB were created using healpix based on a theoretical power spectrum created by class888 class available at http://class-code.net/. To create this power spectrum, we used the best fit 2015 Planck Λ\LambdaCDM parameters (Planck XIII, 2016): h=0.6774h=0.6774, Ωb=0.04860\Omega_{\rm b}=0.04860, Neff=3.04N_{\rm eff}=3.04, Ωcdm=0.2589\Omega_{\rm cdm}=0.2589, ΩΛ=0.6911\Omega_{\Lambda}=0.6911, Yp=0.249Y_{\rm p}=0.249, and zreio=8.8z_{\rm reio}=8.8, along with the WMAP value (Fixsen, 2009) Tcmb=2.726KT_{\rm cmb}=2.726~{\rm K}. Prior to creating realizations, the power spectrum was dampened slightly with pixel window and Gaussian beam functions, since the observational data, as presented, contains pixel window and telescope beam effects.

The healpix simulations were then created at the same resolution (Nside=128N_{\rm side}=128) as our degraded-resolution SMICA map, with the same harmonic content (21002\leq\ell\leq 100) that is required to perform the S1/2S_{1/2} calculation in equation 4.

Each simulation was treated in the same way as the SMICA map: we fed each simulation into spice, which applied the degraded-resolution Common mask and removed the monopole and dipole in order to deliver an estimate of the power spectrum CC_{\ell}. We created two separate ensembles of power spectra: one with 10410^{4} and one with 10510^{5} members. The spectra in the smaller set were transformed via equation 3 in order to create the ensemble average autocorrelation function C(θ)C(\theta) and 1σ1\sigma band, shown in Fig. 1. Those in the larger set were used directly in equation 4 with x=1/2x=1/2 in order to create an ensemble of S1/2S_{1/2} values, shown in Fig. 2, with 3 different values of min\ell_{\rm min}, the lowest multipole included in the summation.

Refer to caption
Figure 1: The auto-correlation C(θ)=T(𝐧^)T(𝐧^)C(\theta)=\langle T(\mathbf{\hat{n}})T(\mathbf{\hat{n}^{\prime}})\rangle where cosθ=𝕟^𝕟^\cos\theta=\mathbb{\hat{n}}\cdot\mathbb{\hat{n}^{\prime}}, of the masked SMICA map (solid green line), with the average and 1σ1\sigma confidence region of the autocorrelation functions of 10410^{4} masked CMB simulations (dashed and shaded blue).

3 Effect of the Low Quadrupole Power

The summation formulae for C(θ)C(\theta) and S1/2S_{1/2} (equations 3, 4) make it easy to examine the relative importance of each multipole to C(θ)C(\theta) and S1/2S_{1/2}. We found that removing the lowest multipoles from the calculation of S1/2S_{1/2} for both simulations and for the SMICA data drastically increased the pp–value of the SMICA S1/2S_{1/2} statistic (See Fig. 2). In fact, for the min\ell_{\rm min} =4=4 case, the SMICA value is quite near the middle of the simulated distribution. This shows that the low pp–value of the standard S1/2S_{1/2} statistic is highly influenced by the C2C_{2} and C3C_{3} of the observed CMB.

Refer to caption
Figure 2: S1/2S_{1/2} calculated for the masked SMICA map (vertical lines) and for 10510^{5} masked simulations (histograms). The minimum \ell value included in the calculation is varied. Note that for these values, as min\ell_{\rm min} increases, the SMICA S1/2S_{1/2} value increases, while the ensemble values tend to decrease. The pp–values for the ensembles shown here are 0.720.72 per cent, 10.610.6 per cent, and 38.738.7 per cent.

Here we revisit the question of the connection between =2\ell=2 and S1/2S_{1/2}. In particular, we ask how dependent the S1/2S_{1/2} anomaly is on the low observed value of C2C_{2}. In our Λ\LambdaCDM model, the ensemble average quadrupole variance is 1099.94μK21099.94~{\rm\mu K}^{2}, whereas for the degraded and masked SMICA map, we measure a quadrupole variance of only 171.8μK2171.8~{\rm\mu K}^{2}, which is lower than the ensemble average value by a factor of 0.156. However, the low observed quadrupole variance is by itself not exceedingly anomalous. To see this, we created an ensemble of 10510^{5} simulated CMB skies, masked them, and measured their C2C_{2} and S1/2S_{1/2} values (see Fig. 3). In this ensemble, the cut-sky SMICA C2C_{2} has a pp–value of 3.93.9 per cent, which is much higher than the S1/2S_{1/2} pp–value of 0.720.72 per cent. This is consistent with a similar analysis by Efstathiou (2003), who calculated C2C_{2} pp–values using two differently measured WMAP C2C_{2} values of 129129 and 212μK2212~{\rm\mu K^{2}},999The variances presented by Efstathiou (2003) were given in terms of ΔT2=(+1)C/(2π)\Delta T^{2}_{\ell}=\ell(\ell+1)C_{\ell}/(2\pi). which resulted in pp–values of 1.31.3 and 3.63.6 per cent, respectively. This suggests, as Copi et. al. (2009) point out, that the low quadrupole variance anomaly is not the same as the S1/2S_{1/2} anomaly.

This joint S1/2S_{1/2}C2-C_{2} distribution allows us to look at the ensemble relationships between the two values. As can be seen in the scatter plot and density contours of Fig. 3, the distribution has a golf-club like appearance, such that the upper right of the plot shows a thin and narrow structure, while the lower left is wedge shaped. The thin distribution in the upper right indicates that at high values, S1/2S_{1/2} and C2C_{2} are correlated with each other. However, this correlation breaks down at lower values, in the wedge-shaped part of the distribution. At these low values, a low value of S1/2S_{1/2} indicates that C2C_{2} must also be low, but the inverse, that a low value of C2C_{2} indicates a low value of S1/2S_{1/2}, is not true. However, as we show below, conditioning an ensemble to only include low values of C2C_{2} does have a significant impact on the distribution of S1/2S_{1/2} values, and increases the SMICA pp–value substantially.

Refer to caption
Figure 3: C2C_{2} and S1/2S_{1/2} values for 10510^{5} masked-sky CMB simulations, with density contours containing 68.368.3, 95.595.5, and 99.799.7 per cent of the points. Marginalizations are shown in the marginal plots. Inset is a zoom of the area around the SMICA point, with 9191, 9393, 9595, 9797, and 9999 per cent density contours shown. The SMICA value is marked by the blue diamond in each plot; it is within the 95.595.5 per cent contour on the main plot, and just slightly within the 9393 per cent contour on the inset. The location of the diamond near the lower corner of the distribution indicates the low pp–values of S1/2S_{1/2} and C2C_{2}. The S1/2S_{1/2} distribution (with min\ell_{\rm min} =2=2) shown in Fig. 2 can be found by marginalizing over C2C_{2}, whereas the constrained S1/2S_{1/2} distribution shown in Fig. 5 can be thought of as a narrow horizontal slice through the location of the SMICA mark.

We implemented a constraint on the quadrupole variance of our cut-sky simulations by generate–and–test filtering. We first created a simulation, masked it, then measured its quadrupole variance and compared it to the Λ\LambdaCDM expectation value. If C2C_{2} of the simulation fell between 0.1 and 0.2 times the Λ\LambdaCDM expectation value (where 0.1 and 0.2 were chosen to simply bracket the observed ratio of 0.156 with round numbers), it was included in the ensemble of simulations. If not, we threw it out. We kept creating simulations until we had reached the desired number in the ensemble: 10410^{4}.

The results of this ensemble selection process are shown in Fig. 4 and Fig. 5. Comparing these to the previous versions without filtering (Fig. 1 and Fig. 2), we see that the ensemble of correlation functions has changed shapes considerably, converging toward the line indicating zero, and the S1/2S_{1/2} distribution has correspondingly lowered toward zero. The SMICA S1/2S_{1/2} value (unchanged) therefore appears to be in a much less unlikely place in this distribution, with a pp–value of 8.248.24 per cent. This non-anomalous pp–value suggests that, at the very least, S1/2S_{1/2} is not independent from the low quadrupole. Once we condition on the observed quadrupole value, the observed S1/2S_{1/2} value is not that rare.

Refer to caption
Figure 4: The auto-correlation as in Fig. 1, but with simulations selected for the ensemble only if 0.1×C2ΛCDM<C2sim<0.2×C2ΛCDM0.1\times C_{2}^{\rm\Lambda CDM}<C_{2}^{\rm sim}<0.2\times C_{2}^{\rm\Lambda CDM}.
Refer to caption
Figure 5: S1/2S_{1/2} as in Fig. 2 (min\ell_{\rm min} =2=2), but with simulations selected for the ensemble only if 0.1×C2ΛCDM<C2sim<0.2×C2ΛCDM0.1\times C_{2}^{\rm\Lambda CDM}<C_{2}^{\rm sim}<0.2\times C_{2}^{\rm\Lambda CDM}, as in Fig. 4. In this C2C_{2}-filtered ensemble of 10410^{4} simulations, S1/2S_{1/2} has a pp–value of 8.248.24 per cent Note that this is slightly lower than the quadrupole-removed pp–value of 10.610.6 per cent, indicated by the min\ell_{\rm min} =3=3 case in Fig. 2

Finally, we look back at the joint S1/2S_{1/2}C2-C_{2} distribution, and generalize the notion of the pp–value to this 2 dimensional space. To do so, we created iso-probability density contours by approximating the probability density using kernel density estimation. Some of these contours are shown in Fig. 3, labelled by what fraction of the total number of points they contain. The SMICA value lies in a region which has a probability density that is not extremely low, lying just within the 9393 per cent contour, indicating a 2D pp–value of 77 per cent, less than a 2σ2\sigma deviation from the densest region. To compare this to the 1D cases we need to consider that the definition of pp–value that we have been using is only a 1-tailed statistic, and does not account for outliers that are extreme in the opposite direction, at equal or lower probability density. (Our density contours in 2D account for high and low values in all directions.) Therefore, to approximate an equivalent 2-tailed statistic, we multiply the 1-tailed 1D pp–values by 2, giving us 2-tailed 1D pp–values of 1.441.44 per cent for S1/2S_{1/2}, and 7.87.8 per cent for C2C_{2}, which is clearly an underestimate given the asymmetry of the 1D C2C_{2} distribution. Thus we see that the observed values in the joint space occupy a probability density that is similar, in terms of how extreme it is, to the one-dimensional case of C2C_{2}. We conclude that there is no new evidence in this joint space for anomalous behavior.

4 Effect of the a Posteriori Choice

The previous section provides a better description of the influence of the low quadrupole on the lack of correlations at large angles then was available before, suggesting that modifications to Λ\LambdaCDM which lower the expected quadrupole contribution to C(θ)C(\theta) would help reduce the anomalous nature of S1/2S_{1/2}. However, we are still interested in whether S1/2S_{1/2} is consistent with Λ\LambdaCDM without any such modification. To that end, we now turn to addressing what is perhaps the most often criticized aspect of S1/2S_{1/2}: that it was created to capture a feature of the data by analysts who had already taken a look at the data. That is, S1/2S_{1/2} was created a posteriori.101010For a reductio ad absurdum description of a posteriori statistic selection in the context of CMB and π\pi anomalies, see Frolop & Scott (2016).

If, prior to looking at the data, one came up with 1,000 statistics to calculate from the data, and then applied them to the data, assuming they were all uncorrelated, one would expect a uniform distribution of pp–values between 0 and 1. The expectation value for the number of statistics with pp–values between 0 and 0.001 would be 1. Even if they were correlated, one could still perform simulations that would naturally take these correlations into account, and use them to calculate the probability that one of the statistics returned a pp–value of less than 0.001, by counting the fraction of simulations for which this is the case; that is, we calculate the pp–value of the pp–value. Such a probability we call the look–elsewhere–corrected (LEC) pp–value.

To perform the analogous calculation for a single a posteriori statistic, one chosen after inspection of the data, the simulation process would include the process of inspecting the data and inventing a statistic to capture what is perceived to be an unusual feature. This is obviously challenging, if not impossible, as it requires an automated model of the analyst him or herself, so that the simulated data are processed by the simulated analyst who identifies the unusual feature and creates a statistic designed to capture that feature in a single number.

The difficult aspect of the above procedure is the automation of the human process of selection of a statistic from a very high-dimensional array of possibilities. In the following we adopt a crude, but calculable, approximation of this process as the selection of one statistic from a merely one-dimensonal space of alternatives. We create this one dimensional space by extending one single aspect of the already well defined S1/2S_{1/2} statistic: the choice of the upper endpoint of the S1/2S_{1/2} integral, 1/21/2. As described above, we have generalized S1/2S_{1/2} to SxS_{x}, which has an arbitrary upper integration limit, chosen from the whole range of possible values: 1x1-1\leq x\leq 1.

Note that we are not claiming that this is the procedure that was followed historically. We know it is not because, as we will see, the choice of xx that returns the most extreme vaue of SxS_{x} from the real data is x=0.37x=0.37 rather than x=1/2x=1/2. The resulting LEC pp–value we obtain should be viewed as merely indicative of the type of correction that is plausible. We suspect that, if anything, our replacement of the high-dimensional space of alternative statistics with a one-dimensional one underestimates the size of the correction.

We proceed by using the generalized SxS_{x} integral (equation 1) and calculating SxS_{x} as a function of xx over the whole range of xx values for an ensemble of 10510^{5} CMB simulations (without C2C_{2} filtering this time). Then, to save computational time, we choose a subsample of these, the first 10410^{4} curves, to evaluate further. For each S(x)S(x) function in the subsample we find, for each value of xx, a rank in the entire set of 105Sx10^{5}\ S_{x} curves. We divide these ranks by 10510^{5} and they become (1-tailed) pp–values, as a function of xx: p(x)p(x). Then, for each simulation in the subsample, we find its minimum pp–value and corresponding xx and SxS_{x} values. In order to treat the SMICA data in the same manner, we throw it in as just another member of this ensemble and find its optimal xx, SxS_{x}, and p(x)p(x) values with resect to the ensemble as well. Finally, using the ensemble of pp–values, we can calculate the LEC pp–value.

The result of this process is shown111111 plotting package corner.py available at https://github.com/dfm/corner.py in Fig. 6, where the SMICA value is indicated by the blue horizontal and vertical lines. In this ensemble, the SMICA map, optimized for minimum pp–value, has x=0.37x=0.37, Sx=1326μK2S_{x}=1326~{\rm\mu K}^{2}, and a nominal p(x)=0.36p(x)=0.36 per cent. This is lower than the S1/2S_{1/2} pp–value of 0.720.72 per cent that we calculated earlier, which is as expected due to the optimization process. The LEC pp–value is 2.942.94 per cent.

To calculate the error on the LEC pp–value, we estimated the sample variance of these quantities by doing some calculations with smaller SxS_{x} ensembles. These smaller ensembles were constructed as subsets of the same set of 105Sx10^{5}\ S_{x} simulations that we used before, but with 10410^{4} members chosen randomly, and with p(x)p(x) curves calculated using only the corresponding 104Sx10^{4}\ S_{x} curves, rather than the entire set of 10510^{5} of them, as we did previously.121212This is a variation on the jackknife resampling test. Interestingly, these fell into two groups, with 27 of them having xx values near 0.35, and 5 of them being near -0.1. For those near x=0.35x=0.35, the average pp–value was 0.3300.330 per cent, and for those near x=0.1x=-0.1, the average pp–value was 0.3460.346 per cent. There does not appear to be a big difference in pp–values between the two groups, but since our first sample had an xx value of 0.37, we compare it to the first group. Supposing that the sample variance is dominated by the number of SxS_{x} curves used in the calculation, we expect the sample variance of these pp–values to be approximately equal to the Poisson noise associated with the number of simulations used. Since the average pp–value of 0.330.33 per cent corresponds to 33 out of 10000 samples, we hypothesized that these 27 optimal pp–values were drawn from a Poisson distribution with mean 33, and performed a Kolmogorov–Smirnov goodness-of-fit test. This resulted in a Kolmogorov–Smirnov pp–value of 0.24, suggesting that these values are consistent with being drawn from a Poissonian distribution, as hypothesized. Therefore, we also suppose that the S1/2S_{1/2} pp–value that we obtained using 105Sx10^{5}\ S_{x} curves, also subject to a sample variance, can also be thought of as being drawn from a Poissonian distribution. Using the only pp–value that we have (0.361%=361/1050.361\%=361/10^{5}) to derive an estimate of the mean, we find the square root of the Poissonian variance to be 361=19\sqrt{361}=19, constraining our measured pp–value to be 0.361%±0.0190.361\%\pm 0.019 per cent.

Refer to caption
Figure 6: Optimized SxS_{x}, p(x)p(x), and xx results for 10410^{4} minimized p(x)p(x) curves, each based on the SxS_{x} curves of 10510^{5} MC simulated CMBs. The LEC pp–value of 2.942.94 per cent can be seen in the central plot as the small portion of nominal pp–values visible to the left of the vertical line indicating the SMICA value.

We need to now also account for the fact that we checked the entire range of possible xx values in order to find this optimal pp–value (account for the look-elsewhere effect). We do this by examining the fraction of simulated data sets with x-optimized pp–values less than the SMICA value of 0.360.36 per cent, the LEC pp–value. For this ensemble, this value is 2.942.94 per cent. Using the small sample variance for the SMICA pp–value described above, this corresponds to 2.94%0.07%+0.14%2.94\%^{+0.14\%}_{-0.07\%}.

The Planck team (Planck XVI, 2016) recently implemented a very similar process to this that was meant to account for the look-elsewhere effect caused by the a posteriori choice of the integration endpoint, and their analysis produced an LEC pp–value of 2.12.1 per cent131313The Planck Collaboration used the opposite sense of pp–value, such that they calculated the probability of finding values higher than the observed value, rather than lower than the observed value. for the SMICA map. However, their analysis was based on only 10310^{3} simulations, whereas we used 10510^{5}. However, their simulations were much more extensive than ours were. Their simulations were based on their “8th Full Focal Plane simulation set” (FFP8), which includes simulation of instrumental, scanning, and data analysis effects, whereas ours were simply based only on a theoretical power spectrum with some simple Gaussian beam and pixel window smoothing. Using the a similar procedure as above to estimate sampling uncertainties (with 2.1%=21/10002.1\%=21/1000 and 214.6\sqrt{21}\simeq 4.6) we find that their result is consistent with ours to within 1.8σ1.8\sigma using their sampling uncertainties. It is reassuring to note that these two methods gave reasonably consistent results.

Finally, we combine both methods described here: the ensemble filtering by C2C_{2}, as well as the p(x)p(x) optimization. In order to reduce computational time, we simply used 10410^{4} quadrupole-filtered SxS_{x} curves for this.141414The choice of using fewer (10410^{4}) simulations is also justified by the larger resultant pp–value: large numbers of simulations are necessary for high pp–value resolution, which is needed when pp–values are small. Large numbers of simulations are therefore not needed for larger pp–values. The result is shown in Fig. 7. In this ensemble, optimized for pp–value, the SMICA data produced x=0.367x=0.367 and Sx=1326.5S_{x}=1326.5, similar to the values found without C2C_{2} filtering, but a much higher nominal pp–value of 4.504.50 per cent, and an even higher LEC pp–value of 22.322.3 per cent.

Refer to caption
Figure 7: Optimized results from 10410^{4} MC simulated CMBs, where the simulations have been selected based on having a low quadrupole power after applying the mask. The LEC pp–value of 2222 per cent can be seen in the central plot as the large portion of nominal pp–values visible to the left of the vertical line indicating the SMICA value.

5 Summary and Conclusions

Others have previously noted connections between the low quadrupole and the near-zero correlation function on large angular scales, as quantified in the S1/2S_{1/2} statistic. Here we quantitatively investigated the relationship between these statistics as expected in the Λ\LambdaCDM model. We found that if one conditions on the observed low quadrupole power then the distribution of S1/2S_{1/2} values in the Λ\LambdaCDM model is shifted to much lower values and as a result the observed S1/2S_{1/2} has a pp–value of 0.08 rather than its unconditioned value of 0.007. We pointed out that departures from Λ\LambdaCDM that suppress the quadrupole will thus naturally make the observed S1/2S_{1/2} less unusual and potentially take it from a pp–value one might call “anomalous” to one that is merely “somewhat unusual”, if that.

Second, we revisited the exercise in Planck XVI (2016), where S1/2S_{1/2} was generalized to SxS_{x} in order to investigate the impact of the a posteriori choice of the integration cutoff at x=cosθ=1/2.x=\cos\theta=1/2. They found that the process of optimizing the value of xx with the observed data leads to a pp-value that is smaller than what one finds by doing the same process with 21 out of 1000 simulations; i.e., they found a look–elsewhere–corected (LEC) pp–value of 2.1%. Repeating this exercise with 100 times as many simulations (but without the simulation of as many aspects of the instrument), we found an LEC pp–value of 33 per cent. Our result is consistent with the 2.12.1 per cent value at the 2σ2\sigma level given their uncertainties that we estimated from their finite number of samples.

Third, we combined these two techniques of both conditioning on the observed low quadrupole power and optimizing over xx values. This resulted in a quadrupole-filtered LEC pp–value of 2222 per cent. Our point here is not that a proper calculation of the pp–value has to include conditioning on the low quadrupole, but rather that these are by no means independent phenomena: One cannot point to both the low quadrupole and low S1/2S_{1/2} as independent indicators of anomalous behavior on large scales.

The main thing we offer regarding the LEC pp–value for S1/2S_{1/2} (quadrupole-filtered or not) is our interpretation of the result. We view the selection of S1/2S_{1/2} from the one-dimensional space of alternative statistics given by SxS_{x} as a proxy for the choice of S1/2S_{1/2} out of a very high-dimensional space of all possible statistics derivable from C(θ)C(\theta) or CC_{\ell}. The value of the proxy is that it allows for calculation of an LEC pp–value. Since the dimensionality of the alternative space is much lower than that of the space of all possible statistics, it arguably leads to an underestimate of the look-elsewhere correction. In conclusion, we have quantitatively demonstarted that a posteriori selection provides a viable explanation of the low observed value of S1/2S_{1/2}. The correct explanation for low S1/2S_{1/2} might lie with physics beyond Λ\LambdaCDM but there is not yet compelling evidence in favor of such an alternative conclusion. Tests of the so-called “fluke hypothesis” (Yoho et. al., 2014; O’Dwyer et. al., 2016) have the potential to alter this situation.

Acknowledgements

We thank Brent Follin, Silvia Galli, Marius Millea, Marcio O’Dwyer, and Douglas Scott for useful conversations.

Some of the results in this paper have been derived using the healpix (G rski et. al., 2005) package.

The theoretical power spectrum was computed using the class (Blas, Lesgourges, & Tram, 2011) package.

The cut-sky power spectra were calculated using the polspice (aka spice) (Chon et. al., 2004) package.

The observed CMB data and mask were provided by the Planck Legacy Archive (http://pla.esac.esa.int/pla/#home)

References

  • Addison et. al. (2016) Addison, G. E., Huang, Y., Watts, D. J., Bennett, C. L., Halpern, M., Hinshaw, G., Weiland, J. L., 2016, ApJ, 818, 132
  • Akrami et. al. (2014) Akrami, Y., Fantaye, Y., Shafieloo, A., Eriksen, H. K., Hansen, F. K., Banday, A. J., G rski, K. M., 2014, ApJ, 784L, 42
  • Aurich & Lustig (2014) Aurich, R., Lustig, S., 2014, CQGra, 31, 5009
  • Bennett et. al. (2003) Bennett, C. L., Halpern, M., Hinshaw, G., Jarosik, N., Kogut, A., Limon, M., Meyer, S. S., Page, L., Spergel, D. N., Tucker, G. S., Wollack, E., Wright, E. L., Barnes, C., Greason, M. R., Hill, R. S., Komatsu, E., Nolta, M. R., Odegard, N., Peiris, H. V., Verde, L., Weiland, J. L., 2003, ApJS, 148, 1
  • Bennet et. al. (2011) Bennett, C. L., Hill, R. S., Hinshaw, G., Larson, D., Smith, K. M., Dunkley, J., Gold, B., Halpern, M., Jarosik, N., Kogut, A., Komatsu, E., Limon, M., Meyer, S. S., Nolta, M. R., Odegard, N., Page, L., Spergel, D. N., Tucker, G. S., Weiland, J. L., Wollack, E., Wright, E. L., 2011, ApJS 192, 17
  • Bernui et. al. (2006) Bernui, A., Villela, T., Wuensche, C.A., Leonardi, R., Ferreira, I., 2006, A&A, 454, 409
  • Blas, Lesgourges, & Tram (2011) Blas, D., Lesgourgues, J., Tram, T., 2011, JCAP 1107, 034
  • Chon et. al. (2004) Chon, G., Challinor, A., Prunet, S., Hivon, E., Szapudi, I., 2004, MNRAS, 350, 914
  • Contaldi et. al. (2003) Contaldi, C. R., Peloso, M., Kofman, L., Linde, S., 2003, JCAP, 07, 002
  • Copi, O’Dwyer, & Starkman (2016) Copi, C. J., O’Dwyer, M., Starkman, G. D., 2016, MNRAS, 463, 3305
  • Copi et. al. (2007) Copi, C. J., Huterer, D., Schwarz, D. J., Starkman, G. D., 2007, Phys. Rev. D, 75, 023507
  • Copi et. al. (2009) Copi, C. J., Huterer, D., Schwarz, D. J., Starkman, G. D., 2009, MNRAS 399, 295
  • Copi et. al. (2015a) Copi, C. J., Huterer, D., Schwarz, D. J., Starkman, G. D., 2015, MNRAS 451, 2978
  • Copi et. al. (2015b) Copi, C. J., Huterer, D., Schwarz, D. J., Starkman, G. D., 2015, MNRAS 449, 3458
  • Cruz, Mart nez–Gonz lez, & Vielva (2006) Cruz, M., Tucci, M., Mart nez–Gonz lez, E., Vielva, P., 2006, MNRAS, 369, 57
  • Cruz, Mart nez–Gonz lez, & Vielva (2007) Cruz, M., Cay n, L., Mart nez–Gonz lez, E., Vielva, P., Jin, J., 2007, ApJ, 655, 11
  • Cruz, Mart nez–Gonz lez, & Vielva (2010) Cruz, M., Mart nez–Gonz lez, E., Vielva, P., 2010, ASSP, 14, 275
  • de Oliveira–Costa et. al. (2004) de Oliveira–Costa, A., Tegmark, M., Zaldarriaga, M., Hamilton, A., 2004, Phys. Rev. D., 69, 063516
  • Efstathiou (2003) Efstathiou, G., 2003, MNRAS, 346, L26
  • Efstathiou, Ma, & Hanson (2010) Efstathiou, G., Ma, Y., Hanson, D., 2010, MNRAS, 407, 2530
  • Eriksen et. al. (2004) Eriksen, H. K., Hansen, F. K., Banday, A. J., G rski, K. M., Lilje, P. B., 2004, ApJ, 605, 14
  • Eriksen et. al. (2007) Eriksen, H. K., Banday, A. J., G rski, K. M., Hansen, F. K., Lilje, P. B., 2007, ApJ, 660, L81
  • Fixsen (2009) Fixsen, D.J., 2009, ApJ, 707, 916
  • Foreman–Mackey (2016) Foreman–Mackey, D., 2016, JOSS, 24
  • Frolop & Scott (2016) Frolop, A., Scott, D., 2016, arXiv:1603.09703
  • G rski et. al. (2005) G rski, K.M., Hivon, E., Banday, A.J., Wandelt, B.D., Hansen, F.K., Reinecke, M., Bartelmann, M., 2005, Ap.J., 622, 759
  • Gruppuso (2014) Gruppuso, A., 2014, MNRAS 437, 2076
  • Gruppuso & Sagnotti (2014) Gruppuso, A., Sagnotti, A., 2014, IJMPD, 24, 1544008-469
  • Hansen et. al. (2004) Hansen, F. K., Cabella, P., Marinucci, D., Vittorio, N., 2004, ApJL, 607, L67
  • Hansen et. al. (2009) Hansen, F. K., Banday, A. J., G rski, K. M., Eriksen, H. K., Lilje, P. B., 2009, ApJ, 704, 1448
  • Hajian (2007) Hajian, A., 2007, preprint (astro-ph/0702723)
  • Hinshaw et. al. (1996) Hinshaw, G., Branday, A. J., Bennett, C. L., Gorski, K. M., Kogut, A., Lineweaver, C. H., Smoot, G. F., Wright, E. L., 1996, ApJ, 464, L25
  • Hoftuft et. al. (2009) Hoftuft, J., Eriksen, H. K., Banday, A. J., G rski, K. M., Hansen, F. K.; Lilje, P. B., 2009, ApJ, 699, 985
  • Kim & Naselsky (2010a) Kim, J., Naselsky, P., 2010, ApJ, 714, 265
  • Kim & Naselsky (2010b) Kim, J., Naselsky, P., 2010, Phys. Rev. D., 82, 0603002
  • Kim & Naselsky (2011) Kim, J., Naselsky, P., 2011, ApJ, 739, 79
  • Land & Magueijo (2005a) Land, K., and Magueijo, J., 2005, Phys. Rev. D., 72, 101302
  • Land & Magueijo (2005b) Land, K., and Magueijo, J., 2005, Phys. Rev. D., 95, 071301
  • Liu et. al. (2014) Liu, Z., Guo, Z., Piao, Y., 2014, EPJC, 74, 3006L
  • Lello et. al. (2014) Lello, L., Boyanovsky, D., Holman, R., 2014, Phys. Rev. D., 89, 063533
  • Niarchou & Jaffe (2006) Niarchou, A., and Jaffe, A. H., 2006, AIPC, 848, 774
  • O’Dwyer et. al. (2016) O’Dwyer, M., Copi, C. J., Knox, L., Starkman, G. D., 2016, arXiv:1608.02234
  • Planck XV (2014) Planck Collaboration XV, 2014, A&A, 571, A15
  • Planck XVII (2014) Planck Collaboration XVII, 2014, A&A, 571, A17
  • Planck XXIII (2014) Planck Collaboration XXIII, 2014, A&A, 571, A23
  • Planck XXVII (2014) Planck Collaboration XXVII, 2014, A&A, 571, A27
  • Planck XI (2016) Planck Collaboration XI, 2016, A&A, 594, A11
  • Planck XIII (2016) Planck Collaboration XIII, 2016, A&A, 594, A13
  • Planck XV (2016) Planck Collaboration XV, 2016, A&A, 594, A15
  • Planck XVI (2016) Planck Collaboration XVI, 2016, A&A, 594, A16
  • Planck LI (2016) Planck Collaboration LI, 2016, arXiv: 1608.02487
  • Schwarz et. al. (2004) Schwarz, D. J., Starkman, G. D., Huterer, D., Copi, C. J., 2004, Phys. Rev. Let., 93, 221301
  • Schwarz et. al. (2015) Schwarz, D.J., Copi, C.J., Huterer, D., Starkman, G., 2015, arXiv: 1510.07929v1
  • Spergel et. al. (2003) Spergel, D. N., Verde, L., Peiris, H. V., Komatsu, E., Nolta, M. R., Bennett, C. L., Halpern, M., Hinshaw, G., Jarosik, N., Kogut, A., Limon, M., Meyer, S. S., Page, L., Tucker, G. S., Weiland, J. L., Wollack, E., Wright, E. L., 2003, ApJS, 148, 175
  • Stevens, Silk, & Scott (1993) Stevens, D., Scott, D., Silk, J., 1993, Phys Rev Lett, 71, 20
  • Yoho et. al. (2014) Yoho, A., Copi, C. J., Starkman, G. D., Kosowsky, A., 2014, MNRAS, 442, 2392
  • Zhang & Huterer (2010) Zhang, R., Huterer, D., 2010, APh, 33, 69

Appendix A Comparison to other Results

In order to help verify our methods, we approximately repeated one of the S1/2S_{1/2} calculations done by Copi et. al. (2015a), attempting to use the same methods and data that they had used, in order to obtain the same results. One of the data sets which they used included the Planck PR1 SMICA map and the so-called U74 mask, for which Copi et. al. (2015a) found S1/2S_{1/2} =1577.7μK4=1577.7~{\rm\mu K}^{4}, and using an ensemble of 10610^{6} simulations, a pp–value of 0.1910.191 per cent. We obtained the publicly available PR1 SMICA map and the U73 mask, but did not attempt to reproduce the U74 map, which was designed as an approximation to U73, since it was not publicly available at the time of the previous analysis. We also modified our mask and map degradation procedures slightly to match what Copi et. al. (2015a) did in their analysis: they used the healpix ud_grade function, rather than the harmonic space window-weighting method, as well as a mask threshold of 0.8, rather than our 0.9.

We nearly reproduced their result, finding S1/2S_{1/2} =1527.5μK4=1527.5~{\rm\mu K}^{4}, and with an ensemble of 10510^{5} simulations, found a pp–value of 0.2470.247 per cent. To estimate the sampling uncertainty, as in section 4, we suppose that our measured S1/2S_{1/2} pp–value was drawn from a Poissonian distribution and calculate the standard deviation as 247/105.016\sqrt{247}/10^{5}\simeq.016 per cent. Thus the difference between our p-value and that of Copi et. al. (2015a) is about 3.6σ3.6\sigma. This difference in pp–values is likely due to the difference in the masks that we used (U73 vs. U74).