On the accuracy of time-delay cosmography in the Frontier Fields Cluster MACS J1149.5+2223 with supernova Refsdal
Abstract
We study possible systematic effects on the values of the cosmological parameters defining the expansion rate and the geometry of the universe measured through strong lensing analyses of the Hubble Frontier Fields galaxy cluster MACS J1149.5+2223. We use the observed positions of a large set of spectroscopically selected multiple images, including those of supernova “Refsdal” with their estimated time delays. Starting from our reference model in a flat CDM cosmology, published in Grillo et al. (2018), we confirm the relevance of the longest measurable time delay, between SX and S1, and an approximately linear relation between its value and that of . We perform true blind tests by considering a range of time delays around its original estimate of days, as an accurate measurement of this time delay has not been published to date and it is not known at the time of writing. We investigate separately the impact of a constant sheet of mass at the cluster redshift, of a power-law profile for the mass density of the cluster main halo and of some scatter in the cluster member scaling relations. Remarkably, we find that these systematic effects do not introduce a significant bias on the inferred values of and and that the statistical uncertainties dominate the total error budget: a 3% uncertainty on the time delay of image SX translates into approximately 6% and 40% (including both statistical and systematic ) uncertainties for and , respectively. Furthermore, our model accurately reproduces the extended surface brightness distribution of the supernova host, covering more than HST pixels. We also present the interesting possibility of measuring the value of the equation-of-state parameter of the dark energy density, currently with a 30% uncertainty. We conclude that time-delay cluster lenses, when characterized with sufficient spectroscopic data, have the potential to become soon an alternative and competitive cosmological probe.
1 Introduction
After nearly a century from its first estimates (Lemaˆıtre, 1927; Hubble, 1929), the exact value of the cosmic expansion rate, the Hubble constant , is still hotly debated. The most recent measurements of , based on the distance ladder from the SHOES program (Riess et al., 2019) and from the Planck satellite (Planck Collaboration VI, 2018), cannot be reconciled. Accurate and precise estimates from independent methods and calibrations (e.g., Abbott et al. 2017; Freedman et al. 2019; Reid et al. 2013; Wong et al. 2019) are thus crucial to clarify whether an extention of the standard cosmological model is needed. Time delays between the multiple images of supernovae (SNe) and quasars (QSOs) strongly lensed by galaxy clusters can place interesting new constraints on the values of and of the parameters defining the global geometry of the Universe (e.g., Refsdal, 1964; Chen et al., 2019; Wong et al., 2019; Shajib et al., 2019).
In this paper, we use a full strong lensing model of the Hubble Frontier Fields (HFF) galaxy cluster MACS J1149.52223 (hereafter MACS 1149; ), where the first multiply-imaged and spatially-resolved SN “Refsdal” () was discovered (Kelly et al., 2015, 2016a). We extend our previous analysis (Grillo et al. 2018, hereafter G18) and show that a large set of spectroscopically confirmed multiple images from different sources (Treu et al. 2016; Grillo et al. 2016, hereafter G16) and the measured time delays between the multiple images of a variable source (Kelly et al. 2016b; Rodney et al. 2016) in a lens galaxy cluster can provide accurate measurements of the values of the Hubble constant and of cosmological parameters, which are competitive with those from other cosmological probes. Alternative methods exploiting SN Refsdal for cosmography were presented in other studies (Vega-Ferrero et al. 2018; Williams & Liesenborgs 2019).
2 Methods
Our reference strong lensing model of MACS 1149 (labeled as r/REF in the following tables and figures) is the one detailed in G16 and G18. We refer the reader to those papers for a comprehensive decription. Here, we summarize briefly its main caracteristics. To reconstruct the cluster total mass distribution and infer the values of the considered cosmological parameters, we use the observed positions of 89 multiple images from 28 different point-like sources, with redshifts between 1.240 and 3.703, and the values of the time delays of the images S2, S3, and S4, relative to S1, of SN Refsdal, as measured by using polynomial fitting by Rodney et al. (2016). The value of the SX-S1 time delay () is based on the first results by Kelly et al. (2016b) and is varied between 315 and 375 days, with a constant uncertainty of 10 days. The cluster total mass distribution is modeled with a combination of 3 cored elliptical pseudo-isothermal and 300 dual pseudo-isothermal mass density profiles for, respectively, the extended dark-matter halos and the galaxy members of the cluster. Flat () CDM models with uniform priors on the values of , between 20 and 120 km s-1 Mpc-1, and , between 0 and 1, are considered. The software Glee (Suyu & Halkola, 2010; Suyu et al., 2012) is used to conduct the entire strong-lensing study.
First, to analyze the dependence of the inferred value of on the observed value of the time delay between SX and S1, we perform full statistical analyses with the reference model and nine different values for this time delay (i.e., 315, 322, 330, 337, 345, 353, 360, 368 or 375 days), keeping its uncertainty fixed to 10 days. We also check the importance of the SX-S1 time delay in estimating the values of and by comparing the results of the reference model with and without this observable quantity.
Then, to test the effect of the most important sources of systematic uncertainties on the measurement of the values of the cosmological parameters, we consider additional models, where we (1) add to the reference model (with 315, 345 or 375 days for the SX-S1 time delay) a constant sheet of mass at the cluster redshift with a uniform prior between 0.2 and 0.2 on the value of the convergence k0 (models labeled as ); (2) substitute (with 315, 345 or 375 days for the SX-S1 time delay) the pseudo-isothermal profile (i.e., , in the outer regions ) of the cluster main halo with a more general power-law profile (i.e., , in the outer regions ) with a uniform prior between 1.4 and 2.6 on the value of (models labeled as ); (3) use five different realizations (all with 345 days for the SX-S1 time delay) of the values of the Einstein angles () and truncation radii () of the cluster members (models labeled as s1-s5), randomly scattered with normal distributions centered on the values obtained with total mass-to-light ratios increasing according to the galaxy near-IR luminosities and with standard deviations equal to 10% of their values (see Bergamini et al. 2019 for recent results on cluster member modeling with stellar kinematics information).
Next, to quantify the goodness of our best-fitting reference model (with a time delay between SX and S1 of 345 10 days), we compare the observed and model-predicted surface brightness distribution of the multiple images of the SN Refsdal host. We note that in this model all the sources are approximated as point-like objects and the extended surface brightness information has not been exploited yet. We then use the best-fitting reference model, based on the point images, to reconstruct the SN host galaxy on a grid of pixels on the source plane from the lensed and distorted multiple images of the SN host galaxy. To suppress the contaminating contribution of the cluster members’ light to the lensed SN host galaxy light, we use a linear combination of the HST F606W and F435W bands, with a pixel size of 0.06″, from the HFF project. Due to the very large number of pixels over which the multiple images of the SN Refsdal host galaxy extend, our computing memory sets a maximum size of pixels for the reconstruction grid of the source surface brightness. From the reconstructed SN host galaxy surface brightness on the source plane, we then lens it with the mass model to the image plane to compare with the observed lensed images.
Finally, we consider models (with 315, 345 or 375 days for the SX-S1 time delay) in a flat () CDM cosmology, in which the dark energy density is time dependent, with an equation-of-state parameter . Here, the value of is also free to vary between and , with a uniform prior.
3 Results
We remark that all the results presented here come from a blind analysis, because the exact value of the time delay was not known at the time of analysis and writing.

model | |||
---|---|---|---|
r | |||
315d | |||
r | |||
345d | |||
r | |||
375d | |||
s1 | |||
s2 | |||
345d | s3 | ||
s4 | |||
s5 |
We show in Figure 1 the median values of and the confidence level (CL) intervals obtained by varying the value of the time delay between SX and S1 from 315 to 375 days, with linear steps of 7-8 days, as described in Sect. 2. The inferred value of decreases from 79.0 to 66.4 km s-1 Mpc-1 as the time-delay value increases. The assumed constant uncertainty of 10 days on the SX-S1 time delay translates into (1) statistical errors for between 5.8 and 7.0%, with a median value of 6.1%, consistent with the results presented in G18.
We have also checked that an error of 2% on the value of days (i.e., 7 days) results into an approximately 1% smaller error on the value of (similarly to G18) and that a smaller uniform prior between 0.05 and 0.5 for the value of (as adopted, for instance, in Wong et al. 2019) reduces by more than 10% the error on its value and by less than 1% the error on the value of .


In Figure 2, we illustrate explicitly the relevance of the longest measurable time-delay value between the multiple images of SN Refsdal, i.e. the SX-S1 time delay, to the cosmological inference within our reference model. We contrast the inferred results for and with a days time delay and without this time delay. We obtain that the probability distribution function of changes dramatically, from unimodal with a well-defined peak and small dispersion to almost flat over the largest values considered within the uniform prior. Remarkably, the probability distribution function of is instead essentially unaltered, with very similar median values and (1) statistical errors.




We summarize in Table 1 and in Figures 3 and 4 the outcomes of our tests to estimate the systematic uncertainties on the values of and , as detailed in Sect. 2. The most important result is that, for a fixed value of the SX-S1 time delay, the inferred values of the cosmological parameters are not significantly affected by the modeling details (considering their errors). For instance, when we use a time-delay value of days for , the median values of and vary, respectively, between 72.9 and 73.9 km s-1 Mpc-1 and 0.34 and 0.40 and the results of our reference model are not biased low or high. The combination of the eight final MCMC chains of the models r, , , and s1-5 with days provides a global MCMC chain with samples with median values and 1 uncertainties of 73.3 km s-1 Mpc-1 and 0.37 and 6% and 40% for and , respectively. By comparing these results with those published in G18, we can conclude that the error budget on the estimate of is strikingly dominated by the statistical uncertainties and the known systematics play only a secondary role. We notice that the latter are slightly more relevant for the measurement of .
We remark that very similar general conclusions hold when the other two values, i.e. 315 and 375 days, of the SX-S1 time delay are used. As discussed above, the inferred median values of decrease on average with increasing values of . The results of these tests might also suggest that higher values of are preferred by higher values of . Moreover, the 1 errors of and decrease mildly with increasing values of the SX-S1 time delay. This reflects the decreasing relative errors of the assumed constant 10 day uncertainty on longer time delays.
The juxtaposition of the observed and model-predicted surface brightness distributions of the SN Refsdal host shown in Figure 5, similarly to the comparison shown in Figure 7 by G16, strongly supports the validity of our results. Over the selected regions, composed of more than HST pixels, the mean and standard deviation values of the normalized residuals (i.e., the difference between the observed and model-predicted surface brightness values divided by the observational uncertainties) are, respectively, equal to and . This means that the multiple images of the SN Refsdal host, with their many star-forming regions or knots, are very well reconstructed, both in their positions and fluxes. In Figure 6, we show the probability distribution of our normalized residuals and that of a normal distribution with the same standard deviation. We notice that the former has an almost perfectly symmetric shape and we measure that 78.5% (96.5%) of the total number of pixels have normalized residuals smaller than 1 (2), to be compared to the 68.3% (95.5%) probability of a standard normal distribution. The fraction of pixels with normalized residuals larger than 5 is 0.5% (% for a standard normal distribution). We have checked that the vast majority of these pixels are located around the SN host galaxy’s peaks of emission, where the finite resolution of the grid on the source plane, remaining small offsets between the observed and model-predicted image positions, and the so-far neglected contribution of the point spread function make accurate image reconstructions more difficult to achieve. Considering the relatively simple subtraction of the cluster member flux (that can still be improved), the residuals at the positions of the S1-S4 and SX multiple images of SN Refsdal are remarkably small.

Finally, in Figure 7 we show the inference on the values of the cosmological parameters , and . If we consider our reference model (with days) and vary at the same time all the strong lensing model parameters, we obtain the following noteworthy 1 CL constraints: km s-1 Mpc-1 for , for and for . The achieved precision on the values of and is linked to the combined information coming from the time delays and the positions of the multiple images of sources at different redshifts.
4 Discussion
From the results presented in the previous section, it is clear that the most accurate and precise measurements of the values of the cosmological parameters from MACS 1149 and SN Refsdal will require an additional run of the strong lensing models, that include the final measurement of the value and uncertainty of the SX-S1 time delay. Although some scaling relations between the value of and those of the cosmological parameters exist (as highlighted in G18 and above), the best option to infer the exact values of and and their errors, that aim to be competitive with the most recent ones from other cosmological probes, is definitely not through a simple interpolation of the results already obtained with the current plausible but approximate time delay. We have also demonstrated that a precise measurement of the value of is not achievable without some information on .
By comparing different results of the strong lens time delay method, with SN Refsdal in MACS 1149 (as shown here) and with lensed quasars in the galaxy-scale systems of the H0LiCOW program (Suyu et al. 2017), we can conclude that (1) the relative error on the inferred value of from a single (galaxy or cluster) strong lensing system is similar (mean value of 6.4% in Figure 2 of Wong et al. 2019), (2) in a single lens cluster, we have the additional interesting possibility of estimating the value of (and ), thanks to the observations of several multiple-image families with spectroscopically confirmed redshifts and to the measurement of the time-delay values between the multiple images of time-varying sources, and (3) the observed positions of many spectroscopic multiple images (some of which are crucial to constraining the lens tangential and radial critical curves at different redshifts) provide precise calibrations of all the different mass components (i.e., extended dark-matter halos, cluster members, and possibly hot gas) included in the modeling of a galaxy cluster and, thus, also a good approximation of the effect of the “environment” in the regions adjacent to where the time delays are measured. This is supported by the almost negligible impact of the inclusion in the () models of a constant sheet of mass at the lens redshift (Figure 3). In particular, as shown in Figure 4, the inferred value of k0 is approximately 0 and not correlated with that of , whereas this is not the case when only multiple images at a single redshift (i.e., that of SN Refsdal) are used (see Appendix A).
Finally, we notice that relatively large residuals have already been reported in close proximity to the most luminous pixels of multiply imaged QSOs with measured time delays in the image reconstruction of the lensed quasar and host galaxy surface brightness distributions (e.g., Wong et al. 2017). As in the case of SN Refsdal host, this can be ascribed mainly to the finite grid resolution on the source plane. We suggest that a comparison between the observed and model-predicted surface brightness distributions of SN Refsdal host, as that shown in Figures 5 and 6, should be used to test the goodness of any strong lensing model which provides estimates of the values of cosmological parameters from the time delays of the SN Refsdal multiple images. More in general, we believe that the extended surface brightness modeling of multiply imaged QSO and SN hosts, although computationally expensive, will help to make progress in time-delay cosmography with lens galaxy clusters, as already demonstrated with lens galaxies.
5 Conclusions
We have shown that accurate and precise estimates of the value of the Hubble constant can be obtained in lens galaxy clusters, when a large set of spectroscopically confirmed multiple images of sources at different redshifts and the time delays of the multiple images of a time-varying source are measured. In flat CDM models, we have demonstrated that a full and blind strong lensing analysis of the HFF galaxy cluster MACS J1149.52223, with an error of 10 days (i.e., 3%) on the longest measurable time delay between the multiple images of SN Refsdal provides approximately 6% and 40% total uncertainties for and , respectively. A range of values (315-375 days) for this time delay has been explored, since its accurate measurement is still not known at the time of analysis and writing. We have tested several possible sources of systematic uncertainties and confirmed that their contribution to the estimated values of the cosmological parameters is significantly less important than that of the statistical uncertainties. We have verified that the effect of the “mass-sheet degeneracy” is considerably reduced, thanks to the presence of tens of multiply imaged sources at different distances behind the lens. We have proved that the reconstructed extended surface brightness distribution of the SN host validates the reliability of our models. When the values of and are measured, we remark the importance of considering the full covariance between all the parameters defining the cosmological model and the total mass distribution of the cluster lens. We conclude that time-delay cluster lensing will turn into a new valuable cosmological tool, once more high-quality data of cluster strong lenses become available from the next deep and wide surveys.
Appendix A The mass-sheet degeneracy
We compare here the results of two () models with a constant sheet of mass at the cluster redshift and with days, where we consider either the entire sample of 89 multiple images from 28 sources at different redshifts (as presented and discussed above) or only the 63 multiple images from SN Refsdal and its host, all at . In Figure 8, we show the inferred values of and k0. We find that the use of many multiple images belonging to sources located at different distances reduces significantly the effect of the so-called “mass-sheet degeneracy” (Falco et al. 1985; Schneider 2019). At the 1 CL, the credible interval of k0 varies from [0.30,0.24] (extending over a large fraction of the adopted uniform prior, [−0.5,0.5], chosen here to illustrate better the parameter degeneracy), when sources at a single redshift are included in the model, to [0.08,0.06], when all sources at different redshifts are considered. This translates into an approximately difference in the median value of (and of ) and in a remarkable reduction by a factor of more than 3, from 21% to 6%, for its uncertainty (from 63% to 40% for the uncertainty on ).

References
- Abbott et al. (2017) Abbott, B. P., Abbott, R., Abbott, T. D., et al. 2017, Natur., 551, 85
- Bergamini et al. (2019) Bergamini, P., Rosati, P., Mercurio, A., et al. 2019, A&A, 631, 130
- Chen et al. (2019) Chen, G. C.-F., Fassnacht, C. D., Suyu, S. H., et al. MNRAS, 490, 1743
- Falco et al. (1985) Falco, E. E., Gorenstein, M. V., & Shapiro, I. I. 1985, ApJ, 289, 1
- Freedman et al. (2019) Freedman, W. L., Madore, B. F., Hatt, D., et al. 2019, ApJ, 882, 34
- Grillo et al. (2016) Grillo, C., Karman, W., Suyu, S. H., et al. 2016 (G16), ApJ, 822, 78
- Grillo et al. (2018) Grillo, C., Rosati, P., Suyu, S. H., et al. 2018 (G18), ApJ, 860, 94
- Hubble (1929) Hubble, E. 1929, PNAS, 15, 168
- Kelly et al. (2015) Kelly, P. L., Rodney, S. A., Treu, T., et al. 2015, Science, 347, 1123
- Kelly et al. (2016a) Kelly, P. L., Brammer, G., Selsing, J., et al. 2016a, ApJ, 831, 205
- Kelly et al. (2016b) Kelly, P. L., Rodney, S. A., Treu, T., et al. 2016b, ApJ, 819, 8
- Lemaˆıtre (1927) Lemaître, G. 1927, ASSB, 47, 49
- Planck Collaboration VI (2018) Planck Collaboration VI, arXiv:1807.06209
- Refsdal (1964) Refsdal, S. 1964, MNRAS, 128, 307
- Reid et al. (2013) Reid, M. J., Braatz, J. A., Condon, J. J., et al. 2013, ApJ, 767, 154
- Riess et al. (2019) Riess, A. G., Casertano, S., Yuan, W., Macri, L. M., Scolnic, D., ApJ, 876, 85
- Rodney et al. (2016) Rodney, S. A., Strolger, L.-G., Kelly, P. L., et al. 2016, ApJ, 820, 50
- Schneider & Sluse (2013) Schneider, P. & Sluse, D. 2013, A&A, 559, 37
- Schneider (2019) Schneider, P. 2019, A&A, 624, 54
- Shajib et al. (2019) Shajib, A. J., Birrer, S., Treu, T., et al. 2019, arXiv:1910.06306
- Suyu & Halkola (2010) Suyu, S. H., & Halkola, A. 2010, A&A, 524, A94
- Suyu et al. (2012) Suyu S. H., Hensel S. W., McKean J. P., et al. 2012, ApJ, 750, 10
- Suyu et al. (2017) Suyu, S. H., Bonvin, V., Courbin, F., et al. 2017, MNRAS, 468, 2590
- Treu et al. (2016) Treu, T., Brammer, G. B., Diego, J. M., et al. 2016, ApJ, 817, 60
- Vega-Ferrero et al. (2018) Vega-Ferrero, J., Diego, J. M., Miranda, V., & Bernstein, G. M. 2018, ApJ, 853, 31
- Williams & Liesenborgs (2019) Williams, L. L. R. & Liesenborgs, J. 2019, MNRAS, 482, 5666
- Wong et al. (2017) Wong, K. C., Suyu, S. H., Auger, M. W., et al. 2017, MNRAS, 465, 4895
- Wong et al. (2019) Wong K. C., Suyu, S. H., Chen, G. C.-F., et al., arXiv:1907.04869