This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Cosmological constraints of Palatini f()f(\mathcal{R}) gravity

Débora Aguiar Gomes    Rebecca Briffa    Aleksander Kozak    Jackson Levi Said    Margus Saal    and Aneta Wojnar 0000-0002-1545-1483
Abstract

In this study, we investigate a Palatini f(R)f(R) gravity model featuring a quadratic term correction, aligning it with the most recent expansion rate data, with a particular focus on the latest SNIa and BAO data. Our analysis employs CC data as the fundamental dataset, complemented by contributions from the SN sample and a combination of non-overlapping transversal BAO datasets. We conduct a comprehensive MCMC analysis for each data set combination, yielding constraints on all cosmological parameters within the model. Additionally, we incorporate the latest Hubble constant value from the SH0ES Team. Finally, we present a statistical comparison between the Palatini quadratic model and Λ\LambdaCDM using the AIC and BIC metrics, ultimately obtaining the constraint |α|1049m2|\alpha|\leq 10^{49}\,\text{m}^{2}. We also stress the significance of studying stellar and substellar objects for obtaining more precise constraints on modified gravity compared to those derived from cosmological observations.

1 Introduction

The standard Λ\LambdaCDM concordance model continues to be the strongest model that is consistent with observational measurements at all scales of cosmology [1, 2]. In this setting, cold dark matter (CDM) acts on large scale structures [3, 4, 5], whereas dark energy takes on the form of a cosmological constant (Λ\Lambda) [6], and acts in the late Universe to produce cosmic acceleration. While great efforts have been put into the cosmological constant, internal consistency issues persist [7]. On the other hand, dark matter particles remain elusive in particle physics observatories [8]. In addition, other important problems plague the Λ\LambdaCDM model such as the lithium problem [9] in Big Bang nucleosynthesis measurements, as well as anomalies in the cosmic microwave background (CMB) radiation ranging from the lack of large-scale temperature correlations [10] to asymmetries in all-sky surveys [11, 12], as well as other open questions [13, 14]. Besides these long-standing issues, recently, the effectiveness in the ability of Λ\LambdaCDM to describe certain combinations of data sets has been called into question [15, 16, 17, 18]. This is called the cosmic tensions problem, and poses a foundational challenge to the Λ\LambdaCDM concordance model. The tension between direct measurements of cosmic expansion in the late Universe [19, 20, 21], and model-dependent observations using data from the early Universe [13, 22, 23] is a growing statistical feature of recent studies. While it may be that this is an artefact of the type of measurement technique being employed [24, 25, 26, 27], but the consistency of the appearance of this tension seems to indicate the potential requirement for a new physical model of cosmology [28, 29].

The combined challenges to the Λ\LambdaCDM model has led to a re-evaluation of competing formulations of different cosmological models [30, 2, 31, 32, 33, 34, 35, 36, 37, 38]. The paradigm of early- and late-Universe emerges out of the cosmic tensions problem itself, and has prompted attempts at resolutions to the problem that focus in these separate sectors individually. Late-time solutions alter the evolution of the Universe while preserving its early-time behavior. Models of this nature range from decaying dark matter models [39] as well as decaying dark energy models [40]. These sometimes include interaction terms [41, 42]. There are also running vacua models [43], as well as cosmological constant sign switching models [44]. However, the combination of supernovae of Type Ia (SNIa), cosmic chronometer (CC), and baryonic acoustic oscillation (BAO) data severely curtails the possibility of resolving the cosmic tensions problem using only late-time modifications to cosmology [45].

On the other end of the spectrum there have been many early Universe proposals which are principally focused on decreasing the sound horizon, since this infers a higher Hubble constant for a fixed CMB angular scale. Early Universe dark energy models [46] are the most studied proposals in this sector, whereas models of neutrinos beyond the standard particle physics model are also potentially active in this field [47], among many others. The decrease in the sound horizon does pose a problem for these models since it may jeopardize the compatibility of these cosmological models with the growth of large scale structures [48]. Altogether, this points to a global rescaling of the Hubble evolution of the Universe [49]. This may include a combination of modifications to the gravitational and matter sectors.

In recent years, there have been several proposals for a diversity of different classes of modifications to the gravitational sector, which have been shown to tackle a range of physical problems [31, 50, 51, 2, 52, 53]. One interesting class of models are Ricci-based ones, among which Palatini f(R)f(R) is the simplest, apart from General Relativity (GR), and the most studied extensions of the Einstein’s proposal.

Research in Palatini f(R)f(R) gravity, which is comprehensively reviewed in [54], has encompassed a wide spectrum of cosmological inquiries. Notably, [55] unveiled the derivation of the Palatini effective Lagrangian from Loop Quantum Cosmology, shedding light on its covariant characteristics. The authors of [56] delved into the examination of singularities and constraints, particularly with respect to the possibility of inflation, employing Chaplygin gas models. The exploration of bouncing solutions and perturbations within this framework was documented in [57]. In contrast, [58] focused on the reconstruction of a class of Palatini Lagrangians using cosmography and analytical approximations, incorporating Planck data to determine key parameters such as Ωm\Omega_{m}. The consideration of torsion, as illustrated in [59], extended the theory, leading to scalar-tensor Palatini gravity, though it raised concerns about its stability. The examination of Noether symmetries in a cosmological context was the subject of [60]. Early-time inflation and late-time cosmic acceleration without invoking dark energy were investigated in [61], while the derivation of the effective equation of state in Palatini cosmology was presented in [62].

Furthermore, an avenue of research, as exemplified by the studies [63, 64], probed into bouncing cosmologies, scrutinizing scenarios both isotropic and anisotropic. Simultaneously, [65] centered its focus on comprehending the formation of large-scale structures. Additionally, research conducted by a separate group [66] was dedicated to exploring cosmological perturbations within the Palatini f(R)f(R) gravity framework. Lastly, a distinct approach to the study of cosmological singularities was undertaken by another set of researchers [67], significantly enhancing our understanding of the singular behavior inherent in this gravitational model.

On the other hand, research in Palatini gravity with scalar fields has also yielded a diverse array of insights. Among others, [68] explored the non-minimal coupling of scalar fields in Palatini gravity, employing a Noether symmetry approach. Additionally, the concept of Palatini Higgs inflation has been rigorously examined in a series of studies [69, 70, 71, 72, 73]. Furthermore, the interplay of inflationary dynamics and Palatini quadratic gravity has been extensively explored in various works, including [74, 75, 76, 77], and others. The specific dynamics of tachyonic preheating in Palatini R2R^{2} inflation were elucidated in [78]. Moreover, quintessential inflation within Palatini f(R)f(R) gravity was analyzed in [79, 80, 81]. Lastly, the comparison between metric and Palatini formulations of inflation was studied in various works, such as [82, 83, 84, 85, 86].

The main purpose of this paper is to test the well-known Starobinsky (quadratic) model in the Palatini approach in the context of late-time cosmic expansion by comparing its predictions against empirical data and performing a suitable statistical analysis. The next section introduces the necessary theoretical background, including the aforementioned Palatini approach to modified gravity, that will elucidate the key differences between the model considered here and the Λ\LambdaCDM model. We present the modified Friedmann equation, that will be the basis for our numerical calculations. Next, the data sets used in our investigations will be described, as well as the methodology allowing us to impose constraints on the parameters introduced by the model. We then employ an MCMC analysis of the quadratic model and present the results for the Hubble parameter, matter content, the absolute magnitude of SNIa, and the Starobinsky parameter α\alpha, accompanied by two statistical criteria assessing the performance of the model compared to Λ\LambdaCDM. At the end, we draw some conclusions and summarize the results of the paper.

2 Palatini gravity

2.1 Preliminaries

Palatini gravity belongs to the category of metric-affine models of gravity, representing the simplest example of this framework. In this approach, the metric tensor gg and the connection Γ^\hat{\Gamma} are considered as independent entities. Consequently, to derive the field equations governing this gravitational theory, one must perform variations with respect to both of these variables. The action functional for this theory is given as follows:

S=12κ2d4xgf()+Sm[gμν,ψm].S=\frac{1}{2\kappa^{2}}\int d^{4}x\sqrt{-g}f(\mathcal{R})+S_{m}[g_{\mu\nu},\psi_{m}]. (2.1)

Here, g\sqrt{-g} represents the determinant of the metric tensor, f()f(\mathcal{R}) is a function of the curvature scalar \mathcal{R}, and SmS_{m} represents the action for matter fields ψm\psi_{m}, which exclusively depends on the metric tensor gμνg_{\mu\nu}. Importantly, the matter action SmS_{m} remains entirely independent of the connection Γ^\hat{\Gamma}. Although it is conceivable to extend the matter action to encompass a dependence on Γ^\hat{\Gamma}, such an augmentation is deemed pertinent primarily in the context of fermionic interactions as opposed to classical fluid dynamics on a large scale, and thus, we shall omit its consideration in this context.

Furthermore, the curvature scalar \mathcal{R} is constructed utilizing both the metric tensor and the Palatini-Ricci curvature tensor μν(Γ)\mathcal{R}_{\mu\nu}(\Gamma). Specifically, it can be expressed as =gμνμν(Γ)\mathcal{R}=g^{\mu\nu}\mathcal{R}_{\mu\nu}(\Gamma). It’s noteworthy that μν\mathcal{R}_{\mu\nu} must maintain symmetry to prevent instabilities [87, 88, 89].

Should the functional form of ff exhibit linearity with respect to \mathcal{R}, we recover GR. In scenarios involving vacuum or spacetime dominated by pure radiation, we effectively deal with GR featuring a cosmological constant. However, when considering ff as an arbitrary function of \mathcal{R}, we encounter a distinct spacetime structure [90, 91, 54]. Significantly, within our context, these modifications carry implications for the description of stellar phenomena [92] and early and late cosmology [90, 91, 93, 94]. Let us delve deeper into this subject.

Variation of Eq. (2.1) solely with respect to the metric tensor gμνg_{\mu\nu} results in the following field equations:

f()μν12f()gμν=κ2Tμν,f^{\prime}(\mathcal{R})\mathcal{R}_{\mu\nu}-\frac{1}{2}f(\mathcal{R})g_{\mu\nu}=\kappa^{2}T_{\mu\nu}, (2.2)

where TμνT_{\mu\nu} represents the energy-momentum tensor for the matter field and can be derived in the conventional manner:

Tμν=2gδSmδgμν.T_{\mu\nu}=-\frac{2}{\sqrt{-g}}\frac{\delta S_{m}}{\delta g_{\mu\nu}}. (2.3)

In our scenario, we assume the matter content to be in the form of a perfect fluid. In the aforementioned equations, the primes denote derivatives with respect to the argument of the function, denoted as f()=df()/df^{\prime}(\mathcal{R})=df(\mathcal{R})/d\mathcal{R}. Evidently, for linear f()f(\mathcal{R}), these equations reduce to the familiar equations of GR.

Conversely, the variation with respect to the independent connection Γ^\hat{\Gamma} can be expressed as [93, 95]:

^β(gf()gμν)=0.\hat{\nabla}_{\beta}(\sqrt{-g}f^{\prime}(\mathcal{R})g^{\mu\nu})=0. (2.4)

Here, ^β\hat{\nabla}_{\beta} denotes the covariant derivative calculated with respect to Γ^\hat{\Gamma}. Notably, by introducing a new metric tensor defined as:

g^μν=f()gμν,\hat{g}_{\mu\nu}=f^{\prime}(\mathcal{R})g_{\mu\nu}, (2.5)

we can reformulate Eq. (2.4) as follows:

β(g^g^μν)=0,\nabla_{\beta}(\sqrt{-\hat{g}}\hat{g}^{\mu\nu})=0, (2.6)

provided that the independent connection Γ^\hat{\Gamma} corresponds to the Levi-Civita connection associated with the metric g^μν\hat{g}_{\mu\nu}. It is also evident that, for linear ff, the connection Γ^\hat{\Gamma} reduces to the Levi-Civita connection Γ\Gamma associated with the metric gμνg_{\mu\nu}.

Furthermore, we can gain deeper insights into the characteristics of this gravitational theory by scrutinizing the gg-trace of Eq. (2.2):

f()2f()=κ2T,f^{\prime}(\mathcal{R})\mathcal{R}-2f(\mathcal{R})=\kappa^{2}T, (2.7)

where TT represents the trace of the energy-momentum tensor. This expression unveils the possibility of algebraically expressing \mathcal{R} as a function of matter fields, contingent on a given f()f(\mathcal{R}). In instances featuring vacuum and/or pure radiation-dominated spacetime, where T=0T=0, we can solve Eq. (2.7), resulting in the theory converging to GR with a cosmological constant. Additionally, it can be established that this theory does not introduce any supplementary degrees of freedom beyond those already inherent in GR. This contrasts with the case of metric f(R)f(R) gravity, where the relationship between RR and TT arises through a differential equation, thereby introducing an extra degree of freedom, often interpreted as a scalar field, subject to appropriate transformations.

In our subsequent analysis, we will consider the simplest extension of GR, often referred to as the Starobinsky or quadratic model, which is expressed as:

f()=+α2,f(\mathcal{R})=\mathcal{R}+\alpha\mathcal{R}^{2}, (2.8)

where α\alpha serves as a parameter. In this scenario, the solution to Eq. (2.7) adopts the form of GR, i.e., =κ2T\mathcal{R}=-\kappa^{2}T. This signifies that any modifications manifesting in the field equations are contingent on matter fields and are parametrized by a solitary parameter, α\alpha.

2.2 Palatini cosmology

As was already mentioned, Palatini f()f(\mathcal{R}) gravity does not introduce any additional degrees of freedom and reduces to GR with a cosmological constant in the vacuum. Due to this fact, Palatini theories find applications in modeling the late-time acceleration of the Universe [54, 96], where curvature functions of type f()=+αnnf(\mathcal{R})=\mathcal{R}+\alpha_{n}\mathcal{R}^{n} are of particular interest. It was also found that the choice f()=n(t)f(\mathcal{R})=\mathcal{R}^{n(t)}, where the exponent depends on the temperature and varies from n=1n=1 in the low-energy limit, and n=2n=2 at the early Universe, yields a scale-invariant description of gravity in high-energy limit, reduces to GR at later times, and evades Ostroradsky’s instability [97].

In what follows, we will assume the quadratic model in the Palatini approach, i.e. f()=+α2f(\mathcal{R})=\mathcal{R}+\alpha\mathcal{R}^{2}, and the standard Friedmann-Lemaître-Robertson-Walker metric:

gμν=diag(1,a21kr2,a2r2,a2r2sin2θ)g_{\mu\nu}=\text{diag}\left(-1,\frac{a^{2}}{1-kr^{2}},a^{2}r^{2},a^{2}r^{2}\sin^{2}\theta\right) (2.9)

and the stress-energy tensor for the perfect fluid:

Tμν=(p+ρ)uμuν+pgμν,T=3pρ,T_{\mu\nu}=(p+\rho)u_{\mu}u_{\nu}+pg_{\mu\nu},\quad T=3p-\rho, (2.10)

satisfying the continuity equation μ(g)Tμν=0\nabla^{(g)}_{\mu}T^{\mu\nu}=0, where (g)\nabla^{(g)} is the covariant derivative calculated using the metric tensor gμνg_{\mu\nu}. The continuity equation allows us to write:

=3H02(Ωm,0a3+4ΩΛ,0).\mathcal{R}=3H_{0}^{2}(\Omega_{m,0}a^{-3}+4\Omega_{\Lambda,0})\,. (2.11)

It must be noted that the two Ricci tensors, Palatini μν\mathcal{R}_{\mu\nu} and metric RμνR_{\mu\nu}, are related to each other via a conformal transformation since μν=μν(f(T)g)\mathcal{R}_{\mu\nu}=\mathcal{R}_{\mu\nu}(f^{\prime}(T)g), so that111Let us notice that the curvature scalar \mathcal{R} is not the curvature scalar built from the conformal metric only, R^(f(T)g)\hat{R}(f^{\prime}(T)g), but rather it is a contraction of gμνg^{\mu\nu} and the Palatini Ricci tensor μν(f(T)g)\mathcal{R}_{\mu\nu}(f^{\prime}(T)g). [98]

μν=Rμν+321(f(T))2μf(T)νf(T)1f(T)(μ(g)ν(g)12gμν(g))f(T).\begin{split}\mathcal{R}_{\mu\nu}&=R_{\mu\nu}+\frac{3}{2}\frac{1}{(f^{\prime}(T))^{2}}\partial_{\mu}f^{\prime}(T)\partial_{\nu}f^{\prime}(T)\\ &-\frac{1}{f^{\prime}(T)}\left(\nabla^{(g)}_{\mu}\nabla^{(g)}_{\nu}-\frac{1}{2}g_{\mu\nu}\Box^{(g)}\right)f^{\prime}(T)\,.\end{split} (2.12)

To get the Friedmann equation, one needs to plug relations (2.11) and (2.12) to the field equations (2.2) and get rid of a′′a^{\prime\prime} from one of the formulas. Upon doing so, one arrives at the following equation for the quadratic model [94, 99, 56, 100]:

H2H02\displaystyle\frac{H^{2}}{H_{0}^{2}} =b2(b+d2)2[Ωα(Ωm,0a3+ΩΛ,0)2×(K3)(K+1)2b\displaystyle=\frac{b^{2}}{\left(b+\frac{d}{2}\right)^{2}}\left[\Omega_{\alpha}(\Omega_{\text{m},0}a^{-3}+\Omega_{\Lambda,0})^{2}\times\frac{(K-3)(K+1)}{2b}\right.
+(Ωm,0a3+ΩΛ,0)+Ωr,0a4b+Ωk],\displaystyle\left.+(\Omega_{\text{m,0}}a^{-3}+\Omega_{\Lambda,0})+\frac{\Omega_{\text{r},0}a^{-4}}{b}+\Omega_{k}\right], (2.13)

where

Ωk\displaystyle\Omega_{k} =kH02a2,\displaystyle=-\frac{k}{H_{0}^{2}a^{2}}, (2.14)
Ωr,0\displaystyle\Omega_{\text{r},0} =κ2ρr,03H02,\displaystyle=\frac{\kappa^{2}\rho_{\text{r,0}}}{3H_{0}^{2}}, (2.15)
Ωm,0\displaystyle\Omega_{\text{m},0} =κ2ρm,03H02,\displaystyle=\frac{\kappa^{2}\rho_{\text{m,0}}}{3H_{0}^{2}}, (2.16)
ΩΛ,0\displaystyle\Omega_{\Lambda,0} =κ2ρΛ,03H02,\displaystyle=\frac{\kappa^{2}\rho_{\Lambda,0}}{3H_{0}^{2}}, (2.17)
K\displaystyle K =3ΩΛ,0(Ωm,0a3+ΩΛ,0),\displaystyle=\frac{3\Omega_{\Lambda,0}}{(\Omega_{\text{m,0}}a^{-3}+\Omega_{\Lambda,0})}, (2.18)
Ωα\displaystyle\Omega_{\alpha} =3αH02,\displaystyle=3\alpha H_{0}^{2}, (2.19)
b\displaystyle b =f(R^)=1+2Ωα(Ωm,0a3+4ΩΛ,0),\displaystyle=f^{\prime}(\hat{R})=1+2\Omega_{\alpha}(\Omega_{\text{m,0}}a^{-3}+4\Omega_{\Lambda,0}), (2.20)
d\displaystyle d =1Hdbdt=2Ωα(Ωm,0a3+ΩΛ,0)(3K).\displaystyle=\frac{1}{H}\frac{db}{dt}=-2\Omega_{\alpha}(\Omega_{\text{m,0}}a^{-3}+\Omega_{\Lambda,0})(3-K)\,. (2.21)

From the above one can check that the standard model (2.22) can be reconstructed in the limit α0\alpha\mapsto 0:

H2H02=Ωr,0a4+Ωm,0a3+ΩΛ,0+Ωk.\frac{H^{2}}{H_{0}^{2}}=\Omega_{\text{r},0}a^{-4}+\Omega_{\text{m},0}a^{-3}+\Omega_{\Lambda,0}+\Omega_{k}. (2.22)

In what follows, we will skip the spatial curvature term Ωk\Omega_{k}.

3 Constraints from Observational Data

In this section, we start with a description of the observational data that we use together with a brief explanation of how we implement this into our analysis pipeline. We then go on to the main constraint results which is composed of a series of Markov Chain Monte Carlo (MCMC) constraint analyses which we use to both set limits on the parameters in the model, with an emphasis on the new Ωα\Omega_{\alpha} parameter, as well as to assess how the model compared with the standard Λ\LambdaCDM cosmological model in each case.

3.1 Observational Data

We devote this part of the manuscript to the description of the data sets being used in our analysis. Our baseline data set consists of Hubble expansion data which we then complement with supernova Type Ia (SNIa) and Baryonic acoustic oscillation (BAO) data. We also include the recent release of the SH0ES value for the Hubble constant [101], H0S22=73.04±1.04kms1Mpc1H_{0}^{\rm S22}=73.04\pm 1.04\,{\rm km\,s}^{-1}{\rm Mpc}^{-1}, which has an associated reported absolute magnitude of MS22=19.253±0.027magM^{\rm S22}=-19.253\pm 0.027\,{\rm mag}. The other data sets are

  • Cosmic Chronometers (CC) – We use the public list of 3131 data points [102, 103, 104, 105, 106, 107, 108] that utilize the CC method. In this approach, a spectroscopic dating technique is used for passively evolving galaxies. In this way, direct measurements are obtainable for the Hubble parameter up to z2z\lesssim 2. CC also has the advantage that it is independent of any cosmological model as well as the the Cepheid distance scale. On the other hand, some assumptions are made on the modeling of stellar ages. For two passively aging galaxies, observations can constrain Δz/Δt\Delta z/\Delta t which can inturn be used to estimate the Hubble parameter at that redshift through H(z)=(1+z)1Δz/ΔtH(z)=-(1+z)^{-1}\Delta z/\Delta t. The corresponding χH2\chi^{2}_{H} estimator is represented by

    χH2=i=131(H(zi,Θ)Hobs(zi))2σH2(zi),\chi^{2}_{H}=\sum^{31}_{i=1}\frac{\left(H(z_{i},\Theta)-H_{\mathrm{obs}}(z_{i})\right)^{2}}{\sigma^{2}_{H}(z_{i})}\,, (3.1)

    where H(zi,Θ)H(z_{i},\Theta) are the theoretical model Hubble parameter values at redshift ziz_{i} with model parameters Θ\Theta while Hobs(zi)H_{\mathrm{obs}}(z_{i}) are the corresponding Hubble data values at ziz_{i} with an associated uncertainty of σH(zi)\sigma_{H}(z_{i}).

  • Pantheon+ Compilation – This compilation is based on SNIa observations that are calibrated using a Cepheid distance scale. These events occur in binary stellar systems with uniform intrinsic brightness making them ideal standard candles in order to measure expansion profilers for distant galaxies. The Pantheon+ (SN) compilation [109] consists of 1701 SNIa samples which reports distance modulus μ\mu of an object together with the redshift zz as measured in the frame of the cosmic microwave background radiation (CMB). The distance modulus depends on the observed apparent magnitude of an object, mm, together with its absolute magnitude, MM which is a measure of intrinsic brightness. For an object at redshift ziz_{i}, the distance modulus is given as

    μ(zi,Θ)=mM=5log10[DL(zi,Θ)]+25,\mu(z_{i},\Theta)=m-M=5\log_{10}[D_{L}(z_{i},\Theta)]+25\,, (3.2)

    where DL(zi,Θ)D_{L}(z_{i},\Theta) is the luminosity distance defined as

    DL(zi,Θ)=c(1+zi)0zidzH(z,Θ).D_{L}(z_{i},\Theta)=c(1+z_{i})\int_{0}^{z_{i}}\frac{dz^{\prime}}{H(z^{\prime},\Theta)}\,. (3.3)

    The SN data set needs to be calibrated for the absolute magnitude MM, which is treated as a nuisance parameter in the MCMC and is marginalized over. Finally, the cosmological parameters are constrained by minimizing a χ2\chi^{2} likelihood specified by [110]

    χSN2=(Δμ(zi),Θ))TC1(Δμ(zi),Θ)),\chi^{2}_{\mathrm{SN}}=(\Delta\mu(z_{i}),\Theta))^{T}C^{-1}(\Delta\mu(z_{i}),\Theta))\,, (3.4)

    where (Δμ(zi),Θ))=(μ(zi),Θ)μ(zi)obs(\Delta\mu(z_{i}),\Theta))=(\mu(z_{i}),\Theta)-\mu(z_{i})_{\mathrm{obs}} and CC is the corresponding covariance matrix which accounts for the statistical and systematic uncertainties. The SN compilation used in this analysis incorporates the SH0ES Cepheid host distance anchors (H0S22H_{0}^{\rm S22}) [101] in the likelihood which helps to break the degeneracy between the parameters MM and H0H_{0} when analyzing SNIa alone. The redshift range of the SN compilation is 0.01<z<2.50.01<z<2.5 which includes much more information at lower redshift as compared with previous SNIa compilations.

  • Baryon Acoustic Oscillations - Our analysis is nuanced by a joint BAO data set consisting of separately independent data points. This includes measurements from the SDSS Main Galaxy Sample at zeff=0.15z_{\mathrm{eff}}=0.15 [111], the six-degree Field Galaxy Survey at zeff=0.106z_{\mathrm{eff}}=0.106 [112], and the BOSS DR11 quasar Lyman-alpha measurement at zeff=2.36z_{\mathrm{eff}}=2.36 together with the BOSS DR10 measurements at zeff={0.32,0.57}z_{\mathrm{eff}}=\{0.32,0.57\} [113]. We also incorporate the angular diameter distances and H(z)H(z) measurements of the SDSS-IV eBOSS DR14 quasar survey at zeff={0.98,1.23,1.52,1.94}z_{\mathrm{eff}}=\{0.98,1.23,1.52,1.94\} [114], along with the SDSS-III BOSS DR12 consensus BAO measurements of the Hubble parameter and the corresponding comoving angular diameter distances at zeff={0.38,0.51,0.61}z_{\mathrm{eff}}=\{0.38,0.51,0.61\} [115]. These two BAO data sets include covariance matrix information which is included in our MCMC analysis. The BAO data sets involve various measures of cosmic expansion include the Hubble distance DH(z)D_{H}(z), comoving angular diameter distance DM(z)D_{M}(z), and volume-average distance DV(z)D_{V}(z), which are respectively defined through

    DH(z)\displaystyle D_{H}(z) =cH(z),\displaystyle=\frac{c}{H(z)}\,,
    DM(z)\displaystyle D_{M}(z) =(1+z)DA(z),\displaystyle=(1+z)D_{A}(z)\,,
    DV(z)\displaystyle D_{V}(z) =[(1+z)2DA(z)2czH(z)]1/3,\displaystyle=\left[(1+z)^{2}D_{A}(z)^{2}\frac{cz}{H(z)}\right]^{1/3}\,, (3.5)

    and where DA(z)=(1+z)2DL(z)D_{A}(z)=(1+z)^{-2}D_{L}(z) is the angular diameter distance. In accordance with the BAO results, we calculate the combination of parameters 𝒢(zi)=DV(zi)/rs(zd)\mathcal{G}(z_{i})=D_{V}(z_{i})/r_{s}(z_{d}), rs(zd)/DV(zi),r_{s}(z_{d})/D_{V}(z_{i}), DH(zi),D_{H}(z_{i}), DM(zi)(rs,fid(zd)/rs(zd)),D_{M}(z_{i})(r_{s,\mathrm{fid}}(z_{d})/r_{s}(z_{d})), H(zi)(rs(zd)/rs,fid(zd)),H(z_{i})(r_{s}(z_{d})/r_{s,\mathrm{fid}}(z_{d})),
    DA(zi)(rs,fid(zd)/rs(zd))D_{A}(z_{i})(r_{s,\mathrm{fid}}(z_{d})/r_{s}(z_{d})) for which the comoving sound horizon at the end of the baryon drag epoch, at redshift zd1059.94z_{d}\approx 1059.94, has a fiducial value of zd1059.94z_{d}\approx 1059.94 [116]. This lets us define the corresponding χ2\chi^{2} for the BAO data as

    χBAO2(Θ)=ΔG(zi,Θ)TCBAO1ΔG(zi,Θ)\chi^{2}_{\text{BAO}}(\Theta)=\Delta G(z_{i},\Theta)^{T}C_{\text{BAO}}^{-1}\Delta G(z_{i},\Theta) (3.6)

    where ΔG(zi,Θ)=G(zi,Θ)Gobs(zi)\Delta G(z_{i},\Theta)=G(z_{i},\Theta)-G_{\text{obs}}(z_{i})and CBAOC_{\text{BAO}} is the covariance matrix of all the considered BAO observations.

Finally, we include statistical criterions through which to assess the performance of the model against Λ\LambdaCDM for the various data sets as well as the inclusion of the H0S22H_{0}^{\rm S22} prior. To do this, we use the Akaike Information Criteria (AIC) [117] which includes the best performing minimum χmin2\chi^{2}_{\mathrm{min}} values and a measure of the model complexity, through the number of parameters nn. This goodness-of-fit measure is then defined as

AIC=χmin2+2n.\mathrm{AIC}=\chi^{2}_{\mathrm{min}}+2n\,. (3.7)

AIC penalizes high numbers of parameters and so lower values of AIC are indicative of better performing models.

We also examine the Bayesian Information Criterion (BIC) which nuances the AIC by putting more emphasise on the level of complexity that a model exhibits. BIC is defined as

BIC=χmin2+nlnm,\mathrm{BIC}=\chi^{2}_{\mathrm{min}}+n\ln m\,, (3.8)

where mm is the sample size of the observational data combination. The goals of both parameters is the same but BIC helps penalize higher parameters models in a stronger way. Absolute values of these statistical measures are difficult to relate to the real performance of individual models. For this reason, we consider the difference in each criterion for each individual data set for both the model under consideration together with the MCMC analysis of Λ\LambdaCDM.

3.2 Observational Constraints

In this section, we consider how the quadratic Palatini model fairs with observational data through an MCMC constraint analysis. We do this for a combination of all the data sets described previously with CC being our baseline for all analyses. We also perform these analyses again with the H0S22H_{0}^{\rm S22} prior on the Hubble constant, as well as including statistical criterions in order to compare and contrast the different performances.

Considering the Friedmann equation in Eq. (2.13), we can fit for the set of free parameters that make up this model. We assume a flat FLRW geometry to avoid additional degeneracies. By evaluating this relation at current times, we find that

ΩΛ=[14Ωα(2+Ωm,0)±\displaystyle\Omega_{\Lambda}=\Big{[}-1-4\Omega_{\alpha}(-2+\Omega_{m,0})\pm
1+Ωα2(6496Ωm,0)8Ωα(2+3Ωm,0+4Ωr,0)]/16Ωα,\displaystyle\sqrt{1+\Omega_{\alpha}^{2}\left(64-96\Omega_{m,0}\right)-8\Omega_{\alpha}\left(-2+3\Omega_{m,0}+4\Omega_{r,0}\right)}\Big{]}/16\Omega_{\alpha}\,, (3.9)

which has a healthy Λ\LambdaCDM as Ωα0\Omega_{\alpha}\rightarrow 0. The analytical nature of the solution in Eq. (2.13) makes it very fast, and more able to be incorporated into precision calculations. We show the constraints for the full set of parameters, namely {H0,Ωm,0,M,Ωα}\{H_{0},\Omega_{m,0},M,\Omega_{\alpha}\} in Fig. 1. The high level of consistency in the posterior of the Ωα\Omega_{\alpha} shows promising results for the low-energy Palatini model. On the other hand, the parameter seems to be degenerate with the other parameters to a certain degree, except for the CC+SN+BAO case where the contours are rather small. For the other parameters, similar relationships are observed as in the standard cosmological model case. For instance, the nuisance parameter is pushed to lower values when BAO data is incorporated, while an anti-correlation is observed between the Hubble constant and the matter density parameter. The matter density parameter is also very consistent across the different data sets, while the posteriors of the Hubble constant show a mild tension between the joint data sets.

Refer to caption
Figure 1: Posteriors and confidence contours for the model parameters for a combination of data sets. The blue and green contours show the CC with Pantheon+ and CC with BAO data sets respectively, while the red contour shows the full combination.

The outputs of the MCMC constraint analysis are shown in Table 1, while the corresponding Λ\LambdaCDM fits are contained in Appendix A. One notes that higher Hubble constants are obtained in the quadratic Palatini cases except when BAO data is included. On the other hand, smaller matter density parameters are obtained except for the baseline case. As for the Ωα\Omega_{\alpha} parameter, the observational data appears to prefer values that are in the 12σ1-2\sigma range as compared with the Λ\LambdaCDM value where this parameter is zero. Indeed, for the case where the statistical distance is largest from Λ\LambdaCDM, namely for the CC+BAO data set, the statistical criterions give a negative value indicating that the model is slightly preferred, whereas the other cases show a marginal preference for the standard cosmological model.

Table 1: Results for the model where the first column lists the data set combinations, the second column contains the Hubble constant constraint, while the third, fourth, and fifth columns contain the matter density, nuisance, and Ωα\Omega_{\alpha} parameters. The sixth and seventh columns show the difference in AIC and BIC criterions as compared with the corresponding Λ\LambdaCDM constraints.
Data sets H0H_{0} Ωm,0\Omega_{m,0} MM Ωα\Omega_{\alpha} ΔAIC\Delta{\rm AIC} ΔBIC\Delta{\rm BIC}
CC 68.93.1+3.468.9^{+3.4}_{-3.1} 0.2670.113+0.0990.267^{+0.099}_{-0.113} 0.00300.0120+0.0230-0.0030^{+0.0230}_{-0.0120} 1.97 1.46
CC+SN 72.060.98+0.8672.06^{+0.86}_{-0.98} 0.3410.025+0.0240.341^{+0.024}_{-0.025} 19.2950.025+0.026-19.295^{+0.026}_{-0.025} 0.00840.0049+0.0060-0.0084^{+0.0060}_{-0.0049} 0.08 1.32
CC+BAO 67.31.5+1.467.3^{+1.4}_{-1.5} 0.361±0.0510.361\pm 0.051 0.00450.0012+0.0016-0.0045^{+0.0016}_{-0.0012} 2.84-2.84 3.14-3.14
CC+SN+BAO 69.640.64+0.6069.64^{+0.60}_{-0.64} 0.3200.018+0.0170.320^{+0.017}_{-0.018} 19.3660.019+0.015-19.366^{+0.015}_{-0.019} 0.0042±0.0040-0.0042\pm 0.0040 2.38 3.62

We consider the case of adding the H0S22H_{0}^{\rm S22} prior to all the data sets under consideration in Fig. 2 to assess the consistency of the analyses with this value of the Hubble constant reported in the literature. The introduction of the prior makes the absolute magnitude and Hubble constant anti-correlation more pronuanced while the consistency of the matter density parameter is slightly perturbed. On the other hand, the posterior values of the Ωα\Omega_{\alpha} parameter are generically more consistent with Λ\LambdaCDM in this scenario.

Refer to caption
Figure 2: The posteriors and confidence contours for the model parameters are again shown except that the H0S22H_{0}^{\rm S22} prior added in the constraint analysis.

The outputs of the MCMC analyses of the Palatini model for the H0S22H_{0}^{\rm S22} prior case are shown in Table 2. Generically, we find higher values of the fits for the Hubble constant, as one would expect, but we also find lower values of the matter density parameters with reasonable uncertainties, which is consistent with how one would expect the cosmology to evolve. On the other hand, the precise fits for the Ωα\Omega_{\alpha} parameter are consistent with Λ\LambdaCDM but there is a preference for a nontrivial Ωα\Omega_{\alpha}. By and large the statistical performance of the model against the different data sets remains largely the same. The superior performance of the model for the instance where we consider CC+BAO data set is considered remains the case. The various fits for the model parameters and the different data set combinations are shown in Fig. 3 which contains a whisker plot of the final constraints together with an indicative region for the H0S22H_{0}^{\rm S22} prior.

Table 2: The corresponding results for the model when the H0S22H_{0}^{\rm S22} prior is included in the constraint analysis.
Data sets H0H_{0} Ωm,0\Omega_{m,0} MM Ωα\Omega_{\alpha} ΔAIC\Delta{\rm AIC} ΔBIC\Delta{\rm BIC}
CC 72.240.99+1.0072.24^{+1.00}_{-0.99} 0.1880.047+0.0440.188^{+0.044}_{-0.047} 0.00800.0190+0.05100.0080^{+0.0510}_{-0.0190} 1.32 0.81
CC+SN 72.550.72+0.7472.55^{+0.74}_{-0.72} 0.3380.025+0.0240.338^{+0.024}_{-0.025} 19.2780.021+0.019-19.278^{+0.019}_{-0.021} 0.00840.0048+0.0055-0.0084^{+0.0055}_{-0.0048} 0.01 1.25
CC+BAO 71.010.77+0.8571.01^{+0.85}_{-0.77} 0.2500.025+0.0260.250^{+0.026}_{-0.025} 0.00080.0024+0.0031-0.0008^{+0.0031}_{-0.0024} -1.94 -2.23
CC+SN+BAO 70.560.56+0.5270.56^{+0.52}_{-0.56} 0.304±0.0160.304\pm 0.016 19.3430.015+0.014-19.343^{+0.014}_{-0.015} 0.00400.0015+0.00180.0040^{+0.0018}_{-0.0015} 7.05 8.29
Refer to caption
Figure 3: Comparative constraints for the different data sets under consideration together with the corresponding outputs when the H0S22H_{0}^{\rm S22} prior is included. The shaded region shows the 1σ\sigma uncertainty of the H0S22H_{0}^{\rm S22} Hubble constant value.

4 Conclusion

The leading order corrections to GR in the Palatini approach scenario have been probed in this work using the latest expansion rate data with an emphasis on the latest SNIa and BAO data. The Palatini formulation of gravity offers an interesting approach in which the relationship between the gravitational connection and the metric tensor field remains open and may be used as a freedom of the system to address more physical constraints. This is important so that the various implicit assumptions in the standard formulation of gravity can be isolated and explored individually. In our case, we explore the late Universe and so we can assume the low-energy limit of Palatini gravity, which produces the Friedmann equations as explained in Sec. 2.2 where the gravitational sector leads directly to the appearance of an additional parameter Ωα\Omega_{\alpha} which is responsible for the low-energy gravitational action corrections. For our analyses, we use CC data as our baseline data set with contributions also from the SN sample and a combination of transversal BAO data sets, that are non-overlapping. For each combination of data sets, we perform a full MCMC analysis obtaining constraints on all cosmological parameters of the model under investigation. Additionally, we use the latest literature value of the Hubble constant from the SH0ES Team, namely H0S22H_{0}^{\rm S22}. Finally, we give a statistical comparison of the Palatini quadratic model against Λ\LambdaCDM using the AIC and BIC metrics.

The Λ\LambdaCDM reference constraint values are contained in Appendix A, which is important for estimating the AIC and BIC statistical measures for the different data set combinations. In general, our analysis shows promising results for this low-energy limit of Palatini gravity. We find higher values of the Hubble constant and consistent matter density parameter ranges. As for the new Ωα\Omega_{\alpha}, this is mostly consistent with the Λ\LambdaCDM limit but also shows differences in some cases, which may be further nuanced with the addition of data sets. Considering the AIC and BIC measures, these seem to show a mild preference for Λ\LambdaCDM in most cases since it has less model parameters in this setting. However, for the CC+BAO data set combination, we find a slight preference for the Palatini quadratic model which motivates us to further study this model. These features are strengthened when the H0S22H_{0}^{\rm S22} prior is added to the analysis being undertaken.

Let us compare our findings to the results obtained previously in the literature. In the SI units, the theory parameter α\alpha can be expressed as α=c2Ωα3H02\alpha=\frac{c^{2}\Omega_{\alpha}}{3H^{2}_{0}}; for example, for Ωα103\Omega_{\alpha}\sim 10^{-3} we get α1048m2\alpha\sim 10^{48}\,\text{m}^{2}. If we consider the case where a slight preference for the quadratic model is exhibited, i.e. for the CC+BAO data set, we get the following range for the α\alpha parameter: α=0.4961.488+1.921×1049m2\alpha=-0.496^{+1.921}_{-1.488}\times 10^{49}\,\text{m}^{2}. This gives a better bound on the parameter related to the quadratic correction than found in [118] using Union2.1 data set: α=0.0929.025+8.840×1055m2\alpha=-0.092^{+8.840}_{-9.025}\times 10^{55}\,\text{m}^{2}, although the model considered there features also as additional term proportional to 1/1/\mathcal{R}. The same model as ours was analyzed also in [100]; the data used comprised of CMB observations, data from the WMAP, 580 SNIa events, and measurements of H(z)H(z); the results were Ωα=9.709.70+134.80×109\Omega_{\alpha}=9.70^{+134.80}_{-9.70}\times 10^{-9}. For this model, the BIC value was greater than 6, indicating a preference for the Λ\LambdaCDM. In [94], utilizing observational H(z)H(z), BAO, and CMB data, the Lagrangian of the form L=g(+α2+β1+δ+γ1+σLm)L=\sqrt{-g}(\mathcal{R}+\alpha\mathcal{R}^{2}+\beta\mathcal{R}^{1+\delta}+\gamma\mathcal{R}^{1+\sigma}L_{m}) was considered; it was found that, depending on the assumptions, the dimensionless parameter varies from Ωα=56.3422.971+3.102\Omega_{\alpha}=-56.342^{+3.102}_{-2.971} to Ωα=4.401±0.079\Omega_{\alpha}=4.401\pm 0.079. In [99], a similar methodology was used to assess the goodness of the Starobinsky model with a modified EoS - instead of the perfect fluid, the so-called Chaplygin gas was used, mimicking the effects of dark matter at high densities, and dark energy at the low end of the energy scale; the parameter range is Ωα=1.1560.08+1.156×109\Omega_{\alpha}=-1.156^{+1.156}_{-0.08}\times 10^{-9}; the model, however, features three new parameters, as compared to just one introduced by the modification of the Einstein-Hilbert Lagrangian. Summarizing, the latest cosmological data allows us to put the bound on the parameter α\alpha in the range |α|1049|\alpha|\leq 10^{49} m2 .

On the other hand, in the context of the f()=+α2f(\mathcal{R})=\mathcal{R}+\alpha\mathcal{R}^{2} model, investigations into the weak-field limit have indicated that the magnitude of α\alpha typically remains below 2×108m22\times 10^{8}\,\text{m}^{2} [119]. Nevertheless, precise constraints on these parameters have proven challenging due to uncertainties stemming from microphysics, rendering experiments within the Solar System inadequate for such determinations [120]. It is important to note that gravitational tests conducted in a vacuum, such as the Shapiro delay technique, do not impose constraints on Palatini gravity since the theory reduces to General Relativity with a cosmological constant, as elucidated in the discussion following Eq. (2.7).

In a manner reminiscent of General Relativity, none of the f()f(\mathcal{R}) models effectively account for the rotation curves observed in galaxies [121, 122]. Consequently, constraints derived from galaxy catalogs continue to remain elusive up to the present day.

Conversely, when delving into microphysical considerations, seismic data from Earth has imposed more stringent limitations on the parameter, with |α|109m2|\alpha|\lesssim 10^{9}\,\text{m}^{2} (at a 2σ2\sigma level of precision) [123, 124]. Notably, in studies focused on the non-relativistic limit, it has been ascertained that only the quadratic term holds significance, while higher-order terms, starting from the sixth order, become relevant [120]. Therefore, these bounds also hold for the model considered here.

The significant divergence of nearly 40 orders of magnitude in the constraints obtained underscores the imperative need for an intensified exploration of modified gravity within the context of stellar and substellar objects. This demand is further underscored by the multifaceted complexities inherent in astrophysical modeling, necessitating a realistic treatment of matter descriptions that account for potential anisotropies, non-ideal fluid characteristics, energy transport mechanisms, opacity considerations, and more. Furthermore, it is essential to address uncertainties related to the material component, a factor often disregarded in cosmological contexts.

In accordance with the insights presented in [125], particularly evident in diagrams 1 and 2, the curvature regime governing these celestial bodies occupies a pivotal intermediate position. This region bridges the divide between the problematic cosmological realm and the domain where GR suffices to describe gravitational phenomena adequately. Consequently, this transitional realm, serving as a boundary between small-scale systems like compact objects and the Solar System, may conceal the subtle emergence of corrections to GR.

Nevertheless, the low-energy Palatini model analysis being showcased here gives precision insights into the behavior of the model in the context of the late Universe together with its expansion profile in that regime. The study gives a positive sign that this approach shows promise in confronting the major challenges that have appeared in the literature in recent years. We intend to further advance in this direction with a broader numerical analysis of this model including perturbations and related data sets in the near future.

Appendix A Λ\LambdaCDM Constraints

The parameter constraints for the Λ\LambdaCDM baseline model are shown in Table 3. These standard cosmological model constraints can be compared with the constraints obtained in the main sections of this work in order to understand how the cosmic evolution changes in this scenario.

Table 3: Parameter constraints on Λ\LambdaCDM for different combinations of data sets.
Data sets H0H_{0} Ωm,0\Omega_{m,0} MM AIC BIC
CC 67.72.9+3.067.7^{+3.0}_{-2.9} 0.3210.058+0.0620.321^{+0.062}_{-0.058} 20.523 18.997
CC+SN 71.680.93+0.8471.68^{+0.84}_{-0.93} 0.3200.017+0.0150.320^{+0.015}_{-0.017} 19.3010.024+0.026-19.301^{+0.026}_{-0.024} 1561.118 1564.833
CC+BAO 68.740.80+0.7968.74^{+0.79}_{-0.80} 0.294±0.0120.294\pm 0.012 34.703 33.826
CC+SN+BAO 70.57±0.6470.57\pm 0.64 0.310±0.0100.310\pm 0.010 19.336±0.020-19.336\pm 0.020 1583.535 1587.266
CC+H0S22H_{0}^{\rm S22} 67.72.9+3.067.7^{+3.0}_{-2.9} 0.3210.058+0.0620.321^{+0.062}_{-0.058} 23.163902 21.637987
CC+SN+H0S22H_{0}^{\rm S22} 71.680.93+0.8471.68^{+0.84}_{-0.93} 0.3200.017+0.0150.320^{+0.015}_{-0.017} 19.3010.024+0.026-19.301^{+0.026}_{-0.024} 1562.141016 1565.856660
CC+BAO+H0S22H_{0}^{\rm S22} 68.740.80+0.7968.74^{+0.79}_{-0.80} 0.294±0.0120.294\pm 0.012 45.391895 44.514606
CC+SN+BAO+H0S22H_{0}^{\rm S22} 70.57±0.6470.57\pm 0.64 0.310±0.0100.310\pm 0.010 19.336±0.020-19.336\pm 0.020 1587.608045 1591.338647

Acknowledgements

This article is based upon work from COST Action CA21136 Addressing observational tensions in cosmology with systematics and fundamental physics (CosmoVerse) supported by COST (European Cooperation in Science and Technology). AK and DAG were beneficiaries of Short Term Scientific Missions (STSM) of the COST action CosmoVerse (CA21136). JLS would like to acknowledge funding from “The Malta Council for Science and Technology” through the “FUSION R&I: Research Excellence Programme”. AW acknowledges financial support from MICINN (Spain) Ayuda Juan de la Cierva - incorporación 2020 No. IJC2020-044751-I.

References