This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The intrinsic multiplicity distribution of exoplanets revealed from the radial velocity method

Wei Zhu (祝伟) Department of Astronomy, Tsinghua University, Beijing 100084, China weizhu@tsinghua.edu.cn
Abstract

Planet multiplicities are useful in constraining the formation and evolution of planetary systems but usually difficult to constrain observationally. Here, we develop a general method that can properly take into account the survey incompleteness and recover the intrinsic planet multiplicity distribution. We then apply it to the radial velocity (RV) planet sample from the California Legacy Survey (CLS). Within the 1 au (10 au) region, we find 21±4%21\pm 4\% (19.2±2.8%19.2\pm 2.8\%) of Sun-like stars host planets with masses above 10M10\,M_{\oplus} (0.3MJ0.3\,M_{\rm J}), about 30% (40%) of which are multi-planet systems; in terms of the RV semi-amplitude KK, 33±7%33\pm 7\% (25±3%25\pm 3\%) of Sun-like stars contain planets of K>1ms1K>1\,{\rm m\,s^{-1}} (3ms13\,{\rm m\,s^{-1}}), and each system hosts on average 1.8±0.41.8\pm 0.4 (1.63±0.161.63\pm 0.16) planets. We note that the hot Jupiter rate in the CLS Sun-like sample is higher than the consensus value of \sim1% by a factor of about three. We also confirm previous studies on the correlation between inner (<1<1\,au) and outer (>1>1\,au) planets.

Exoplanets (498) — Exoplanet systems (484) — Radial velocity (1332)

1 Introduction

Multi-planet systems are common. Of the thousands of exoplanets (or planet candidates) found by the Kepler mission (Borucki et al., 2010), about half of them are known to have siblings (e.g., Thompson et al., 2018; Berger et al., 2018). This fraction further increases dramatically after the transit probability is properly taken into account (e.g., Lissauer et al., 2011; Tremaine & Dong, 2012; Fabrycky et al., 2014; Zhu et al., 2018; He et al., 2020). Besides the planet sample from transit surveys, multi-planet systems have also been found to be common via (combinations of) other detection techniques (e.g., Wright et al., 2009; Gould et al., 2010; Knutson et al., 2014; Bryan et al., 2016; Zhu & Wu, 2018).

Planet multiplicity conveys rich information about the system formation and evolution. To the zeroth order, planets around the same host are formed out of the same disk, and thus whether and how different types of planets can reside in the same system can provide important constraints on their formation channels and/or locations (e.g., Knutson et al., 2014; Huang et al., 2016; Zhu & Wu, 2018; Bryan et al., 2019). Additionally, the dynamical interactions among planets may have also imprinted signatures on the present-day architecture of the planetary system (e.g., Chatterjee et al., 2008; Jurić & Tremaine, 2008; Pu & Wu, 2015). We refer interested readers to Dawson & Johnson (2018) and Zhu & Dong (2021) for some recent reviews on the related topics.

To better connect observations with theoretical models of planet formation and evolution, we need to go beyond the commonly used planet occurrence rates and constrain directly the intrinsic multiplicity distribution of planetary systems, that is, the fractions of stars with different numbers of planets within the given parameter space. This requires additional modeling effort to properly take into account the observational biases, as they may affect different multiplicities at different levels.

Because the Kepler survey provided so far the largest uniform planet sample, previous studies have relied on the Kepler transit sample to deriving the intrinsic multiplicity distribution (e.g., Lissauer et al., 2011; Fang & Margot, 2012; Zhu et al., 2018; Mulders et al., 2018; Zink et al., 2019; He et al., 2020). However, the transit method is compromised by the strong degeneracy between the intrinsic multiplicity and the mutual inclination distribution, which cannot be resolved without external constraints (e.g., Lissauer et al. 2011; Tremaine & Dong 2012; Zhu et al. 2018; see Zhu & Dong 2021 for further discussions on this issue).

In this work, we make use of the recently released planet sample from the radial velocity (RV) survey—the California Legacy Survey (CLS, Rosenthal et al. 2021a)—and derive the intrinsic multiplicity distribution of planets around Sun-like stars. We describe the general method to model the multiplicity distribution in any RV survey in Section 2. We then apply this method to the CLS sample and report the results in Section 3. A discussion about the method as well as the results from the CLS sample is given in Section 4.

2 Method

2.1 Notations

We follow the general framework of Tremaine & Dong (2012) to derive the intrinsic multiplicity distribution from an exoplanet survey with limited sensitivities. We use 𝑭F for the intrinsic multiplicity vector, with each component FkF_{k} representing the fraction of stars with kk planets in the target parameter space. 111Throughout this paper the letter kk is used for the number of intrinsic planets, regardless of whether they are detected or not, whereas the letter jj is used for the number of planets that are actually detected. The observed multiplicity vector is 𝑵N, with each component NjN_{j} representing the number of stars with jj detected planets. With a statistical sample of NN_{\star} stars searched in the survey, a relation can be established

𝑵=N𝒮¯𝑭,\mbox{\boldmath$N$}=N_{\star}\bar{\mathcal{S}}\mbox{\boldmath$F$}, (1)

or equivalently

Nj=Nk=0S¯jkFk.N_{j}=N_{\star}\sum_{k=0}\bar{S}_{jk}F_{k}. (2)

Here 𝒮¯\bar{\mathcal{S}} is a matrix that captures the survey detection sensitivity, averaged over all stars in the sample. Each entry of the matrix, S¯jk\bar{S}_{jk}, quantifies the (averaged) probability that the survey identifies jj planet detections in a typical system of kk planets (Tremaine & Dong, 2012).

Under the general assumption that the detections of individual planets around the same host are independent, the sensitivity matrix 𝒮\mathcal{S} of the ll-th star in the sample is given as

Sjk(pl)={k!j!(kj)!plj(1pl)kj,jk0,j>k,S_{jk}(p_{l})=\left\{\begin{array}[]{ll}\frac{k!}{j!(k-j)!}p_{l}^{j}(1-p_{l})^{k-j},&j\leq k\\ 0,&j>k\end{array}\right., (3)

where plp_{l} is the detection probability of the ll-th star to a typical planet in the pre-defined parameter space Δ𝜽\Delta\mbox{\boldmath$\theta$}

plΔ𝜽f(𝜽)sl(𝜽)𝑑𝜽Δ𝜽f(𝜽)𝑑𝜽.p_{l}\equiv\frac{\int_{\Delta\mbox{\boldmath$\theta$}}f(\mbox{\boldmath$\theta$})s_{l}(\mbox{\boldmath$\theta$})d\mbox{\boldmath$\theta$}}{\int_{\Delta\mbox{\boldmath$\theta$}}f(\mbox{\boldmath$\theta$})d\mbox{\boldmath$\theta$}}. (4)

Here 𝜽\theta denotes a set of parameters that define the properties of the planet. Note that the evaluation of plp_{l} takes into account not only the detection sensitivity of the star, sl(𝜽)s_{l}(\mbox{\boldmath$\theta$}), but also the intrinsic planet distribution of all stars, f(𝜽)f(\mbox{\boldmath$\theta$}). The mean sensitivity of the survey is thus

𝒮¯=1Nl=1N𝒮(pl).\bar{\mathcal{S}}=\frac{1}{N_{\star}}\sum_{l=1}^{N_{\star}}\mathcal{S}(p_{l}). (5)

The frequency of planets, or the average number of planets per star, is

n¯pk=1kFk.\bar{n}_{\rm p}\equiv\sum_{k=1}kF_{k}. (6)

With the above equations it is straightforward to show that

n¯p=1Np¯j=1jNj,\bar{n}_{\rm p}=\frac{1}{N_{\star}\bar{p}}\sum_{j=1}jN_{j}, (7)

where the averaged detection probability

p¯1Nl=1Npl.\bar{p}\equiv\frac{1}{N_{\star}}\sum_{l=1}^{N_{\star}}p_{l}. (8)

That is, the derivation of the frequency (or occurrence rate) of planets is independent of the multiplicity distribution. This is consistent with the original assumption that the detections of individual planets are independent. Additionally, as the planet frequency only depends on the averaged detection probability p¯\bar{p}, it provides a practical way to derive the intrinsic planet distribution f(𝜽)f(\mbox{\boldmath$\theta$}). This last quantity is central to the calculation of the per-star detection probability plp_{l} (Equation 4). We will further demonstrate it in Section 3.

Another quantity of interest to the exoplanet statistics is the frequency of planetary systems, or the fraction of stars with (at least one) planets,

Fpk=1Fk=1F0.F_{\rm p}\equiv\sum_{k=1}F_{k}=1-F_{0}. (9)

Unlike the derivation of n¯p\bar{n}_{\rm p}, the derivation of FpF_{\rm p} depends on planet multiplicity.

The ratio between the frequency of planets and the frequency of planetary systems gives the average number of planets per planetary system, or average multiplicity (Zhu & Dong, 2021)

m¯pn¯pFp.\bar{m}_{\rm p}\equiv\frac{\bar{n}_{\rm p}}{F_{\rm p}}. (10)

2.2 Maximum likelihood analysis

With a given intrinsic multiplicity vector, 𝑭F, and the sensitivity matrix, 𝒮¯\bar{\mathcal{S}}, one can compute the expected multiplicity distribution, 𝑵¯{N¯j}\bar{\mbox{\boldmath$N$}}\equiv\{\bar{N}_{j}\}, based on Equation (1). The detections of individual systems are best described as Poisson processes, and thus the likelihood of having the observed multiplicity distribution, 𝑵{Nj}\mbox{\boldmath$N$}\equiv\{N_{j}\}, is given by

=j=0N¯jNjexp(N¯j)Nj!,\mathcal{L}=\prod_{j=0}\frac{\bar{N}_{j}^{N_{j}}\exp(-\bar{N}_{j})}{N_{j}!}, (11)

and its logarithm

ln=j=0(NjlnN¯jN¯j)+constant.\ln{\mathcal{L}}=\sum_{j=0}\left(N_{j}\ln{\bar{N}_{j}}-\bar{N}_{j}\right)+{\rm constant}. (12)

The constant term is ignored hereafter. The above likelihood is maximized when 𝑵¯=𝑵\bar{\mbox{\boldmath$N$}}=\mbox{\boldmath$N$}, leading to the solution

𝑭=1N𝒮¯1𝑵.\mbox{\boldmath$F$}=\frac{1}{N_{\star}}\bar{\mathcal{S}}^{-1}\mbox{\boldmath$N$}. (13)

The above analytic solution does not take into account the physical constraints on the intrinsic multiplicities, namely 0Fk10\leq F_{k}\leq 1. Nevertheless, it provides a simply and useful way to check the numerical results and investigate the impact of input parameters on the derived frequencies.

One can also derive/constrain the intrinsic multiplicities in a numerical way. This allows to impose the physical constraints and derive the uncertainties of the intrinsic multiplicities. The data to be fitted by the model is the observed multiplicity distribution, (N0,N1,N2,)(N_{0},N_{1},N_{2},\dots), and the free parameters of the model are (F1,F2,)(F_{1},F_{2},\dots). Note that the first component of 𝑭F, F0F_{0}, is not a free parameter of the model, because it can be derived by F0=1j=1FjF_{0}=1-\sum_{j=1}F_{j}. Therefore, the number of observational constraints is always larger than the number of free parameters, and thus the model can be constrained.

The observed multiplicity distribution can in principle be extended to infinity, which cannot be handled in practice, so where to truncate it? Theoretically, mass conservation of the planetary material and long-term dynamical stability arguments do not allow a system to host infinite number of planets. However, such arguments do not give deterministic criterion on the maximum number of planets per system, so we rely on the statistical argument and regard it as an issue of model selection. The likelihood of the model is optimized under Equation (15), and the inclusion of higher multiplicities with zero detections no longer enhances it. Therefore, both intrinsic and observed multiplicity distributions are truncated at the highest multiplicities with non-zero detections. 222A few criteria, such as the Bayesian Information Criterion (BIC), have been proposed to quantify the superiority of one model over another. However, the commonly used form of BIC assumes that the number of data points substantially exceeds the number of model parameters. This is not true in the present case. We will discuss the robustness of our results against the choice of the maximum multiplicity in the modeling, kmaxk_{\rm max}, in Section 4.2.

We derive the uncertainties on the intrinsic multiplicities in the following way. We generate a fine grid around the maximum-likelihood solution in Equation (15) and calculate the likelihood value at each grid point. The nn-σ\sigma confidence interval is then defined by grid points whose likelihoods are no more than Δln=n2/2\Delta\ln{\mathcal{L}}=n^{2}/2 from the maximum value.

3 Application to the CLS sample

3.1 Sample description

Refer to caption
Figure 1: Surface gravity (logg\log{g}) vs. effective temperature (TeffT_{\rm eff}) for stars in the CLS sample. Here black dots are all CLS stars, red dots are the Sun-like stars according to the selection criteria of Section 3.1, and the blue circles mark the CLS stars with planet detections.
Refer to caption
Figure 2: The distribution of CLS planets with Sun-like hosts in the minimum mass vs. semi-major axis plane. Planets with different observed multiplicities are differentiated with different labels and colors. The contours are the survey completeness curves of 10% (darkest) to 90% (lightest), constructed based on the injection–recovery simulations of all Sun-like stars in the CLS sample (Rosenthal et al., 2021a). The vertical dashed line indicates the outer boundary (10 au) that is considered in this work, and the horizontal dashed line indicates the upper mass limit (13MJ13\,M_{\rm J}). The positions of Jupiter and Saturn are also shown. The regions of three types of planets defined in Section 3 are given as the black boxes. The red solid and dashed lines denote the RV semi-amplitudes of 3 and 1ms11\,{\rm m\,s^{-1}}, respectively. Note that one additional planet, 55 Cnc e, with minimum mass msini=9.4Mm\sin{i}=9.4\,M_{\oplus} and semi-major axis a=0.02a=0.02\,au, is slightly beyond the limit and thus does not appear on the plot.
Refer to caption
Figure 3: The cumulative distribution functions (CDFs) of planetary parameters for planets from systems with one planet detection (blue curves) and more than one planet detections (orange curves). In the left, middle, and right panels are shown the minimum mass, the semi-major axis, and the orbital eccentricity, respectively. The pp value of the two-sample KS test is indicated in each of these comparisons.

The CLS sample contains 178 RV planet detections around 719 stars (Rosenthal et al., 2021a; Fulton et al., 2021). The CLS stars cover a broad range in stellar properties, including FGKM-type dwarf and giant stars. For this study, we will focus on the Sun-like stars, which are defined as stars with effective temperature, TeffT_{\rm eff}, in the range of 4700–6500 K and surface gravity logg>4.0\log{g}>4.0. Applying these cuts to the CLS sample, we are left with 474 Sun-like stars. These include 92 known planet hosts, and the total number of CLS planet detections is 142. Figure 1 shows the selected Sun-like stars and planet hosts in comparison to the overall CLS sample.

We reconstruct the CLS completeness curves for the selected stars based on the per-star injection–recovery simulations of Rosenthal et al. (2021a). The averaged completeness curves are shown in Figure 2 together with the CLS planet detections around Sun-like hosts. Our sample has (382,57,26,5,2,2)(382,57,26,5,2,2) systems with (0,1,2,3,4,5)(0,1,2,3,4,5) planet detections.

In principle, one can use the overall observed multiplicity distribution and the estimated completeness curves to constrain the intrinsic multiplicity vector of planets across the whole mass–semi-major axis parameter space. However, given the limited sensitivity of CLS, such an attempt is determined to be very sensitive to assumptions about the unknown distributions of low-mass, long-period planets. We therefore limit ourselves to certain regions of the parameter space. Specifically, we identify the following types of planets (see Figure 2 for an illustration):

  • Close-in giant planets: planets with semi-major axis a<1a<1\,AU and minimum mass msinim\sin{i} in the range 0.3–13MJ\,M_{\rm J};

  • Close-in intermediate-mass planets: planets with semi-major axis a<1a<1\,AU and minimum mass msinim\sin{i} in the range 10M\,M_{\oplus}–0.3MJ\,M_{\rm J};

  • Cold giant planets: planets with semi-major axis aa in the range 1–10 AU and minimum mass msinim\sin{i} in the range 0.3–13MJ\,M_{\rm J};

We also study the combinations of the above planet types, namely close-in giant and intermediate-mass planets and all giant planets (including close-in and cold giant planets). Additionally, for the purpose of future RV observations, we also study the planet distribution in the RV semi-amplitude KK vs. semi-major axis plane and derive the multiplicity distributions for planets with K>3ms1K>3\,{\rm m\,s^{-1}} and a<10a<10\,au as well as with K>1ms1K>1\,{\rm m\,s^{-1}} and a<1a<1\,au. The observed multiplicity distributions for different types of planets are summarized in Table 1.

The method outlined in Section 2 to derive the intrinsic planet multiplicity has assumed that the properties (e.g., orbital periods, mass, etc) and detections of individual planets around the same host are mutually independent. Given data with high enough quality, the detections of multiple planets in the same system are indeed independent. In practice, however, the detectability of a new planet is dependent on the properties of the detected planets. One well-known example is the degeneracy between one eccentric planet and two near-circular planets in 2:1 resonant orbits (e.g., Anglada-Escudé et al., 2010; Wittenmyer et al., 2019). The recent analysis by Boisvert et al. (2018) found that about 25% of published RV systems might suffer from this degeneracy. This fraction may be taken as an upper limit on the impact of the degeneracy in the CLS sample, given that CLS on average has more data points and covers longer durations. For systems that may actually suffer from this degeneracy, the hidden planets are usually not massive enough, 333Specifically, the hidden planet is lighter than the observed one by a factor of e/21/3\sim e/2^{1/3}, where ee is the orbital eccentricity of the reported planet. and thus our main conclusion about the giant planet multiplicity will not be affected. Future studies and observations will be needed to understand the impact of the degeneracy on the derived intrinsic multiplicity of low-mass planets.

One indirect way to check the validity of the assumption about independence is to compare the parameters of planets from systems with only one planet detection and at least two planet detections. Such a comparison is shown in Figure 3. The two-sample Kolmogorov-Smirnov (KS) tests yield pp-values on the order of unity, suggesting that the two distributions under comparison can be considered to be drawn from the same underlying distribution. In other words, the planetary parameters are statistically similar regardless of the number of planet detections. Therefore, it is reasonable to assume that the planet properties and detections are largely independent in multi-planet systems. See also Tremaine & Dong (2012) for further discussion on this separability issue.

Detected planets in the Kepler multi-planet systems appear to have similar properties (e.g., Lissauer et al., 2011; Ciardi et al., 2013). Whether or not this is of astrophysical origin is still under debate (e.g., Millholland et al. 2017; Weiss et al. 2018; He et al. 2019; Zhu 2020; Murchikova & Tremaine 2020), and we refer to Zhu & Dong (2021) for further discussions about this issue. Regardless of the origin, it has marginal impact on the method and results of the current work for two reasons. First, even if the observed intra-system similarity in Kepler systems is astrophysical, such a correlation will be much weaker in typical RV systems, which have on average more massive and fewer planets. Second, as explained earlier in the paper, we study the multiplicity distribution of specific planet types whose physical properties are already restricted to a relatively narrow range (e.g., all giant planets have very similar sizes). In other words, the hypothetical planets in the hypothetical multi-planet systems implied in our method already have similar properties, even though they are generated in an independent way. The primary parameter that is affected by assumptions like the intra-system similarity is the detection probability pp. We will discuss in Section 4.1 how our results will be changed with different average detection probabilities.

3.2 Frequencies of planets

Refer to caption
Figure 4: This figure shows in blue dots the planet detections from the CLS Sun-like sample whose properties are within our chosen parameter space. The values within each grid point gives the average number of such planets per 100 Sun-like stars. Error bars gives the 68% confidence interval, whereas the values in red indicate the 95% upper limit. The integrated planet frequencies are n¯p=0.076±0.013\bar{n}_{\rm p}=0.076\pm 0.013, 0.21±0.050.21\pm 0.05, 0.22±0.030.22\pm 0.03 for our definition of close-in giants, close-in intermediate-mass, and cold giant planets, respectively.
Refer to caption
Figure 5: Similar to Figure 4 except that now the yy-axis is the RV semi-amplitude KK. The dashed line indicates the upper mass limit of a planet (13MJ13\,M_{\rm J}) around a 1M1\,M_{\odot} host. The frequency of planets with K>3ms1K>3\,{\rm m\,s^{-1}} and a<10a<10\,au is 0.42±0.040.42\pm 0.04. The frequency of planets with K>1ms1K>1\,{\rm m\,s^{-1}} and a<1a<1\,au is 0.58±0.090.58\pm 0.09.
Refer to caption
Figure 6: Cumulative distributions of the per-star detection probabilities pp for the five chosen planet types. For any given planet type, the vertical dashed line of the same color indicate the average detection probability p¯\bar{p}.

We first derive the planet frequency, n¯p\bar{n}_{\rm p}, over the chosen parameter space, in order to determine the mean detection probability p¯\bar{p} as well as the detection probabilities {pl}\{p_{l}\} for individual stars.

We follow the general approach of Zhu & Dong (2021) to derive the planet frequencies. The parameter space is first divided into a combination of grids, with each grid covering an area that is small compared to the overall parameter space but large compared to the positional uncertainties of enclosed planets. Within each grid, the planet distribution function can be treated as a constant, and the average detection probability is given by the mean efficiency of the survey detection. The posterior distribution of the planet frequency within each grid is given by a Gamma distribution. For any grid with at least two planet detections, a log-flat prior on the planet frequency is adopted to determine the median and the 68% confidence interval. Otherwise, the 95% upper limit is derived under a linear prior. The results are illustrated in Figures 4 and 5 for planets in the mass vs. semi-major axis plane and the RV semi-amplitude vs. semi-major axis plane, respectively.

For our definition of close-in giant, close-in intermediate-mass, and cold giant planets, the planet frequency, n¯p\bar{n}_{\rm p}, is determined to be 0.076±0.0130.076\pm 0.013, 0.21±0.050.21\pm 0.05, and 0.22±0.030.22\pm 0.03, respectively. In the KK vs. aa plane, the planet frequency is 0.42±0.040.42\pm 0.04 for planets with K>3ms1K>3\,{\rm m\,s^{-1}} and a<10a<10\,au and 0.58±0.090.58\pm 0.09 for planets with K>1ms1K>1\,{\rm m\,s^{-1}} and a<1a<1\,au. The average detection probabilities for these planet classes can be derived according to Equation (7). Similarly, the average detection probability for planets defined in the joint parameter space (i.e., close-in CLS planets and all giant planets) as well as those in the RV semi-amplitude vs. semi-major axis plane can also be derived. The median values of pp for all planet classes are included in Table 1.

In order to derive of the intrinsic multiplicity distribution, we need to calculate the per-star detection probability, plp_{l}, which is the detection probability of a typical planet around the ll-th star in the sample. Making use of the injection–recovery simulations, this is given by

pl=jf(𝜽j)sl(𝜽j)jf(𝜽j).p_{l}=\frac{\sum_{j^{\prime}}f(\mbox{\boldmath$\theta$}_{j^{\prime}})s_{l}(\mbox{\boldmath$\theta$}_{j^{\prime}})}{\sum_{j^{\prime}}f(\mbox{\boldmath$\theta$}_{j^{\prime}})}. (14)

Here f(𝜽)f(\mbox{\boldmath$\theta$}) is the rate map in the specified parameter space (i.e., Figure 4 for msinim\sin{i} vs. aa and Figure 5 for KK vs. aa), and sl(𝜽j)s_{l}(\mbox{\boldmath$\theta$}_{j^{\prime}}) evaluates whether or not the jj^{\prime}-th injected planet around the ll-th star is detected in the simulation. The summations are performed over all simulated planets that are within the pre-defined parameter space. We show in Figure 6 the distributions of the per-star detection probabilities for the five planet types considered here. As it indicates, this detection probability parameter varies substantially among all stars in the sample, primarily due to the different numbers and quality levels of RV observations for individual stars.

These per-star detection probabilities enable the calculation of the mean sensitivity matrix, 𝒮¯\bar{\mathcal{S}}, a key ingredient in the derivation of the intrinsic multiplicity distribution.

3.3 Intrinsic multiplicity distributions

Table 1: The observed multiplicity distribution and the estimated mean detection probability pp for different types of planets in the CLS Sun-like star sample.
Close-in giant Close-in Cold giant All giant Close-in giant K>3ms1K>3\,{\rm m\,s^{-1}} K>1ms1K>1\,{\rm m\,s^{-1}}
intermediate & intermediate & a<10a<10\,au & a<1a<1\,au
N0N_{0} 444 452 425 408 424 393 419
N1N_{1} 28 17 41 47 42 56 43
N2N_{2} 2 5 8 16 7 21 7
N3N_{3} 0 0 0 2 1 2 3
N4N_{4} 0 0 0 1 0 1 2
N5N_{5} 0 0 0 0 0 1 0
p¯\bar{p} 0.89 0.27 0.55 0.64 0.44 0.57 0.27
Table 2: Intrinsic multiplicities for different classes of planets. Here values with uncertainties are the maximum-likelihood solution (Equation 15) and the associated 1-σ\sigma error bars. For parameters that peak at zero, only the 2-σ\sigma upper limits are provided. The last two rows are the estimated frequencies of planetary systems based on Equations (17) and (18), respectively.
Close-in giant Close-in Cold giant All giant Close-in giant K>3ms1K>3\,{\rm m\,s^{-1}} K>1ms1K>1\,{\rm m\,s^{-1}}
intermediate & intermediate & a<10a<10\,au & a<1a<1\,au
F1F_{1} 6.7±1.2%6.7\pm 1.2\% 7.5±4.3%7.5\pm 4.3\% 12.6±2.5%12.6\pm 2.5\% 11.5±2.4%11.5\pm 2.4\% 14.9±3.6%14.9\pm 3.6\% 13.5±3.0%13.5\pm 3.0\% 22±8%22\pm 8\%
F2F_{2} 0.50.3+0.5%0.5^{+0.5}_{-0.3}\% 7.5±3.1%7.5\pm 3.1\% 4.6±1.5%4.6\pm 1.5\% 6.5±1.8%6.5\pm 1.8\% 4.6±2.6%4.6\pm 2.6\% 10.2±2.4%10.2\pm 2.4\% <17%<17\%
F3F_{3}  \cdots  \cdots  \cdots <3.2%<3.2\% 1.30.9+1.7%1.3^{+1.7}_{-0.9}\% <3.8%<3.8\% <13%<13\%
F4F_{4}  \cdots  \cdots  \cdots 0.70.5+0.9%0.7^{+0.9}_{-0.5}\%  \cdots <2.7%<2.7\% 94+39^{+3}_{-4}
F5F_{5}  \cdots  \cdots  \cdots  \cdots  \cdots 1.00.7+0.9%1.0^{+0.9}_{-0.7}\%  \cdots
FpF_{\rm p} 7.2±1.6%7.2\pm 1.6\% 15±4%15\pm 4\% 17±3%17\pm 3\% 19.2±2.8%19.2\pm 2.8\% 21±4%21\pm 4\% 25±3%25\pm 3\% 33±7%33\pm 7\%
n¯p\bar{n}_{\rm p} a 0.076±0.0130.076\pm 0.013 0.21±0.050.21\pm 0.05 0.22±0.030.22\pm 0.03 0.29±0.050.29\pm 0.05 0.28±0.050.28\pm 0.05 0.42±0.040.42\pm 0.04 0.58±0.090.58\pm 0.09
m¯p\bar{m}_{\rm p} 1.08±0.071.08\pm 0.07 1.5±0.31.5\pm 0.3 1.27±0.121.27\pm 0.12 1.52±0.151.52\pm 0.15 1.36±0.171.36\pm 0.17 1.63±0.161.63\pm 0.16 1.8±0.41.8\pm 0.4
FpsingleF_{\rm p}^{\rm single} 7.1%7.1\% 17%17\% 19%19\% 22%22\% 24%24\% 30%30\% 43%43\%
FpmultiF_{\rm p}^{\rm multi} 7.0%7.0\% 12%12\% 16%16\% 18%18\% 19%19\% 23%23\% 27%27\%

Note. — a Here the values for the planet frequencies come from the derivation of Section 3.2. The maximum-likelihood method yields consistent values, albeit with slightly different uncertainties.

Refer to caption
Figure 7: Illustrations of the derived intrinsic planet multiplicity distributions of the five chosen planet classes. The maximum-likelihood solutions and 1–3σ\sigma confidence intervals are shown in blue colors with increasing transparency. For comparisons, we also show in each panel the scaled Poisson distribution to the total frequency of the given planetary system as the gray points. The shape parameter of the Poisson distribution is set to be the average multiplicity of the given planet type. Values for the maximum-likelihood solution and 1σ\sigma confidence intervals (in some cases, the 2σ\sigma upper limits) are tabulated in Table 2.

We model the observed multiplicity distribution in the CLS Sun-like star sample following the method detailed in Section 2. The results are summarized in Table 2 and illustrated in Figure 7. Below we discuss the individual planet classes in details.

3.3.1 Close-in giant planets

With an average detection probability p¯=0.89\bar{p}=0.89, the parameter space of the close-in giant planets is well covered by CLS. This suggests that the observed multiplicity distribution is close to the intrinsic multiplicity distribution. Indeed, our analysis reveals that 6.7±1.2%6.7\pm 1.2\% and 0.50.3+0.5%0.5^{+0.5}_{-0.3}\% of Sun-like stars host one and two close-in giant planets, respectively. The intrinsic multiplicity fraction is very close to the observed multiplicity fraction.

Combining both multiplicities, we find that 7.2±1.6%7.2\pm 1.6\% of Sun-like stars in the CLS sample host at least one giant planet within 11\,au. The average multiplicity of such planets is marginally consistent with unity (1.08±0.071.08\pm 0.07).

3.3.2 Close-in intermediate-mass planets

The CLS Sun-like star sample contains 17 and 5 systems with one and two close-in (<1<1~{}au), intermediate-mass (10M10\,M_{\oplus}0.3MJ0.3\,M_{\rm J}) planets, respectively. Compared to the other planet classes, these planets are most susceptible to the sensitivity limit of CLS (Figure 2), resulting in an average detection probability of 0.270.27. The average number of close-in, intermediate-mass planets per Sun-like star is 0.21±0.050.21\pm 0.05. It is worth noting that the planet frequency rises rapidly as the planet mass goes below 30M\sim 30\,M_{\oplus}. As shown in Figure 4, the integrated frequencies of close-in planets in the mass ranges of 30M30\,M_{\oplus}0.3MJ0.3\,M_{\rm J} and 10–30M\,M_{\oplus} are 0.039±0.0130.039\pm 0.013 and 0.16±0.040.16\pm 0.04, respectively.

As shown in Table 2, the maximum likelihood analysis reveals that 15±4%15\pm 4\% of Sun-like stars in the CLS sample have close-in intermediate-mass planets, equally divided between one- and two-planet systems. The average multiplicity of such planets is 1.5±0.31.5\pm 0.3.

3.3.3 Cold giants

The CLS Sun-like sample has 41 and 8 systems with one and two cold giant planets, respectively, and the average detection probability is 0.550.55. Here a cold giant planet is a planet with (minimum) mass above 0.3MJ0.3\,M_{\rm J} and semi-major axis in the range of 1–10 au. According to this definition, our Saturn sits at (or very close to) the borderline, and thus the Solar System can be seen as hosting either one or two such cold giant planets. The frequency of cold giant planets around Sun-like stars is 0.22±0.030.22\pm 0.03, after the relevant grids in Figure 4 are combined.

As shown in Table 2, the maximum likelihood analysis reveals that 12.6±2.5%12.6\pm 2.5\% and 4.6±1.5%4.6\pm 1.5\% of Sun-like stars in the CLS sample have one and two cold giant planets, respectively. With both multiplicities combined, the total fraction of Sun-like stars have (at least one) cold giant planet is 17±3%17\pm 3\%, and each system has 1.27±0.121.27\pm 0.12 cold giant planets. These numbers suggest that about 27%27\% of systems with cold giant planets have two such planets and that, compared to the observed multiplicity rate (8/498/49), some 11%11\% of the CLS Sun-like stars with (so far) one known cold giant should have additional, currently undetected cold giant planet.

3.3.4 All giant planets

The CLS Sun-like sample contains 66 systems with at least one giant planet (>0.3MJ>0.3\,M_{\rm J}) within 1010\,au, including (16, 2, 1) hosting (2, 3, 4) such planets. Combining the average detection probabilities of close-in giant and cold giant planets, the average detection probability of a giant planet across the whole axis range (\sim0.03–10 au) is 0.640.64.

As shown in Table 2, the maximum likelihood analysis reveals that 19.2±2.8%19.2\pm 2.8\% of Sun-like stars have at least one giant planet within 10 au. This includes 11.5±2.4%11.5\pm 2.4\% with only one giant planet, 6.5±1.8%6.5\pm 1.8\% with two, <3.2%<3.2\% (2-σ\sigma limit) with three, and 0.70.5+0.9%0.7^{+0.9}_{-0.5}\% with four. Taking all multiplicities together, each giant planet system hosts on average 1.52±0.151.52\pm 0.15 planets in the given parameter space. Compared to the observed multiplicity rate (19/66=0.2919/66=0.29), the intrinsic multiplicity rate (0.4\sim 0.4) is substantially higher.

Out of the 19 systems with multiple giant planets, four contain giant planet pairs whose period ratios are close to (<0.1<0.1) first-order period commesurabilities: HD 128311, HD 37124, HD 82943, and HD 95128 (47 UMa). This fraction, 4/19, is substantially higher than what one gets from a random sampling, 444For example, if the period ratio is randomly drawn from a log-flat distribution between 2 and 10, then one expects a 3%3\% chance to have a ratio between 2 and 2.1. Taking this fraction as the fraction of giant planet pairs in (or near) mean-motion resonances, we find that about 20% of systems with multiple giant planets have apparent low-order period commensurabilities, which is in general consistent with previous studies (e.g., Wright et al., 2011). However, it is noteworthy that the mean-motion resonance nature has not been confirmed in every of the four systems. Furthermore, giant planet pairs close to period commensurabilities are still rare, so our results based on the assumption that planet detections are independent from each other are largely unaffected.

3.3.5 Close-in giant & intermediate-mass planets

There are (42, 7, 1) systems with (1, 2, 3) detected planets with semi-major axis a<1a<1\,au and mass in the range of 10M10\,M_{\oplus}13MJ13\,M_{\rm J} in the CLS Sun-like star sample. Combining the results of close-in giant and close-in intermediate-mass planets, the average detection probabilities of close-in planets in the joint parameter space is 0.440.44.

As shown in Table 2, the maximum likelihood analysis shows that 21±4%21\pm 4\% of Sun-like stars have close-in planets with masses above 10M10\,M_{\oplus}. This includes 14.9±3.6%14.9\pm 3.6\% of Sun-like stars hosting only one, 4.6±2.6%4.6\pm 2.6\% hosting two, and 1.30.9+1.7%1.3^{+1.7}_{-0.9}\% hosting three such close-in planets.

3.3.6 K>3ms1K>3\,{\rm m\,s^{-1}} planets within 10 au

The CLS Sun-like sample contains (56, 21, 2, 1, 1) stars with (1, 2, 3, 4, 5) planets whose orbits are inside 1010\,au and RV semi-amplitudes are above 3ms13\,{\rm m\,s^{-1}}. The average detection probability, calculated in the KK vs. aa plane (see Figure 5), is 0.570.57.

As shown in Table 2, the maximum likelihood analysis reveals that 25±3%25\pm 3\% of Sun-like stars in the CLS sample host at least one planet with K>3ms1K>3\,{\rm m\,s^{-1}} within 1010\,au. This includes 13.5±3.0%13.5\pm 3.0\% with only one, 10.2±2.4%10.2\pm 2.4\% with two, <3.8%<3.8\% with three, <2.7%<2.7\% with four, and 1.00.7+0.9%1.0^{+0.9}_{-0.7}\% with five such planets. Together with the measured frequency of such planets, n¯p=0.42±0.04\bar{n}_{\rm p}=0.42\pm 0.04, each system hosts on average 1.63±0.161.63\pm 0.16 such planets. For comparison, the observed average multiplicity in the 81 CLS stars with such planets is 1.4. This suggests that about 19 planets with K>3ms1K>3\,{\rm m\,s^{-1}} and a<10a<10\,au remain to be detected in this sample of planet hosts.

3.3.7 K>1ms1K>1\,{\rm m\,s^{-1}} planets within 1 au

Within the CLS Sun-like sample, we find (43, 7, 3, 2) stars with (1, 2, 3, 4) planet detections whose orbits are inside 11\,au and RV semi-amplitudes are above 1ms11\,{\rm m\,s^{-1}}. The average detection probability is 0.270.27, with the majority of the per-star detection probability below 0.20.2.

As shown in Table 2, the maximum likelihood analysis shows that 33±7%33\pm 7\% of Sun-like stars in the CLS sample host at least one planet with K>1ms1K>1\,{\rm m\,s^{-1}} and a<1a<1\,au, with the majority (22±8%22\pm 8\%) having only one such planet. The fraction of Sun-like stars with four such planets is found to be 94+3%9^{+3}_{-4}\%. For frequencies of two- and three-planet systems, we find upper limits of 17%17\% and 13%13\%, respectively. Together with the measured planet frequency of 0.58±0.090.58\pm 0.09, the average multiplicity of systems with such planets is determined to be 1.8±0.41.8\pm 0.4. Compared to the observed average multiplicity of 1.31.3 in the 55 hosts of this type of planets, our results suggest that each system is currently missing on average \sim0.5 planets whose RV semi-amplitudes are above the current detection limit (1ms11\,{\rm m\,s^{-1}}). Future RV observations should be able to test this prediction.

4 Discussion

Below we discuss issues that may have impact on our derived results and compare our results with those from previous studies.

4.1 Impact of the detection probability

Refer to caption
Figure 8: The fraction of Sun-like stars with giant planets (top panel), the average number of giant planets per Sun-like star (middle panel), and the average multiplicity of giant planets (bottom panel) as functions of the average detection probability p¯\bar{p}. In each panel, the grey curve corresponds to the value from the analytic solution (Equation 15), the black curve is the maximum likelihood solution that takes into account the boundary conditions (i.e., 𝑭>0\mbox{\boldmath$F$}>0), and the horizontal line and shaded region are the derived frequency and the associated 1-σ\sigma confidence interval, respectively. The vertical dashed line marks the value of pp that is adopted in the maximum likelihood analysis.

The per-star detection probabilities, {pl}\{p_{l}\}, are crucial in the derivation of the intrinsic multiplicity distribution as well as the overall frequencies. As defined by Equation (4), this per-star detection probability takes into account both the intrinsic planet distribution as well as the survey completeness. While the latter is (or can be) known fairly precisely, the former is usually not. As a result, the detection probability may not be so well determined.

Our derivation of the intrinsic multiplicity distribution makes use of the per-star detection probabilities, and thus it is less susceptible to the variation in the intrinsic planet distribution. Nevertheless, we investigate the impact of the mean detection probability parameter, p¯\bar{p}, on the derived frequencies, as the intrinsic multiplicity distribution can be approximated as

𝑭1N𝒮1(p¯)𝑵=1N𝒮(p¯1)𝑵.\mbox{\boldmath$F$}\approx\frac{1}{N_{\star}}\mathcal{S}^{-1}(\bar{p})\mbox{\boldmath$N$}=\frac{1}{N_{\star}}\mathcal{S}(\bar{p}^{-1})\mbox{\boldmath$N$}. (15)

The latter step makes use of the property of the sensitivity matrix (Tremaine & Dong, 2012). Figure 8 shows the derived frequencies—frequency of planetary systems, frequency of planets, and average multiplicity—as functions of the average detection probability p¯\bar{p} for the “all giant planet” case. We may convert the uncertainty in the frequency of planets, n¯p\bar{n}_{\rm p}, into the uncertainty of the average detection probability, given the direct connection between these two quantities (Equation 7). This gives p¯allgiants=0.65±0.11\bar{p}_{\rm all~{}giants}=0.65\pm 0.11. The variation of p¯\bar{p} at such a level will not lead to substantial changes in the other two frequencies, both of which are rather insensitive to the variation of p¯\bar{p}.

4.2 Impact of maximum multiplicity

Refer to caption
Figure 9: Likelihood contours between model parameters in the “all giant planet” case. We have used two different values for the maximum multiplicity, kmaxk_{\rm max}, in the modeling: Blue filled contours correspond to kmax=4k_{\rm max}=4, which is the highest multiplicity with non-zero detections, whereas red dashed contours correspond to kmax=5k_{\rm max}=5. For each set of contours, the plus signs indicate the analytic solution (Equation 15), and the contours with increasing levels of transparency correspond to the 1–3σ\sigma uncertainty regions. The constraints on the frequency of planetary systems, FpF_{\rm p}, and the average multiplicity, m¯p\bar{m}_{\rm p}, are shown in the upper right corner. The inclusion of higher planet multiplicities with zero detections does not change the constraints on almost all low multiplicities. The only exception is seen in F4F_{4}, of which the lower limit is modified.

In our maximum-likelihood analysis, the multiplicity distribution is truncated at the highest multiplicity with non-zero detections. As explained in Section 2.2, this is because the inclusion of higher multiplicities in the data and the modeling introduces additional degrees of freedom but no benefit in the goodness of fit. Here we use the case of “all giant planet” to further demonstrate this point.

The CLS Sun-like sample has zero systems with more than four giant planets inside 1010\,au, and thus we have adopted the maximum multiplicity kmax=4k_{\rm max}=4 in our maximum-likelihood analysis. The results have been reported in the previous section, and the likelihood contours are shown in Figure 9. For comparisons, we also perform the analysis with kmax=5k_{\rm max}=5. The resulting likelihood contours are also illustrated in Figure 9. The inclusion of the higher multiplicity with zero planet detections does not change the maximum-likelihood solution or the uncertainties on all model parameters except for F4F_{4}, of which the lower limits are slightly modified. This is because the introduction of F5F_{5} permits F4=0F_{4}=0, which is otherwise not allowed with kmax=4k_{\rm max}=4. The integrated fraction of planetary systems and the average multiplicity remain largely unaffected, too.

4.3 Frequencies of giant planets

Regarding the frequencies of giant planets around Sun-like stars, we find

{n¯p(HJ)=0.028±0.008n¯p(WJ)=0.048±0.011n¯p(CJ)=0.22±0.03.\left\{\begin{array}[]{ll}\bar{n}_{\rm p}({\rm HJ})&=0.028\pm 0.008\\ \bar{n}_{\rm p}({\rm WJ})&=0.048\pm 0.011\\ \bar{n}_{\rm p}({\rm CJ})&=0.22\pm 0.03\\ \end{array}\right.. (16)

Here “HJ”, “WJ”, and “CJ” denote hot Jupiters (giant planets within 0.1 au), warm Jupiters (giant planets between 0.1–1 au), and cold Jupiters (giant planets between 1–10 au), respectively. The HJ rate we derive is higher than previous RV studies, which collectively points to a value of 1%\sim 1\% (e.g., Cumming et al., 2008; Mayor et al., 2011; Wittenmyer et al., 2020). However, given the large statistical uncertainties and the possibly different stellar distributions (e.g., metallicity, binarity) in these studies, we do not investigate further the cause of this discrepancy.

Regarding the WJ and CJ rates, a very direct comparison is with Fulton et al. (2021), who used the whole CLS sample and derived the frequency of planets around FGKM stars. 555Note the different definition of a “giant planet” used in Fulton et al. (2021) and here. Fulton et al. (2021) set the lower limit of a “giant planet” to be 30M30\,M_{\oplus}. Adopting their best-fit broken power-law model, we find 0.05\sim 0.05 and 0.14\sim 0.14 for the frequencies of WJs and CJs, respectively. While the WJ rate is consistent with ours, their CJ rate is substantially lower. This is probably because Fulton et al. (2021) included M-type stars in their analysis. In the CLS sample, there are only 13 cold giant planets found around 245 M-type stars, whereas the numbers are 57 and 474 for Sun-like stars. Our rates of WJs and CJs are also in general agreement with studies using different samples (e.g., Mayor et al., 2011; Fernandes et al., 2019; Wittenmyer et al., 2020). For example, the re-analysis of the Mayor et al. (2011) sample by Fernandes et al. (2019) found 0.04\sim 0.04 and 0.17\sim 0.17 for frequencies of WJs and CJs, respectively. These values are estimated based on the asymmetric broken power-law model used in Fernandes et al. (2019).

The relative frequencies between HJ and WJ can provide useful insights into the formation of these close-in giant planets (see Dawson & Johnson 2018 and references therein). In the CLS Sun-like star sample, the overall rate of WJ is higher than the overall rate of HJ, and the rate of WJ per log interval is within a factor of two of that of HJ. Both features pose challenges to the theoretical models of HJ formation.

4.4 Frequencies of planetary systems

One approach to derive the frequency of planetary systems in RV surveys, which has been frequently used in the literature, is to include only one out of the multiple planet detections in each system and regard the resulting frequency of planets as the frequency of planetary systems (e.g., Cumming et al., 2008; Mayor et al., 2011). In the mathematical framework of the present work, this is

Fpsingle=1Np~j=1Nj.F_{\rm p}^{\rm single}=\frac{1}{N_{\star}\tilde{p}}\sum_{j=1}N_{j}. (17)

Here p~\tilde{p} is the average detection probability derived from the planet sample that includes at most one planet detection for each star. It will not be too different from the value of p¯\bar{p}, given the dominating number of single-detections in typical RV samples.

For our approach, which takes the planet multiplicity fully into account, a reasonable approximation of the frequency of planetary systems is

Fpmulti=1Npmj=1Nj.F_{\rm p}^{\rm multi}=\frac{1}{N_{\star}p_{m}}\sum_{j=1}N_{j}. (18)

Here the correction factor, pmp_{m}, quantifies the probability of detecting at least one planet in a system with m¯p\bar{m}_{\rm p} planets

pm=1(1p¯)m¯p.p_{m}=1-(1-\bar{p})^{\bar{m}_{\rm p}}. (19)

The estimator given by Equation (17) yields unbiased rates only when the average multiplicity m¯p\bar{m}_{\rm p} approaches unity. We provide in the last two rows of Table 2 the two frequency estimators for our chosen planet types. In all cases, our new estimator, FpmultiF_{\rm p}^{\rm multi}, yields better agreement with the true frequency.

The estimator given by Equation (18) has broader implications to other statistical analyses that involve planet multiplicity issues. For Sun-like stars, the sensitivity region of the Kepler transit mission is almost fully enclosed in the box-shaped parameter space defined by a lower radius limit of 1R\sim 1\,R_{\oplus} and an outer period limit of 1\sim 1\,yr. According to the estimated planet frequencies in this box-shaped region (e.g., Zhu & Dong, 2021), the probability of having any one planet inside the Kepler sensitivity region is p¯kep0.8\bar{p}_{\rm kep}\approx 0.8. For Kepler-like planetary systems with an average multiplicity of three (Zhu et al., 2018), the chance to see at least one planet out of the box-shaped parameter space to be inside the Kepler sensitivity region is therefore 1(1p¯kep)3=0.991-(1-\bar{p}_{\rm kep})^{3}=0.99. In other words, the fraction of Sun-like stars with planets of radii above RR_{\oplus} and periods within 1 yr is almost the same as the fraction of Sun-like stars with at least one planet inside the Kepler sensitivity range. For comparison, the sensitivity correction on the planet-by-planet basis would lead to an increase in the frequency of planetary systems by nearly a factor of two (e.g., Yang et al., 2020).

In addition to providing an unbiased estimate of the total frequency of planetary systems, the approach undertaken in the present work also yields the intrinsic multiplicity distribution 𝑭F, which is more informative to theoretical models as well as other statistical studies. For example, for giant planets within 33\,au, based on which Cumming et al. (2008) derived a total frequency of 10.5%10.5\%, our approach yields the following total rates based on the CLS Sun-like sample

{FpC08=12.2±2.1%n¯pC08=0.15±0.03m¯pC08=1.21±0.10.\left\{\begin{array}[]{ll}F_{\rm p}^{\rm C08}&=12.2\pm 2.1\%\\ \bar{n}_{\rm p}^{\rm C08}&=0.15\pm 0.03\\ \bar{m}_{\rm p}^{\rm C08}&=1.21\pm 0.10\\ \end{array}\right.. (20)

as well as the intrinsic multiplicity distribution

Fk=1,2,3C08=(10.6±1.5%,0.80.4+0.6%,0.90.4+0.5%).F_{k=1,2,3}^{\rm C08}=(10.6\pm 1.5\%,~{}0.8^{+0.6}_{-0.4}\%,~{}0.9^{+0.5}_{-0.4}\%). (21)

The overall rate (FpC08F_{\rm p}^{\rm C08}) is higher than Cumming et al. (2008) probably because of the difference in the stellar samples.

4.5 Correlations (or not) between different planet populations

By including in the analysis the intrinsic multiplicity distribution, our method does not require the presences of multiple planets in the same system to be independent (although the properties of them should be, largely, independent). Therefore, it allows us to investigate whether or not the existence of a certain type of planet depends on the existence of some other type of planet in the same system. Considering the relatively large uncertainties of the derived frequencies, here we focus on confirming previously found (anti-)correlations rather than reporting tentative evidence for new ones.

Under the assumption that the presences of close-in and cold giant planets are not correlated, the frequency of planetary systems with any giant planets within 10 au should be 1(17.2%)×(117%)=23%1-(1-7.2\%)\times(1-17\%)=23\%. This is higher than what is given by the maximum likelihood analysis over the joint parameter space (19.2%19.2\%, see Table 2). Therefore, the assumption is rejected/disfavored and the close-in and cold giant planets are positively correlated. This is consistent with previous findings that close-in giant planets frequently have distant companions (e.g., Knutson et al., 2014; Bryan et al., 2016). Similarly, our results also suggest positive correlations between close-in planets (close-in giant and intermediate-mass planets) and cold giant planets, which also confirms previous findings (e.g., Zhu & Wu, 2018; Bryan et al., 2019; Rosenthal et al., 2021b).

It is noticed that the fraction of stars with close-in giant and intermediate-mass planets equals the summation of frequencies of “close-in giant” and “close-in intermediate-mass” planets. Indeed, under the assumption that the presences of these latter two classes of planets are uncorrelated, one would expect that the frequency of close-in CLS planetary system should be 1(10.072)×(10.15)=21%1-(1-0.072)\times(1-0.15)=21\%, which is consistent with the fraction of planets in the joint parameter space (21±4%21\pm 4\%). In other words, the two planet populations—close-in giant planets and close-in intermediate-mass planets—are uncorrelated. This may be explained by the combination of two different features of close-in planets. On the one hand, the majority of HJs and some WJs are lonely (e.g., Steffen et al., 2012; Hord et al., 2021). On the other hand, at least half of WJs do have nearby, small companions (Huang et al., 2016).

5 Summary

Our present study develops a general method that can recover the intrinsic multiplicity distribution of planets out of observational surveys with incomplete coverage of the planetary parameter space. We then applied it to the CLS Sun-like sample. The key results are summarized in Table 2 and illustrated in Figure 7. A few notable findings to highlight here:

  • About 19.2%19.2\% of Sun-like stars in the CLS sample host giant (>0.3MJ>0.3\,M_{\rm J}) planets within 10 au, with the majority having such planets in the cold (>1>1\,au) region. Of the giant planet hosts, about 40% host at least two such planets.

  • Within the 1 au region, giant planets typically do not find siblings of the same kind, whereas lower-mass (10M0.3MJ10\,M_{\oplus}\mbox{--}0.3\,M_{\rm J}) planets usually do. In total, 21%21\% of Sun-like stars host planets with masses above 10M10\,M_{\oplus} within 1 au.

  • About 25%25\% of Sun-like stars host planets with semi-major axis a<10a<10\,au and RV semi-amplitude K>3ms1K>3\,{\rm m\,s^{-1}}, and about half of them have at least two such planets. With a RV precision of 1ms11\,{\rm m\,s^{-1}}, about 0.5 planets remain to be detected around each of the CLS Sun-like hosts.

  • The CLS Sun-like sample reports a hot Jupiter rate of 2.8%2.8\%, which is a factor of \sim3 higher than what previous studies have found from different star samples.

It is worth noting that, although our method is developed for the RV method, it is generally applicable to other exoplanet-finding techniques such as gravitational microlensing, astrometry, and direct imaging. It also works for the transit method after the geometric transit probability is properly taken into account (Tremaine & Dong, 2012).

I would like to thank Fei Dai and Scott Tremaine for comments and suggestions on an earlier version of the manuscript. This work is supported by the National Science Foundation of China (grant No. 12173021 and 12133005).

References

  • Anglada-Escudé et al. (2010) Anglada-Escudé, G., López-Morales, M., & Chambers, J. E. 2010, ApJ, 709, 168, doi: 10.1088/0004-637X/709/1/168
  • Berger et al. (2018) Berger, T. A., Huber, D., Gaidos, E., & van Saders, J. L. 2018, ApJ, 866, 99, doi: 10.3847/1538-4357/aada83
  • Boisvert et al. (2018) Boisvert, J. H., Nelson, B. E., & Steffen, J. H. 2018, MNRAS, 480, 2846, doi: 10.1093/mnras/sty2023
  • Borucki et al. (2010) Borucki, W. J., Koch, D., Basri, G., et al. 2010, Science, 327, 977, doi: 10.1126/science.1185402
  • Bryan et al. (2019) Bryan, M. L., Knutson, H. A., Lee, E. J., et al. 2019, AJ, 157, 52, doi: 10.3847/1538-3881/aaf57f
  • Bryan et al. (2016) Bryan, M. L., Knutson, H. A., Howard, A. W., et al. 2016, ApJ, 821, 89, doi: 10.3847/0004-637X/821/2/89
  • Chatterjee et al. (2008) Chatterjee, S., Ford, E. B., Matsumura, S., & Rasio, F. A. 2008, ApJ, 686, 580, doi: 10.1086/590227
  • Ciardi et al. (2013) Ciardi, D. R., Fabrycky, D. C., Ford, E. B., et al. 2013, ApJ, 763, 41, doi: 10.1088/0004-637X/763/1/41
  • Cumming et al. (2008) Cumming, A., Butler, R. P., Marcy, G. W., et al. 2008, PASP, 120, 531, doi: 10.1086/588487
  • Dawson & Johnson (2018) Dawson, R. I., & Johnson, J. A. 2018, ARA&A, 56, 175, doi: 10.1146/annurev-astro-081817-051853
  • Fabrycky et al. (2014) Fabrycky, D. C., Lissauer, J. J., Ragozzine, D., et al. 2014, ApJ, 790, 146, doi: 10.1088/0004-637X/790/2/146
  • Fang & Margot (2012) Fang, J., & Margot, J.-L. 2012, ApJ, 761, 92, doi: 10.1088/0004-637X/761/2/92
  • Fernandes et al. (2019) Fernandes, R. B., Mulders, G. D., Pascucci, I., Mordasini, C., & Emsenhuber, A. 2019, ApJ, 874, 81, doi: 10.3847/1538-4357/ab0300
  • Fulton et al. (2021) Fulton, B. J., Rosenthal, L. J., Hirsch, L. A., et al. 2021, ApJS, 255, 14, doi: 10.3847/1538-4365/abfcc1
  • Gould et al. (2010) Gould, A., Dong, S., Gaudi, B. S., et al. 2010, ApJ, 720, 1073, doi: 10.1088/0004-637X/720/2/1073
  • He et al. (2019) He, M. Y., Ford, E. B., & Ragozzine, D. 2019, MNRAS, 490, 4575, doi: 10.1093/mnras/stz2869
  • He et al. (2020) He, M. Y., Ford, E. B., Ragozzine, D., & Carrera, D. 2020, AJ, 160, 276, doi: 10.3847/1538-3881/abba18
  • Hord et al. (2021) Hord, B. J., Colón, K. D., Kostov, V., et al. 2021, AJ, 162, 263, doi: 10.3847/1538-3881/ac2602
  • Huang et al. (2016) Huang, C., Wu, Y., & Triaud, A. H. M. J. 2016, ApJ, 825, 98, doi: 10.3847/0004-637X/825/2/98
  • Jurić & Tremaine (2008) Jurić, M., & Tremaine, S. 2008, ApJ, 686, 603, doi: 10.1086/590047
  • Knutson et al. (2014) Knutson, H. A., Fulton, B. J., Montet, B. T., et al. 2014, ApJ, 785, 126, doi: 10.1088/0004-637X/785/2/126
  • Lissauer et al. (2011) Lissauer, J. J., Ragozzine, D., Fabrycky, D. C., et al. 2011, ApJS, 197, 8, doi: 10.1088/0067-0049/197/1/8
  • Mayor et al. (2011) Mayor, M., Marmier, M., Lovis, C., et al. 2011, arXiv e-prints, arXiv:1109.2497. https://arxiv.org/abs/1109.2497
  • Millholland et al. (2017) Millholland, S., Wang, S., & Laughlin, G. 2017, ApJ, 849, L33, doi: 10.3847/2041-8213/aa9714
  • Mulders et al. (2018) Mulders, G. D., Pascucci, I., Apai, D., & Ciesla, F. J. 2018, AJ, 156, 24, doi: 10.3847/1538-3881/aac5ea
  • Murchikova & Tremaine (2020) Murchikova, L., & Tremaine, S. 2020, AJ, 160, 160, doi: 10.3847/1538-3881/abab9e
  • Pu & Wu (2015) Pu, B., & Wu, Y. 2015, ApJ, 807, 44, doi: 10.1088/0004-637X/807/1/44
  • Rosenthal et al. (2021a) Rosenthal, L. J., Fulton, B. J., Hirsch, L. A., et al. 2021a, ApJS, 255, 8, doi: 10.3847/1538-4365/abe23c
  • Rosenthal et al. (2021b) Rosenthal, L. J., Knutson, H. A., Chachan, Y., et al. 2021b, arXiv e-prints, arXiv:2112.03399. https://arxiv.org/abs/2112.03399
  • Steffen et al. (2012) Steffen, J. H., Ragozzine, D., Fabrycky, D. C., et al. 2012, Proceedings of the National Academy of Science, 109, 7982, doi: 10.1073/pnas.1120970109
  • Thompson et al. (2018) Thompson, S. E., Coughlin, J. L., Hoffman, K., et al. 2018, ApJS, 235, 38, doi: 10.3847/1538-4365/aab4f9
  • Tremaine & Dong (2012) Tremaine, S., & Dong, S. 2012, AJ, 143, 94, doi: 10.1088/0004-6256/143/4/94
  • Weiss et al. (2018) Weiss, L. M., Marcy, G. W., Petigura, E. A., et al. 2018, AJ, 155, 48, doi: 10.3847/1538-3881/aa9ff6
  • Wittenmyer et al. (2019) Wittenmyer, R. A., Bergmann, C., Horner, J., Clark, J., & Kane, S. R. 2019, MNRAS, 484, 4230, doi: 10.1093/mnras/stz236
  • Wittenmyer et al. (2020) Wittenmyer, R. A., Wang, S., Horner, J., et al. 2020, MNRAS, 492, 377, doi: 10.1093/mnras/stz3436
  • Wright et al. (2009) Wright, J. T., Upadhyay, S., Marcy, G. W., et al. 2009, ApJ, 693, 1084, doi: 10.1088/0004-637X/693/2/1084
  • Wright et al. (2011) Wright, J. T., Veras, D., Ford, E. B., et al. 2011, ApJ, 730, 93, doi: 10.1088/0004-637X/730/2/93
  • Yang et al. (2020) Yang, J.-Y., Xie, J.-W., & Zhou, J.-L. 2020, AJ, 159, 164, doi: 10.3847/1538-3881/ab7373
  • Zhu (2020) Zhu, W. 2020, AJ, 159, 188, doi: 10.3847/1538-3881/ab7814
  • Zhu & Dong (2021) Zhu, W., & Dong, S. 2021, ARA&A, 59, doi: 10.1146/annurev-astro-112420-020055
  • Zhu et al. (2018) Zhu, W., Petrovich, C., Wu, Y., Dong, S., & Xie, J. 2018, ApJ, 860, 101, doi: 10.3847/1538-4357/aac6d5
  • Zhu & Wu (2018) Zhu, W., & Wu, Y. 2018, AJ, 156, 92, doi: 10.3847/1538-3881/aad22a
  • Zink et al. (2019) Zink, J. K., Christiansen, J. L., & Hansen, B. M. S. 2019, MNRAS, 483, 4479, doi: 10.1093/mnras/sty3463