This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Massive Black Hole Mergers with Orbital Information: Predictions from the ASTRID Simulation

Nianyi Chen,1 Yueying Ni,1,2 A. Miguel Holgado,1 Tiziana Di Matteo,1,2 Michael Tremmel,3 Colin DeGraf,4 Simeon Bird,5 Rupert Croft,1,2 Yu Feng6
1 McWilliams Center for Cosmology, Department of Physics, Carnegie Mellon University, Pittsburgh, PA 15213
2 NSF AI Planning Institute for Physics of the Future, Carnegie Mellon University, Pittsburgh, PA 15213, USA
3 Astronomy Department, Yale University, P.O. Box 208120, New Haven, CT 06520, USA
4 Department of Physics, Truman State University, Kirksville, MO 63501, USA
5 Department of Physics & Astronomy, University of California, Riverside, 900 University Ave., Riverside, CA 92521, USA
6 Berkeley Center for Cosmological Physics and Department of Physics, University of California, Berkeley, CA 94720, USA
E-mail: nianyic@andrew.cmu.edu
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract

We examine massive black hole (MBH) mergers and their associated gravitational wave signals from the large-volume cosmological simulation Astrid . Astrid includes galaxy formation and black hole models recently updated with a MBH seed population between 3×104h1M3\times 10^{4}{h^{-1}M_{\odot}} and 3×105h1M3\times 10^{5}{h^{-1}M_{\odot}} and a sub-grid dynamical friction (DF) model to follow the MBH dynamics down to 1.5ckpc/h1.5\;\text{ckpc}/h. We calculate initial eccentricities of MBH orbits directly from the simulation at kpc-scales, and find orbital eccentricities above 0.70.7 for most MBH pairs before the numerical merger. After approximating unresolved evolution on scales below 200pc{\sim 200\,\text{pc}}, we find that the in-simulation DF on large scales accounts for more than half of the total orbital decay time (500Myrs\sim 500\,\text{Myrs}) due to DF. The binary hardening time is an order of magnitude longer than the DF time, especially for the seed-mass binaries (MBH<2MseedM_{\text{BH}}<2M_{\text{seed}}). As a result, only 20%\lesssim 20\% of seed MBH pairs merge at z>3z>3 after considering both unresolved DF evolution and binary hardening. These z>3z>3 seed-mass mergers are hosted in a biased population of galaxies with the highest stellar masses of >109M>10^{9}\,M_{\odot}. With the higher initial eccentricity prediction from Astrid , we estimate an expected merger rate of 0.30.70.3-0.7 per year from the z>3z>3 MBH population. This is a factor of 7\sim 7 higher than the prediction using the circular orbit assumption. The LISA events are expected at a similar rate, and comprise 60%\gtrsim 60\% seed-seed mergers, 30%\sim 30\% involving only one seed-mass MBH, and 10%\sim 10\% mergers of non-seed MBHs.

keywords:
gravitational waves – methods: numerical – quasars: supermassive black holes.
pubyear: 2021

1 Introduction

Massive Black Holes (MBHs) are known to exist at the center of galaxies (e.g. Soltan, 1982; Kormendy & Richstone, 1995; Magorrian et al., 1998; Kormendy & Ho, 2013). As these galaxies merge (e.g. Lacey & Cole, 1993; Lotz et al., 2011; Rodriguez-Gomez et al., 2015), the MBHs that they host will also merge, resulting in the mass growth of the MBH population (e.g. Begelman et al., 1980). MBH mergers following their host galaxy mergers are an important aspect of the growth for MBHs in dense environments (e.g. Kulier et al., 2015). Even more importanly, as a by-product of MBH mergers, gravitational waves are emitted, and their detection opens up a new channel for probing the formation and evolution of early MBHs in the universe (e.g. Sesana et al., 2007a; Barausse, 2012).

The gravitational wave (GW) detection by LIGO (Abbott et al., 2016) proves the experimental feasibility of using gravitational waves for studying black hole (BH) binaries. While LIGO cannot detect GWs from binaries more massive than 100M\sim 100M_{\odot} (Mangiagli et al., 2019), long-baseline experiments are being planned for detections of more massive BH binaries. Specifically, the upcoming Laser Interferometer Space Antenna (LISA) (Amaro-Seoane et al., 2017) mission will be sensitive to low-frequency (10410110^{-4}-10^{-1}Hz) gravitational waves from the coalescence of MBHs with masses 104107M10^{4}-10^{7}M_{\odot} up to z20z\sim 20. At lower frequencies, Pulsar Timing Arrays (PTAs) are already collecting data and the Square Kilometer Array (SKA) in the next decade will be a major leap forward in sensitivity. While MBH binaries are the primary sources for PTAs and LISA, these two experiments probe different stages of MBH evolution. PTAs are most sensitive to the early inspiral (orbital periods of years or longer) of nearby (z<1z<1) (massive) sources (Mingarelli et al., 2017). In contrast, LISA is sensitive to the inspiral, merger, and ringdown of MBH binaries at a wide range of redshifts (Amaro-Seoane et al., 2012).

GWs from MBH mergers will provide a unique way of probing the high-redshift universe and understanding the early formation of the MBH seeds, especially when combined with observations of the electromagnetic (EM) counterparts (Natarajan et al., 2017; DeGraf & Sijacki, 2020). For instance, a MBH merger multi-messenger detections should allow us to distinguish between different MBH seeding mechanisms at high-redshift (Ricarte & Natarajan, 2018), to obtain information on the dynamical evolution of massive black holes (Bonetti et al., 2019), and to gain information about the gas properties within the accretion disc (Derdzinski et al., 2019).

To properly access the potential of the upcoming GW signals as well as the EM observations of MBH binaries, we need to gain a thorough understanding of the physics of these events with theoretical tools and be able to make statistical predictions for the binary population. Early studies have provided merger rate predictions for MBH binaries using analytic models (e.g. Haehnelt, 1994; Jaffe & Backer, 2003; Wyithe & Loeb, 2003). Some more recent predictions made use of semi-analytic models (e.g. Sesana et al., 2004; Tanaka & Haiman, 2009; Barausse, 2012; Ricarte & Natarajan, 2018) to enhance the model complexity and physical realism. Recent developments in large-volume cosmological simulations (e.g. Hirschmann et al., 2014; Vogelsberger et al., 2014; Schaye et al., 2015; Feng et al., 2016; Volonteri et al., 2016; Pillepich et al., 2018; Davé et al., 2019) have enabled the study of MBH mergers within the context of cosmological galaxy formation (e.g. Salcido et al., 2016; Kelley et al., 2017a; Katz et al., 2020; Volonteri et al., 2020). These simulations directly associate MBH binaries with their host galaxies, and they are carried out in large enough cosmological volumes to provide the statistical power to make merger rate predictions across cosmic time which are crucial for the upcoming observations.

In order to accurately predict when MBH mergers occur in these simulations, one must account for the orbital decay and binary hardening timescales in a wide dynamical range. During galaxy mergers, the central MBHs start at large separation in the remnant galaxy (as much as a few tens of kpc). These MBHs then gradually lose their orbital energy and sink to the center of the remnant galaxy due to the dynamical friction exerted by the gas, stars and dark matter around them (e.g. Chandrasekhar, 1943; Ostriker, 1999). When their separation is 1\lesssim 1 parsec, a MBH binary forms and other energy-loss channels begin to dominate, such as scattering with stars (e.g. Quinlan, 1996; Sesana et al., 2007b; Vasiliev et al., 2015), gas drag from the circumbinary disk (e.g. Haiman et al., 2009), or, if relevant, three-body scattering with a third black hole (e.g. Bonetti et al., 2018).

Among these processes, only the dynamical friction decay affects the dynamics at orbital separation above the resolution of large-volume cosmological simulations. However, so far there is limited attempt to directly model dynamical friction (at small scales, close to the resolution) in the large-volume cosmological simulations mentioned above. In most cosmological simulations, once MBHs are within a given halo, they are simply repositioned to the minimum potential position of the host galaxy at each time-step. For these simulations, (although sometimes the effects of subgrid dynamical friction are treated in post-processing), many spurious mergers occur during fly-by encounters. Among simulations that do include subgrid modeling of DF on-the-fly, Dubois et al. (2014) only includes the friction from gas but not stars, while Tremmel et al. (2017) and Hirschmann et al. (2014) model the dynamical friction from stars and dark matter particles.

Here we study MBH mergers using the large volume cosmological simulation Astrid which uses a novel power-law seeding with a range of MBH seed masses and so includes relatively low mass MBHs. More importantly, it directly incorporates additional dynamical friction modeling, following the recent model by (Chen et al., 2021) for the MBH dynamics down to the resolution limit (see also similar implementations by Hirschmann et al., 2014; Tremmel et al., 2015). With more physical modeling of the MBH dynamics, we can follow the in-simulation mergers for a more extended period of time over hundreds of Myrs, and almost completely prevent mergers during fly-by encounters. Moreover, for the first time we can aim to measure the orbital evolution and eccentricities of MBH pairs on sub-kpc scales. Such information should be important both for estimating the binary hardening timescales, and for predicting the GW signals from the MBH mergers.

This paper is organized as follows: in Section 2 we introduce the Astrid simulation, in particular the MBH modeling, and describe how we obtain the merger catalog from the simulation; in Section 3, we describe our methods for measuring the MBH orbital eccentricity from the simulation, and present results of our measurements. Section 4 focuses on the modeling of post-processing delay times including the dynamical friction time and binary hardening time after the numerical merger. Then in Section 5, we present our prediction for MBH merger rate at z>3z>3, and investigate the properties of high-redshift MBH merger systems. Finally, in Section 6 we show the GW strain and signal-to-noise ratios for the binary population that merges at z>3z>3.

2 Simulation

2.1 The Astrid Simulation

The Astrid simulation is a large-scale cosmological hydrodynamic simulation in a 250Mpc/h250\,{\rm Mpc}/h box with 2×550032\times 5500^{3} particles. Astrid contains a statistical sample of halos which can be compared to future survey data from JWST, while resolving galactic halos down to 109M10^{9}\,M_{\odot} (corresponding to 200 dark matter particles). Astrid has been run from z=99z=99 to z=3z=3. It contains models for inhomogeneous hydrogen and helium reionization, baryon relative velocities and massive neutrinos, as well as ’full-physics’ star formation model, BH accretion and associated supernova and AGN feedback respectively. The BH model includes mergers driven by dynamic friction rather than repositioning. Our treatment of MBHs largely follows the BlueTides  simulation in terms of the BH accretion and feedback, which is based on the earlier work by Di Matteo et al. (2005); Springel et al. (2005). Compared with BlueTides , we slightly changed the seeding scheme of MBHs by drawing the seed mass from a power-law distribution instead of using a universal seed mass. Furthermore, we use a dynamical friction model (tested and validated in Chen et al., 2021) to evolve the binary black holes and include the sinking and merger of MBHs in the simulation in a more physical way. Here we briefly summarize the black hole seeding and dynamics treatment in Astrid, and refer to Bird et al. (2021) and Ni et al. (2021) for detailed presentations of physical models for star formation and black holes.

2.1.1 MBH Seeding

To seed MBHs in the simulation, we periodically run a FOF group finder on the fly with a linking length of 0.2 times the mean particle separation, to identify halos with a total mass and stellar mass satisfying the seeding criteria { Mhalo,FOF>Mhalo,thrM_{\rm halo,FOF}>M_{\rm halo,thr}; M,FOF>M,thrM_{\rm*,FOF}>M_{\rm*,thr}}. We apply a mass threshold value of Mhalo,thr=5×109h1MM_{\rm halo,thr}=5\times 10^{9}{h^{-1}M_{\odot}} and M,thr=2×106h1MM_{\rm*,thr}=2\times 10^{6}{h^{-1}M_{\odot}}.

Considering the complex astrophysical process involved in BH seed formation in realistic cases, halos with the same mass can form different mass MBH seeds. Therefore, in Astrid, instead of applying a uniform seed mass for all MBHs, we probe a mass range of the MBH seed mass MseedM_{\rm seed} drawn probabilistically from a power-law distribution:

P(Mseed)={0Mseed<Mseed,min𝒩(Mseed)nMseed,minMseedMseed,max0Mseed>Mseed,maxP(M_{\rm seed})=\begin{cases}0&M_{\rm seed}<M_{\rm seed,min}\\ \mathcal{N}(M_{\rm seed})^{-n}&M_{\rm seed,min}\leq M_{\rm seed}\leq M_{\rm seed,max}\\ 0&M_{\rm seed}>M_{\rm seed,max}\end{cases} (1)

where 𝒩\mathcal{N} is the normalization factor. The minimum seed mass is Mseed,min=3×104h1MM_{\rm seed,min}=3\times 10^{4}{h^{-1}M_{\odot}} and the maximum seed mass is Mseed,max=3×105h1MM_{\rm seed,max}=3\times 10^{5}{h^{-1}M_{\odot}}, with a power-law index n=1n=-1. For each halo that satisfies the seeding criteria but does not already contain at least one BH particle, we convert the densest gas particle into a BH particle. The new-born BH particle inherits the position and velocity of its parent gas particle.

2.1.2 MBH Dynamics

Instead of constantly repositioning the black hole towards the potential minimum, as in earlier simulations, in Chen et al. (2021) we implemented and tested a model for sub-grid dynamical friction (similar to Tremmel et al., 2015; Tremmel et al., 2017). Dynamical friction is an artificial force for modelling unresolved small-scale interactions between the MBH and nearby stars and dark matter. These interactions transfer momentum from the MBH to individual stars in the surrounding star clusters, gradually reducing the momentum of the MBH particle relative to the surrounding collisionless objects in the bulge (e.g. Governato et al., 1994; Kazantzidis et al., 2005). The additional dynamical friction also stabilizes the MBH motion at the center of the galaxy.

We estimate dynamical friction on MBHs using Eq. 8.3 of Binney & Tremaine (2008):

𝐅DF=16π2G2MBH2malog(Λ)𝐯BHvBH30vBH𝑑vava2f(va),\mathbf{F}_{\rm DF}=-16\pi^{2}G^{2}M_{\rm BH}^{2}m_{a}\;\text{log}(\Lambda)\frac{\mathbf{v}_{\rm BH}}{v_{\rm BH}^{3}}\int_{0}^{v_{\rm BH}}dv_{a}v_{a}^{2}f(v_{a}), (2)

where MBHM_{\rm BH} is the BH mass, vBH\textbf{v}_{\rm BH} is the BH velocity relative to its surrounding medium, mam_{a} and vav_{a} are the masses and velocities of the particles surrounding the BH, and log(Λ)=log(bmax/bmin)\text{log}(\Lambda)=\text{log}(b_{\rm max}/b_{\rm min}) is the Coulomb logarithm that accounts for the effective range of the friction between the specified bminb_{\rm min} and bmaxb_{\rm max}. f(va)f(v_{a}) in Eq. 2 is the velocity distribution of the surrounding collisionless particles including both stars and dark matter. Here we have assumed an isotropic velocity distribution of the particles surrounding the BH so that we are left with a 1D integration.

In Astrid, the BH seed mass extends down to 3×104M/h3\times 10^{4}M_{\odot}/h, which is one order of magnitude smaller than the stellar particle mass. In this regime, the dynamical friction of BH is likely unrealistic due to its small mass compared to the masses of other particles, and so the dynamics of the seed BH would be unstable due to dynamical heating (when MBHM_{\rm BH} is below the mass resolution). Therefore, we boost the dynamical friction in this regime with Mdyn=2×MDMM_{\rm dyn}=2\times M_{\rm DM} when MBH<Mdyn<1M_{\rm BH}<M_{\rm dyn}<1. This temporarily boosts the BH dynamical mass for BHs near the seed mass and helps stabilize their motion during the early post-seeding evolution.

We approximate the distribution function f(va)f(v_{a}) by the Maxwellian distribution (as, e.g. Binney & Tremaine, 2008), and account for the neighbouring collisionless particles up to the range of the SPH kernel of the BH particle (see, Chen et al., 2021, for more details). Eq. 2 becomes

𝐅DF=4πρsph(GMdynvBH)2log(Λ)(vBHσv)𝐯BHvBH.\mathbf{F}_{\rm DF}=-4\pi\rho_{\rm sph}\left(\frac{GM_{\rm dyn}}{v_{\rm BH}}\right)^{2}\;\text{log}(\Lambda)\mathcal{F}\left(\frac{v_{\rm BH}}{\sigma_{v}}\right)\frac{\bf{v}_{\rm BH}}{v_{\rm BH}}. (3)

Here ρsph\rho_{\rm sph} is the density of dark matter and star particles within the SPH kernel, \mathcal{F} is the integral in Equation 2 assuming a Maxwellian distribution of stellar velocities. σv\sigma_{v} is the velocity dispersion of the dark matter and star particles within the SPH kernel.

The boost of the initial MdynM_{\rm dyn} may overestimate the dynamical friction for small BHs and the resultant sinking timescale will be shortened by a factor of MBH/Mdyn\sim M_{\rm BH}/M_{\rm dyn} compared to the no-boost case. On the other hand, it is also possible that the BH sinking time scale estimated in our simulation in the no-boost case could overestimate the true sinking time, as the high-density stellar bulges are not fully resolved (e.g. Antonini & Merritt, 2012; Dosopoulou & Antonini, 2017; Biernacki et al., 2017). Therefore, boosting the initial MdynM_{\rm dyn} seems a reasonable compromise to model the dynamics of small mass BHs while also alleviating the noisy perturbation of dynamic heating brought by the limit of resolution. Note that even if our dynamic friction implementation overestimates the force, it still provides a substantially more conservative estimation of BH sinking than the common model where BHs are repositioned to the potential minimum.

2.2 Merger Catalog

There are a total of 445635 BH mergers in the simulation for z>3z>3. For each merger event we extract the relevant environmental variables (the density profiles of gas, dark matter and stars, and the stellar velocity dispersion) from the nearest snapshot before and after the merger. The snapshots used are separated by 20\sim 20 Myrs. In a small fraction of cases, the mergers take place within 20Myrs20\,{\rm Myrs} after one of the MBHs are born, and so we cannot find the corresponding MBH in the previous snapshot. We remove these mergers from the catalog, after which 440999 mergers remain.

From the snapshots immediately before and after the merger, we identify the host halos and subhalos containing the binaries using FOF and SubFind, respectively. Out of the mergers that remain in the catalog, we further remove those not associated with any halo/subhalo, and those whose host galaxy has less than 200200 star particles. The hosts for these binaries are not well resolved in our simulation, so we cannot reliably compute the binary hardening time in post processing. This leaves us with a final catalog of 430938 black hole merger candidates. For each host halo identified, we define the halo center as the position of the particle with the minimum potential, and calculate galaxy properties such as the density profiles and half-mass radius with respect to this point.

In Figure 1, we show the last few orbits of a few selected BH pairs in our merger catalog plotted on their host galaxies’ stellar distributions. The distance from left to right of each image is 8ckpc/h8\,{\rm ckpc}/h. The brightness corresponds to the stellar density, and the colours show the stellar age with older stars being redder and younger stars being whiter. The red curves are the BH pairs’ positions relative to their center of mass.

3 Orbital Eccentricity

Refer to caption
Figure 1: The last few orbits (starting from 80Myrs\sim 80\,{\rm Myrs} before the merger) of selected binaries in the Astrid simulation plotted on their host galaxies. The distance from left to right of each image is 8ckpc/h8\,{\rm ckpc}/h. The brightness corresponds to the stellar density, and the colours show the stellar age with older stars being redder. The red curves are the BH pairs’ position relative to their center of mass. In most cases we see a Rosetta orbit, as the local potential is a spherical potential dominated by stars and dark matter. We find that some orbits circularize over time (e.g. third row, fifth column), although the majority of the orbits still remain eccentric when merging (see e.g. Figure 2).
Refer to caption
Figure 2: Comparison between eccentricity measurements from the shape method and the energy method. Left: the distribution of the (generalized) orbital eccentricity from the two measurements. In both cases, the distribution is dominated by highly-eccentric binaries, as we can also see from the images in Figure 1. The shape method has a more skewed distribution compare to the energy method. Middle: A scattered plot of the eccentricity from the two measurements. We can see that the two measurements yield similar results by comparing the distribution to the diagonal line. In most cases, the energy measurement is 10%\sim 10\% lower than the shape measurement. Right: In addition to the eccentricity, we show the apoapses and periapases of the two measurements. The orange dots are the apoapses and the green dots are the periapases. The scatter relation also follows the diagonal line quite closely. When the two black holes merge in the simulation, the apoapsis is usually a few kpc and the periapsis is usually less than 1kpc.

As was described in Section 2.1, our simulation has a build-in sub-grid dynamical friction model, which allows us to follow the black holes’ orbits before their numerical mergers down to the resolution limit. Figure 1 shows several examples of the last few orbits of BH pairs just before they merge in the simulation. The black hole orbits are plotted in the center-of-mass frame of the BH pairs, with a face-on projection on the 2D plane perpendicular to the mean angular momentum of the last orbit. Since we record the BH information at each time step when the BH is active, the orbits are much better resolved in time compared with the galaxies. Most orbits start off at a semi-major axis of >1>1 kpc, and gradually go through orbital decay until merger.

From the images, we see that the majority of the orbits are very non-circular during the initial encounter of the BHs. While some of them circularize with time, most orbits still retain a high eccentricity at the time of merger in the simulation. This motivates us to characterize the orbital eccentricity before merging, as it is an important piece of information not only for estimating the binary hardening time with analytical models, but also for calculating the GW signals from the merger events. In this section, we will describe two ways of characterizing the orbital eccentricities of the BH pairs in our simulation.111We also tried applying the method of osculating elements (e.g., Efroimsky & Goldreich, 2004, and references therein) to the orbital trajectories; however, we found that the stellar environment dominated the binary’s evolution, such that it could not be adequately described as a post-Keplerian orbit.

3.1 Shape Measurement

Given the images in Figure 1, a natural way of measuring the orbital eccentricity is to use the shape of the orbits just before the numerical merger, and this is the first approach we take.

On kpc\sim{\rm kpc} scales, since most orbits are not Keplerian except those of the most massive BHs and the orbits are constantly shrinking, the BH orbits do not fit an ellipse. Instead, they exhibit the feature of a Rosetta orbit (the feature is most prominent in e.g. second row, second column of Figure 1, although standard Rosetta orbits do not shrink over time). For orbits resulted from the spherically symmetric potential, we can characterize the eccentricity by the size of the inner radius and the size of the outer radius. More specifically, for each orbit, we define Δr2\Delta r_{2} to be position of the secondary BH with respect to the center of mass, and we take the local minimum of Δr2\Delta r_{2} as the (generalized) periapsis of the orbit, and the local maximum of Δr2\Delta r_{2} as the apoapsis. Then, we represent the orbital eccentricity of the binary by the generalized eccentricity, defined for a spherical potential as: (Binney & Tremaine, 2008):

ϵ=raporperirapo+rperi,\epsilon=\frac{r_{\rm apo}-r_{\rm peri}}{r_{\rm apo}+r_{\rm peri}}, (4)

where rapor_{\rm apo} and rperr_{\rm per} are the apoapsis and the periapsis of the orbit, respectively. To distinguish between the measurement of the two methods, we will use the subscript ”sh” to refer to the measurements from this shape-based method. We average the eccentricity measurements over the last three orbits. We note, however, that the distribution in eccentricity does not change significantly when we take the average of the last one, two or three orbits.

3.2 Solving the Orbital Equation

In addition to the shape-based measurement in §3.1, we also calculate the generalized orbital eccentricity by simply solving the orbital equation. Using these two independent methods we will then be able to compare the robustness of the BH orbit eccentricity distribution measurement from the simulations.

When the BH merger occurs in the simulation, the separation between the black hole pair is 3\sim 3 ckpc/hh. At this distance, the gravitational potential is dominated by the surrounding stars and dark matter particles instead of the BHs themselves. Under such circumstances, the orbit of the satellite BH is non-Keplerian, as we have shown in Section 3.1. In the case of a spherical potential, the (generalized) apoapsis and periapsis can be obtained by solving the generalized orbital equation (Binney & Tremaine, 2008):

1r2+2[ϕ(r)E]J2=0,\frac{1}{r^{2}}+\frac{2[\phi(r)-E]}{J^{2}}=0, (5)

where ϕ(r)\phi(r) is the gravitational potential computed from the density profile of surrounding particles, EE is the total energy per unit mass and JJ is the angular momentum per unit mass of the secondary black hole with respect to the host galaxy center. The larger root of the equation corresponds to rapor_{\rm apo} and the smaller root is rperir_{\rm peri}.

When solving Equation 5, we take EE and JJ to be the average energy and angular momentum over the last half-orbit (i.e. from the last local maximum to the last local minimum of the BH separation) of the BH. We did not take the average over a more extended period of time because the black hole pair is constantly losing energy. After getting the two apses, we again use Equation 4 to calculate the orbital eccentricity. We refer to this method as the energy method, and use subscript ”en” when showing results.

Figure 2 shows a comparison between the (generalized) eccentricity measurements from the shape method and the energy method. The left panel shows the distribution of eccentricities for all the mergers in the simulation. The measurements from both methods show that the BH binary population dominated by highly eccentric orbits, with a peak at ϵ0.85\epsilon\sim 0.85 for the shape-based method and 0.75\sim 0.75 for the energy-based method. Comparing the two distributions, we see that the shape measurement generally produces a distribution with higher eccentricities than the energy method. In the middle panel we show a scatter plot of the eccentricities from the two measurements. There is a positive correlation between the two eccentricities, with the majority of the measurements close to the diagonal line. This means that the two measurements are not only close in distribution, but also yield correlated results for each individual orbit. Similar to what shown by the 1D distributions, the shape method predicts higher values of eccentricity for most pairs than the energy method (typically \sim 10% lower).

In addition to the eccentricity, in the right panel we further compare the apoapses and periapases from the two measurements. Overall, we can see that the apoapsis peaks around 131\sim 3 kpc, while the periapsis peaks around 0.10.70.1\sim 0.7 kpc. Again there is a good alignment between the two measurements, with the peaks distributed close to the diagonal line. In the majority of cases the shape measurement gives an larger apoapsis value.

To estimate the binary hardening time, we will use the measured binary eccentricities as an input to the model. By doing so, we do not consider any time-evolution of the binary eccentricity due to dynamical friction beyond the point of numerical merger (Colpi et al., 1999; Hashimoto et al., 2003). In particular, we will only show results using the values from the energy-based method (ϵen\epsilon_{\rm en}) in later sections, and we have tested that the effect of using the shape-based values is minor compared with the uncertainties from other sources (e.g. density in the central region of the galaxy).

4 Post-processing Delays

4.1 Dynamical Friction

Refer to caption
Figure 3: Comparison between the pre-merger dynamical friction time and the post-merger dynamical friction time. Top: Distributions of the pre- and post- merger DF times for all MBH pairs in Astrid. The two distributions are similar and both peaks around 200 Myrs, indicating that by adding dynamical friction to the simulation, we have resolved more than half of the total dynamical friction delay. Bottom left: Relation between the DF times and the mass ratio between the two MBHs (qq). We observe the expected negative correlation between DF times and qq. Bottom right: 1D distribution of the mass ratio qq.
Refer to caption
Figure 4: Density profiles (left) and images (right) of the host galaxies of three MBH mergers in the simulation. The blue crosses mark all MBHs in the host galaxy, scaled by the BH mass. The red circles mark the merging binary. Top: Host of a very massive binary with Mtot=5.6×108MM_{\rm tot}=5.6\times 10^{8}M_{\odot} at z=3z=3. The stellar density is the dominant component on scales below 10\sim 10 ckpc/hh. Middle: Host of a binary with Mtot=7.6×106MM_{\rm tot}=7.6\times 10^{6}M_{\odot} at z=3z=3. For this less massive binary, the density of the three components is comparable at r<10r<10 ckpc/h/h, and the density profile flattens at a larger radius. Bottom: Host of a binary with two seed-mass MBHs. The mass of the host galaxy is high relative to the binary mass. The binary is not the most massive MBHs in this galaxy, but the merger still occurs at a relatively central region.
Refer to caption
Figure 5: Left: The density profiles of Astrid galaxies that host a recent numerical merger. The blue solid line shows the median density of all binary hosts measured from the simulation and the shaded region encloses 95% of the population. The power law extrapolation is shown by dashed lines. Here we show the results for extrapolation scales rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g} (purple) and rext=2ϵgr_{\rm ext}=2\epsilon_{g} (green). A larger rextr_{\rm ext} results in a steeper power-law slope. Middle: Distribution of the power-law index of the density profile γ\gamma, measured at rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g} (purple) and rext=2ϵgr_{\rm ext}=2\epsilon_{g} (green). For rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g}, the distribution peaks at γ=1.4\gamma=1.4, while for rext=2.0ϵgr_{\rm ext}=2.0\epsilon_{g}, the distribution peaks at γ=1.9\gamma=1.9. We plot the power-index estimate in Kelley et al. (2017b) for comparison. Right: Distribution of density extrapolated to 10pc10\,{\rm pc}. We compare the two rextr_{\rm ext} values. The extrapolated density is sensitive to the change in rextr_{\rm ext}: rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g} gives a distribution centered at 10M/pc310\,M_{\odot}/{\rm pc}^{3}, while rext=2.0ϵgr_{\rm ext}=2.0\epsilon_{g} gives a distribution centered at 100M/pc3100\,M_{\odot}/{\rm pc}^{3}.

In Astrid, we have already accounted for the dynamical friction timescale above the resolution limit, leading to significant delays of in-simulation mergers compared to the traditional MBH repositioning methods. However, dynamical friction will continue to dominate over other delay processes on scales of 1010010\sim 100 pc (e.g. Kelley et al., 2017a), which is beyond our current resolution. In this section, we will compute the unresolved DF timescales for the MBH mergers, and compare with the in-simulation DF timescale.

For the in-simulation DF time, we measure it in the following way: for each black hole pair that merge within the simulation at zmergez_{\rm merge}, we track their trajectories before the merger event, and find the redshift zencounterz_{\rm encounter} at which they are first within 2ϵg2\epsilon_{g} of each other. zencounterz_{\rm encounter} is the approximate time at which the BHs would merge if we did not account for the dynamical friction time at all (note that under the reposition model, BHs usually merge even before zencounterz_{\rm encounter}). We consider the time difference between zencounterz_{\rm encounter} and zmergez_{\rm merge} as the in-simulation DF time, TDF,simT_{\rm DF,sim}. Among all BH mergers in the simulation, 5713 mergers (1.4%\sim 1.4\% of the whole merger population) happen at the first encounter.

For the post-processed DF time TDF,postT_{\rm DF,post}, we adopt the treatment from Merritt (2013) and Dosopoulou & Antonini (2017), who modifies the Chandrasekhar formalism (e.g. Chandrasekhar, 1943; Binney & Tremaine, 2008) to include the effect of the secondary BH embedded in a tight core of stars brought in from the secondary galaxy. The increased dynamical friction allows the secondary to sink faster towards the primary galaxy’s center, and thus the resulting dynamical friction time is less than the prediction from the canonical Binney & Tremaine (2008) treatment assuming a bare BH. In Dosopoulou & Antonini (2017) the assumption is that the mass of stars bound to the secondary BH is 1000 times the mass of the BH itself, and the resulting dynamical friction timescale is:

tDF,post=0.12Gyr(r10kpc)2(σ300km/s)(108MM2)1log(Λ),t_{\rm DF,post}=0.12\,{\rm Gyr}\left(\frac{r}{10{\rm kpc}}\right)^{2}\left(\frac{\sigma}{300{\rm km/s}}\right)\left(\frac{10^{8}M_{\odot}}{M_{\rm 2}}\right)\frac{1}{{\rm log}(\Lambda)}, (6)

where log(Λ)\text{log}(\Lambda) is the Coulomb logarithm, M2M_{2} is the mass of the secondary black hole. For the initial separation rr, we use the radius of the circular orbit that has the same energy as the last orbit of the binary before the (numerical) merger. Note that the model in Equation 6 does not account for the effect of non-circular orbits on the DF time. Taking the eccentricity into consideration can further reduce the estimated DF time (e.g. Taffoni et al., 2003). Following the method in Chen et al. (2021),we compute the Coulomb logarithm by:

Λ=bmax(GM2)/vBH2,bmax=10 ckpc/h,\Lambda=\frac{b_{\rm max}}{(GM_{\rm 2})/v_{\rm BH}^{2}},\;b_{\rm max}=10\text{ ckpc}/h, (7)

where M2M_{\rm 2} is the mass of the secondary black hole and vBHv_{\rm BH} is the velocity of the secondary black hole with respect to the host galaxy center.

Figure 3 shows the comparison between the in-simulation dynamical friction time and the post-processed dynamical friction time from above. The top panel shows the overall distributions of the DF times. The two distributions are on the same order of magnitude at around 10210^{2} Myrs, with a range from 10 Myrs to 1 Gyrs. For most BH pairs, TDF,simT_{\rm DF,sim} is longer than TDF,postT_{\rm DF,post}. This means that accounting for dynamical friction to the simulation, we have already included about half of the total dynamical friction delay effects. Note that both DF timescales are shorter than 1 Gyr. In the case of the resolved DF time, this is mainly due to the fact that most of the black holes have not existed for more than 1 Gyr at z=3z=3.

In the bottom panel of Figure 3, we show the relation between the two DF times and the binary mass ratio qq. Here we only show the times involving at least one non-seed MBH, defined as mergers with M1>2M1,seedM_{1}>2M_{1,\rm seed}. This is because our merger population is dominated by seed MBHs which have not grown out of their dynamical mass and thus the in-simulation DF time estimation is not exact. From Equation 6, we can see that the DF time is correlated with the mass of the primary galaxy (and MBH) through σ\sigma, and that it is inversely proportional to the secondary black hole mass. Hence, we expect that minor mergers will have longer decay timescales, and in the plot we do see a negative correlation between TDF,postT_{\rm DF,post} and qq. For the in-simulation DF, although this relation is not imposed explicitly, we still observe a negative correlation between qq and the DF time. This indicates that the negative correlation is still captured by the in-simulation dynamical friction modeling.

4.2 Loss Cone Scattering and Gravitational Wave Hardening

Refer to caption
Figure 6: Variables used to calculate dynamical friction and binary hardening timescales. Left: MtotσM_{\rm tot}-\sigma relation measured from all the binaries at the time of merger in the simulation, compared to the analytical relation given in Tremaine et al. (2002) and Kormendy & Ho (2013). Middle: The influence radius derived from γ\gamma and σ\sigma measured the simulation, compared with the analytical model used in Vasiliev et al. (2015) (green dashed line), and in Sesana (2010) with γ=1.5\gamma=1.5 (black dashed line). Our measured σ\sigma and rinfr_{\rm inf} are both close to the analytical models. Right: Density at influence radius extrapolated from the simulation. To illustrate the effect of extrapolation scales on ρinf\rho_{\rm inf}, we show the resulting extrapolation from both 1.5ϵg1.5\epsilon_{g} (pink dots) and 2.0ϵg2.0\epsilon_{g} (green contour). As was demonstrated in Figure 5, the density extrapolation is sensitive to the starting point of the extrapolation. However, even the extrapolated density from an outer radius is smaller compared with the analytical model used in Sesana (2010) with γ=1.5\gamma=1.5 (black dashed line).

Once the two MBHs become gravitationally bound, the dynamical friction formalism is no longer a valid approximation, and individual interactions between singular stars and the binary must be considered. These interactions extract angular momentum from the binary, driving them closer to each other (e.g. Merritt, 2013; Vasiliev & Merritt, 2013). This regime is the loss-cone scattering (LC) regime, which refers to the specific cone in parameter space where stars have to exist in order to extract angular momentum from the binary (e.g. Frank & Rees, 1976; Lightman & Shapiro, 1977). On even smaller scales, the binary will enter the gravitational wave regime where it will evolve until coalescence. Once the binary enters the gravitational wave regime, its dynamics follow the formalism of Peters (1964) at small separations of 10100010-1000 Schwarzschild radii.

For the loss-cone scattering and gravitational-wave hardening phase, we adopt the analytical prescription in Vasiliev et al. (2015) (V15 hereafter). However, the time estimation in Equation (25) of V15 assumes a single family of MtotσinfrinfM_{\rm tot}-\sigma_{\rm inf}-r_{\rm inf} relation, and thus it may over-simplify the properties of the galaxies hosting the merger events. Hence we will adopt the V15 formalism but with some slight changes, so that we can use the host galaxy properties measured from the simulation. In this Section, we first explain how we measure the relevant galaxy properties, then we give our binary hardening time estimation by combining analytic modeling with the measured properties.

4.2.1 Extrapolated Galaxy Properties

To compute the hardening time for the binaries, the important quantities to measure are: the influence radius rinfr_{\rm inf} defined as the radius containing a stellar mass equal to two times the binary mass, the velocity dispersion of stars at the influence radius σinf\sigma_{\rm inf}, the power-law slope of the stellar density profile γ\gamma, and the stellar density at the influence radius ρinf\rho_{\rm inf}. Since the binary hardening phase begins after the dynamical friction phase, we use the snapshot immediately following the numerical merger to measure these properties.

Among the quantities above, the velocity dispersion can be measured directly from the simulation without extrapolation (for an isothermal sphere, the velocity dispersion is independent of radius). Therefore, we take the approximation that σinf=σgal\sigma_{\rm inf}=\sigma_{\rm gal}, and use the measured velocity dispersion within the half-mass radius of the host galaxy.

The next galaxy property we measure from the simulation is the power-law slope γ\gamma of the stellar density profile. In Figure 4, we show three examples of the density profiles of dark matter, gas and stars for galaxies hosting recently merged binaries. We show the profiles of a massive binary with Mtot=5.6×108MM_{\rm tot}=5.6\times 10^{8}M_{\odot}, a less massive one with Mtot=7.6×106MM_{\rm tot}=7.6\times 10^{6}M_{\odot} at z=3z=3, and a seed-mass binary with Mtot=1.9×106MM_{\rm tot}=1.9\times 10^{6}M_{\odot}. For the most massive binary (top), the stellar density is the dominant component on scales below 10\sim 10 ckpc/hh. The stellar density profile follows a power law down to the gravitational softening length ϵg\epsilon_{g}, where the profile flattens due to gravitational softening. In the case of the medium-mass binary, the density of all three components is comparable at r<10r<10 ckpc/h/h, and the density profile flattens at a larger radius compared to the massive one. In the third case of a seed-seed merger, the mass of the host galaxy is high relative to the binary mass. The binary is not the most massive MBHs in this galaxy, but the merger still occurs at a relatively central region. We note that this binary belongs to the seed MBH population that still merge after the post-processing delays.

As we do not resolve the scale of interest for the loss-cone scattering, we assume that below a scale rextr_{\rm ext} close to the resolution limit ϵg\epsilon_{g}, the stellar density profile follows a single power law ρrγ\rho\propto r^{-\gamma}. By doing so, we are able to extrapolate the stellar density to the inner region of the host galaxy. To measure the value of γ\gamma, we take the measured density from 10 bins just above rextr_{\rm ext}, and fit it to the power law profile. Our choice of rextr_{\rm ext} is motivated by the flattening of the profile at 1.5ϵg\sim 1.5\epsilon_{g} in Figure 4 and the fact that gravity is not well-resolved within 2ϵg\sim 2\epsilon_{g}. Since the exact scale on which the simulation density becomes unrealistic is uncertain, we use both rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g} and rext=2.0ϵgr_{\rm ext}=2.0\epsilon_{g} to bracket our predictions.

The left panel of Figure 5 shows the measured stellar density profiles and extrapolations beyond rextr_{\rm ext} for all binaries in Astrid. We show the median as well as the 95% contour of the measured density, and we compare the power-law extrapolation from rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g} and rext=2.0ϵgr_{\rm ext}=2.0\epsilon_{g}. From the comparison, we see that the measurement of γ\gamma is sensitive to the extrapolation scale, and that larger rextr_{\rm ext} results in a steeper power-law slope and thus a higher density at the inner region. However, we also note that the shift due to rextr_{\rm ext} is comparable to the width of the distribution, and that both measurements are consistent with the values assumed in various binary hardening models.

This is further illustrated by the middle/right panel of Figure 5, where we show the distribution of the measured γ\gamma and the density extrapolated to 1010 pc. For rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g}, the distribution peaks at γ=1.4\gamma=1.4, while for rext=2.0ϵgr_{\rm ext}=2.0\epsilon_{g}, the distribution peaks at γ=1.9\gamma=1.9. These values are consistent with the range of values used in most loss-cone scattering models (e.g. Sesana, 2010; Merritt, 2013; Vasiliev et al., 2015; Sesana & Khan, 2015). In the figure, we also compared our distributions with the measured distribution in Kelley et al. (2017b) from the Illustris simulation. Compared to Kelley et al. (2017b), our measured profiles are significantly steeper, which also lead to a higher extrapolated density at r=10pcr=10pc. Our simulation has a higher resolution than Illustris, and thus resolves the stellar density profiles better on kpc scales. This is also due to the fact that we begin our extrapolation at different scales: Kelley et al. (2017b) uses the inner-most eight density bins that contains at least four particles, which could lie well below the gravitational softening. From the right panel, we see that the extrapolated density is sensitive to the change in rextr_{\rm ext}: rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g} gives a distribution centered at 10M/pc310\,M_{\odot}/pc^{3}, while rext=2.0ϵgr_{\rm ext}=2.0\epsilon_{g} gives a distribution centered at 100M/pc3100\,M_{\odot}/pc^{3}. The order-of-magnitude difference motivates us to propagate the uncertainty in rextr_{\rm ext} throughout subsequent analyses, as it may have non-trivial impacts on the final merger rate predictions from the simulation.

Finally, we compute rinfr_{\rm inf} and ρinf\rho_{\rm inf} from the quantities measured above. As we cannot resolve the inner cusp of the galaxies in our simulation, a direct measurement of rinfr_{\rm inf} is not possible. To estimate the influence radius, we adopt the analytical relation (e.g. Sesana, 2010):

rinf=(3γ)GMtotσinf2,r_{\rm inf}=(3-\gamma)\frac{GM_{\rm tot}}{\sigma_{\rm inf}^{2}}, (8)

where γ\gamma is the density power law slope we just showed, and σinf\sigma_{\rm inf} is approximated by the measured galaxy velocity dispersion.

To get the density at the influence radius ρinf\rho_{\rm inf}, we extrapolate the power-law relation of the density profile down to rinfr_{\rm inf}, using the measured γ\gamma and ρ\rho. Note that our simulation does not resolve the high-density peaks below our resolution, or nuclear star clusters, and thus the extrapolated ρinf\rho_{\rm inf} is likely a lower limit. Moreover, since the nuclear star clusters are not resolved, we do not account for effects such as tidal disruption, which can to a shorter binary hardening time (e.g. Arca-Sedda & Gualandris, 2018; Biava et al., 2019; Ogiya et al., 2020).

Figure 6 shows all of the measured or derived variables for computing the binary hardening timescales, and their relation with the binary mass. The MBHσM_{\rm BH}-\sigma relation follows the relation in Tremaine et al. (2002) for binaries with Mtot>2×106MM_{\rm tot}>2\times 10^{6}M_{\odot}, but is flatter compared to the relation in Kormendy & Ho (2013). There is a large scatter in σ\sigma for seed-mass binaries. Since the influence radius rinfr_{\rm inf} is derived from σ\sigma, γ\gamma and the binary mass, we expect it to stay close to the analytical models from binary hardening papers. Here we compared it with the analytical model adopted in Sesana (2010) and Vasiliev et al. (2015). Our values are in line with the Sesana (2010) model with a constant γ=1.5\gamma=1.5, although the scatter is large. This is also consistent with the fact that our distribution in γ\gamma peaks around γ=1.4\gamma=1.4 when measured at rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g}.

Finally, in the right panel we show the density at the influence radius extrapolated from the simulation. To illustrate the effect of extrapolation scales on ρinf\rho_{\rm inf}, we show the resulting extrapolation from both 1.5ϵg1.5\epsilon_{g} and 2.0ϵg2.0\epsilon_{g}. As shown in Figure,5, the density extrapolation is very sensitive to the starting point of the extrapolation. Shifting the starting point by 0.5ϵg=0.75ckpc/h0.5\epsilon_{g}=0.75{\rm ckpc}/h can result in an order of magnitude difference in ρinf\rho_{\rm inf}. However, we note that even the density extrapolated from the outer radius is smaller than the analytical model used in Sesana (2010) with γ=1.5\gamma=1.5.

4.2.2 Binary Hardening Timescales

Refer to caption
Figure 7: Top: The distribution of the loss-cone and gravitational-wave hardening time for all binaries in the simulation. Here we use rextr_{\rm ext} from 1.5ϵg1.5\epsilon_{g}. The shaded distribution is computed using the measured eccentricity ϵen\epsilon_{\rm en}. If we assume ϵ=0\epsilon=0 (unshaded), the decay timescales will generally be longer by a factor of 100\sim 100 and peak at 100 Gyr, which is much longer than a Hubble time. Middle: the relation between the hardening timescale and the density at influence radius ρinf\rho_{\rm inf}. The timescale is negatively correlated with ρinf\rho_{\rm inf}. Changing rextr_{\rm ext} from 1.5ϵg1.5\epsilon_{g} (pink dots) to 2.0ϵg2.0\epsilon_{g} (green contours) shortens the hardening timescale. The right panel shows a clearer dependency when we remove the seed population. Bottom: the relation between the hardening timescale and the measured eccentricity. We see a weak negative correlation between ThardT_{\rm hard} and ϵen\epsilon_{\rm en}.

After measuring the quantities of interested for computing the binary hardening time, we will proceed to describe the analytical model for estimating the hardening timescale. As was mentioned earlier, we base most of our model on Vasiliev et al. (2015) (V15), with appropriate changes to incorporate information from the simulation.

V15 models a separation-dependent LC hardening rate by:

S(a)=μSinf(aah)ν,S_{*}(a)=\mu S_{\rm inf}\left(\frac{a}{a_{h}}\right)^{\nu}, (9)

where aa is the binary separation, aha_{h} is the hardening radius given by ah=q4(1+q)2rinfa_{h}=\frac{q}{4(1+q)^{2}}r_{\rm inf}, μ\mu is the filling fraction of the loss cone, and ν\nu characterizes the radial dependence of the hardening rate. We adopt the fiducial values of μ=0.3\mu=0.3 and ν=0.4\nu=0.4 from V15. SinfS_{\rm inf} is the full LC hardening rate at the influence radius given by:

Sinf=HGρinfσinf,S_{\rm inf}=H\frac{G\rho_{\rm inf}}{\sigma_{\rm inf}}, (10)

where σinf\sigma_{\rm inf} and ρinf\rho_{\rm inf} are the velocity dispersion and stellar density at the influence radius rinfr_{\rm inf}, and HH is a constant LC hardening rate given by H=2πAH=2\pi A, with A=4A=4 in V15. This value is slightly larger than the H=15H=15 rate given by Sesana & Khan (2015).

At a closer separation, GW emission becomes the dominant channel for binary energy loss. The hardening rate in the GW regime is given by (Peters, 1964):

SGW(a)=1a564G3M1M2MtotF(ϵ)5c2,S_{\rm GW}(a)=\frac{1}{a^{5}}\frac{64G^{3}M_{1}M_{2}M_{\rm tot}F(\epsilon)}{5c^{2}}, (11)

where ϵ\epsilon is the eccentricity of the binary orbit and

F(ϵ)=(1ϵ2)7/2[1+(73/24)ϵ2+(37/96)ϵ4]F(\epsilon)=(1-\epsilon^{2})^{-7/2}[1+(73/24)\epsilon^{2}+(37/96)\epsilon^{4}] (12)

accounts for the eccentricity dependence of the GW hardening rate.

The separation at which the binary spends the most time, agwa_{\rm gw}, is calculated by setting S(a)=SGW(a)S_{*}(a)=S_{\rm GW}(a), which leads to:

aGW=(64G3M1M2MtotF(ϵ)5c2ahνσinfμSinf)1/(5+ν)a_{\rm GW}=\left(\frac{64G^{3}M_{1}M_{2}M_{\rm tot}F(\epsilon)}{5c^{2}}\frac{a_{h}^{\nu}\sigma_{\rm inf}}{\mu S_{\rm inf}}\right)^{1/(5+\nu)} (13)

Finally, we can estimate the LC+GW hardening timescale by:

Thardϵgw=1S(aGW)×aGW.T_{\rm hard}^{\epsilon_{gw}}=\frac{1}{S_{*}(a_{\rm GW})\times a_{\rm GW}}. (14)

Note that in this expression, we have only accounted for the eccentricity dependence during the GW hardening stage, and thus the superscript ϵgw\epsilon_{\rm gw}. However, the orbital eccentricity also evolves during the LC scattering phase and can impact the hardening time. V15 models this effect by:

Thard=Thardϵgw=0×(1ϵ2)[k+(1k)(1ϵ2)4]T_{\rm hard}=T_{\rm hard}^{\epsilon_{gw}=0}\times(1-\epsilon^{2})[k+(1-k)(1-\epsilon^{2})^{4}] (15)

where k=0.4+0.1log10(Mtot/108M)k=0.4+0.1\,{\rm log}_{10}(M_{\rm tot}/10^{8}\,M_{\odot}). At higher eccentricities, Equation 14 and 15 give similar results, but for ϵ0\epsilon\sim 0, the former underestimates the hardening timescale by a factor of 3\sim 3.

Refer to caption
Figure 8: Left: The merger rates for all binary population in Astrid down to z=3z=3 with different levels of delays. Without considering any post-processing delays (orange), we expect a total of 2\sim 2 mergers per year of observation down to z=3z=3. The rate when considering only the DF delay (green) has an at most 50%50\% decrease compared to the raw rate at the highest redshifts. The binary hardening time has the most significant effect in reducing the merger rate (purple). The upper limit of the band assumes rext=2ϵgr_{\rm ext}=2\epsilon_{g}, and the lower limit assumes rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g}. The bottom panel shows the ratio between the delayed merger rates and the simulation merger rates. Left: The mass distribution of the two MBHs involved in the mergers. The red curves correspond to the more massive MBH and the blue curves correspond to the less massive MBH. The mass distribution of the simulation mergers is plotted in dashed lines, and that of the delayed mergers is plotted in solid lines. The bottom panel shows the ratio between the mass distributions of simulation mergers and delayed mergers. The seed-mass mergers (enclosed in the vertical dashed lines) are suppressed most strongly by a factor of 6\sim 6.

For the binaries in our simulation, we use the galaxy and binary properties shown in Section 4.2.1, together with the above formalism to estimate the binary hardening time. Note that the hardening timescale depends on the orbital eccentricity as the BHs enter the hardening regime: more eccentric orbits merge faster compare to circular ones. To take this effect into account, we use the orbital eccentricity shown in Section 3 as a proxy for the orbital eccentricity at the beginning of the binary hardening phase, assuming that the post-processed dynamical friction does not change the orbital eccentricity greatly (e.g. Colpi et al., 1999; Hashimoto et al., 2003). However, some studies also suggested that this might not be the case, as the rotation of gas on 10210^{2} pc scales can affect the BH angular momentum and eccentricity (e.g. Dotti et al., 2007; Bonetti et al., 2020). We also note that the galaxy properties we put into the calculation are instantaneous properties from the simulation after the BHs go through the numerical merger. Given that the galaxy and central stellar densities will only grow with time (as well as the BH masses), our estimations are likely upper limits of the hardening time.

Figure 7 shows the relation between the binary hardening time and ρinf\rho_{\rm inf} as well as the energy-based eccentricity ϵen\epsilon_{\rm en}. We also show the 1D distribution of hardening times. The left column includes all binaries in the catalog, while the right column only includes binaries with M1>2Mseed,1M_{1}>2M_{\rm seed,1}. For all binaries, given our measured initial eccentricities, the hardening timescale falls between 100 Myrs and 100 Gyrs, with a peak around 5 Gyrs. The timescale is strongly correlated with ρinf\rho_{\rm inf} and therefore also rextr_{\rm ext}. Changing the value of rextr_{\rm ext} from 1.5ϵg1.5\epsilon_{g} to 2.0ϵg2.0\epsilon_{g} leads to a shorter estimated hardening timescale. This is because higher stellar density leads to shorter hardening timescales, as the LC stars can more efficiently carry away the energy from the binary. In fact, we find that the inner stellar density is the most important property for determining the hardening timescale. Nonetheless, in both cases the hardening timescale is much longer than the dynamical friction timescale. Note that if we do not account for the eccentricities of the binary orbits, the decay timescales will generally be longer by a factor of 100\sim 100 and peak at 100 Gyr, which is much longer than a Hubble time.

The bottom row shows the relation between the hardening timescale and the measured eccentricity. When looking at the whole binary population, we see a negative correlation between ThardT_{\rm hard} and ϵen\epsilon_{\rm en}. This is expected as eccentric orbits have accelerated hardening rates. However, when we only focus on the non-seed mergers, the ϵen\epsilon_{\rm en} dependence is washed out by the strong correlation with ρinf\rho_{\rm inf}.

Because of the strong dependence of the delay timescale on the uncertain variable ρinf\rho_{\rm inf}, we will propagate this uncertainty to the merger rate predictions in the next session, and characterize how the uncertainty due to numerical resolution affects the mergers in Astrid.

5 MBH merger rate and Host galaxy properties

After characterizing the delay time, in this section we present the rate at which GW signals from MBH mergers will reach the earth, taking into account the sub-resolution delay processes. We also examine how the DF and binary hardening delay affects different population of MBH mergers. Finally, we investigate the galaxy properties for different part of the merger population.

5.1 Merger Rate Predictions

Refer to caption
Figure 9: Merger rates for different mass cuts and mass-ratio cuts. Left: The merger rates for the seed-mass population, where the masses of both MBHs are less than two times their seed masses. The colors are the same as in Figure 8. Compared to 8, this population makes up 60%\sim 60\% of the mergers. Middle: Merger rate for MBHs with only one of the two grown out of the seed mass. This rate makes up 30%\sim 30\% of the entire merger population. Compared to the seed-seed mergers, here we see less mergers at high redshifts, but a similar rate at z=3z=3. Right: Mergers with both MBHs larger than two times their seed masses and with q>0.1q>0.1. When constrained to major and non-seed mergers, the effect of DF is barely noticeable. The DF+Hard delayed rate makes up 50%50\% of the total rate. The lower panels show the ratio between the delayed merger rates and the simulation merger rates.

We calculate the rate by integrating the number of mergers in the simulation over redshifts, incorporating the cosmic volume at the given redshift:

dNdzdt=d2n(z)dzdVcdzdtdVcdz11+z\frac{{\rm d}N}{{\rm d}z\,{\rm d}t}=\frac{{\rm d}^{2}n(z)}{{\rm d}z\,{\rm d}V_{c}}\frac{{\rm d}z}{{\rm d}t}\frac{{\rm d}V_{c}}{{\rm d}z}\frac{1}{1+z}\, (16)

where dVc{\rm d}V_{c} is the comoving volume element of the universe at a given redshift and n(z)n(z) is the number of mergers at that redshift. The 1/(1+z)1/(1+z) term redshifts the infinitesimal time element in dz/dt{\rm d}z/{\rm d}t to the observer frame time interval.

To calculate this rate from our simulation, we take the finite-interval approximation:

d2n(z)dzdVc=N(z)ΔzVsim,\frac{{\rm d}^{2}n(z)}{{\rm d}z\,{\rm d}V_{c}}=\frac{N(z)}{\Delta z\,V_{\rm sim}}, (17)

where Δz\Delta z is the width of the redshift bin, N(z)N(z) is the total number of mergers within that redshift bin, and Vsim=(250Mpc/h)3V_{\rm sim}=(250\,{\rm Mpc}/h)^{3} is the volume of our simulation in comoving units.

To clearly see the effect of each stage of the delay, we calculate three different rates. We first compute the ”Sim” rate which uses the numerical merging time as the redshift of the merger (also see DeGraf et al., prep). Then we add the post-processed DF time to the numerical merger time to compute the ”DF-only” rate. Finally, we further account for the binary hardening timescales and calculate the ”DF+Hard” rate.

In the left panel of Figure 8, we plot the merger rates with different levels of post-processed delays, for the whole merger population in Astrid. First, we notice that the number of mergers keeps increasing with decreasing redshift for all three models. This is because we keep seeding BHs as structures form and grow, and that we have not reached the peak in seeding rate at z=3z=3. Without considering any post-processed delays (”Sim”), we expect a total of 1.8\sim 1.8 mergers per year of observation down to z=3z=3. The post-processed DF time does not significantly impact the total observed merger rate (”DF-only”), with a 50%\sim 50\% decrease at the highest redshift (z8z\sim 8). The binary hardening time has the most significant impact on the merger rate at all redshifts (”DF+Hard”). We see that the merger rate is reduced by a factor of 373\sim 7 after adding the delay from binary hardening. The resulting merger rate is 0.30.70.3\sim 0.7 at z>3z>3. Here the upper limit is given by assuming rext=2ϵgr_{\rm ext}=2\epsilon_{g} and the lower limit is given by rext=1.5ϵgr_{\rm ext}=1.5\epsilon_{g}. On the bottom panel, we show the ratio between the delayed merger rate and the simulation merger rate as a function of redshift. For both DF-only and DF+Hard delays, the fractional rates get higher at lower redshifts. This is a result of the high-redshift mergers being pushed down to low redshifts.

On the right panel, we show the mass distribution of the two MBHs involved in each merger. The dashed lines corresponds to the simulation merger without any delays, and the solid lines shows the distribution of the merger population after the DF+hardening delays. First, we can see that both before and after the delay, the merger population is dominated by seed-mass mergers (the ones enclosed by the vertical dashed lines), with M1M_{1} evenly distributed across the seed masses and M2M_{2} concentrated on the lower-mass end of the seeds. It is also this seed-mass merger population that gets suppressed the most by the delay. From the ratio between the mass functions shown in the bottom panel, we see that for the seed-mass mergers, only 15%\sim 15\% still merge at z>3z>3 after the delays, whereas at the high-mass end this fraction increases to 50%50\%.

In order to disentangle different merger populations, in Figure 9 we further split the rate by how many seed MBHs are involved in the merger. The left panel shows the merger rates for the seed-mass population, where the masses of both MBHs are below two times their seed masses. This population makes up 60%\sim 60\% of the mergers. At z>5z>5, the seed-seed mergers are strongly suppressed by the binary hardening delays because the stellar density is relatively low. The middle panel shows the mergers with the only more massive MBH grown beyond two times its seed mass. At z=3z=3, the rate from this group is comparable to the rate from the seed-seed mergers. However, the number decreases more steeply as we go to higher redshifts. Compared to the seed-seed mergers, this group has a higher mass ratio and thus a longer DF time. The effect of the binary hardening delay, however, is smaller because of the higher-density in the remnant galaxy. Finally, on the right panel, we show the more massive and major mergers. Compared to the previous two groups where at least one seed-mass MBH is involved in the merger, the mergers from this group is 6\sim 6 times lower. The effect of delay is also the smallest. In particular, we noticed that the DF-only rate is very similar to the simulation rate. Even for this group where the effect of delays is the smallest, the merger rate is still suppressed by >50%>50\% at each redshift compared to the simulation merger rate.

Figure 10 shows the distribution of MBH mergers on the MtotzmergeM_{\rm tot}-z_{\rm merge} plane for both the simulation and delayed mergers, color coded by the number of mergers per Myr. Without any delay, the majority of the merger events are seed-seed mergers around z=34z=3-4. As we would expect from the black hole mass growth over time, we see more massive mergers at lower redshifts. The middle panel shows the same merger population with the post-merger DF time added. As was discussed in the previous paragraph and in Section 4.1, the post-processed DF peaks around 200200 Myrs and does not significantly delay the mergers. Here, we see a slight shift of the merger population towards lower redshift.

On the right panel, we show the distribution after considering the DF delay and hardening phase. Note that since the final simulation output is at z=3z=3, all the data points at z<3z<3 are the results of delayed z>3z>3 numerical mergers, and are not representative of all merger events at z<3z<3. Compared with the other two panels, we see a significant shift of the mergers towards lower redshifts. The population that is most significantly shifted are the low-mass mergers with Mtot<106.5MM_{\rm tot}<10^{6.5}M_{\odot}, while the most massive binaries are still able to merge at relatively high redshifts. This is a consequence of the large hardening time scale of smaller BHs associated with lower ρinf\rho_{\rm inf}.

5.2 Properties of High-z MBH Mergers

Refer to caption
Figure 10: The distribution of mergers on the MtotzmergeM_{\rm tot}-z_{\rm merge} plane for the simulation and delayed mergers, color coded by the number of mergers per Myr. Left: The distribution for all mergers without delays. Middle: The same merger population with the post-merger DF time added. Here, we see a slight shift of the merger population towards lower redshift, but nothing gets delayed below z=2z=2. Right: The distribution after considering both the DF delay and the hardening time. Note that since the latest redshift of the simulation is z=3z=3, all the data points at z<3z<3 (masked in grey) are results of delay from z>3z>3 numerical mergers, and is not representative of all merger events at z<3z<3. We see a significant shift of the mergers towards lower redshifts. The population most significantly shifted are the low-mass mergers with Mtot<106.5MM_{\rm tot}<10^{6.5}M_{\odot}, while the most massive binaries are still able to merge at relatively high redshifts.

From the previous section, we have seen that while some low-mass mergers are significantly delayed and do not merge at z>3z>3, 15%\sim 15\% of them still do. For the non-seed mergers, although the delay is generally less significant, we still see a 50%50\% decrease in merger rate when accounting for the delays. Now we will investigate which part of the merger population gets significantly delayed, and which still manages to merge at high redshifts.

In Figure 11 we show the properties of MBHs involved in both the simulation mergers and the delayed mergers. The top row shows the properties of the non-seed mergers, and the bottom row shows the properties of the seed-seed mergers. We start by looking at the mass distribution of galaxies hosting the mergers (shown in the first column). For the simulation mergers consisting of two non-seed MBHs, the masses of the host galaxies peak at 4×109M4\times 10^{9}M_{\odot}. For systems that still merge after the delays, we see a clear shift towards the higher-end in stellar masses with a peak at 1010M\sim 10^{10}M_{\odot}. This is because for more massive galaxies, the high stellar density enables more efficient hardening through loss-cone scattering, and thus the delay time is shorter (also see Figure 7). For mergers involving two MBH seeds shown on the bottom, we observe a similar trend. Overall, seed mergers reside in less massive galaxies with stellar masses below 4×108M4\times 10^{8}\,M_{\odot}. The delayed merger events also pick up the more massive galaxy population out of the simulation mergers with galaxy masses distributed around 109M10^{9}\,M_{\odot}.

In addition to the stellar environment which plays an important role in the delay time estimation, the seeding redshift of the MBHs can also affect whether the two MBHs still merge at a high redshift after the delay. This is shown in the second column of Figure 11. While the seeding redshift of the simulation merger MBHs is z7z\sim 7, the MBHs involved in delayed mergers are seeded as early as z=10z=10. For the seed-seed mergers shown on the bottom, the overall seeding redshift is lower, but we also see a shift towards higher redshift when comparing the delayed mergers to the simulation mergers. The bias towards early MBH seeding for delayed mergers is also correlated with the higher host galaxy mass we have seen earlier: because the delayed mergers favor earlier seeds, they also tend to reside in galaxies that are massive enough at high redshifts to host an MBH seed.

Refer to caption
Figure 11: The fraction of the merger population in each bin of the galaxy stellar mass hosting the merger (first column), seeding redshifts of the merged MBHs (second column), number of MBHs in the host galaxy (third column), and the ratio between the total MBH mass in the host galaxy and the binary mass MtotM_{\rm tot} (fourth column). The top row shows the non-seed merger population, and the bottom row shows the seed-mass merger population. The simulation mergers are shown in orange and the DF+Hard delayed mergers are shown in purple. The total number of z>3z>3 mergers in each population is shown in the second column with corresponding colors.

On the right two columns, we examine the properties of other MBHs embedded in the host galaxy of the mergers. The third column shows the total number of MBHs embedded in the host galaxy of the merger, in the snapshot immediately following the numerical merger (so the merging MBHs will be counted as one object). The fourth column is the mass ratio between all MBHs in the host galaxy and the merging system. For both the seed and non-seed merger populations, the merging system is the sole MBH in the host galaxy in a majority of mergers. For the non-seed population, there is still a >50%>50\% fraction of mergers happening next to a third MBH (or more). Interestingly, the delayed merger systems favor galaxies with more MBHs besides the merging ones (also correlated with larger galaxy masses). Nonetheless, the merging system is still the most massive MBH in its host galaxy in the most cases when we look at the MBH,gal/MtotM_{\rm BH,gal}/M_{\rm tot} ratio.

When constrained to seed-seed mergers, we see that 70%\sim 70\% of the mergers are the single MBH in the host galaxy. The delayed mergers also tend to pick out the galaxies with more MBHs compared to the simulation mergers. However, contrary to the non-seed case where the merging MBH is more massive than the other MBHs in the same galaxy, for seed-seed mergers that do occur near a third MBH, the mass of the third MBH is more likely to be larger. This can be seen from the fact that the NBH,galN_{\rm BH,gal} distribution is more peaked at NBH,gal>1N_{\rm BH,gal}>1 compared to the MBH,gal/Mtot>1M_{\rm BH,gal}/M_{{\rm tot}}>1 distribution (it means that if there is a third MBH, its mass can be larger than MtotM_{\rm tot} in some cases, resulting in the longer tail of MBH,gal/MtotM_{\rm BH,gal}/M_{{\rm tot}}).

From the investigations above, we conclude that the z>3z>3 mergers after the DF and hardening delay make up a small and biased sample of the simulation mergers. In particular, they are systems with MBHs seeded earlier and embedded in more massive galaxies compared to the overall simulation merger population. Moreover, the majority of the merger remnant is the only MBH in its host galaxy, especially for the seed-mass mergers. However, the delayed mergers tend to pick out more systems that have other nearby MBHs in the remnant galaxy compared to the overall simulation merger population.

6 Gravitational Wave Emission from MBH Mergers

With a catalog of merging binaries, their merging time, and orbital eccentricities, we can not only compute the rate of mergers reaching the Earth, but also predict the gravitational wave signal that can be observed from these sources. This section is dedicated to predicting the gravitational wave signal and detectability of the Astrid mergers with LISA. We first briefly describe the characteristic strain for circular sources, and then we generalize to the signal from eccentric sources. After that, we combine with the LISA sensitivity curve and compute the signal-to-noise ratio (SNR) for each mergers in the simulation.

6.1 Characteristic Strain of Circular Orbits

MBH binaries provide a variety of signals measurable by LISA since their chirp evolution in the frequency domain occurs near the low-frequency band edge of the LISA sensitivity curve. Binaries with 105107M10^{5}-10^{7}M_{\odot} total mass will provide a measurable inspiral, merger, and ringdown leading to signals out to the cosmic horizon (Amaro-Seoane et al., 2017). The binary inspiral is the initial stage of binary black hole coalescence when the two MBHs orbit one another at separations greater than the innermost stable circular orbit (ISCO; R=6GMBH/c2R=6GM_{\rm BH}/c^{2}). At these separations, the orbit is usually treated with a post-Newtonian formalism. The merger stage follows the binary inspiral with a highly non-linear relativistic process. This process continues until the binary components form a single event horizon, leading to ringdown.

We use the characteristic strain, hsh_{s}, to model the binary signal which accounts for the time the binary spends in each frequency bin (Finn & Thorne, 2000). The characteristic strain is given by (e.g. Moore et al., 2015):

hs(f)=4f2|h~(f)|2h_{s}(f)=4f^{2}|\tilde{h}(f)|^{2} (18)

where h~(f)\tilde{h}(f) represents the Fourier transform of a time domain signal.

To generate the waveforms, we use the phenomenological waveform PhenomD (Husa et al., 2016; Khan et al., 2016) implemented within the gwsnrcalc Python package (Katz & Larson, 2019). The input parameters are the binary masses, merging redshift, and the dimensionless spins of the binary. For the MBH masses, we do not account for mass growth after the numerical merger. However, we note that the MBH can potentially gain a significant fraction of its mass during the >1Gyr>1\,{\rm Gyr} of time in the dynamical friction (e.g. Banks et al., 2021) or loss-cone scattering phase. The dimensionless spin aa characterizes the alignment of the spin angular momentum with the orbital angular momentum, and the value of aa ranges from 1-1 to 11. However, we do not have any information on the spin of the SMBHs in our simulation. Therefore, following the argument in Katz et al. (2020), we assume a constant dimensionless spin of a1=a2=0.8a_{1}=a_{2}=0.8 for all binaries (e.g. Miller, 2007; Reynolds, 2013).

Refer to caption
Figure 12: Left: Example waveforms for three binaries of different masses in Astrid. The thick curve shows the waveform assuming ϵ=0\epsilon=0, while the thin lines are the waveform assuming eccentric orbits. We also show the LISA sensitivity curve from Amaro-Seoane et al. (2017) (black solid) for comparison. The numerical merging time of all example binaries is z3.1z\sim 3.1. Right: The hfh-f distribution after applying the delay models. The arrows indicate the shifts in strain and frequency by the delay. Most signals are shifted to the upper-right due to the lower redshift of the merger after the delays. The light grey region shows the merger population delayed to z<3z<3, which is not part of our prediction.

In Figure 12, we show the distribution of the merging frequency fmergef_{\rm merge} and the strain at this frequency for all binaries in the simulation, before and after applying the DF+Hardening delay models. To evaluate the detectability of the population with LISA, we also plot the proposed LISA sensitivity curves. We use the LISA sensitivity configuration from the LISA Mission Proposal (Amaro-Seoane et al., 2017), and we use hN=SNh_{N}=\sqrt{S_{N}} (Moore et al., 2015) to convert from the proposed power spectral density SNS_{N} to strain hNh_{N}.

In the left panel, we show example waveforms for binaries of different masses but similar numerical merging time. The thick curve shows the waveform assuming ϵ=0\epsilon=0, with the dot representing the merging frequency fmergef_{\rm merge}. We will discuss the thin lines with non-zero eccentricities in later sections. From the example waveforms, we see that at a fixed source redshift, the more massive binary has a higher strain amplitude. However, this does not necessarily lead to a more significant detection, because the lower frequency at which the wave is emitted falls into the region where the LISA sensitivity is worse. Out of these three binaries, the two least massive binaries are detectable by LISA while the most massive one is not. After the DF and hardening delays, all curves have higher strain amplitudes, as the strength of the signal is negatively correlated with redshift.

After looking at individual cases, we turn to the whole binary population. On the right panel, we show the distribution of fmergef_{\rm merge} and hs(fmerge)h_{s}(f_{\rm merge}) for Astrid mergers, after the post-processed delays. We have masked the signals from z<3z<3 mergers in light grey, as they are purely due to the post-processed delays, and are not part of our simulation predictions. The majority of merger events within the simulation lie above the LISA sensitivity curve. From example waveforms, we see that once any given GW signal crosses the detector sensitivity curve, the ratio of the signal to the sensitivity curve rapidly increases by a few orders of magnitude. Since the merger population is dominated by seed-seed mergers, we see a peak around fmerge102f_{\rm merge}\sim 10^{-2}Hz, corresponding to the example green curve. Finally, we demonstrate the shift of the signal due to the delay model by the colored arrows. The tail of the arrows indicates the location of the frequency/strain before the delays. The head of the arrows are the signals after the delays. We see that in the example cases, the signal shifts to the high-strain, high-frequency region of the plane. This is mainly because of the delay of the mergers from z>3z>3 to z<3z<3.

6.2 GW Signal from Eccentric Sources

In the previous section we have shown a single hsfh_{s}-f relation by assuming circular orbits for the binaries. In this section, we will utilize the eccentricity measured from the simulation when calculating the strain and signal-to-noise ratio (SNR) for each binary.

The GW strain from an individual, eccentric source can be related to that of a circular source as (e.g. Amaro-Seoane et al., 2010; Kelley et al., 2017b):

hs2(fr)=(2n)2n=1hr,circ2(fh)g(n,ϵ)|fh=fr/n,h_{s}^{2}(f_{\rm r})=\left(\frac{2}{n}\right)^{2}\sum_{n=1}^{\infty}h_{\rm r,circ}^{2}(f_{\rm h})g(n,\epsilon)|_{f_{\rm h}=f_{\rm r}/n}, (19)

where hr,circh_{\rm r,circ} is the characteristic strain of a circular source given by Equation 18, g(n,ϵ)g(n,\epsilon) is the GW frequency distribution function given by Equation 20 in Peters & Mathews (1963) with n=1g(n,ϵ)=F(ϵ)\sum_{n=1}^{\infty}g(n,\epsilon)=F(\epsilon), where F(ϵ)F(\epsilon) is defined by Equation 12.

During the GW-driven inspiral, the orbital eccentricity also evolves according to Peters (1964) Equation (5.7), such that it decays towards zero as the binary inspirals toward merger. This will affect the orbital frequency by:

forbf0=[1ϵ021ϵ2(ϵϵ0)12/19(1+121304ϵ21+121304ϵ02)870/2299]3/2,\frac{f_{\rm orb}}{f_{0}}=\left[\frac{1-\epsilon_{0}^{2}}{1-\epsilon^{2}}\left(\frac{\epsilon}{\epsilon_{0}}\right)^{12/19}\left(\frac{1+\frac{121}{304}\epsilon^{2}}{1+\frac{121}{304}\epsilon_{0}^{2}}\right)^{870/2299}\right]^{-3/2}\ , (20)

where ϵ0\epsilon_{0} is the initial eccentricity at the reference frequency f0f_{0}.

In Figure 12, the multiple thin lines are the waveforms from higher-order harmonics assuming eccentric orbits. For circular orbits, the GW is emitted at a single frequency at a fixed separation, while the eccentric binaries emit GW at higher-order harmonics at a given time. One consequence of this is that the energy dissipated in higher-order harmonics is below the detection sensitivity, and thus the signal will be smaller compared with the circular orbits.

We note that when using the simulation measurement of the orbital eccentricity as the initial eccentricity in the inspiral phase, we did not account for the possible increase in ϵ\epsilon during the loss-cone scattering phase (see, e.g. Sesana, 2010; Kelley et al., 2017b).

6.3 Detectability Prediction

Refer to caption
Figure 13: Left: the joint distribution of the SNR and redshift for Astrid mergers. The top row is the SNR computed before the DF and hardening delays, and the bottom row is the SNR after the delay time is applied. The mergers delayed to z<3z<3 are masked in grey. Middle: distribution of binary mass for all Astrid mergers (red), the ones with SNR¿8 without the delay model (blue), and the ones that merge before z=3z=3 after the delays (brown). The SNR¿8 cut eliminates all mergers with Mtot>108MM_{\rm tot}>10^{8}M_{\odot}, while the drop in low-mass merger events is due to the delays. Right: the distribution of two MBH masses for LISA detectable merger events at z>3z>3. Most events are expected to involve two seed-mass MBHs.

Although the strain in Figure 12 is a good estimation of the detectibility of a circular binary, for the eccentric case a more careful prediction comes from the signal-to-noise ratio (SNR). The SNR is estimated by integrating the ratio of the signal to the noise in the frequency domain. The sky, orientation, and polarization averaged SNR is given by :

SNR2=165fstartfendhs2hN2f1𝑑f,\langle{\rm SNR}\rangle^{2}=\frac{16}{5}\int_{f_{\rm start}}^{f_{\rm end}}\frac{h_{s}^{2}}{h_{N}^{2}}f^{-1}df, (21)

where fstart=f(tstart)f_{\rm start}=f(t_{\rm start}) and fend=f(tend)f_{\rm end}=f(t_{\rm end}), with tstartt_{\rm start} and tendt_{\rm end} representing the starting and ending time of when the signal is observed. Note that here we are assuming eccentric waveforms for the binaries, and thus hsh_{s} is given by the sum over different modes following Equation 19. As it is not computationally feasible to sum an infinite number of modes, we truncate the sum in Equation 19 at n=50n=50 and we have checked that the difference between the first 5050 and the first 100100 modes is less than 5%5\%.

For the current configuration, we assume that the LISA observation lasts for 4 years. We further assume a most optimistic SNR for all mergers by taking tend=tpeakt_{\rm end}=t_{\rm peak} and tstart=tpeak4yrst_{\rm start}=t_{\rm peak}-4{\rm yrs}. Under this assumption, we are always integrating the part of the waveform where the strain is maximized. However, as was discussed in Salcido et al. (2016) and Katz et al. (2020), the actual SNR may be smaller if there is an offset between the LISA observation window and the merger time of the binary.

Figure 13 shows the distribution of the SNR computed for all mergers in the simulation. The left column shows the joint distribution of SNR and the merging redshift. The top row is the SNR computed with merging redshifts before the DF and hardening delays, and the bottom row is the SNR after the delay time is applied. As was expected from the simpler calculation shown in Figure 12, the majority of the binary population in the simulation has a SNR larger than the LISA detection threshold of 8 (plotted as dashed gray lines). The ones that falls below the SNR cut are mainly massive mergers with Mtot>107MM_{\rm tot}>10^{7}\,M_{\odot}. When we account for the delays, the mergers are pushed towards lower redshifts, and the resulting SNR is higher for each event. However, when restricted to z>3z>3, the

The middle panel shows the effect of delays and the SNR cut on mergers with different masses. The SNR cut removes all mergers with Mtot>108MM_{\rm tot}>10^{8}\,M_{\odot} from the LISA-detectable population. On the low-mass end of MBH mergers, the reduction results from the DF and binary hardening delays. Combining both the delays and SNR cut, we see that the overall detectable mergers at z>3z>3 are 15%\sim 15\% of the original Astrid merger population across all masses. The seed-mass mergers still dominate over other events even though they are most strongly suppressed by the delays. Finally, on the right panel we show the mass distribution of the two MBHs involved in each detectable events. The majority of these events are expected to be mergers from two seed-mass MBHs. On the high-mass end, the detectable events has a mass ratio of q1q\sim 1 (close to the diagonal line). Based on these results, the likelihood that a LISA detection comes from mergers of MBH seeds is high, but the detectable MBH seed mergers is only a small sample of the seed MBH pairs and the associated galaxy mergers.

Here, accounting for the delay time to merger affects the resulting SNR more than the eccentricity. The eccentricity itself, however, may affect the prospects for multi-messenger follow-up. For example, eccentric binaries may spend a shorter amount of time in the LISA band compared to circular binaries. Spin-orbit interactions in eccentric binaries may change the orbital inclination with respect to the line-of-sight, which may also play a role in detectability and sky localization. We will explore such effects and their implications for multi-messenger follow-up in a companion paper.

7 Conclusion and Discussion

In this work we have made predictions for the MBH merger rate and associated LISA events for a cosmological population of MBHs with masses ranging between 5×104M5\times 10^{4}M_{\odot} and 1010M10^{10}M_{\odot} down to z=3z=3, using the large volume cosmological simulation Astrid. At high redshifts, MBH mergers and the associated GW signal should provide strong contraints for models of seed black hole formation. In Astrid , MBH seeds range from 5×104M5\times 10^{4}M_{\odot} to 5×105M5\times 10^{5}M_{\odot}, covering down to masses that LISA will be most sensitive too. Moreover, in Astrid we have included an on-the-fly subgrid dynamical friction prescription, which allows us to trace the MBH orbits down to the resolution limit.

Using the MBH orbits directly from the simulation, we estimated the orbital eccentricity for MBH pairs that undergo merger in Astrid simulation. In addition, we use the most recent post-processing models to account for additional delay in MBH mergers due to dynamical friction (Dosopoulou & Antonini, 2017) and binary hardening (Vasiliev et al., 2015) at scales not resolved directly by Astrid. This is done by accounting for the orbital eccentricities constrained by the simulation which is important for the loss-cone scattering and gravitational-wave hardening phase. After considering the effect of these processes in delaying the MBH merger, we made detailed prediction of the expected number of mergers down to z=3z=3, the redshift that the simulation has currently reached. Finally, we computed the detectability of these events by LISA.

We find that most MBHs pairs in Astrid have eccentric orbits distributed near ϵ=0.8\epsilon=0.8. We verify the eccentricity measurements by using both the shape and the dynamical information of the MBHs and find general agreement on the result. While some orbits circularize during the dynamical friction decay, the majority of them still maintain a high level of eccentricity at the time of the numerical merger. The orbital eccentricity is important in accelerating the binary hardening process. In particular, we show that the assumption of circular orbits for all binaries leads to estimates for the binary hardening time that can exceed 2020 Gyrs for most Astrid binaries. Taking into account the measured orbital eccentricities, our estimated hardening times fall between 1101\sim 10 Gyrs.

Even after considering the accelerated binary hardening rate due to eccentric orbits, for Astrid mergers close to the seed mass, the binary hardening (including LC and GW hardening) time typically provides the longest delay, and it remains more important than the dynamical friction component (including DF time modeled in Astrid directly and the estimated sub-resolution component). For MBH binaries above the seed mass, the hardening time becomes comparable to the DF time and always remains <1<1 Gyrs. By comparing the DF directly modeled in Astrid with the post-processed (sub-resolution) DF time, we find that they are comparable, accounting for 100300100\sim 300 Myr of binary evolution. At the resolution of Astrid, the sub-grid DF added directly in the simulation is able to recover more than half of the dynamical friction decay process before the numerical merger.

Without accounting for any additional post-processed binary dynamics delays, we expect 2\sim 2 merger events per year (DeGraf et al., prep) from the z>3z>3 MBH population in Astrid. With the post-processed dynamical friction and binary hardening taken into account, the expected merger rate reduces to 0.30.70.3\sim 0.7 per year at z>3z>3. Astrid predicts for merger rates that are higher than most previous predictions from hydro-dynamical simulations of comparable size (e.g. Salcido et al., 2016; Katz et al., 2020; Volonteri et al., 2020), because Astrid accounts for a seed population (see DeGraf et al., prep, for a more direct comparison) in halos about an order of magnitude lower in mass than e.g. Illustris (Mhalo,thr=7×1010MM_{\rm halo,thr}=7\times 10^{10}\,M_{\odot}) and EAGLE (Mhalo,thr=1.4×1010MM_{\rm halo,thr}=1.4\times 10^{10}\,M_{\odot}). Among the whole MBH merger population, the seed-mass mergers are most affected by the delays, with only <20%<20\% of the original simulation mergers still merging at z>3z>3. Nonetheless, because the seed-mass mergers dominate the merger population in absolute numbers (250455 out of 440999), they still occupy a large fraction of the delayed mergers. Out of the delayed merger events at z>3z>3, 60%\sim 60\% involve two seed-mass MBHs, 30%\sim 30\% are mergers between one non-seed MBH and one seed-mass MBH, and 10%\sim 10\% are mergers between two large mass MBHs.

We use a 4-year LISA observation time to calculate an upper limit on the SNR for each merger event. Many of these high-z mergers result in SNRs around 200\sim 200. With a SNR¿8 threshold, high-mass merger (Mtot>108MM_{\rm tot}>10^{8}\,M_{\odot}) events are removed from the detectable population at z>3z>3. The Mtot<107MM_{\rm tot}<10^{7}\,M_{\odot} mergers are still detectable. As a result, the LISA detectable population is still dominated by seed MBH mergers, and the expected detection rate is similar to the total merger rate of 0.30.70.3\sim 0.7 per year at z>3z>3.

Based on these results, a LISA detection of merger events from MBH seeds population is highly feasible. However, the detectable MBH seed mergers are predicted to correspond to the sample of the seed MBH pairs that occur in hosts with stellar masses close to 109M10^{9}\,M_{\odot}. This is about three times larger than the typical stellar mass at which seed-mass mergers are expected to occur if loss cone scattering was not accounted for. We also find that 80%\sim 80\% of the seed-seed merger remnants in the simulation are the only MBH residing in their host galaxies. Accounting for the DF and binary hardening delays slightly favors systems embedded in a larger galaxy with a more massive MBH around. This is because the more massive hosts tend to provide a higher stellar density and hence a more effective loss-cone scattering. However, sole MBH remnants still make up 70%\sim 70\% of the seed-seed merger population after the delays. Regardless, Astrid predicts the host galaxies of the detectable z>3z>3 mergers to be galaxies of M1091010MM_{*}\sim 10^{9}-10^{10}\,M_{\odot}. These host galaxies are detectable with current and upcoming telescopes.

We note also that our estimation of the low-mass MBH merger rate is a lower-limit, since we do not resolve the MBHs residing in low-mass dwarf galaxies. Observations have provided evidence that dwarf galaxies host MBHs in their center (e.g. Reines et al., 2013; Moran et al., 2014; Satyapal et al., 2014; Lemons et al., 2015; Sartori et al., 2015; Pardo et al., 2016; Nguyen et al., 2019). Simulations (e.g. van Wassenhove et al., 2010; Bellovary et al., 2019; Volonteri et al., 2020) also shows that dwarf galaxies consistently merge into larger galaxies over time. Hence, missing the dwarf galaxy MBHs could bias our merger rate and detection rate estimation towards the lower end.

We find that current simulations such as Astrid are getting closer to predicting DF timescales for the binary evolution, but the estimation of the binary hardening timescale remains more uncertain as it depends on the properties of central stellar densities below the resolution limit. We have shown that changing the stellar density extrapolation starting point from 1.5ϵg1.5\epsilon_{g} to 2ϵg2\epsilon_{g} increases the estimated density at the influence radius by a factor of 10\sim 10, and thereby shortens the estimated binary hardening timescale by a factor of 10\sim 10. This translates to a factor of 3\sim 3 different in the merger rate predictions. To more confidently estimate the binary hardening timescale and thus the MBH merger rate in the context of cosmological simulations, a better modeling of the inner region of the galaxy would be needed. Nonetheless, we still expect the merger rates to be within a factor of a few of what a cosmological simulation is able to predict (at the resolution of Astrid)

Finally, in this work, we do not evolve the orbital eccentricity during the loss-cone scattering phase. Loss-cone scattering can increase the orbital eccentricity of the binary (e.g. Sesana, 2010; Kelley et al., 2017b), and may affect the detected GW signal. We also do not consider circumbinary-disk interactions (e.g. Haiman et al., 2009), since circumbinary-disk simulations for eccentric binaries have not yet been comprehensively explored for a wide-enough range of binary parameters and disk properties. A significant amount of progress, however, has been made in the hydrodynamic modeling of such systems (e.g., Duffell et al., 2020; Tiede et al., 2020; D’Orazio & Duffell, 2021). Binary-disk interactions may also affect the spin orientations of each MBH. It is also currently uncertain how a circumbinary disk would respond when an eccentric binary undergoes post-Newtonian spin-orbit interactions. We thus leave such analyses with our cosmological binary population for future work.

Acknowledgements

Astrid was run on the Frontera facility at the Texas Advanced Computing Center.

TDM and RACC acknowledge funding from the NSF AI Institute: Physics of the Future, NSF PHY-2020295, NASA ATP NNX17AK56G, and NASA ATP 80NSSC18K101. TDM acknowledges additional support from NSF ACI-1614853, NSF AST-1616168, NASA ATP 19-ATP19-0084, and NASA ATP 80NSSC20K0519, and RACC from NSF AST-1909193. SB was supported by NSF grant AST-1817256. AMH is supported by the McWilliams Postdoctoral Fellowship.

Data Availability

The code to reproduce the simulation is available at https://github.com/MP-Gadget/MP-Gadget, and continues to be developed. Text file forms of the data presented here as well as scripts to generate the figures are available. Binary catalogs including the MBH information and the host galaxy properties are available upon request.

References

  • Abbott et al. (2016) Abbott B. P., et al., 2016, Phys. Rev. Lett., 116, 061102
  • Amaro-Seoane et al. (2010) Amaro-Seoane P., Sesana A., Hoffman L., Benacquista M., Eichhorn C., Makino J., Spurzem R., 2010, MNRAS, 402, 2308
  • Amaro-Seoane et al. (2012) Amaro-Seoane P., et al., 2012, Classical and Quantum Gravity, 29, 124016
  • Amaro-Seoane et al. (2017) Amaro-Seoane P., et al., 2017, arXiv e-prints, p. arXiv:1702.00786
  • Antonini & Merritt (2012) Antonini F., Merritt D., 2012, ApJ, 745, 83
  • Arca-Sedda & Gualandris (2018) Arca-Sedda M., Gualandris A., 2018, MNRAS, 477, 4423
  • Banks et al. (2021) Banks S., Lee K., Azimi N., Scarborough K., Stefanov N., Periwal I., DeGraf C., Di Matteo T., 2021, arXiv e-prints, p. arXiv:2107.09084
  • Barausse (2012) Barausse E., 2012, MNRAS, 423, 2533
  • Begelman et al. (1980) Begelman M. C., Blandford R. D., Rees M. J., 1980, Nature, 287, 307
  • Bellovary et al. (2019) Bellovary J. M., Cleary C. E., Munshi F., Tremmel M., Christensen C. R., Brooks A., Quinn T. R., 2019, MNRAS, 482, 2913
  • Biava et al. (2019) Biava N., Colpi M., Capelo P. R., Bonetti M., Volonteri M., Tamfal T., Mayer L., Sesana A., 2019, MNRAS, 487, 4985
  • Biernacki et al. (2017) Biernacki P., Teyssier R., Bleuler A., 2017, MNRAS, 469, 295
  • Binney & Tremaine (2008) Binney J., Tremaine S., 2008, Galactic Dynamics: Second Edition
  • Bird et al. (2021) Bird S., Ni Y., Di Matteo T., Croft R., Feng Y., Chen N., 2021, arXiv e-prints, p. arXiv:2111.01160
  • Bonetti et al. (2018) Bonetti M., Haardt F., Sesana A., Barausse E., 2018, MNRAS, 477, 3910
  • Bonetti et al. (2019) Bonetti M., Sesana A., Haardt F., Barausse E., Colpi M., 2019, MNRAS, 486, 4044
  • Bonetti et al. (2020) Bonetti M., Bortolas E., Lupi A., Dotti M., Raimundo S. I., 2020, MNRAS, 494, 3053
  • Chandrasekhar (1943) Chandrasekhar S., 1943, ApJ, 97, 255
  • Chen et al. (2021) Chen N., Ni Y., Tremmel M., Di Matteo T., Bird S., DeGraf C., Feng Y., 2021, arXiv e-prints, p. arXiv:2104.00021
  • Colpi et al. (1999) Colpi M., Mayer L., Governato F., 1999, ApJ, 525, 720
  • D’Orazio & Duffell (2021) D’Orazio D. J., Duffell P. C., 2021, ApJ, 914, L21
  • Davé et al. (2019) Davé R., Anglés-Alcázar D., Narayanan D., Li Q., Rafieferantsoa M. H., Appleby S., 2019, MNRAS, 486, 2827
  • DeGraf & Sijacki (2020) DeGraf C., Sijacki D., 2020, MNRAS, 491, 4973
  • DeGraf et al. (prep) DeGraf C., et al., in prep, MNRAS
  • Derdzinski et al. (2019) Derdzinski A. M., D’Orazio D., Duffell P., Haiman Z., MacFadyen A., 2019, MNRAS, 486, 2754
  • Di Matteo et al. (2005) Di Matteo T., Springel V., Hernquist L., 2005, Nature, 433, 604
  • Dosopoulou & Antonini (2017) Dosopoulou F., Antonini F., 2017, ApJ, 840, 31
  • Dotti et al. (2007) Dotti M., Colpi M., Haardt F., Mayer L., 2007, MNRAS, 379, 956
  • Dubois et al. (2014) Dubois Y., et al., 2014, MNRAS, 444, 1453
  • Duffell et al. (2020) Duffell P. C., D’Orazio D., Derdzinski A., Haiman Z., MacFadyen A., Rosen A. L., Zrake J., 2020, ApJ, 901, 25
  • Efroimsky & Goldreich (2004) Efroimsky M., Goldreich P., 2004, A&A, 415, 1187
  • Feng et al. (2016) Feng Y., Di-Matteo T., Croft R. A., Bird S., Battaglia N., Wilkins S., 2016, MNRAS, 455, 2778
  • Finn & Thorne (2000) Finn L. S., Thorne K. S., 2000, Phys. Rev. D, 62, 124021
  • Frank & Rees (1976) Frank J., Rees M. J., 1976, MNRAS, 176, 633
  • Governato et al. (1994) Governato F., Colpi M., Maraschi L., 1994, MNRAS, 271, 317
  • Haehnelt (1994) Haehnelt M. G., 1994, MNRAS, 269, 199
  • Haiman et al. (2009) Haiman Z., Kocsis B., Menou K., 2009, ApJ, 700, 1952
  • Hashimoto et al. (2003) Hashimoto Y., Funato Y., Makino J., 2003, ApJ, 582, 196
  • Hirschmann et al. (2014) Hirschmann M., Dolag K., Saro A., Bachmann L., Borgani S., Burkert A., 2014, MNRAS, 442, 2304
  • Husa et al. (2016) Husa S., Khan S., Hannam M., Pürrer M., Ohme F., Forteza X. J., Bohé A., 2016, Phys. Rev. D, 93, 044006
  • Jaffe & Backer (2003) Jaffe A. H., Backer D. C., 2003, ApJ, 583, 616
  • Katz & Larson (2019) Katz M. L., Larson S. L., 2019, MNRAS, 483, 3108
  • Katz et al. (2020) Katz M. L., Kelley L. Z., Dosopoulou F., Berry S., Blecha L., Larson S. L., 2020, MNRAS, 491, 2301
  • Kazantzidis et al. (2005) Kazantzidis S., et al., 2005, ApJ, 623, L67
  • Kelley et al. (2017a) Kelley L. Z., Blecha L., Hernquist L., 2017a, MNRAS, 464, 3131
  • Kelley et al. (2017b) Kelley L. Z., Blecha L., Hernquist L., 2017b, MNRAS, 464, 3131
  • Khan et al. (2016) Khan S., Husa S., Hannam M., Ohme F., Pürrer M., Forteza X. J., Bohé A., 2016, Phys. Rev. D, 93, 044007
  • Kormendy & Ho (2013) Kormendy J., Ho L. C., 2013, ARA&A, 51, 511
  • Kormendy & Richstone (1995) Kormendy J., Richstone D., 1995, ARA&A, 33, 581
  • Kulier et al. (2015) Kulier A., Ostriker J. P., Natarajan P., Lackner C. N., Cen R., 2015, ApJ, 799, 178
  • Lacey & Cole (1993) Lacey C., Cole S., 1993, MNRAS, 262, 627
  • Lemons et al. (2015) Lemons S. M., Reines A. E., Plotkin R. M., Gallo E., Greene J. E., 2015, ApJ, 805, 12
  • Lightman & Shapiro (1977) Lightman A. P., Shapiro S. L., 1977, ApJ, 211, 244
  • Lotz et al. (2011) Lotz J. M., Jonsson P., Cox T. J., Croton D., Primack J. R., Somerville R. S., Stewart K., 2011, ApJ, 742, 103
  • Magorrian et al. (1998) Magorrian J., et al., 1998, AJ, 115, 2285
  • Mangiagli et al. (2019) Mangiagli A., Bonetti M., Sesana A., Colpi M., 2019, ApJ, 883, L27
  • Merritt (2013) Merritt D., 2013, Dynamics and Evolution of Galactic Nuclei
  • Miller (2007) Miller J. M., 2007, ARA&A, 45, 441
  • Mingarelli et al. (2017) Mingarelli C. M. F., et al., 2017, Nature Astronomy, 1, 886
  • Moore et al. (2015) Moore C. J., Cole R. H., Berry C. P. L., 2015, Classical and Quantum Gravity, 32, 015014
  • Moran et al. (2014) Moran E. C., Shahinyan K., Sugarman H. R., Vélez D. O., Eracleous M., 2014, AJ, 148, 136
  • Natarajan et al. (2017) Natarajan P., Pacucci F., Ferrara A., Agarwal B., Ricarte A., Zackrisson E., Cappelluti N., 2017, ApJ, 838, 117
  • Nguyen et al. (2019) Nguyen D. D., et al., 2019, ApJ, 872, 104
  • Ni et al. (2021) Ni Y., et al., 2021, arXiv e-prints, p. arXiv:2110.14154
  • Ogiya et al. (2020) Ogiya G., Hahn O., Mingarelli C. M. F., Volonteri M., 2020, MNRAS, 493, 3676
  • Ostriker (1999) Ostriker E. C., 1999, ApJ, 513, 252
  • Pardo et al. (2016) Pardo K., et al., 2016, ApJ, 831, 203
  • Peters (1964) Peters P. C., 1964, Physical Review, 136, 1224
  • Peters & Mathews (1963) Peters P. C., Mathews J., 1963, Physical Review, 131, 435
  • Pillepich et al. (2018) Pillepich A., et al., 2018, MNRAS, 473, 4077
  • Quinlan (1996) Quinlan G. D., 1996, New Astron., 1, 35
  • Reines et al. (2013) Reines A. E., Greene J. E., Geha M., 2013, ApJ, 775, 116
  • Reynolds (2013) Reynolds C. S., 2013, Classical and Quantum Gravity, 30, 244004
  • Ricarte & Natarajan (2018) Ricarte A., Natarajan P., 2018, MNRAS, 481, 3278
  • Rodriguez-Gomez et al. (2015) Rodriguez-Gomez V., et al., 2015, MNRAS, 449, 49
  • Salcido et al. (2016) Salcido J., Bower R. G., Theuns T., McAlpine S., Schaller M., Crain R. A., Schaye J., Regan J., 2016, MNRAS, 463, 870
  • Sartori et al. (2015) Sartori L. F., Schawinski K., Treister E., Trakhtenbrot B., Koss M., Shirazi M., Oh K., 2015, MNRAS, 454, 3722
  • Satyapal et al. (2014) Satyapal S., Secrest N. J., McAlpine W., Ellison S. L., Fischer J., Rosenberg J. L., 2014, ApJ, 784, 113
  • Schaye et al. (2015) Schaye J., et al., 2015, MNRAS, 446, 521
  • Sesana (2010) Sesana A., 2010, ApJ, 719, 851
  • Sesana & Khan (2015) Sesana A., Khan F. M., 2015, MNRAS, 454, L66
  • Sesana et al. (2004) Sesana A., Haardt F., Madau P., Volonteri M., 2004, ApJ, 611, 623
  • Sesana et al. (2007a) Sesana A., Volonteri M., Haardt F., 2007a, MNRAS, 377, 1711
  • Sesana et al. (2007b) Sesana A., Haardt F., Madau P., 2007b, ApJ, 660, 546
  • Soltan (1982) Soltan A., 1982, MNRAS, 200, 115
  • Springel et al. (2005) Springel V., Di Matteo T., Hernquist L., 2005, MNRAS, 361, 776
  • Taffoni et al. (2003) Taffoni G., Mayer L., Colpi M., Governato F., 2003, MNRAS, 341, 434
  • Tanaka & Haiman (2009) Tanaka T., Haiman Z., 2009, ApJ, 696, 1798
  • Tiede et al. (2020) Tiede C., Zrake J., MacFadyen A., Haiman Z., 2020, ApJ, 900, 43
  • Tremaine et al. (2002) Tremaine S., et al., 2002, ApJ, 574, 740
  • Tremmel et al. (2015) Tremmel M., Governato F., Volonteri M., Quinn T. R., 2015, MNRAS, 451, 1868
  • Tremmel et al. (2017) Tremmel M., Karcher M., Governato F., Volonteri M., Quinn T. R., Pontzen A., Anderson L., Bellovary J., 2017, MNRAS, 470, 1121
  • Vasiliev & Merritt (2013) Vasiliev E., Merritt D., 2013, ApJ, 774, 87
  • Vasiliev et al. (2015) Vasiliev E., Antonini F., Merritt D., 2015, ApJ, 810, 49
  • Vogelsberger et al. (2014) Vogelsberger M., et al., 2014, MNRAS, 444, 1518
  • Volonteri et al. (2016) Volonteri M., Dubois Y., Pichon C., Devriendt J., 2016, MNRAS, 460, 2979
  • Volonteri et al. (2020) Volonteri M., et al., 2020, MNRAS, 498, 2219
  • Wyithe & Loeb (2003) Wyithe J. S. B., Loeb A., 2003, ApJ, 590, 691
  • van Wassenhove et al. (2010) van Wassenhove S., Volonteri M., Walker M. G., Gair J. R., 2010, MNRAS, 408, 1139