This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Null-stream-based Bayesian Unmodeled Framework to Probe Generic Gravitational-wave Polarizations

Isaac C. F. Wong cfwong@link.cuhk.edu.hk Department of Physics, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong    Peter T. H. Pang thopang@nikhef.nl Nikhef – National Institute for Subatomic Physics, Science Park, 1098 XG Amsterdam, The Netherlands Institute for Gravitational and Subatomic Physics (GRASP), Utrecht University, Princetonplein 1, 3584 CC Utrecht, The Netherlands    Rico K. L. Lo kllo@caltech.edu LIGO, California Institute of Technology, Pasadena, California 91125, USA    Tjonnie G. F. Li Department of Physics, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong Institute for Theoretical Physics, KU Leuven, Celestijnenlaan 200D, B-3001 Leuven, Belgium Department of Electrical Engineering (ESAT), KU Leuven, Kasteelpark Arenberg 10, B-3001 Leuven, Belgium    Chris Van Den Broeck Nikhef – National Institute for Subatomic Physics, Science Park, 1098 XG Amsterdam, The Netherlands Institute for Gravitational and Subatomic Physics (GRASP), Utrecht University, Princetonplein 1, 3584 CC Utrecht, The Netherlands
Abstract

We present a null-stream-based Bayesian unmodeled framework to probe generic gravitational-wave polarizations. Generic metric theories allow six gravitational-wave polarization states, but general relativity only permits the existence of two of them namely the tensorial polarizations. The strain signal measured by an interferometer is a linear combination of the polarization modes and such a linear combination depends on the geometry of the detector and the source location. The detector network of Advanced LIGO and Advanced Virgo allows us to measure different linear combinations of the polarization modes and therefore we can constrain the polarization content by analyzing how the polarization modes are linearly combined. We propose the basis formulation to construct a null stream along the polarization basis modes without requiring modeling the basis explicitly. We conduct a mock data study and we show that the framework is capable of probing pure and mixed polarizations in the Advanced LIGO-Advanced Virgo 3-detector network without knowing the sky location of the source from electromagnetic counterparts. We also discuss the effect of the presence of the uncaptured orthogonal polarization component in the framework, and we propose using the plug-in method to test the existence of the orthogonal polarizations.

I introduction

Since the first gravitational wave (GW) detection Abbott et al. (2016a) in 2015, Advanced LIGO Aasi et al. (2015) and Advanced Virgo Acernese et al. (2014) have detected dozens of compact binary coalescence (CBC) events in the first, second and third observing runs Abbott et al. (2019a, 2020a). The GW detections allow us to test general relativity (GR) in the strong-field and dynamical regime Abbott et al. (2019b); Collaboration et al. (2020).

One property of GWs predicted by GR is that they are described by only two polarization modes namely the plus polarization and the cross polarization or collectively the tensorial polarizations. Generic metric theories allow six GW polarization modes Eardley et al. (1973) which are two tensorial polarization modes, two vectorial polarization modes, and two scalar polarization modes. Non-tensorial polarization modes exist in many modified gravity theories. For example, the Brans-Dicke theory Brans and Dicke (1961) predicts a scalar breathing polarization mode in addition to the two tensorial polarization modes. The bimetric theory proposed by Rosen Rosen (1971, 1974) predicts the existence of all six polarization modes. Different polarization modes stretch and squeeze the space differently, and the strain measured by an interferometer is a linear combination of the polarization modes. The polarization modes are linearly combined in a way that depends on the geometry of the interferometer and the source location. A network of non-coaligned interferometers allows us to measure different linear combinations of the polarization modes in each detector and therefore allows us to constrain the polarization content of the signal.

Different methods have been proposed to constrain the polarization content of GWs. At the time of writing the coincident detections are only up to three detectors, so the analyses so far only focus on pure polarizations i.e. comparing the likelihood of the signal being purely tensorial, purely vectorial, and pure scalar since at least M+1M+1 non-coaligned detectors are needed to resolve MM non-degenerate polarization modes Chatziioannou et al. (2012a). To date, the proposed tests could be categorized into two groups which are heuristic tests and model-independent tests. The heuristic test in Refs. Abbott et al. (2017a, 2019c) is performed by replacing the tensorial beam pattern function with the non-tensorial beam pattern function while keeping the GR waveform template. The model-independent tests include the sine-Gaussian analysis using BayesWave Cornish and Littenberg (2015) performed on GW150914 Abbott et al. (2016b), and the null-stream-based analysis Collaboration et al. (2020) performed on GWTC-2 Abbott et al. (2020a).

In our previous work Pang et al. (2020), we discussed null-stream-based frequentist methods to probe mixed polarizations with the source location informed by electromagnetic (EM) counterparts. The methods do not rely on any waveform models, and we could therefore test the polarization content model-agnostically to confirm or rule out a specific group of modified gravity theories. However, there are two disadvantages of the methods. First, it requires the source location to be known that could be informed by EM counterparts, but to date, GW170817 is the only GW event that has a confident association with a gamma-ray burst Abbott et al. (2017b) and this indicates the rarity of joint GW-EM observations. Second, the methods could only detect the existence of non-tensorial polarizations but are not capable of inferring which non-tensorial components are more likely to present. In this work, we propose a generalized null-stream-based Bayesian framework to infer generic GW polarizations including pure and mixture polarizations without requiring the a priori knowledge of source location. The Bayesian framework allows us to compare the marginal likelihood between different polarization hypotheses in contrast to the frequentist methods that we previously proposed.

The paper is structured as follows. In Sec. II, we review the observation model of GW and null stream, and we discuss the basis formulation and develop the Bayesian framework to probe GW polarizations. In Sec. III, we present the results of a mock data study with non-GR injections and show that our framework is capable of probing GW polarizations without requiring the a priori knowledge of the source location. We also discuss the effect of the presence of an uncaptured orthogonal polarization component in the analysis, and we present the plug-in method to test the existence of an orthogonal polarization component in the signal. The conclusions are summarized in Sec. IV.

II Methodology

In this section, we describe the GW observation model, review the null stream, and present the Bayesian null stream formulation to probe GW polarizations.

II.1 Observation model

The additive noise observation model of GW with all possible polarization modes in a DD-detector network writes

𝒅(t;Δ𝒕)=𝑭(α,δ,ψ,t)𝒉(t)+𝒏(t;Δ𝒕)\boldsymbol{d}(t;\Delta\boldsymbol{t})=\boldsymbol{F}(\alpha,\delta,\psi,t)\boldsymbol{h}(t)+\boldsymbol{n}(t;\Delta\boldsymbol{t}) (1)

where

𝒅(t;Δ𝒕)=[d1(t+Δt1)d2(t+Δt2)dD(t+ΔtD)]\boldsymbol{d}(t;\Delta\boldsymbol{t})=\begin{bmatrix}d_{1}(t+\Delta t_{1})\\ d_{2}(t+\Delta t_{2})\\ \vdots\\ d_{D}(t+\Delta t_{D})\end{bmatrix} (2)

is the observed strain outputs at shifted times,

𝑭(α,δ,ψ,t)=[𝒇+𝒇×𝒇b𝒇l𝒇x𝒇y]=[F1+F1×F1bF1lF1xF1yF2+F2×F2bF2lF2xF2yFD+FD×FDbFDlFDxFDy]\begin{split}\boldsymbol{F}(\alpha,\delta,\psi,t)&=\begin{bmatrix}\boldsymbol{f}_{+}&\boldsymbol{f}_{\times}&\boldsymbol{f}_{b}&\boldsymbol{f}_{l}&\boldsymbol{f}_{x}&\boldsymbol{f}_{y}\end{bmatrix}\\ &=\begin{bmatrix}F_{1}^{+}&F_{1}^{\times}&F_{1}^{b}&F_{1}^{l}&F_{1}^{x}&F_{1}^{y}\\ F_{2}^{+}&F_{2}^{\times}&F_{2}^{b}&F_{2}^{l}&F_{2}^{x}&F_{2}^{y}\\ \vdots&\vdots&\vdots&\vdots&\vdots&\vdots\\ F_{D}^{+}&F_{D}^{\times}&F_{D}^{b}&F_{D}^{l}&F_{D}^{x}&F_{D}^{y}\end{bmatrix}\end{split} (3)

is the beam pattern matrix,

𝒉(t)=[h+(t)h×(t)hb(t)hl(t)hx(t)hy(t)]T\boldsymbol{h}(t)=\begin{bmatrix}h_{+}(t)&h_{\times}(t)&h_{b}(t)&h_{l}(t)&h_{x}(t)&h_{y}(t)\end{bmatrix}^{T} (4)

are the polarization modes where TT denotes transpose and the plus mode, cross mode, scalar breathing mode, scalar longitudinal mode, vector x mode and vector y mode are labeled with the subscripts ++, ×\times, bb, ll, xx and yy respectively,

𝒏(t;Δ𝒕)=[n1(t+Δt1)n2(t+Δt2)nD(t+ΔtD)]\boldsymbol{n}(t;\Delta\boldsymbol{t})=\begin{bmatrix}n_{1}(t+\Delta t_{1})\\ n_{2}(t+\Delta t_{2})\\ \vdots\\ n_{D}(t+\Delta t_{D})\end{bmatrix} (5)

is the detector noise at shifted times, α\alpha, δ\delta are the right ascension and declination of the source location respectively, ψ\psi is the polarization angle, Δ𝒕={Δt1,Δt2,,ΔtD}\Delta\boldsymbol{t}=\{\Delta t_{1},\Delta t_{2},...,\Delta t_{D}\} where

Δtj=𝒓j𝑵^c\Delta t_{j}=-\frac{\boldsymbol{r}_{j}\cdot\hat{\boldsymbol{N}}}{c} (6)

is the time delay of the signal arrival at each detector with reference to the Earth center, 𝒓j\boldsymbol{r}_{j} is the coordinates of the jthj^{\text{th}} detector in the Earth-centered coordinate system,

𝑵^=[cos(δ)cos(tgmstα)cos(δ)sin(tgmstα)sin(δ)]\hat{\boldsymbol{N}}=\begin{bmatrix}\cos(\delta)\cos(t_{\text{gmst}}-\alpha)\\ -\cos(\delta)\sin(t_{\text{gmst}}-\alpha)\\ \sin(\delta)\end{bmatrix} (7)

where tgmstt_{\text{gmst}} is the Greenwich mean sidereal time, and cc is the speed of gravity. For a two-arm interferometer, the beam pattern functions take the following forms when the two arms are directed to the x-axis and y-axis respectively Nishizawa et al. (2009):

F+(θ,ϕ,ψ)=12(1+cos2θ)cos2ϕcos2ψcosθsin2ϕsin2ψ\begin{split}F_{+}(\theta,\phi,\psi)=&\frac{1}{2}(1+\cos^{2}\theta)\cos{2\phi}\cos{2\psi}\\ &-\cos\theta\sin{2\phi}\sin{2\psi}\end{split} (8)
F×(θ,ϕ,ψ)=12(1+cos2θ)cos2ϕsin2ψcosθsin2ϕcos2ψ\begin{split}F_{\times}(\theta,\phi,\psi)=&-\frac{1}{2}(1+\cos^{2}\theta)\cos{2\phi}\sin{2\psi}\\ &-\cos{\theta}\sin{2\phi}\cos{2\psi}\end{split} (9)
Fx(θ,ϕ,ψ)=sinθ(cosθcos2ϕcosψsin2ϕsinψ)F_{x}(\theta,\phi,\psi)=\sin{\theta}(\cos{\theta}\cos{2\phi}\cos{\psi}-\sin{2\phi}\sin{\psi}) (10)
Fy(θ,ϕ,ψ)=sinθ(cosθcos2ϕsinψ+sin2ϕcosψ)F_{y}(\theta,\phi,\psi)=-\sin{\theta}(\cos\theta\cos{2\phi}\sin{\psi}+\sin{2\phi}\cos{\psi}) (11)
Fb(θ,ϕ)=12sin2θcos2ϕF_{b}(\theta,\phi)=-\frac{1}{2}\sin^{2}\theta\cos{2\phi} (12)
Fl(θ,ϕ)=12sin2θcos2ϕF_{l}(\theta,\phi)=\frac{1}{\sqrt{2}}\sin^{2}\theta\cos{2\phi} (13)

where θ\theta and ϕ\phi are the polar angle and the azimuthal angle of the source location in the Earth-centered coordinate system respectively, and ψ\psi is the polarization angle. Fig. 1 demonstrates the effect of different polarization modes on a ring of free-falling test particles.

Refer to caption
Refer to caption
Figure 1: The effect on a ring of free-falling test particles of a GW in tensor “++” mode (upper left), tensor “×\times” mode (upper right), vector “X” mode (middle left), vector “Y” mode (middle right), scalar breathing mode (lower left) and scalar longitudinal mode (lower right). In each case the wave is traveling in the zz-direction. The solid and dotted lines are the states of the ring with a phase difference of π\pi.

In frequency domain, the observation model writes

𝒅~[k;Δ𝒕]=𝑭(α,δ,ψ,tevent)𝒉~[k]+𝒏~[k;Δ𝒕]\tilde{\boldsymbol{d}}[k;\Delta\boldsymbol{t}]=\boldsymbol{F}(\alpha,\delta,\psi,t_{\text{event}})\tilde{\boldsymbol{h}}[k]+\tilde{\boldsymbol{n}}[k;\Delta\boldsymbol{t}] (14)

where

𝒅~[k;Δ𝒕]=[d~1[k]ei2πkΔt1/Td~2[k]ei2πkΔt2/Td~D[k]ei2πkΔtD/T]\tilde{\boldsymbol{d}}[k;\Delta\boldsymbol{t}]=\begin{bmatrix}\tilde{d}_{1}[k]e^{i2\pi k\Delta t_{1}/T}\\ \tilde{d}_{2}[k]e^{i2\pi k\Delta t_{2}/T}\\ \vdots\\ \tilde{d}_{D}[k]e^{i2\pi k\Delta t_{D}/T}\end{bmatrix} (15)

is the Fourier transform (see Appendix A) of the time-shifted strain outputs, TT is the duration of data in second, 𝑭(α,δ,ψ,t)𝑭(α,δ,ψ,tevent)\boldsymbol{F}(\alpha,\delta,\psi,t)\approx\boldsymbol{F}(\alpha,\delta,\psi,t_{\text{event}}) is approximated to be constant over the data segment which is valid for a transient signal and is evaluated at the event time teventt_{\text{event}},

𝒉~[k]=[h~+[k]h~×[k]h~b[k]h~l[k]h~x[k]h~y[k]]T\tilde{\boldsymbol{h}}[k]=\begin{bmatrix}\tilde{h}_{+}[k]&\tilde{h}_{\times}[k]&\tilde{h}_{b}[k]&\tilde{h}_{l}[k]&\tilde{h}_{x}[k]&\tilde{h}_{y}[k]\end{bmatrix}^{T} (16)

is the Fourier transform of the polarization modes, and

𝒏~[k;Δ𝒕]=[n~1[k]ei2πkΔt1/Tn~2[k]ei2πkΔt2/Tn~D[k]ei2πkΔtD/T]\tilde{\boldsymbol{n}}[k;\Delta\boldsymbol{t}]=\begin{bmatrix}\tilde{n}_{1}[k]e^{i2\pi k\Delta t_{1}/T}\\ \tilde{n}_{2}[k]e^{i2\pi k\Delta t_{2}/T}\\ \vdots\\ \tilde{n}_{D}[k]e^{i2\pi k\Delta t_{D}/T}\end{bmatrix} (17)

is the Fourier transform of the time-shifted noise in each detector. Assume the detector noise follows a stationary Gaussian distribution and the noise is independent between detectors, we have

n~j[k]n~j[k]=12ΔfSj[k]δkkδjj\braket{\tilde{n}_{j^{\prime}}^{*}[k^{\prime}]\tilde{n}_{j}[k]}=\frac{1}{2\Delta f}S_{j}[k]\delta_{k^{\prime}k}\delta_{j^{\prime}j} (18)

where \braket{\cdot} denotes expectation over the noise realizations, Sj[k]S_{j}[k] is the one-sided power spectral density (PSD) of the noise in detector jj, δjk\delta_{jk} denotes Kronecker delta, and Δf\Delta f is the frequency resolution of the discrete Fourier transform.

For brevity, in the following sections we write the time-shifted data as the time-shift operator 𝓣(;Δ𝒕):D×KD×K\boldsymbol{\mathcal{T}}\left(\cdot;\Delta\boldsymbol{t}\right):\mathbb{C}^{D\times K}\rightarrow\mathbb{C}^{D\times K} applied on the unshifted data defined by

𝓣(𝒙~;Δ𝒕)=𝒙~𝒮(Δ𝒕)\boldsymbol{\mathcal{T}}(\tilde{\boldsymbol{x}};\Delta\boldsymbol{t})=\tilde{\boldsymbol{x}}\odot\mathcal{S}(\Delta\boldsymbol{t}) (19)

where

𝒙~=[x~1[1]x~1[2]x~1[K]x~2[1]x~2[2]x~2[K]x~D[1]x~D[2]x~D[K]]\tilde{\boldsymbol{x}}=\begin{bmatrix}\tilde{x}_{1}[1]&\tilde{x}_{1}[2]&\ldots&\tilde{x}_{1}[K]\\ \tilde{x}_{2}[1]&\tilde{x}_{2}[2]&\ldots&\tilde{x}_{2}[K]\\ \vdots&\vdots&\ddots&\vdots\\ \tilde{x}_{D}[1]&\tilde{x}_{D}[2]&\ldots&\tilde{x}_{D}[K]\end{bmatrix} (20)

is the unshifted data, (𝒮(Δ𝒕))jk=ei2π(k1)Δtj/T(\mathcal{S}(\Delta\boldsymbol{t}))_{jk}=e^{i2\pi(k-1)\Delta t_{j}/T} where j=1,2,,Dj=1,2,...,D and k=1,2,,Kk=1,2,...,K or more explicitly

𝒮(Δ𝒕)=[ei2π(0)Δt1/Tei2π(1)Δt1/Tei2π(K1)Δt1/Tei2π(0)Δt2/Tei2π(1)Δt2/Tei2π(K1)Δt2/Tei2π(0)ΔtD/Tei2π(1)ΔtD/Tei2π(K1)ΔtD/T],\begin{split}&\mathcal{S}(\Delta\boldsymbol{t})\\ &=\begin{bmatrix}e^{i2\pi(0)\Delta t_{1}/T}&e^{i2\pi(1)\Delta t_{1}/T}&\ldots&e^{i2\pi(K-1)\Delta t_{1}/T}\\ e^{i2\pi(0)\Delta t_{2}/T}&e^{i2\pi(1)\Delta t_{2}/T}&\ldots&e^{i2\pi(K-1)\Delta t_{2}/T}\\ \vdots&\vdots&\ddots&\vdots\\ e^{i2\pi(0)\Delta t_{D}/T}&e^{i2\pi(1)\Delta t_{D}/T}&\ldots&e^{i2\pi(K-1)\Delta t_{D}/T}\end{bmatrix}\end{split}\,, (21)

\odot denotes the elementwise product, and KK is the number of frequency bins.

II.2 Null Stream

Null stream Gürsel and Tinto (1989) is a specific linear combination of strain outputs from a network of detectors such that the linear combination only contains noise regardless of the GW waveform. Let us consider a tensorial signal as follows

𝒔~(f)=[𝒇+𝒇×][h~+(f)h~×(f)],\displaystyle\tilde{\boldsymbol{s}}(f)=\begin{bmatrix}\boldsymbol{f}_{+}&\boldsymbol{f}_{\times}\end{bmatrix}\begin{bmatrix}\tilde{h}_{+}(f)\\ \tilde{h}_{\times}(f)\end{bmatrix}\,, (22)

the signal 𝒔~(f)\tilde{\boldsymbol{s}}(f) could be regarded as living in the subspace spanned by 𝒇+\boldsymbol{f}_{+} and 𝒇×\boldsymbol{f}_{\times} as shown in Fig. 2. The hyperplane spanned by 𝒇+\boldsymbol{f}_{+} and 𝒇×\boldsymbol{f}_{\times} in the figure defines all possible strain measurement of a tensorial signal in each detector. We could project away the hyperplane to remove any tensorial signal regardless of the waveform of h~+(f)\tilde{h}_{+}(f) and h~×(f)\tilde{h}_{\times}(f). The residual remains in the null space of 𝒇+\boldsymbol{f}_{+} and 𝒇×\boldsymbol{f}_{\times} is the so-called null stream which contains noise only.

Refer to caption
Figure 2: The axes d~1\tilde{d}_{1}, d~2\tilde{d}_{2} and d~3\tilde{d}_{3} represent the measurement of the strain output in each detector. The hyperplane spanned by 𝒇+\boldsymbol{f}_{+} and 𝒇×\boldsymbol{f}_{\times} defines all possible strain measurement of a tensorial signal in each detector. The vector 𝒏\boldsymbol{n} is orthogonal to 𝒇+\boldsymbol{f}_{+} and 𝒇×\boldsymbol{f}_{\times} and it spans the null space of 𝒇+\boldsymbol{f}_{+} and 𝒇×\boldsymbol{f}_{\times}. There is no strain measurement of any tensorial signal along 𝒏\boldsymbol{n}.

One application of null stream is to distinguish between coincident GW transients and coincident noise glitches. If the observed coherent excess power is a genuine GW transient, one should obtain a null stream consistent with noise. For noise glitches, in general, the resultant null stream would still contain excess power. Null stream is therefore used in conducting vetos Wen and Schutz (2005); Ajith et al. (2006); Chatterji et al. (2006) and GW searches associated with gamma-ray bursts and other astrophysical triggers Sutton et al. (2010). In this section, we review the method to construct a null stream. The matrix representation follows from Ref. Sutton et al. (2010). The only difference is that we include the time-shift operation explicitly in the construction of null stream since we do not know the source location a priori in contrast to the follow-up analysis of gamma-ray bursts in Ref. Sutton et al. (2010).

Suppose we observe a signal at time teventt_{\text{event}}, the frequency domain observation model writes

𝒅~[k]=𝓣(𝑭(α,δ,ψ,tevent)𝒉~;Δ𝒕)[k]+𝒏~[k].\tilde{\boldsymbol{d}}[k]=\boldsymbol{\mathcal{T}}\left(\boldsymbol{F}(\alpha,\delta,\psi,t_{\text{event}})\tilde{\boldsymbol{h}};-\Delta\boldsymbol{t}\right)[k]+\tilde{\boldsymbol{n}}[k]\,. (23)

Before performing the null projection, we first whiten the data since this would make it easier to handle the statistics of the residual noise. Dividing both sides of Eq. (23) by 12ΔfS[k]\sqrt{\frac{1}{2\Delta f}S[k]}, we have

𝒅~w[k]=𝓣(𝑭w(α,δ,ψ,tevent)𝒉~;Δ𝒕)[k]+𝒏~w[k]\tilde{\boldsymbol{d}}_{w}[k]=\boldsymbol{\mathcal{T}}\left(\boldsymbol{F}_{w}(\alpha,\delta,\psi,t_{\text{event}})\tilde{\boldsymbol{h}};-\Delta\boldsymbol{t}\right)[k]+\tilde{\boldsymbol{n}}_{w}[k] (24)

where

d~w,j[k]=d~j[k]12ΔfSj[k],\tilde{d}_{w,j}[k]=\frac{\tilde{d}_{j}[k]}{\sqrt{\frac{1}{2\Delta f}S_{j}[k]}}\,, (25)
Fw,j[k]=Fj12ΔfSj[k]F_{w,j}[k]=\frac{F_{j}}{\sqrt{\frac{1}{2\Delta f}S_{j}[k]}} (26)

and

n~w,j[k]=n~j[k]12ΔfSj[k]\tilde{n}_{w,j}[k]=\frac{\tilde{n}_{j}[k]}{\sqrt{\frac{1}{2\Delta f}S_{j}[k]}} (27)

where FjF_{j} represents the jthj^{\text{th}} row of 𝑭\boldsymbol{F}, 𝑭wD×M×K\boldsymbol{F}_{w}\in\mathbb{C}^{D\times M\times K} is the noise-weighed beam pattern matrix, and MM is the number of column vectors in 𝑭\boldsymbol{F}. The construction of null stream consists of two steps: 1) time-shifting the data with the time delay implied by the source location and event time i.e. Δ𝒕=Δ𝒕(α,δ,tevent)\Delta\boldsymbol{t}=\Delta\boldsymbol{t}(\alpha,\delta,t_{\text{event}}), and 2) performing the null projection. Denote the null operator as 𝑷:D×D×KD×D×K\boldsymbol{P}:\mathbb{C}^{D\times D\times K}\rightarrow\mathbb{C}^{D\times D\times K} defined by

𝑷(;α,δ,ψ,tevent)=(𝑰𝑭w(𝑭w𝑭w)1𝑭w)𝓣(;Δ𝒕)\boldsymbol{P}\left({}\cdot{};\alpha,\delta,\psi,t_{\text{event}}\right)=(\boldsymbol{I}-\boldsymbol{F}_{w}(\boldsymbol{F}_{w}^{\dagger}\boldsymbol{F}_{w})^{-1}\boldsymbol{F}_{w}^{\dagger})\boldsymbol{\mathcal{T}}\left(\cdot;\Delta\boldsymbol{t}\right) (28)

where 𝑰D×D×K\boldsymbol{I}\in\mathbb{C}^{D\times D\times K} is a stack of KK D×DD\times D identity matrices and \dagger denotes conjugate transpose. The matrix operation in Eq. (28) is performed frequency-wise. One should notice that the null operator is completely determined by the constituent 𝒇m\boldsymbol{f}_{m} of 𝑭\boldsymbol{F} and the set of parameters {α,δ,ψ,tevent}\{\alpha,\delta,\psi,t_{\text{event}}\} is independent of the waveform 𝒉~\tilde{\boldsymbol{h}}. When the set of parameters are correctly specified, one could verify that the null operator projects away 𝒉~\tilde{\boldsymbol{h}} without requiring any knowledge of 𝒉~\tilde{\boldsymbol{h}} itself, and the residual 𝒛~D×K\tilde{\boldsymbol{z}}\in\mathbb{C}^{D\times K} is

𝒛~𝑷(𝒅~w;α,δ,ψ,tevent)=(𝑰𝑭w(𝑭w𝑭w)1𝑭w)𝓣(𝒅~w;Δ𝒕)=(𝑰𝑭w(𝑭w𝑭w)1𝑭w)(𝑭w𝒉~+𝓣(𝒏~;Δ𝒕))=(𝑰𝑭w(𝑭w𝑭w)1𝑭w)𝓣(𝒏~w;Δ𝒕)\begin{split}\tilde{\boldsymbol{z}}&\coloneqq\boldsymbol{P}(\tilde{\boldsymbol{d}}_{w};\alpha,\delta,\psi,t_{\text{event}})\\ &=(\boldsymbol{I}-\boldsymbol{F}_{w}(\boldsymbol{F}_{w}^{\dagger}\boldsymbol{F}_{w})^{-1}\boldsymbol{F}_{w}^{\dagger})\boldsymbol{\mathcal{T}}\left(\tilde{\boldsymbol{d}}_{w};\Delta\boldsymbol{t}\right)\\ &=(\boldsymbol{I}-\boldsymbol{F}_{w}(\boldsymbol{F}_{w}^{\dagger}\boldsymbol{F}_{w})^{-1}\boldsymbol{F}_{w}^{\dagger})\left(\boldsymbol{F}_{w}\tilde{\boldsymbol{h}}+\boldsymbol{\mathcal{T}}(\tilde{\boldsymbol{n}};\Delta\boldsymbol{t})\right)\\ &=(\boldsymbol{I}-\boldsymbol{F}_{w}(\boldsymbol{F}_{w}^{\dagger}\boldsymbol{F}_{w})^{-1}\boldsymbol{F}_{w}^{\dagger})\boldsymbol{\mathcal{T}}\left(\tilde{\boldsymbol{n}}_{w};\Delta\boldsymbol{t}\right)\end{split} (29)

which is the residual noise in the lower dimensional space. To understand the reduced dimensionality, one should be reminded that null projection removes the data on the hyperplane spanned by the beam pattern vectors 𝒇m\boldsymbol{f}_{m}, and we may call the hyperplane as the GW space. Only the data in the null space that is orthogonal to the GW space survive and it is the so-called null stream. One could perform a rotation \mathcal{R} to rotate the residuals to align along the principal axes to visualize the reduced dimensionality (see Appendix B.1). Although we write the null operator in Eq. (28) as a function of the polarization angle ψ\psi, the null operator is in fact independent of the polarization angle if we consider non-degenerate polarization modes (see Appendix B.2).

II.3 Basis formulation

In this section, we present the basis formulation to test GW polarizations. Suppose we would like to compare two polarization hypotheses 1\mathcal{H}_{1} and 2\mathcal{H}_{2} which beam pattern matrices are 𝑭1D×m1\boldsymbol{F}_{1}\in\mathbb{R}^{D\times m_{1}} and 𝑭2D×m2\boldsymbol{F}_{2}\in\mathbb{R}^{D\times m_{2}} respectively where m1<m2m_{1}<m_{2} without loss of generality. One should notice the resultant null streams have unequal dimensionalities. To have a fair comparison between different models, the residuals have to be of the same dimensionality. Instead of performing the most generic null projection with 𝑭2D×m2\boldsymbol{F}_{2}\in\mathbb{R}^{D\times m_{2}}, one could perform the null projection to remove a smaller subspace to match the dimensionality of 𝑭1D×m1\boldsymbol{F}_{1}\in\mathbb{R}^{D\times m_{1}}. We propose the basis formulation to restrict the null projection to remove a smaller subspace constrained by the polarization basis modes. To illustrate the idea, we may consider a scalar-tensorial signal model with the scalar breathing mode in addition to the two tensorial modes

𝒔~[k]=𝒇+h~+[k]+𝒇×h~×[k]+𝒇bh~b[k].\tilde{\boldsymbol{s}}[k]=\boldsymbol{f}_{+}\tilde{h}_{+}[k]+\boldsymbol{f}_{\times}\tilde{h}_{\times}[k]+\boldsymbol{f}_{b}\tilde{h}_{b}[k]\,. (30)

The most generic null projection would remove three dimensions from the data since we have three linearly independent beam pattern vectors. Without loss of generality, we choose the ++ mode as the basis, and we can always decompose any polarization mode h~m[k]\tilde{h}_{m}[k] into the parallel component and the orthogonal component with respect to the basis i.e.

h~m[k]=Cmh~[k]+Cmh~[k]\tilde{h}_{m}[k]=C_{\parallel}^{m}\tilde{h}_{\parallel}[k]+C_{\perp}^{m}\tilde{h}_{\perp}[k] (31)

where h~[k]=h~+[k]\tilde{h}_{\parallel}[k]=\tilde{h}_{+}[k] and C,CC_{\parallel},C_{\perp}\in\mathbb{C}. With this formulation, we can rewrite Eq. (30) as

𝒔~[k]=(𝒇++C×𝒇×+Cb𝒇b)h~[k]+(C×𝒇×+Cb𝒇b)h~[k]=𝒇h~[k]+𝒇h~[k]\begin{split}&\tilde{\boldsymbol{s}}[k]\\ &=(\boldsymbol{f}_{+}+C_{\parallel}^{\times}\boldsymbol{f}_{\times}+C_{\parallel}^{b}\boldsymbol{f}_{b})\tilde{h}_{\parallel}[k]+(C_{\perp}^{\times}\boldsymbol{f}_{\times}+C_{\perp}^{b}\boldsymbol{f}_{b})\tilde{h}_{\perp}[k]\\ &=\boldsymbol{f}_{\parallel}\tilde{h}_{\parallel}[k]+\boldsymbol{f}_{\perp}\tilde{h}_{\perp}[k]\end{split} (32)

where

h~+[k]=h~[k]h~×[k]=C×h~[k]+C×h~[k]h~b[k]=Cbh~[k]+Cbh~[k]\begin{split}&\tilde{h}_{+}[k]=\tilde{h}_{\parallel}[k]\\ &\tilde{h}_{\times}[k]=C_{\parallel}^{\times}\tilde{h}_{\parallel}[k]+C_{\perp}^{\times}\tilde{h}_{\perp}[k]\\ &\tilde{h}_{b}[k]=C_{\parallel}^{b}\tilde{h}_{\parallel}[k]+C_{\perp}^{b}\tilde{h}_{\perp}[k]\end{split} (33)

and

𝒇=𝒇++C×𝒇×+Cb𝒇b𝒇=C×𝒇×+Cb𝒇b.\begin{split}&\boldsymbol{f}_{\parallel}=\boldsymbol{f}_{+}+C_{\parallel}^{\times}\boldsymbol{f}_{\times}+C_{\parallel}^{b}\boldsymbol{f}_{b}\\ &\boldsymbol{f}_{\perp}=C_{\perp}^{\times}\boldsymbol{f}_{\times}+C_{\perp}^{b}\boldsymbol{f}_{b}\,.\end{split} (34)

We could then perform the (partial) null projection to remove the smaller subspace spanned by 𝒇\boldsymbol{f}_{\parallel} only. Qualitatively, we have reduced the most generic scalar-tensor GW space spanned by 𝒇+\boldsymbol{f}_{+}, 𝒇×\boldsymbol{f}_{\times} and 𝒇b\boldsymbol{f}_{b} to the scalar-tensor GW subspace spanned by 𝒇\boldsymbol{f}_{\parallel} where the polarization modes are described by a single basis. With this formulation, the dimensionality of the GW space is then determined by the number of basis modes in contrast to that in the generic null projection determined by the number of beam pattern vectors. In other words, this formulation allows us to capture the non-tensorial components parallel to the basis mode(s) (in general we can have more than one basis mode). One attractive feature of this formulation is that we only need to choose the basis modes of some polarization types, but we do not need to assume the waveform of the basis modes.

In the basis formulation, we have the freedom to choose the basis mode(s) without explicitly modeling them. Also, we have the freedom to choose the number of basis mode(s). In terms of testing GR, we want to compare the likelihood between the tensor hypothesis and the non-tensor hypotheses. Since there are two polarization modes in tensorial polarizations, we could choose either one or two basis mode(s) to parameterize the tensor hypothesis. The competing hypotheses are then formulated with the same number of basis modes accordingly. For brevity, we may call the one-basis-mode analysis as the L=1L=1 analysis, and the two-basis-mode analysis as the L=2L=2 analysis where LL denotes the number of basis mode(s). In the L=2L=2 analysis of the tensor hypothesis, the signal space contains all possible tensorial polarizations, and the waveforms of the plus and cross polarization can take any independent shape. In the L=1L=1 analysis, an additional structure is imposed on the plus and cross polarizations i.e. they are linearly dependent in the Fourier domain, and therefore the signal space is more restricted compared to that in the L=2L=2 analysis.

In general, a higher number of basis modes gives a greater degree of freedom to fit the data but it also reduces the distinguishing power between different polarization hypotheses. The major reason is that with a higher number of basis modes we remove a larger subspace from the data, and therefore the residual has a lower dimensionality. One should be reminded that the polarization subspaces spanned by the beam pattern vectors are in general not orthogonal to each other. For example, removing the tensorial subspace could also remove part of the non-tensorial component (if any) of the signal. The polarization subspaces, in general, have a greater overlap in the L=2L=2 analysis than that in the L=1L=1 analysis, and one should expect the analysis with a higher number of basis modes has a weaker distinguishing power between the polarization hypotheses.

Although the L=1L=1 analysis, in general, has a stronger distinguishing power than the L=2L=2 analysis, the L=2L=2 analysis helps to capture the orthogonal polarization mode(s) that is/are possibly missed in the L=1L=1 analysis. In practice, we can always perform both analyses and examine the consistency of the results.

II.4 Parameterization

In this section, we present the parameterization of the L=1L=1 analysis and the L=2L=2 analysis. Each polarization hypothesis is characterized by the beam pattern vector 𝒇m\boldsymbol{f}_{m} involved in the construction of the null operator. To facilitate the mathematical expressions, we may define 𝔽p\mathbb{F}_{p} as the set of labels of polarizations corresponding to the polarization hypothesis p\mathcal{H}_{p} as follows

𝔽T={+,×}𝔽V={x,y}𝔽S={b,l}𝔽TV=𝔽T𝔽V𝔽TS=𝔽T𝔽S𝔽VS=𝔽V𝔽S𝔽TVS=𝔽T𝔽V𝔽S\begin{split}&\mathbb{F}_{T}=\{+,\times\}\\ &\mathbb{F}_{V}=\{x,y\}\\ &\mathbb{F}_{S}=\{b,l\}\\ &\mathbb{F}_{TV}=\mathbb{F}_{T}\cup\mathbb{F}_{V}\\ &\mathbb{F}_{TS}=\mathbb{F}_{T}\cup\mathbb{F}_{S}\\ &\mathbb{F}_{VS}=\mathbb{F}_{V}\cup\mathbb{F}_{S}\\ &\mathbb{F}_{TVS}=\mathbb{F}_{T}\cup\mathbb{F}_{V}\cup\mathbb{F}_{S}\end{split} (35)

where ++, ×\times, bb, ll, xx and yy denotes the labels for the plus polarization, cross polarization, scalar breathing polarization, scalar longitudinal polarization, vector xx polarization, and vector yy polarization respectively.

II.4.1 One-basis-mode (L=1L=1) analysis

In the L=1L=1 analysis, the beam pattern matrix 𝑭\boldsymbol{F} of the polarization hypothesis p\mathcal{H}_{p} with the basis mode \ell is defined by

𝑭(α,δ,ψ,𝓐,𝝋,tevent)=[𝒇+m𝔽p{}𝒜meiφm𝒇m].\begin{split}&\boldsymbol{F}(\alpha,\delta,\psi,\boldsymbol{\mathcal{A}},\boldsymbol{\varphi},t_{\text{event}})\\ &=\begin{bmatrix}\boldsymbol{f}_{\ell}+\sum\limits_{m\in\mathbb{F}_{p}\setminus\{\ell\}}\mathcal{A}_{m}e^{i\varphi_{m}}\boldsymbol{f}_{m}\end{bmatrix}\,.\end{split} (36)

where 𝔽p{}\mathbb{F}_{p}\setminus\{\ell\} is the set of labels of polarizations in hypothesis p\mathcal{H}_{p} defined in Eq. (35) excluding \ell, 𝒜m0\mathcal{A}_{m}\geq 0 and φm[0,2π)\varphi_{m}\in[0,2\pi). As an example, suppose we choose the plus mode as the basis mode in the scalar-tensor hypothesis, the parallel beam pattern vector 𝒇\boldsymbol{f}_{\parallel} is then

𝒇=𝒇++m={×,b,l}𝒜meiφm𝒇m.\boldsymbol{f}_{\parallel}=\boldsymbol{f}_{+}+\sum_{m=\{\times,b,l\}}\mathcal{A}_{m}e^{i\varphi_{m}}\boldsymbol{f}_{m}\,. (37)

The beam pattern matrix 𝑭\boldsymbol{F} is

𝑭(α,δ,ψ,𝓐,𝝋,tevent)=[𝒇]=[𝒇++m={×,b,l}𝒜meiφm𝒇m]\begin{split}&\boldsymbol{F}(\alpha,\delta,\psi,\boldsymbol{\mathcal{A}},\boldsymbol{\varphi},t_{\text{event}})\\ &=\begin{bmatrix}\boldsymbol{f}_{\parallel}\end{bmatrix}\\ &=\begin{bmatrix}\boldsymbol{f}_{+}+\sum\limits_{m=\{\times,b,l\}}\mathcal{A}_{m}e^{i\varphi_{m}}\boldsymbol{f}_{m}\end{bmatrix}\end{split} (38)

where 𝓐={A×,Ab,Al}\boldsymbol{\mathcal{A}}=\{A_{\times},A_{b},A_{l}\} and 𝝋={φ×,φb,φl}\boldsymbol{\varphi}=\{\varphi_{\times},\varphi_{b},\varphi_{l}\}. The null operator is then constructed with Eq. (28). One should notice that now it is also a function of 𝓐\boldsymbol{\mathcal{A}} and 𝝋\boldsymbol{\varphi} in addition to α\alpha, δ\delta, ψ\psi and teventt_{\text{event}}. The polarization angle ψ\psi is now relevant in the construction of 𝑷\boldsymbol{P} since we can no longer factor out ψ\psi from 𝑭\boldsymbol{F} in constrast to the generic null projection discussed in Appendix B.2. One exception is the scalar hypothesis which involves 𝒇b\boldsymbol{f}_{b} and 𝒇l\boldsymbol{f}_{l}, but they are linearly dependent so we could choose either one of them. With a single beam pattern vector, the ψ\psi can be factored out and the construction of null operator is therefore independent of ψ\psi. Other polarization hypotheses are formulated with the same manner by including the relevant 𝒇m\boldsymbol{f}_{m} into 𝒇\boldsymbol{f}_{\parallel}.

II.4.2 Two-basis-mode (L=2L=2) analysis

In the L=2L=2 analysis, we choose two polarization modes as the basis. The beam pattern matrix 𝑭\boldsymbol{F} of the polarization hypothesis p\mathcal{H}_{p} with the basis modes kk and \ell is defined by

𝑭(α,δ,ψ,𝓐,𝝋,tevent)=[𝒇,k𝒇,]\begin{split}&\boldsymbol{F}(\alpha,\delta,\psi,\boldsymbol{\mathcal{A}},\boldsymbol{\varphi},t_{\text{event}})\\ &=\begin{bmatrix}\boldsymbol{f}_{\parallel,k}&\boldsymbol{f}_{\parallel,\ell}\end{bmatrix}\end{split} (39)

where

𝒇,k=𝒇k+m𝔽p{k,}𝒜1,meiφ1,m𝒇m\boldsymbol{f}_{\parallel,k}=\boldsymbol{f}_{k}+\sum\limits_{m\in\mathbb{F}_{p}\setminus\{k,\ell\}}\mathcal{A}_{1,m}e^{i\varphi_{1,m}}\boldsymbol{f}_{m} (40)

and

𝒇,=𝒇+m𝔽p{k,}𝒜2,meiφ2,m𝒇m\boldsymbol{f}_{\parallel,\ell}=\boldsymbol{f}_{\ell}+\sum\limits_{m\in\mathbb{F}_{p}\setminus\{k,\ell\}}\mathcal{A}_{2,m}e^{i\varphi_{2,m}}\boldsymbol{f}_{m} (41)

where 𝔽p{k,}\mathbb{F}_{p}\setminus\{k,\ell\} is the set of labels of polarizations in hypothesis p\mathcal{H}_{p} defined in Eq. (35) excluding kk and \ell, 𝒜{1,2},m0\mathcal{A}_{\{1,2\},m}\geq 0 and φ{1,2},m[0,2π)\varphi_{\{1,2\},m}\in[0,2\pi). In principle we could choose any two polarization modes as the basis, but we shall see from the non-GR waveforms of e.g. Brans-Dicke theory, Rosen’s theory and Lightman-Lee theory presented in Ref. Chatziioannou et al. (2012b), the dipole radiation only enters the non-tensorial polarizations. Therefore, the more sensible choice is to choose one tensorial mode and one non-tensorial mode as the basis. As a comparison with the example in the L=1L=1 analysis, we shall again take the scalar-tensor hypothesis as an example. Suppose we choose the plus mode and the scalar breathing mode as the basis, the two parallel beam pattern vectors are

𝒇,+=𝒇++m={×,l}𝒜1,meiφ1,m𝒇m\boldsymbol{f}_{\parallel,+}=\boldsymbol{f}_{+}+\sum_{m=\{\times,l\}}\mathcal{A}_{1,m}e^{i\varphi_{1,m}}\boldsymbol{f}_{m} (42)

and

𝒇,b=𝒇b+m={×,l}𝒜2,meiφ2,m𝒇m.\boldsymbol{f}_{\parallel,b}=\boldsymbol{f}_{b}+\sum_{m=\{\times,l\}}\mathcal{A}_{2,m}e^{i\varphi_{2,m}}\boldsymbol{f}_{m}\,. (43)

The beam pattern matrix is therefore

𝑭(α,δ,ψ,𝓐,𝝋,tevent)=[𝒇,+𝒇,b]\begin{split}\boldsymbol{F}(\alpha,\delta,\psi,\boldsymbol{\mathcal{A}},\boldsymbol{\varphi},t_{\text{event}})=\begin{bmatrix}\boldsymbol{f}_{\parallel,+}&\boldsymbol{f}_{\parallel,b}\end{bmatrix}\end{split} (44)

where 𝓐={A1,×,A1,l,A2,×,A2,l}\boldsymbol{\mathcal{A}}=\{A_{1,\times},A_{1,l},A_{2,\times},A_{2,l}\} and 𝝋={φ1,×,φ1,l,φ2,×,φ2,l}\boldsymbol{\varphi}=\{\varphi_{1,\times},\varphi_{1,l},\varphi_{2,\times},\varphi_{2,l}\}. The null operator is then constructed with Eq. (28). The scalar hypothesis is not defined in the L=2L=2 analysis since 𝒇b\boldsymbol{f}_{b} and 𝒇l\boldsymbol{f}_{l} are linearly dependent. For the tensor hypothesis and the vector hypothesis, since they only have two polarization modes, they are fully characterized by themselves without requiring the parameters 𝓐\boldsymbol{\mathcal{A}} and 𝝋\boldsymbol{\varphi}, and therefore ψ\psi is irrelevant in the construction of the null operator as discussed in Appendix B.2.

The extension to even more basis modes (L>2L>2) is trivial. Since we are most interested in detecting GR violation, but the tensor hypothesis is not defined in L>2L>2 analyses, so the L=1L=1 and the L=2L=2 analyses are sufficient in terms of probing deviation from GR.

II.5 Bayesian model selection

Since the model parameters 𝜽={α\boldsymbol{\theta}=\{\alpha, δ,ψ,𝓐,𝝋}\delta,\psi,\boldsymbol{\mathcal{A}},\boldsymbol{\varphi}\} are not known (we take the event time teventt_{\text{event}} from the search pipelines as known), we adopt a Bayesian analysis to marginalize the unknown parameters to obtain the Bayesian evidence of the competing hypotheses.

To improve the sensitivity, we analyze the data in the time-frequency domain and target the region spanned by the candidate signal through time-frequency clustering. We use the Wilson-Daubechies-Meyer (WDM) time-frequency transform Necula et al. (2012) because of its superior spectral leakage control. Given a multi-detector discrete time series 𝒙D×N\boldsymbol{x}\in\mathbb{R}^{D\times N} where DD is the number of detectors and NN is the number of time bins, denote the time-frequency transform as TF:D×ND×J×K\textbf{TF}:\mathbb{R}^{D\times N}\rightarrow\mathbb{R}^{D\times J\times K} where JJ is the number of time bins and KK is the number of frequency bins which performs the WDM time-frequency transform on the time series of each detector independently. The corresponding time-frequency representation is denoted with a subscript as 𝒙TF\boldsymbol{x}_{\text{TF}}.

One should notice that the construction of the null operator involves the noise-weighed beam pattern function matrix 𝑭w\boldsymbol{F}_{w} in Eq. (26). The noise PSD of LIGO-Virgo interferometers typically exhibits sharp spectral lines due to the resonance of wires in suspension, electrical supply power line, and injected calibration lines Littenberg and Cornish (2015). To reduce the spectral leakage, we compute the residual 𝒛~\tilde{\boldsymbol{z}} in Eq. (29) in the frequency domain which has a higher frequency resolution than the time-frequency representation. Then, we perform the inverse Fourier transform 1\mathcal{F}^{-1} to transform the residual back to the time domain and then transform the time domain residual to the time-frequency domain by TF. As shown in Eq. (29), when the null operator 𝑷\boldsymbol{P} is constructed with the correct parameters, the residual 𝒛~\tilde{\boldsymbol{z}} is only noise in the lower dimensional space regardless of 𝒉~\tilde{\boldsymbol{h}}. The time-frequency representation of the residual 𝒛TF\boldsymbol{z}_{\text{TF}} is also only noise with the null energy EnullE_{\text{null}} Sutton et al. (2010) defined as

Enull(𝒛TF)=j=1D{k,l}𝒞zTF,j[k,l]2E_{\text{null}}(\boldsymbol{z}_{\text{TF}})=\sum_{j=1}^{D}\sum_{\{k,l\}\in\mathcal{C}}\left\lVert z_{\text{TF},j}[k,l]\right\rVert^{2} (45)

where zTF,j[k,l]z_{\text{TF},j}[k,l] denotes the coefficient of the time-frequency representation 𝒛TF\boldsymbol{z}_{\text{TF}} corresponding to the kthk^{\text{th}} time index and lthl^{\text{th}} frequency index of the residual of the jthj^{\text{th}} detector, and 𝒞\mathcal{C} denotes the set of time-frequency indices that the candidate signal occupies on the time-frequency plane follows the χ2\chi^{2} distribution with a degree of freedom (DL)N(D-L)N where LL is the number of basis modes and NN is the number of time-frequency pixels that the candidate signal spans when the null operator is correctly constructed. The degree of freedom is half of that in Ref. Sutton et al. (2010) since the WDM coefficients are real but in Ref. Sutton et al. (2010) Fourier transform is used and the real and imaginary components double the degree of freedom.

The likelihood is then given by

p(𝒅|α,δ,ψ,𝓐,𝝋,)=χDoF2(Enull(𝒛TF))p(\boldsymbol{d}|\alpha,\delta,\psi,\boldsymbol{\mathcal{A}},\boldsymbol{\varphi},\mathcal{H})=\chi^{2}_{\text{DoF}}(E_{\text{null}}(\boldsymbol{z}_{\text{TF}})) (46)

where 𝒛TF=TF(1(𝑷(𝒅~;α,δ,ψ,𝓐,𝝋)))\boldsymbol{z}_{\text{TF}}=\textbf{TF}(\mathcal{F}^{-1}(\boldsymbol{P}_{\mathcal{H}}(\tilde{\boldsymbol{d}};\alpha,\delta,\psi,\boldsymbol{\mathcal{A}},\boldsymbol{\varphi}))), 𝑷\boldsymbol{P}_{\mathcal{H}} denotes the null operator constructed with the 𝒇m\boldsymbol{f}_{m} implied by the hypothesis \mathcal{H} and the number of basis modes LL, and χDoF2()\chi_{\text{DoF}}^{2}(\cdot) is the χ2\chi^{2} probability density function with the degree of freedom DoF implied by the hypothesis \mathcal{H}. The Bayesian evidence is then evaluated by marginalizing all parameters

p(𝒅|)=p(𝒅|α,δ,ψ,𝓐,𝝋,)p(α,δ,ψ,𝓐,𝝋|)𝑑α𝑑δ𝑑ψ𝑑𝓐𝑑𝝋\begin{split}&p(\boldsymbol{d}|\mathcal{H})\\ &=\int p(\boldsymbol{d}|\alpha,\delta,\psi,\boldsymbol{\mathcal{A}},\boldsymbol{\varphi},\mathcal{H})p(\alpha,\delta,\psi,\boldsymbol{\mathcal{A}},\boldsymbol{\varphi}|\mathcal{H})d\alpha d\delta d\psi d\boldsymbol{\mathcal{A}}d\boldsymbol{\varphi}\end{split} (47)

where p(α,δ,ψ,𝑨,𝝋|)p(\alpha,\delta,\psi,\boldsymbol{A},\boldsymbol{\varphi}|\mathcal{H}) is the prior distribution of the parameters given \mathcal{H}. The posterior odds between 1\mathcal{H}_{1} and 2\mathcal{H}_{2} defined as

𝒪21=p(1|𝒅)p(2|𝒅)=p(𝒅|1)p(𝒅|2)×p(1)p(2)\begin{split}\mathcal{O}_{\mathcal{H}_{2}}^{\mathcal{H}_{1}}&=\frac{p(\mathcal{H}_{1}|\boldsymbol{d})}{p(\mathcal{H}_{2}|\boldsymbol{d})}\\ &=\frac{p(\boldsymbol{d}|\mathcal{H}_{1})}{p(\boldsymbol{d}|\mathcal{H}_{2})}\times\frac{p(\mathcal{H}_{1})}{p(\mathcal{H}_{2})}\end{split} (48)

that decribes the ratio of probabilities between the two hypotheses 1\mathcal{H}_{1} and 2\mathcal{H}_{2} given the observed data 𝒅\boldsymbol{d}, p(𝒅|1)p(𝒅|2)\frac{p(\boldsymbol{d}|\mathcal{H}_{1})}{p(\boldsymbol{d}|\mathcal{H}_{2})} is the ratio of model evidences and is also called the Bayes factor, and p(1)p(2)\frac{p(\mathcal{H}_{1})}{p(\mathcal{H}_{2})} is the prior odds between the hypotheses 1\mathcal{H}_{1} and 2\mathcal{H}_{2} which describes the a priori belief of the ratio of probabilities of the two hypotheses. When we are ignorant about the relative probability between the competing hypothesis, we take the prior odds to be 11, and then the posterior odds would be equal to the Bayes factor. In the following sections, we use the Bayes factor as the detection statistic. It should be understood that Eq. (47) is presented for generality, {ψ,𝑨,𝝋}\{\psi,\boldsymbol{A},\boldsymbol{\varphi}\} are not present in the integral when the hypothesis \mathcal{H} does not have the extra parameterization.

As a follow-up analysis, in the analysis pipeline, we require the time-frequency cluster found to span across the event time reported by the search pipeline. This could fail when the signal is too weak, and there are loud noise transients around the event. If the cluster does not span across the event time, the analysis is performed in the frequency domain, and the whole data chunk is being analyzed. The null energy is then computed with the frequency domain residual 𝒛~\tilde{\boldsymbol{z}}

Enull(𝒛~)=j=1Dk=1Kz~j[k]2E_{\text{null}}(\tilde{\boldsymbol{z}})=\sum_{j=1}^{D}\sum_{k=1}^{K}\left\lVert\tilde{z}_{j}[k]\right\rVert^{2} (49)

where z~j[k]\tilde{z}_{j}[k] denotes the residual at frequency bin kk of the jthj^{\text{th}} detector, DD denotes the number of detectors and KK denotes the number of frequency bins. 2Enull2E_{\text{null}} then follows the χ2\chi^{2} distribution with degree of freedom 2(DM)K2(D-M)K when the null operator is correctly constructed.

II.6 The orthogonal component h~\tilde{h}_{\perp}

One would still need to pay attention to the orthogonal polarization component h~\tilde{h}_{\perp} with the partial null projection. In terms of testing GR, the orthogonal polarization component must not be significant in the tensor hypothesis, and otherwise, this could lead to false GR violation. For CBC signals, the plus- and the cross-polarization modes in the dominant 22-mode only differ by the amplitude (due to the inclination angle) and the phase by π/2\pi/2 which are frequency-independent Creighton and Anderson (2011). The addition of the higher-order harmonics only adds subdominant corrections. The plus mode and the cross mode are still well described by a single basis mode, and the orthogonal component is vanishingly small. In the example of the L=1L=1 analysis presented in Sec. II.4.1, the orthogonal component would be significant if the dipole radiation only enters the non-tensorial polarizations but not the tensorial polarizations like the Brans-Dicke theory Chatziioannou et al. (2012b). We would argue that detecting the non-tensorial component parallel to the basis modes is sufficient for us to detect a GR violation. However, we also want to know how much uncaptured h~\tilde{h}_{\perp} are in the data to understand how well the model can explain the data. This is important for us to understand the systematics of the ranking of the non-tensor hypotheses if we observe a deviation from GR.

With the same spirit of the residuals test Abbott et al. (2019b); Collaboration et al. (2020), we analyze whether the null energy defined in Eq. (45) with the maximum-likelihood parameters are consistent with noise or not. The consistency could be quantified using the plug-in p-value Demortier (2007) defined by

pplug-in(Enull)=EnullχDoF2(E)𝑑Ep_{\text{plug-in}}(E_{\text{null}})=\int_{E_{\text{null}}}^{\infty}\chi^{2}_{\text{DoF}}(E)dE (50)

where EnullE_{\text{null}} is the null energy with the maximum-likelihood parameters. One should notice that the plug-in p-value does not distribute uniformly between 0 and 11. Instead, it should always be around pplug-in(DoF2)p_{\text{plug-in}}(\text{DoF}-2) (assume DoF2\text{DoF}\geq 2) since the mode of a χ2\chi^{2} distribution with a degree of freedom DoF is DoF2\text{DoF}-2 that is the EnullE_{\text{null}} with the highest possible likelihood. A significantly smaller pplug-in(Enull)p_{\text{plug-in}}(E_{\text{null}}) than the pplug-in(DoF2)p_{\text{plug-in}}(\text{DoF}-2) indicates the presence of an orthogonal polarization component. The pplug-inp_{\text{plug-in}} could therefore be served as a diagnostic tool, but one shoule be reminded that it cannot be interpreted as the usual p-value since it does not distriute uniformly between 0 and 11 under the null hypothesis. A more interpretable statistic could be obtained using the double bootstrap method to calculate the adjusted plug-in p-value Demortier (2007); Davison and Hinkley (1997) that distributes uniformly between 0 and 11.

II.7 Calibration errors

Calibration errors are present in instruments Cahillane et al. (2017); Accadia et al. (2010). Although a previous study Vitale et al. (2012) shows that calibration-induced errors of Advanced LIGO and Advanced Virgo are not a significant detriment to accurate parameter estimation, including the effects of calibration errors has been a standard practice in the LIGO Scientific Collaboration and the Virgo Collaboration Abbott et al. (2019a, 2020a) to improve the accuracy of results and for completeness. The details of including effects of calibration errors into the framework are discussed in Appendix C.

III Results

We experiment on tensor, vector, scalar, tensor-scalar, tensor-vector, vector-scalar and tensor-vector-scalar injections to validate the method. In this section, we start with the ad hoc non-tensorial injections which non-tensorial components are generated by projecting h~+(f)\tilde{h}_{+}(f) and h~×(f)\tilde{h}_{\times}(f) onto the non-tensorial beam pattern functions. The polarization modes can then be described well with a single basis mode. This serves as testing the methods when there is no orthogonal polarization component. We then experiment on the more realistic non-GR waveforms with both cases when the orthogonal polarization component is present and absent respectively.

III.1 Nested sampling configurations

The Bayesian model evidences are computed with nested sampling using MultiNest Feroz and Hobson (2008); Feroz et al. (2009, 2019a). The prior of source sky position Ω^=(α\hat{\Omega}=(\alpha, δ)\delta) is taken to be uniform over the sky sphere. The prior of polarization angle ψ\psi is taken to be uniform over [0,π][0,\pi]. The prior of each relative amplitude in 𝓐\boldsymbol{\mathcal{A}} is taken to be uniform over [0,2][0,2]. The prior of each relative phase in 𝝋\boldsymbol{\varphi} is taken to be uniform over [0,2π][0,2\pi]. 1024 live points are used. The sampling efficiency is set to be 0.30.3 as recommended in the GitHub repository of MultiNest Feroz et al. (2019b).

III.2 Ad hoc injections

III.2.1 Mock data preparation

One should expect to observe a stronger model preference for the non-tensor hypotheses when the underlying non-tensorial signal has a higher signal-to-noise ratio (SNR). Also, one should expect to observe a stronger model preference for the tensor hypothesis when the underlying tensorial signal has a higher SNR. Therefore, we generate tensorial and non-tensorial injections, and check whether the correct hypothesis is more favored. We fix the noise realization and every waveform parameter except the luminosity distance for scaling to the targeted SNR. We use the IMRPhenomPv2 Husa et al. (2016); Khan et al. (2016); Hannam et al. (2014) waveform model to generate the binary black hole (BBH) waveforms h+(t)h_{+}(t) and h×(t)h_{\times}(t) with a sampling rate fs=2048 Hzf_{s}=2048\text{ Hz} and a frequency lower cut flow=20 Hzf_{\text{low}}=20\text{ Hz}. The component masses are set to be m1=m2=20Mm_{1}=m_{2}=20M_{\odot}. The spins, inclination angle, coalescence phase and polarization angle are set to be 0. The geocentric GPS time is set to be 12821078241282107824 and the right ascension and declination of the source location are set to be α=2.72\alpha=2.72 and δ=0.36\delta=-0.36 respectively. We generate the non-tensorial signals by projecting h+(t)h_{+}(t) and h×(t)h_{\times}(t) onto the non-tensorial beam pattern functions. The waveforms are projected onto the Hanford-Livingston-Virgo (HLV) detector network, and injected into simulated Gaussian noise with the advanced LIGO and advanced Virgo design sensitivities, or more specifically aLIGODesignSensitivityP1200087 and AdVDesignSensitivityP1200087 Abbott et al. (2020b) respectively.

We first present the results of the analysis on the injections with pure polarizations i.e. pure tensorial, pure vectorial, and pure scalar signal. The injected signals are scaled to 20 different network SNRs equally spaced between 10 and 100. The single detector SNR is defined by

ρ=40s~(f)2S(f)𝑑f\rho=\sqrt{4\int_{0}^{\infty}\frac{\left\lVert\tilde{s}(f)\right\rVert^{2}}{S(f)}df} (51)

where s~(f)\tilde{s}(f) is the Fourier transform of the observed signal and S(f)S(f) is the one-sided PSD of detector noise, and the NN-detector network SNR is defined by

ρnet=j=1Nρj2.\rho_{\text{net}}=\sqrt{\sum_{j=1}^{N}\rho_{j}^{2}}\,. (52)

We then present the results of the analysis on the injections with mixed polarizations. We study the case when the signal is dominantly tensorial with a non-tensorial correction. We fix the network SNR of the injections to be 100100, and vary the strength of the non-tensorial component. Lastly, we also present the results of the analysis on vector-scalar injections for completeness.

III.2.2 Pure polarizations

The scalar signal is generated by

sS(t)=Fbh+(t)+Flh×(t)s_{S}(t)=F_{b}h_{+}(t)+F_{l}h_{\times}(t) (53)

and a vector signal is generated by

sV(t)=Fxh+(t)+Fyh×(t).s_{V}(t)=F_{x}h_{+}(t)+F_{y}h_{\times}(t)\,. (54)

Fig. 3 shows the results of log10\log_{10} Bayes factor of scalar hypothesis S\mathcal{H}_{S} against tensor hypothesis T\mathcal{H}_{T} (log10TS\log_{10}\mathcal{B}_{T}^{S}) with scalar injections and tensor injections. Since the scalar breathing beam pattern function and scalar longitudinal beam pattern function are linearly dependent, the scalar hypothesis is only defined with one basis mode. Consequently, the competing hypotheses also have to be constructed with one basis mode. The competing tensor hypothesis is hence defined with the ++ mode chosen to be the basis.

Fig. 4 shows the results of log10\log_{10} Bayes factor of vector hypothesis V\mathcal{H}_{V} against tensor hypothesis T\mathcal{H}_{T} (log10TS\log_{10}\mathcal{B}_{T}^{S}) with vector injections and tensor injections. The upper panel shows the results with one basis mode. xx mode is chosen as the basis for the vector hypothesis and ++ mode is chosen as the basis for the tensor hypothesis. The lower panel shows the results with two basis modes. Since both vector polarizations and tensor polarizations have two polarization modes, xx and yy modes are the only choices of basis for vector hypothesis, and ++ and ×\times modes are the only choices of basis for the tensor hypothesis.

The error bars denote the ±1\pm 1 sigma error of log10\log_{10}\mathcal{B} estimated by σ12+σ22\sqrt{\sigma_{1}^{2}+\sigma_{2}^{2}} where σ1,2\sigma_{1,2} are the one sigma error of the competing model log evidences from the MultiNest Feroz and Hobson (2008); Feroz et al. (2009, 2019a) outputs after converting from base ee to base 1010. The plots show a tendency to be in more favor of the true polarization hypotheses with an increasing SNR.

Refer to caption
Figure 3: The plot shows the variation of log10TS\log_{10}\mathcal{B}_{T}^{S} with varying SNR. The signals are injected into the Hanford-Livingston-Virgo 3-detector network. The squares denote the results of analysis of scalar injections, and the triangles denote the results of analysis of tensor injections. The error bars are the standard error of log10TS\log_{10}\mathcal{B}_{T}^{S} that due to the error of evidence estimation of the sampler. The error bars are too small to be seen. Since the scalar breathing and scalar longitudinal beam pattern functions are linearly dependent, the scalar hypothesis is only defined with one basis mode. The tensor hypothesis here takes the ++ mode as the basis. The beam pattern matrix of the hypotheses is defined in Eq. (36). The true polarization hypotheses are more favored with an increasing SNR.
Refer to caption
Figure 4: The plots show the variation of log10TV\log_{10}\mathcal{B}_{T}^{V} with varying SNR. The signals are injected into the Hanford-Livingston-Virgo 3-detector network. The squares denote the results of analysis of vector injections, and the triangles denote the results of analysis of tensor injections. The error bars are the standard error of log10TV\log_{10}\mathcal{B}_{T}^{V} that due to the error of evidence estimation of the sampler. The error bars are too small to be seen. The upper panel shows the results with the xx mode as the basis component for the vector hypothesis, and the ++ mode as the basis component for the tensor hypothesis. The beam pattern matrix of the one-basis-mode analysis is defined in Eq. (36). The lower panel shows the results with the xx and the yy modes as the basis components for the vector hypothesis, and the ++ mode and the ×\times modes as the basis components for the tensor hypothesis. The beam pattern matrix of the two-basis-mode analysis is defined in Eq. (39). The true polarization hypotheses are more favored with an increasing SNR.

III.2.3 Tensor polarizations with non-tensorial corrections

We study the case when the polarization content is dominantly tensorial with non-tensorial corrections. We should expect to observe a stronger model preference for the non-tensor hypotheses with a stronger non-tensorial component. We perform the analysis on tensor-scalar, tensor-vector and tensor-vector-scalar injections with different strengths of non-tensorial components. A tensor-scalar signal is generated by

sTS(t)=F+h+(t)+F×h×(t)+𝒜(Fbh+(t)+Flh×(t))s_{TS}(t)=F_{+}h_{+}(t)+F_{\times}h_{\times}(t)+\mathcal{A}(F_{b}h_{+}(t)+F_{l}h_{\times}(t)) (55)

where 𝒜\mathcal{A} denotes the relative amplitude of the scalar modes to the tensor modes. A tensor-vector signal is generated by

sTV(t)=F+h+(t)+F×h×(t)+𝒜(Fxh+(t)+Fyh×(t))s_{TV}(t)=F_{+}h_{+}(t)+F_{\times}h_{\times}(t)+\mathcal{A}(F_{x}h_{+}(t)+F_{y}h_{\times}(t)) (56)

where 𝒜\mathcal{A} denotes the relative amplitude of the vector modes to the tensor modes. A tensor-vector-scalar signal is generated by

sTVS(t)=F+h+(t)+F×h×(t)+𝒜(Fxh+(t)+Fyh×(t)+Fbh+(t)+Flh×(t))\begin{split}s_{TVS}(t)&=F_{+}h_{+}(t)+F_{\times}h_{\times}(t)\\ &+\mathcal{A}(F_{x}h_{+}(t)+F_{y}h_{\times}(t)+F_{b}h_{+}(t)+F_{l}h_{\times}(t))\end{split} (57)

where 𝒜\mathcal{A} denotes the relative amplitude of the non-tensorial modes to the tensor modes. Signals with 𝒜={0,0.2,0.4,0.6,0.8,1.0}\mathcal{A}=\{0,0.2,0.4,0.6,0.8,1.0\} are generated and injected into the HLV detector network with the network SNR fixed to be 100100.

Refer to caption
Figure 5: The plots show the variation of log10TTS\log_{10}\mathcal{B}_{T}^{TS} with a tensor-scalar injection of a varying relative strength 𝒜\mathcal{A} of the scalar polarization mode relative to the tensorial polarization mode. The signals are injected into the Hanford-Livingston-Virgo 3-detector network and the network SNR is fixed to be 100100 for all injections. The error bars are the standard error of log10TTS\log_{10}\mathcal{B}_{T}^{TS} that due to the error of evidence estimation of the sampler. The upper panel shows the results using the ++ mode as the basis component for both of the tensor hypothesis T\mathcal{H}_{T} and the tensor-scalar hypothesis TS\mathcal{H}_{TS}. The lower panel shows the results using the ++ mode and the ×\times mode as the basis components for both hypotheses. The tensor-scalar hypothesis is more favored when the strength of the scalar component is stronger.
Refer to caption
Figure 6: The plots show the variation of log10TTV\log_{10}\mathcal{B}_{T}^{TV} with a tensor-vector injection of a varying relative strength 𝒜\mathcal{A} of the vector polarization mode relative to the tensorial polarization mode. The signals are injected into the Hanford-Livingston-Virgo 3-detector network and the network SNR is fixed to be 100100 for all injections. The error bars are the standard error of log10TTV\log_{10}\mathcal{B}_{T}^{TV} that due to the error of evidence estimation of the sampler. The error bars on the upper panel are too small to be seen. The upper panel shows the results using the ++ mode as the basis component for both of the tensor hypothesis T\mathcal{H}_{T} and the tensor-vector hypothesis TV\mathcal{H}_{TV}. The lower panel shows the results using the ++ mode and the ×\times mode as the basis components for both hypotheses. The tensor-vector hypothesis is more favored when the strength of the vector component is stronger.
Refer to caption
Figure 7: The plots show the variation of log10TTVS\log_{10}\mathcal{B}_{T}^{TVS} with a tensor-vector-scalar injection of a varying relative strength 𝒜\mathcal{A} of the vector and scalar polarization modes relative to the tensorial polarization mode. The signals are injected into the Hanford-Livingston-Virgo 3-detector network and the network SNR is fixed to be 100100 for all injections. The error bars are the standard error of log10TTVS\log_{10}\mathcal{B}_{T}^{TVS} that due to the error of evidence estimation of the sampler. The error bars on the upper panel are too small to be seen. The upper panel shows the results using the ++ mode as the basis component for both of the tensor hypothesis T\mathcal{H}_{T} and the tensor-vector-scalar hypothesis TVS\mathcal{H}_{TVS}. The lower panel shows the results using the ++ mode and the ×\times mode as the basis components for both hypotheses. The tensor-vector-scalar hypothesis is more favored when the strength of the non-tensorial component is stronger.

The results with the tensor-scalar injections, the tensor-vector injections and the tensor-vector-scalar injections are shown in Fig. 5, Fig. 6 and Fig. 7 respectively. For comparison, we also perform the same analysis with tensor injections and the results are shown in each of the figures. The upper panels show the results of L=1L=1 analysis with the ++ mode chosen to be the basis component, and the lower panels show the results of L=2L=2 analysis with the ++ mode and the ×\times mode chosen to be the basis components. The figures show a general trend favoring the non-tensor hypotheses with a stronger non-tensorial component. However, the tensor hypothesis is not significantly favored when the injection is pure tensorial even with a very high SNR. This is because the non-tensor hypotheses TV\mathcal{H}_{TV} and TVS\mathcal{H}_{TVS} with 𝓐=𝟎\boldsymbol{\mathcal{A}}=\boldsymbol{0} could also perfectly explain the data. The slight preference towards T\mathcal{H}_{T} is due to the penalty on the more complicated models by the Ockham’s razor Sivia and Skilling (2006).

III.2.4 Vector-scalar polarizations

Lastly, we experiment on vector-scalar injections. The vector-scalar signal is generated by

sVS(t)=Fxh+(t)+Fyh×(t)+Fbh+(t)+Flh×(t).s_{VS}(t)=F_{x}h_{+}(t)+F_{y}h_{\times}(t)+F_{b}h_{+}(t)+F_{l}h_{\times}(t)\,. (58)
Refer to caption
Figure 8: The plots show the variation of log10TVS\log_{10}\mathcal{B}_{T}^{VS} with varying SNR. The signals are injected into the Hanford-Livingston-Virgo 3-detector network. The squares denote the results of analysis of vector-scalar injections, and the triangles denote the results of analysis of tensor injections. The error bars are the standard errors of logTVS\log\mathcal{B}_{T}^{VS} that due to the error of evidence estimation of the sampler. The error bars are too small to be seen in the plots. The upper panel shows the results using one basis mode with xx mode as the basis for VS\mathcal{H}_{VS} and ++ mode as the basis for T\mathcal{H}_{T}. The beam pattern matrix of the one-basis-mode analysis is defined in Eq. (36). The lower panel shows the results using two basis modes with xx and yy modes as the basis components for VS\mathcal{H}_{VS} and ++ mode and ×\times mode as the basis components for T\mathcal{H}_{T}. The beam pattern matrix of the two-basis-mode analysis is defined in Eq. (39). The true polarization hypotheses are more favored with an increasing SNR.

The results are shown in Fig. 8. Similarly, the upper panel shows the results using one basis mode. xx mode is chosen to be the basis component for the vector-scalar hypothesis, and ++ mode is chosen to be the basis component for the tensor hypothesis. The lower panel shows the results using two basis modes. xx and yy modes are chosen to be the basis components for the vector-scalar hypothesis, and ++ mode and ×\times mode are chosen to be the basis components for the tensor hypothesis. The plots show a general trend favoring the true underlying polarization models with a higher SNR.

III.2.5 Discussion

The results suggest the formulation we propose is capable of probing all possible types of non-tensorial polarizations. One should observe that using one basis mode, in general, gives a stronger model preference for the true model than using two basis modes and this agrees with our expectation as discussed in Sec. II.3. One should not overinterpret the figures to be stating the expected log10\log_{10}\mathcal{B} to be observed given the relative strength of non-tensorial components and the SNR. The source location of all injections is set to a fixed position arbitrarily chosen as stated in Sec. III.2.1, but the sensitivity to probe for polarizations significantly depends on the source location. If the source locations of the injections are uniformly sampled over the sky sphere, the trend line in the plots would instead appear as a wide band. Fig. 9 shows an example plot of the distribution of log10TV\log_{10}\mathcal{B}_{T}^{V} of vector and tensor injections in the HLV network with randomly sampled injection parameters. The component masses are sampled from a uniform distribution [5,50]M\in[5,50]M_{\odot}. The sky positions of the source are sampled from a uniform sky sphere. The inclination angles ι\iota are sampled from a cosine distribution where ι[0,π]\iota\in[0,\pi]. The coalescence phases are sampled from a uniform distribution [0,2π]\in[0,2\pi]. The polarization angles are sampled from a uniform distribution [0,π]\in[0,\pi]. One should notice that there is a portion of high SNR injections having log10TV0\log_{10}\mathcal{B}_{T}^{V}\approx 0 due to the difficulty to distinguish between vector hypothesis and tensor hypothesis at those sky locations.

Refer to caption
Figure 9: The plots show the distribution of log10TV\log_{10}\mathcal{B}_{T}^{V} which injections are generated with randomly sampled waveform parameters and source locations. The squares denote the results of analysis of vector injections. The triangles denote the results of analysis of tensor injections. The upper panel shows the results with ++ mode as the basis component. The beam pattern matrix of the one-basis-mode analysis is defined in Eq. (36). The lower panel shows the results with ++ and ×\times modes as the basis components. The beam pattern matrix of the two-basis-mode analysis is defined in Eq. (39).

III.3 More realistic non-GR injections

III.3.1 Mock data preparation

We experiment on the waveforms predicted by Rosen’s theory Rosen (1971, 1974) which is a bimetric theory that predicts the existence of all six polarization states. After combining the Eq. 69-71 in Ref. Chatziioannou et al. (2012b), the Fourier transform of the polarization modes in the stationary phase approximation is

h~+(f)=1+cos2ι2kR1/3g~R(2)(f)h~×(f)=icosιkR1/3g~R(2)(f)h~b(f)=sin2ι2kR1/3g~R(2)(f)43𝒢sinιkR1/6g~R(1)(f)h~l(f)=sin2ιkR1/3g~R(2)(f)43𝒢sinιkR1/6g~R(1)(f)h~x(f)=isinιkR1/3g~R(2)(f)43i𝒢kR1/6g~R(1)(f)h~y(f)=sin2ι2kR1/3g~R(2)(f)43𝒢cosιkR1/6g~R(1)(f)\begin{split}&\tilde{h}_{+}(f)=-\frac{1+\cos^{2}\iota}{2}k_{\text{R}}^{-1/3}\tilde{g}_{\text{R}}^{(2)}(f)\\ &\tilde{h}_{\times}(f)=-i\cos\iota k_{\text{R}}^{-1/3}\tilde{g}_{\text{R}}^{(2)}(f)\\ &\tilde{h}_{b}(f)=-\frac{\sin^{2}\iota}{2}k_{\text{R}}^{-1/3}\tilde{g}_{\text{R}}^{(2)}(f)-\frac{4}{3}\mathcal{G}\sin{\iota}k_{\text{R}}^{-1/6}\tilde{g}_{\text{R}}^{(1)}(f)\\ &\tilde{h}_{l}(f)=-\sin^{2}\iota k_{\text{R}}^{-1/3}\tilde{g}_{\text{R}}^{(2)}(f)-\frac{4}{3}\mathcal{G}\sin\iota k_{\text{R}}^{-1/6}\tilde{g}_{\text{R}}^{(1)}(f)\\ &\tilde{h}_{x}(f)=-i\sin\iota k_{\text{R}}^{-1/3}\tilde{g}_{\text{R}}^{(2)}(f)-\frac{4}{3}i\mathcal{G}k_{\text{R}}^{-1/6}\tilde{g}_{\text{R}}^{(1)}(f)\\ &\tilde{h}_{y}(f)=-\frac{\sin{2\iota}}{2}k_{\text{R}}^{-1/3}\tilde{g}_{\text{R}}^{(2)}(f)-\frac{4}{3}\mathcal{G}\cos\iota k_{\text{R}}^{-1/6}\tilde{g}_{\text{R}}^{(1)}(f)\end{split} (59)

where

g~R(2)(f)=kR5/12i5π842D(πf)7/6eiΨR(2)(f)\tilde{g}_{\text{R}}^{(2)}(f)=k_{\text{R}}^{-5/12}i\sqrt{\frac{5\pi}{84}}\frac{\mathcal{M}^{2}}{D}(\pi\mathcal{M}f)^{-7/6}e^{-i\Psi_{\text{R}}^{(2)}(f)} (60)

and

g~R(1)(f)=kR5/12i5π336η1/52D(πf)3/2eiΨR(1)(f).\tilde{g}_{\text{R}}^{(1)}(f)=k_{\text{R}}^{-5/12}i\sqrt{\frac{5\pi}{336}}\eta^{1/5}\frac{\mathcal{M}^{2}}{D}(\pi\mathcal{M}f)^{-3/2}e^{-i\Psi_{\text{R}}^{(1)}(f)}\,. (61)

The equations are presented here for giving readers the intuition of the injections that we choose, and we refer readers to the paper Chatziioannou et al. (2012b) for the definitions of the parameters. One could see that the dipole contribution g~R(1)(f)\tilde{g}_{R}^{(1)}(f) only enters the non-tensorial polarization modes. The strength of the dipole radiation mainly depends on the parameter 𝒢\mathcal{G} which is the difference in the self-gravitational binding energy per unit mass (or the difference in sensitivity Will and Eardley (1977); Will and Zaglauer (1989)) of the binary components. We notice that in the paper Chatziioannou et al. (2012b) the definitions of s1s_{1} and s2s_{2} are not consistent between the expressions 𝒢=s1/m1s2/m2\mathcal{G}=s_{1}/m_{1}-s_{2}/m_{2} and kR=14s1s2/3k_{R}=1-4s_{1}s_{2}/3. The former is the self-gravitational binding energy Will (1977) but the latter is the self-gravitational binding energy per unit mass Will and Eardley (1977). Here we adopt the latter definition, and therefore we have 𝒢=s1s2\mathcal{G}=s_{1}-s_{2}.

Waveforms are generated with a sampling rate fs=2048 Hzf_{s}=2048\text{ Hz} and a frequency lower cut flow=20 Hzf_{\text{low}}=20\text{ Hz}. The component masses are set to m1=m2=20Mm_{1}=m_{2}=20M_{\odot}. The inclination angle is set to π/4\pi/4. The coalescence phase and polarization angle are set to 0. The geocentric GPS time is set to 12821078241282107824. The right ascension and declination of the source location are set to α=2.72\alpha=2.72 and δ=0.36\delta=-0.36 respectively. We experiment on two different cases when the two compact objects have the same sensitivities i.e. s1=s2s_{1}=s_{2} and when they have very different sensitivities i.e. s1s2s_{1}\neq s_{2}. In the former case, the dipole radiation is not excited, and the polarization modes are well described with a single basis mode. In the latter case, the dipole radiation is excited, and the tensorial modes and non-tensorial modes cannot be well described with a single basis mode. We could compute the overlap O(h~a,h~b)O(\tilde{h}_{a},\tilde{h}_{b}) between two polarization modes h~a(f)\tilde{h}_{a}(f) and h~b(f)\tilde{h}_{b}(f) defined by

O(h~a,h~b)=|0h~ah~b(f)𝑑f0|h~a(f)|2𝑑f0|h~b(f)|2𝑑f|O(\tilde{h}_{a},\tilde{h}_{b})=\left|\frac{\int_{0}^{\infty}\tilde{h}_{a}^{*}\tilde{h}_{b}(f)df}{\sqrt{\int_{0}^{\infty}\left|\tilde{h}_{a}(f)\right|^{2}df\int_{0}^{\infty}\left|\tilde{h}_{b}(f)\right|^{2}df}}\right| (62)

to quantify how well the two polarization modes can be described with a single basis mode. The left panels of Fig. 10 and Fig. 11 show the time domain polarization modes with s1=s2=0s_{1}=s_{2}=0 and the overlap between the polarization modes respectively. The right panel of Fig. 10 and Fig. 11 show the time domain polarization modes with s1=0.5s_{1}=0.5 and s2=0s_{2}=0 and the overlap between the polarization modes respectively. The polarization modes are projected onto the HLV network and injected into simulated Gaussian noise with the advanced LIGO and advanced Virgo design sensitivities i.e. aLIGODesignSensitivityP1200087 and AdVDesignSensitivityP1200087 Abbott et al. (2020b) respectively. Similar to the study with the ad hoc injections in Sec. III.2, the injected signals are scaled to 20 different network SNRs equally spaced between 10 and 100. The power of each polarization mode relative to the ++ polarization defined by

Emrel=0|h~m(f)|2𝑑f0|h~+(f)|2𝑑fE^{\text{rel}}_{m}=\frac{\int_{0}^{\infty}\left|\tilde{h}_{m}(f)\right|^{2}df}{\int_{0}^{\infty}\left|\tilde{h}_{+}(f)\right|^{2}df} (63)

where h~m(f)\tilde{h}_{m}(f) is the polarization mode m{+,×,b,l,x,y}m\in\{+,\times,b,l,x,y\} in the frequency domain is summarized in Table 1.

Refer to caption
Figure 10: The plots show the polarization modes of the Rosen waveform. The component masses are m1=m2=20Mm_{1}=m_{2}=20M_{\odot}. The coalescence phase is 0. The inclination angle is π/4\pi/4. The left panel shows the waveforms with sensitivities s1=s2=0s_{1}=s_{2}=0. The right panel shows the waveforms with sensitivities s1=0.5s_{1}=0.5 and s2=0s_{2}=0
Refer to caption
Figure 11: The left panel shows the overlap between the polarization modes when sensitivities are s1=s2=0s_{1}=s_{2}=0. Dipole radiation is not excited, and the polarization modes are linearly dependent with each other. The right panel shows the overlap between the polarization modes when the sensitivities are s1=0.5s_{1}=0.5 and s2=0s_{2}=0. Dipole radiation is excited, and therefore the polarization modes cannot be described with a single basis mode.
s1=s2=0s_{1}=s_{2}=0 s1=0.5s_{1}=0.5, s2=0s_{2}=0
Polarization mode Relative power Relative power
h~+(f)\tilde{h}_{+}(f) 1.001.00 1.001.00
h~×(f)\tilde{h}_{\times}(f) 0.890.89 0.890.89
h~b(f)\tilde{h}_{b}(f) 0.110.11 1.351.35
h~l(f)\tilde{h}_{l}(f) 0.440.44 1.681.68
h~x(f)\tilde{h}_{x}(f) 0.890.89 3.363.36
h~y(f)\tilde{h}_{y}(f) 0.440.44 1.681.68
Table 1: The table summarizes the relative power of the polarization modes with respect to the ++ polarization mode of the Rosen waveform injections.

III.3.2 Effect of the orthogonal polarization component

To begin with, we first discuss how would the orthogonal polarization component affect the detection. To quantify the linear dependency between the polarization modes, we may compute the effective rank Roy and Vetterli (2007) of the matrix of polarization modes

𝒉~=[h~+[1]h~×[1]h~b[1]h~l[1]h~x[1]h~y[1]h~+[2]h~×[2]h~b[2]h~l[2]h~x[2]h~y[2]h~+[K]h~×[K]h~b[K]h~l[K]h~x[K]h~y[K]]\tilde{\boldsymbol{h}}=\begin{bmatrix}\tilde{h}_{+}[1]&\tilde{h}_{\times}[1]&\tilde{h}_{b}[1]&\tilde{h}_{l}[1]&\tilde{h}_{x}[1]&\tilde{h}_{y}[1]\\ \tilde{h}_{+}[2]&\tilde{h}_{\times}[2]&\tilde{h}_{b}[2]&\tilde{h}_{l}[2]&\tilde{h}_{x}[2]&\tilde{h}_{y}[2]\\ \vdots&\vdots&\vdots&\vdots&\vdots&\vdots\\ \tilde{h}_{+}[K]&\tilde{h}_{\times}[K]&\tilde{h}_{b}[K]&\tilde{h}_{l}[K]&\tilde{h}_{x}[K]&\tilde{h}_{y}[K]\\ \end{bmatrix} (64)

where KK is the number of frequency bins. The effective rank for the equal sensitivity injection shown in the left panel of Fig. 10 is 11 which is obvious which implies the polarization modes can be well described with a single basis mode. The effective rank for the unequal sensitivity injection shown in the right panel of Fig. 10 is 1.831.83 which implies the polarization modes cannot be well described with a single basis mode. The results of the analysis are shown in Fig. 12. The upper panel shows the results using one basis mode. ++ mode is chosen to be the basis mode for both T\mathcal{H}_{T} and TVS\mathcal{H}_{TVS}. The lower panel shows the results using two basis modes. ++ mode and ×\times mode are chosen to be the basis modes for T\mathcal{H}_{T} which is also the only choice. ++ mode and vector xx mode are chosen to be the basis modes for TVS\mathcal{H}_{TVS}. The blue dots show the results with the injections of equal sensitivities. In this case the dipole radiation is not excited and there is no orthogonal polarizations as shown in Eq. (59) and the left panel of Fig. 10. The red dots show the results with the injections of unequal sensitivities. As shown in Eq. (59) and the right panel of Fig. 10, the polarization modes can not be well described with a single basis mode. Nevertheless, the red dots in the upper panel still show a general trend favoring TVS\mathcal{H}_{TVS} even when the orthogonal polarization component is significant. A much stronger model preference is observed when the dipole radiation presents in the signal in the L=1L=1 analysis even in this case the orthogonal polarization component is strong.

Refer to caption
Figure 12: The plots show the variation of log10TTVS\log_{10}\mathcal{B}_{T}^{TVS} with varying SNR. The upper panel shows the results with one basis mode. ++ mode is chosen to be the basis mode for both T\mathcal{H}_{T} and TVS\mathcal{H}_{TVS}. The beam pattern matrix of the one-basis-mode analysis is defined in Eq. (36). The lower panel shows the results with two basis modes. ++ mode and ×\times mode are the basis modes for T\mathcal{H}_{T} which is also the only choice. ++ mode and vector xx mode are chosen to be the basis modes for TVS\mathcal{H}_{TVS}. The beam pattern matrix of the two-basis-mode analysis is defined in Eq. (39). The triangles denote the results with the injections of equal sensitivities (s1=s2=0s_{1}=s_{2}=0), and the squares denote the results with the injections of unequal sensitivities (s1=0.5s_{1}=0.5 and s2=0s_{2}=0). The error bars in the upper panel are too small to be seen.

Fig. 13 and Fig. 14 show the plug-in p-value of the injections as discussed in Sec. II.6 to test the presence of uncaptured orthogonal polarizations. The black line labeled as pplugin(DoF2)p_{\text{plugin}}(\text{DoF}-2) in the figures is reference p-value when the null energy attains the highest possible likelihood.

Fig. 13 shows the plug-in p-value of the injections with equal sensitivities. In the upper panel, we could see that the plug-in p-values of TVS\mathcal{H}_{TVS} are consistent with the black line for all SNRs. It implies that there always exists a null operator constructed from a linear combination of 𝒇{+,×,b,l,x,y}\boldsymbol{f}_{\{+,\times,b,l,x,y\}} with one column to produce a residual that is consistent with noise, and no orthogonal polarization component is observed. And indeed the injected polarization modes can be well described with a single basis mode. But for the red dots that correspond to T\mathcal{H}_{T}, the plug-in p-value drops suddenly when the SNR increases through 75\sim 75. This is expected since the injected signals carry tensor, vector, and scalar components, and a null operator constructed from 𝒇{+,x}\boldsymbol{f}_{\{+,x\}} could not completely cancel the signal, but such insufficiency only shows up with a high enough SNR.

Refer to caption
Figure 13: The plot shows the plug-in p-value ppluginp_{\text{plugin}} with the injections of equal sensitivities with different SNRs. The upper panel shows the variation of ppluginp_{\text{plugin}} of the analyses with one basis mode. The bases of T\mathcal{H}_{T} and TVS\mathcal{H}_{TVS} are both chosen to be the ++ mode. The beam pattern matrix of the one-basis-mode analysis is defined in Eq. (36). The lower panel shows the variation of ppluginp_{\text{plugin}} of the analyses with two basis modes. The basis of T\mathcal{H}_{T} is ++ mode and ×\times mode. The basis of TVS\mathcal{H}_{TVS} is chosen to be ++ mode and vector xx mode. The beam pattern matrix of the two-basis-mode analysis is defined in Eq. (39). The triangles denote the plug-in p-value of the analysis with T\mathcal{H}_{T}. The squares denote the plug-in p-value of the analysis with TVS\mathcal{H}_{TVS}. The solid line indicates the p-value of the residual power when it is exactly equal to the mode of the χ2\chi^{2} distribution.

Fig. 14 shows the plug-in p-value of the injections with unequal sensitivities. The upper panel shows the results with one basis mode. The plug-in p-value decreases with a higher SNR for both TVS\mathcal{H}_{TVS} and T\mathcal{H}_{T}. This is expected since the injected signals carry a strong dipole component, and we could not explain the data well with a single basis mode. This suggests the feasibility to use the plug-in p-value as a tool to diagnose the validity of the assumed number of basis modes. A low plug-in p-value also gives us an alarming message if the h~+(f)\tilde{h}_{+}(f) and h~×(f)\tilde{h}_{\times}(f) in the GR waveform model can be well described with a single basis mode, and this would indicate we have observed something that cannot be explained by the GR waveform model. The lower panel shows the results using two basis modes. For the TVS\mathcal{H}_{TVS}, we choose the ++ mode and the vector xx mode as the basis. The plug-in p-values are consistent with pplugin(DoF2)p_{\text{plugin}}(\text{DoF}-2) for all SNRs. As shown in the right panel of Fig. 11, we could choose any one of the tensorial modes and any one of the non-tensorial modes to construct the basis, and the null operator constructed from the basis would be sufficient to cancel the signal. On the other hand, the plug-in p-value for T\mathcal{H}_{T} drops when the SNR is sufficiently high. Even when we allow h~+\tilde{h}_{+} and h~×\tilde{h}_{\times} to take arbitrary forms independently, the null operator could not cancel the signal and it results in an excess amount of residual power.

Refer to caption
Figure 14: The plot shows the plug-in p-value ppluginp_{\text{plugin}} with the injections of unequal sensitivities with different SNRs. The upper panel shows the variation of ppluginp_{\text{plugin}} of the analyses with one basis mode. The bases of T\mathcal{H}_{T} and TVS\mathcal{H}_{TVS} are both chosen to be the ++ mode. The beam pattern matrix of the one-basis-mode analysis is defined in Eq. (36). The lower panel shows the variation of ppluginp_{\text{plugin}} of the analyses with two basis modes. The basis of T\mathcal{H}_{T} is ++ mode and ×\times mode. The basis of TVS\mathcal{H}_{TVS} is chosen to be ++ mode and vector xx mode. The beam pattern matrix of the two-basis-mode analysis is defined in Eq. (39). The triangles denote the plug-in p-value of the analysis with T\mathcal{H}_{T}. The squares denote the plug-in p-value of the analysis with TVS\mathcal{H}_{TVS}. The solid line indicates the p-value of the residual power when it is exactly equal to the mode of the χ2\chi^{2} distribution.

The example shows that the method is capable of capturing the parallel non-tensorial polarization component, and the presence of a significant orthogonal polarization component would not deteriorate the method. We have also demonstrated using the plug-in p-value to test the existence of the orthogonal polarization component.

III.3.3 Ranking among polarization hypotheses

We also perform the analysis with different polarization hypotheses to investigate how would the pipeline rank different hypotheses. The choices of basis and the dimensionality of the parameter space are summarized in Table 2.

Hypothesis Description Mode(s) Basis mode(s) Number of parameters
T,1\mathcal{H}_{T,1} Pure tensorial ++, ×\times ++ 5
V,1\mathcal{H}_{V,1} Pure vectorial xx, yy xx 5
S,1\mathcal{H}_{S,1} Pure scalar bb bb 2
TS,1\mathcal{H}_{TS,1} Tensor-scalar ++, ×\times, bb, ll ++ 9
TV,1\mathcal{H}_{TV,1} Tensor-vector ++, ×\times, xx, yy ++ 9
VS,1\mathcal{H}_{VS,1} Vector-scalar xx, yy, bb, ll xx 9
TVS,1\mathcal{H}_{TVS,1} Tensor-vector-scalar ++, ×\times, xx, yy, bb, ll ++ 13
T,2\mathcal{H}_{T,2} Pure tensorial ++, ×\times ++, ×\times 2
V,2\mathcal{H}_{V,2} Pure vectorial xx, yy xx, yy 2
TS,2\mathcal{H}_{TS,2} Tensor-scalar ++, ×\times, bb, ll ++, bb 11
TV,2\mathcal{H}_{TV,2} Tensor-vector ++, ×\times, xx, yy ++, xx 11
VS,2\mathcal{H}_{VS,2} Vector-scalar xx, yy, bb, ll xx, bb 11
TVS,2\mathcal{H}_{TVS,2} Tensor-vector-scalar ++, ×\times, xx, yy, bb, ll ++, xx 19
Table 2: The table summarizes the choice of basis used in the analysis discussed in Sec. III.3.3 and the number of model parameters of each polarization hypothesis.

Fig. 15 shows the result of the analyses of the injections with equal sensitivities i.e. s1=s2=0s_{1}=s_{2}=0. The plot shows the variation of the log10\log_{10} Bayes factor of different polarization hypotheses against the tensor hypothesis. As the SNR increases, the non-tensor hypotheses are more favored. The upper panel shows the result with one basis mode. Among the hypotheses, V\mathcal{H}_{V}, TV\mathcal{H}_{TV}, VS\mathcal{H}_{VS}, and TVS\mathcal{H}_{TVS} are the most favored in the high SNR cases. When the SNR is between around 4040 and 8080, S\mathcal{H}_{S} is slightly more favored. This is partly because S\mathcal{H}_{S} involves the least number of parameters than other hypotheses as shown in Table 2 and therefore is the least penalized by Ockham’s razor. The another more important reason is related to the overlap between the beam pattern function vectors defined by

𝒟(Ω^;a,b)=|𝒇a(Ω^)𝒇b(Ω^)𝒇a(Ω^)𝒇a(Ω^)𝒇b(Ω^)𝒇b(Ω^)|[0,1].\mathcal{D}(\hat{\Omega};a,b)=\left|\frac{\boldsymbol{f}_{a}(\hat{\Omega})\cdot\boldsymbol{f}_{b}(\hat{\Omega})}{\sqrt{\boldsymbol{f}_{a}(\hat{\Omega})\cdot\boldsymbol{f}_{a}(\hat{\Omega})}\sqrt{\boldsymbol{f}_{b}(\hat{\Omega})\cdot\boldsymbol{f}_{b}(\hat{\Omega})}}\right|\in[0,1]\,. (65)

where Ω^\hat{\Omega} is the sky location of the source, and 𝒇a\boldsymbol{f}_{a} and 𝒇b\boldsymbol{f}_{b} are the beam patten vectors of polarizations aa and bb. Fig. 16. A higher overlap at the sky location Ω^\hat{\Omega} implies it is more difficult to distinguish the polarizations aa and bb when the GW comes from that sky position. Fig. 16 shows the overlap between the beam pattern vectors at the injected source location. We could see that 𝒇b\boldsymbol{f}_{b} has a very significant overlap with 𝒇×\boldsymbol{f}_{\times} and 𝒇x\boldsymbol{f}_{x}. This implies that the signal power from h~×(f)\tilde{h}_{\times}(f) and h~x(f)\tilde{h}_{x}(f) can be significantly reduced with the null operator constructed from solely 𝒇b\boldsymbol{f}_{b}, and this causes some confusion with the other hypotheses. The lower panel shows the result using two basis modes. The VS\mathcal{H}_{VS} is the most favored instead of TVS\mathcal{H}_{TVS} in the high SNR cases. This is again due to the overlap between the beam pattern vectors. One could see from Fig. 16 that 𝒇b\boldsymbol{f}_{b} has an overlap 0.830.83 with 𝒇×\boldsymbol{f}_{\times} and 𝒇y\boldsymbol{f}_{y} has an overlap 0.880.88 with 𝒇+\boldsymbol{f}_{+}. One should be reminded that the polarization modes are linearly dependent when the sensitivities are equal, and therefore we could reduce a significant amount of power from the tensorial modes by the null operator constructed in VS\mathcal{H}_{VS}. Also, the much larger parameter space of TVS\mathcal{H}_{TVS} (19 parameters) compared with VS\mathcal{H}_{VS} (11 parameters) adds a penalty to the evidence.

Refer to caption
Figure 15: The plot shows the variation of log10\log_{10} Bayes factor of different polarization hypotheses against the tensor hypothesis with the Rosen waveform injections with s1=s2=0s_{1}=s_{2}=0 and different SNRs. The upper panel shows the results with one basis mode. The beam pattern matrix of the one-basis-mode analysis is defined in Eq. (36). The lower panel shows the results with two basis modes. The beam pattern matrix of the two-basis-mode analysis is defined in Eq. (39).
Refer to caption
Figure 16: The figure shows the overlap between the beam pattern function vectors 𝒇m\boldsymbol{f}_{m} with the injected source location i.e. α=2.72\alpha=2.72 and δ=0.36\delta=-0.36, polarization angle ψ=0\psi=0 and GPS time 12821078241282107824.

Fig. 17 shows the result of the analyses of the injections with different sensitivities i.e. s1=0.5s_{1}=0.5 and s2=0s_{2}=0. In this case, the dipole radiation is excited, and we have to use at least two basis modes to explain the data. The upper panel shows the result with one basis mode. Even when the orthogonal polarization component is significant, TVS\mathcal{H}_{TVS} still has a very high rank among the hypotheses. TV\mathcal{H}_{TV} is slightly more favored due to the extra penalization on the larger parameter space of TVS\mathcal{H}_{TVS}. The lower panel shows the result with two basis modes. In this case, there is no orthogonal polarization component remains with the best-fit parameters in TVS\mathcal{H}_{TVS} as shown in the lower panel of Fig. 14, and indeed TVS\mathcal{H}_{TVS} is more favored over TV\mathcal{H}_{TV} in the high SNR cases even though it has a much larger parameter space.

Refer to caption
Figure 17: The plot shows the variation of log10\log_{10} Bayes factor of different polarization hypotheses against the tensor hypothesis with the Rosen waveform injections with s1=0.5s_{1}=0.5, s2=0s_{2}=0 and different SNRs. The upper panel shows the results with one basis mode. The beam pattern matrix of the one-basis-mode analysis is defined in Eq. (36). The lower panel shows the results with two basis modes. The beam pattern matrix of the two-basis-mode analysis is defined in Eq. (39).

III.3.4 Discussion

The results in Fig. 17 are in agreement with our discussion in Sec. II.6. The existence of a significant orthogonal polarization component does not deteriorate the capability of the method to detect a GR violation as shown in the upper panel of the figure. Similar to the results in Sec. III.2, one should not overinterpret the figures to be stating the expected deviations that we would observe since the source location of all injections is arbitrarily set to α=2.72\alpha=2.72 and δ=0.36\delta=-0.36, but the distinguishability between different polarization hypotheses significantly depends on the source location. We have shown that confusion between polarization hypotheses can arise when there is a significant overlap between the beam pattern vectors. In terms of identifying the true polarization hypothesis, the most important factor is the overlap between the beam pattern vectors that determines how well we can distinguish between different polarizations. The overlap can be significantly reduced by having a larger detector network in the future.

IV Summary and conclusions

We have presented a null-stream-based generic Bayesian unmodeled framework to probe GW polarizations. We proposed the basis formulation to reformulate the generic null projection along the beam pattern vectors to the null projection along the polarization basis modes. The advantage of this formulation is to guarantee the equal dimensionality of the residuals after performing the null projection for fair model comparison, and we do not need to explicitly assume the waveform of the basis modes. This gives the generality of the framework to probe GW polarizations without requiring modeling non-GR waveform explicitly. This method would be useful when the non-GR waveforms are not well developed.

We first conducted a mock data study with the ad hoc injections and perform the one-basis-mode (L=1L=1) analysis and the two-basis-mode (L=2L=2) analysis. The ad hoc injections are generated by projecting the plus polarization and the cross polarization to the non-tensorial beam pattern functions. In this case, the polarization modes can be well described with a single basis mode. This serves as a sanity check to investigate the capability of the L=1L=1 analysis and the L=2L=2 analysis to detect GW polarizations when there is no uncaptured orthogonal polarization component. We performed the mock data study with tensor, vector, scalar, tensor-scalar, tensor-vector, vector-scalar, and tensor-vector-scalar injections in the HLV 3-detector network at the design sensitivity. The source location is not known and we marginalize over the whole sky sphere in the analysis. We varied the signal-to-noise ratio (SNR) of the injections and we showed that the non-tensor hypotheses are more favored with an increasing SNR and increasing strength of non-tensorial components of the injections. We then conducted a mock data study with the more realistic non-GR waveform i.e. the inspiral waveforms of Rosen’s theory that predicts the existence of all six polarization states. We considered two different cases in which the dipole radiation is excited and not excited respectively. The presence of strong dipole radiation would give rise to a significant orthogonal polarization component in the L=1L=1 analysis. Nevertheless, the L=1L=1 analysis significantly favors TVS\mathcal{H}_{TVS} over T\mathcal{H}_{T}. We also demonstrated the feasibility of using the plug-in p-value to test the presence of an orthogonal polarization component.

Lastly, we investigated how the pipeline ranks different polarization hypotheses with the Rosen waveform injections with equal and unequal sensitivities respectively. We showed that the presence of the orthogonal polarization component contributed by the dipole radiation does not deteriorate the L=1L=1 analysis, and the true polarization hypothesis is one of the top-ranked hypotheses. We showed that there exists some confusion between different polarization hypotheses due to the penalty on the more complicated polarization model and the overlap between the beam pattern vectors. The polarization subspaces spanned by different beam pattern vectors are in general not orthogonal to each other, and the overlap is arguably the most important factor to distinguish between different polarization hypotheses. The overlap depends significantly on the sky location of the source, and in general, it can be reduced with a larger detector network. We should expect to constrain the GW polarization content a lot better in the future by including KAGRA Akutsu et al. (2020); Aso et al. (2013), LIGO India Iyer et al. (2012), Einstein telescope Punturo et al. (2010); Hild et al. (2011), and Cosmic Explorer Abbott et al. (2017c); Reitze et al. (2019) in the joint observing runs.

Our work also demonstrated the feasibility to search for mixed polarizations with a limited number of detectors. Although we need at least M+1M+1 non-coaligned detectors to reconstruct MM independent polarization modes, it is possible to detect the presence of extra polarization modes with a fewer number of detectors. We emphasize that this conclusion does not only apply to null-stream-based methods, and it should also motivate other approaches to search for mixed polarizations in the current LIGO-Virgo 3-detector network.

V Acknowledgement

The authors would like to thank Giancarlo Cella for the comments in preparing the paper. ICFW and TGFL are partially supported by grants from the Research Grants Council of the Hong Kong (Project No. 24304317 and 14306419) and Research Committee of the Chinese University of Hong Kong. PTHP and CVDB are supported by the research program of the Netherlands Organization for Scientific Research (NWO). RKLL and TGFL would also like to gratefully acknowledge the support from the Croucher Foundation in Hong Kong. We are grateful for computational resources provided by Cardiff University, and funded by an STFC grant supporting UK Involvement in the Operation of Advanced LIGO. We are grateful for computational resources provided by the Leonard E Parker Center for Gravitation, Cosmology and Astrophysics at the University of Wisconsin-Milwaukee. We acknowledge the use of IUCAA LDG cluster Sarathi for the computational/numerical work.

Softwares: Eigen Guennebaud et al. (2010), FFTW3 Frigo and Johnson (2005), GSL Gough (2009), HDF5 The HDF Group (2000-2010), HEALPix Gorski et al. (2005), LALSuite LIGO Scientific Collaboration (2018) and MultiNest Feroz and Hobson (2008); Feroz et al. (2009, 2019a) are used to perform analyses. Plots are generated using Matplotlib 416 (2007) and NumPy Harris et al. (2020).

Appendix A Discrete Fourier transform

In this section, we decribe the convention of discrete Fourier transform that we use. Given a discrete-time time series x[n]x[n] of length NN, the discrete Fourier transform x~[k]\tilde{x}[k] is defined as

x~[k]=n=1Nx[n]ei2πnk/NΔt\tilde{x}[k]=\sum_{n=1}^{N}x[n]e^{-i2\pi nk/N}\Delta t (66)

where Δt\Delta t is the sampling interval, and i2=1i^{2}=-1. The inverse transform is defined as

x[n]=k=1Nx~[k]ei2πnk/NΔf{x}[n]=\sum_{k=1}^{N}\tilde{x}[k]e^{i2\pi nk/N}\Delta f (67)

where Δf=1/(NΔt)\Delta f=1/(N\Delta t) is the frequency resolution.

Appendix B Null stream

In this section, we discuss several properties of null stream including the reduced dimensionality of residuals and the role of polarization angle in the generic null stream construction.

B.1 Reduced dimensionality

Null projection removes all data on the hyperplane spanned by the constituent beam pattern function vectors 𝒇m\boldsymbol{f}_{m} used in the construction of the null operator. The resultant null stream, therefore, has a lower dimensionality than the original data. Fig. 18 demonstrates an example of null projection. For the sake of a more intuitive demonstration, the data are displayed in the time domain. One should notice that in the middle panel the signal content is removed and the amplitude of the residual is reduced compared to the strain outputs. This is due to the reduced dimensionality after applying the null operator. The reduced dimensionality could be observed more readily by orienting the residuals along the principal axes. Recall Eq. (29), the null projection is

𝒛~:=(𝑰𝑭w(𝑭w𝑭w)1𝑭w)𝓣(𝒅~w;Δ𝒕).\tilde{\boldsymbol{z}}:=(\boldsymbol{I}-\boldsymbol{F}_{w}(\boldsymbol{F}_{w}^{\dagger}\boldsymbol{F}_{w})^{-1}\boldsymbol{F}_{w}^{\dagger})\boldsymbol{\mathcal{T}}\left(\tilde{\boldsymbol{d}}_{w};\Delta\boldsymbol{t}\right)\,. (68)

We perform the singular value decomposition of each frequency component of 𝑰𝑭w(𝑭w𝑭w)1𝑭w\boldsymbol{I}-\boldsymbol{F}_{w}(\boldsymbol{F}_{w}^{\dagger}\boldsymbol{F}_{w})^{-1}\boldsymbol{F}_{w}^{\dagger}.

(𝑰𝑭w(𝑭w𝑭w)1𝑭w)[k]=𝑼k𝚺k𝑽k(\boldsymbol{I}-\boldsymbol{F}_{w}(\boldsymbol{F}_{w}^{\dagger}\boldsymbol{F}_{w})^{-1}\boldsymbol{F}_{w}^{\dagger})[k]=\boldsymbol{U}_{k}\boldsymbol{\Sigma}_{k}\boldsymbol{V}_{k}^{\dagger} (69)

where 𝑼D×D\boldsymbol{U}\in\mathbb{C}^{D\times D} is a unitary matrix, 𝚺kD×D\boldsymbol{\Sigma}_{k}\in\mathbb{R}^{D\times D} is a diagonal matrix with the singular values as the diagonal entries, 𝑽kD×D\boldsymbol{V}_{k}\in\mathbb{C}^{D\times D} is a unitary matrix, and the subscript kk denotes the decomposition of the kthk^{\text{th}} frequency component of the projector 𝑰𝑭w(𝑭w𝑭w)1𝑭w\boldsymbol{I}-\boldsymbol{F}_{w}(\boldsymbol{F}_{w}^{\dagger}\boldsymbol{F}_{w})^{-1}\boldsymbol{F}_{w}^{\dagger}. The null stream in the principal coordinate system is obtained by applying the rotation matrix 𝑼k\boldsymbol{U}_{k}^{\dagger} i.e.

𝒛~rotated[k]=𝑼kz~[k].\tilde{\boldsymbol{z}}_{\text{rotated}}[k]=\boldsymbol{U}_{k}^{\dagger}\tilde{z}[k]\,. (70)
Refer to caption
Figure 18: The figure shows an example of null stream construction. The upper panel shows the observed strain outputs and the pure tensorial signals in Hanford (H1), Livingston (L1), and Virgo (V1) respectively. The middle panel shows the residual in each detector after applying the correct null operator that is the null stream. The lower panel shows the null stream in the principal coordinate system. The principal axes are the two so-called GW axes on the hyperplane spanned by 𝒇+\boldsymbol{f}_{+} and 𝒇×\boldsymbol{f}_{\times}. The null axis is orthogonal to the GW axes and no tensorial GW signal presents in this dimension.

B.2 Polarization angle

In the most general case where the polarization modes are independent, we may write the GW transient signal 𝒔~\tilde{\boldsymbol{s}} with each of the polarization components as follows

𝒔~tensor=[𝒇+(ψ)𝒇×(ψ)][h~+h~×]=[𝒇+(ψ1)𝒇×(ψ1)]𝑹tensor(ψ2)[h~+h~×]=[𝒇+(ψ1)𝒇×(ψ1)](𝑹tensor(ψ2)[h~+h~×])\begin{split}\tilde{\boldsymbol{s}}_{\text{tensor}}&=\begin{bmatrix}\boldsymbol{f}_{+}(\psi)&\boldsymbol{f}_{\times}(\psi)\end{bmatrix}\begin{bmatrix}\tilde{h}_{+}\\ \tilde{h}_{\times}\end{bmatrix}\\ &=\begin{bmatrix}\boldsymbol{f}_{+}(\psi_{1})&\boldsymbol{f}_{\times}(\psi_{1})\end{bmatrix}\boldsymbol{R}_{\text{tensor}}(\psi_{2})\begin{bmatrix}\tilde{h}_{+}\\ \tilde{h}_{\times}\end{bmatrix}\\ &=\begin{bmatrix}\boldsymbol{f}_{+}(\psi_{1})&\boldsymbol{f}_{\times}(\psi_{1})\end{bmatrix}\left(\boldsymbol{R}_{\text{tensor}}(\psi_{2})\begin{bmatrix}\tilde{h}_{+}\\ \tilde{h}_{\times}\end{bmatrix}\right)\end{split} (71)

where ψ=ψ1+ψ2\psi=\psi_{1}+\psi_{2} and

𝑹tensor(ψ)=[cos2ψsin2ψsin2ψcos2ψ]\boldsymbol{R}_{\text{tensor}}(\psi)=\begin{bmatrix}\cos{2\psi}&-\sin{2\psi}\\ \sin{2\psi}&\cos{2\psi}\end{bmatrix} (72)

is a rotational matrix that represents the rotation of coordinates around the GW-propagating axis. Similarly,

𝒔~vector=[𝒇x(ψ)𝒇y(ψ)][h~xh~y]=[𝒇x(ψ1)𝒇y(ψ1)]𝑹vector(ψ2)[h~xh~y]=[𝒇x(ψ1)𝒇y(ψ1)](𝑹vector(ψ2)[h~xh~y])\begin{split}\tilde{\boldsymbol{s}}_{\text{vector}}&=\begin{bmatrix}\boldsymbol{f}_{x}(\psi)&\boldsymbol{f}_{y}(\psi)\end{bmatrix}\begin{bmatrix}\tilde{h}_{x}\\ \tilde{h}_{y}\end{bmatrix}\\ &=\begin{bmatrix}\boldsymbol{f}_{x}(\psi_{1})&\boldsymbol{f}_{y}(\psi_{1})\end{bmatrix}\boldsymbol{R}_{\text{vector}}(\psi_{2})\begin{bmatrix}\tilde{h}_{x}\\ \tilde{h}_{y}\end{bmatrix}\\ &=\begin{bmatrix}\boldsymbol{f}_{x}(\psi_{1})&\boldsymbol{f}_{y}(\psi_{1})\end{bmatrix}\left(\boldsymbol{R}_{\text{vector}}(\psi_{2})\begin{bmatrix}\tilde{h}_{x}\\ \tilde{h}_{y}\end{bmatrix}\right)\end{split} (73)

where ψ=ψ1+ψ2\psi=\psi_{1}+\psi_{2} and

𝑹vector(ψ)=[cosψsinψsinψcosψ].\boldsymbol{R}_{\text{vector}}(\psi)=\begin{bmatrix}\cos{\psi}&-\sin{\psi}\\ \sin{\psi}&\cos{\psi}\end{bmatrix}\,. (74)

Since the scalar beam pattern function is itself independent of ψ\psi, we do not show it here. We could observe that the signal could be equally well decribed as lying in the subspace spanned by {𝒇m(ψ)}\{\boldsymbol{f}_{m}(\psi)\} for any ψ\psi. This is due to the fact that rotation of the axes does not change the subspace spanned by {𝒇m}\{\boldsymbol{f}_{m}\}. Therefore, the null operator construction in Eq. (28) is independent of ψ\psi.

Appendix C Calibration errors

Following the similar notations used in Ref. Vitale et al. (2012), given the measured strain output d~m(f)\tilde{d}_{m}(f) and the exact strain output d~e(f)\tilde{d}_{e}(f), the errors in amplitude and phase could be accounted by introducing a function K(f)K(f)

d~m(f)=K(f)d~e(f)\tilde{d}_{m}(f)=K(f)\tilde{d}_{e}(f) (75)

where

K(f)=(1+δA(f)A(f))eiδϕ(f).K(f)=\left(1+\frac{\delta A(f)}{A(f)}\right)e^{i\delta\phi(f)}\,. (76)

Then the noise PSD with calibration errors is related to the noise PSD without calibration errors by

Sm(f)=(1+δA(f)A(f))2Se(f).S_{m}(f)=\left(1+\frac{\delta A(f)}{A(f)}\right)^{2}S_{e}(f)\,. (77)

Now we consider the effect of calibration errors in the construction of the null stream. For generality, we write the derivations in the continuous version here, but it is trivial to convert them to the discrete version. After including the calibration errors, the single-detector observation model writes

K(f)d~e(f)=K(f)(𝐅𝐡~(f))+K(f)n~e(f).K(f)\tilde{d}_{e}(f)=K(f)\left(\mathbf{F}\tilde{\mathbf{h}}(f)\right)+K(f)\tilde{n}_{e}(f)\,. (78)

The trivial way to recover the observation model with exact measurements in Eq. (1) is to divide Eq. (78) by K(f)K(f), but K(f)K(f) is not known exactly. In the construction of null stream, we first whiten the data in Eq. (24), however

d~w(f)=(1+δA(f)A(f))eiδϕ(f)d~e(f)12Δf(1+δA(f)A(f))2Se(f)=eiδϕ(f)d~e(f)12ΔfSe(f)\begin{split}&\tilde{d}_{w}(f)\\ &=\frac{\left(1+\frac{\delta A(f)}{A(f)}\right)e^{i\delta\phi(f)}\tilde{d}_{e}(f)}{\sqrt{\frac{1}{2\Delta f}\left(1+\frac{\delta A(f)}{A(f)}\right)^{2}S_{e}(f)}}\\ &=\frac{e^{i\delta\phi(f)}\tilde{d}_{e}(f)}{\sqrt{\frac{1}{2\Delta f}S_{e}(f)}}\end{split} (79)

, and therefore the amplitude error is canceled, and we can conclude only the phase error affects the construction of null stream. The phase error is removed by multiplying the whitened data by eiδϕ(f)e^{-i\delta\phi(f)}. Since the phase error is not known exactly, we use the conventional cubic spline model M. Farr et al. (2014) to model the phase error. Nodal points are equally spaced in logf\log{f}, and a Gaussian prior is placed at each node jj i.e.

p(δϕj)=N(μj,σj2)p(\delta\phi_{j})=N(\mu_{j},\sigma_{j}^{2}) (80)

where μj\mu_{j} and σj2\sigma_{j}^{2} are mean and variance of phase error at node jj which characterize the expected phase error. The integral evaluating the model evidence in Eq. (47) is therefore extended to also marginalize over the phase errors. The calibration error characterization is released as the calibration envelope files by the LIGO Scientific Collaboration and Virgo Collaboration Collaboration and Collaboration (2019).

References

  • Abbott et al. (2016a) B. P. Abbott et al. (LIGO Scientific Collaboration and Virgo Collaboration), “Observation of gravitational waves from a binary black hole merger,” Phys. Rev. Lett. 116, 061102 (2016a).
  • Aasi et al. (2015) J Aasi, B P Abbott, R Abbott, T Abbott, M R Abernathy, K Ackley, C Adams, T Adams, P Addesso,  and et al., “Advanced ligo,” Classical and Quantum Gravity 32, 074001 (2015).
  • Acernese et al. (2014) F Acernese, M Agathos, K Agatsuma, D Aisa, N Allemandou, A Allocca, J Amarni, P Astone, G Balestri, G Ballardin,  and et al., “Advanced virgo: a second-generation interferometric gravitational wave detector,” Classical and Quantum Gravity 32, 024001 (2014).
  • Abbott et al. (2019a) B. P. Abbott et al. (LIGO Scientific Collaboration and Virgo Collaboration), “Gwtc-1: A gravitational-wave transient catalog of compact binary mergers observed by ligo and virgo during the first and second observing runs,” Phys. Rev. X 9, 031040 (2019a).
  • Abbott et al. (2020a) R. Abbott et al., “Gwtc-2: Compact binary coalescences observed by ligo and virgo during the first half of the third observing run,”  (2020a), arXiv:2010.14527 [gr-qc] .
  • Abbott et al. (2019b) B. P. Abbott et al. (The LIGO Scientific Collaboration and the Virgo Collaboration), “Tests of general relativity with the binary black hole signals from the ligo-virgo catalog gwtc-1,” Phys. Rev. D 100, 104036 (2019b).
  • Collaboration et al. (2020) The LIGO Scientific Collaboration, the Virgo Collaboration, et al., “Tests of general relativity with binary black holes from the second ligo-virgo gravitational-wave transient catalog,”  (2020), arXiv:2010.14529 [gr-qc] .
  • Eardley et al. (1973) Douglas M. Eardley, David L. Lee,  and Alan P. Lightman, “Gravitational-wave observations as a tool for testing relativistic gravity,” Phys. Rev. D 8, 3308–3321 (1973).
  • Brans and Dicke (1961) C. Brans and R. H. Dicke, “Mach’s principle and a relativistic theory of gravitation,” Phys. Rev. 124, 925–935 (1961).
  • Rosen (1971) Nathan Rosen, “Theory of gravitation,” Phys. Rev. D 3, 2317–2319 (1971).
  • Rosen (1974) Nathan Rosen, “A theory of gravitation,” Annals of Physics 84, 455–473 (1974).
  • Chatziioannou et al. (2012a) Katerina Chatziioannou, Nicolás Yunes,  and Neil Cornish, “Model-independent test of general relativity: An extended post-einsteinian framework with complete polarization content,” Phys. Rev. D 86, 022004 (2012a).
  • Abbott et al. (2017a) B. P. Abbott et al. (LIGO Scientific Collaboration and Virgo Collaboration), “Gw170814: A three-detector observation of gravitational waves from a binary black hole coalescence,” Phys. Rev. Lett. 119, 141101 (2017a).
  • Abbott et al. (2019c) B. P. Abbott, R. Abbott, T. D. Abbott, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, V. B. Adya,  and et al., “Tests of general relativity with gw170817,” Physical Review Letters 123 (2019c), 10.1103/physrevlett.123.011102.
  • Cornish and Littenberg (2015) Neil J Cornish and Tyson B Littenberg, “Bayeswave: Bayesian inference for gravitational wave bursts and instrument glitches,” Classical and Quantum Gravity 32, 135012 (2015).
  • Abbott et al. (2016b) B. P. Abbott et al. (LIGO Scientific and Virgo Collaborations), “Tests of general relativity with gw150914,” Phys. Rev. Lett. 116, 221101 (2016b).
  • Pang et al. (2020) Peter T. H. Pang, Rico K. L. Lo, Isaac C. F. Wong, Tjonnie G. F. Li,  and Chris Van Den Broeck, “Generic searches for alternative gravitational wave polarizations with networks of interferometric detectors,” Physical Review D 101 (2020), 10.1103/physrevd.101.104055.
  • Abbott et al. (2017b) B. P. Abbott et al., “Multi-messenger observations of a binary neutron star merger,” The Astrophysical Journal 848, L12 (2017b).
  • Nishizawa et al. (2009) Atsushi Nishizawa, Atsushi Taruya, Kazuhiro Hayama, Seiji Kawamura,  and Masa-aki Sakagami, “Probing nontensorial polarizations of stochastic gravitational-wave backgrounds with ground-based laser interferometers,” Physical Review D 79 (2009), 10.1103/physrevd.79.082002.
  • Gürsel and Tinto (1989) Yekta Gürsel and Massimo Tinto, “Near optimal solution to the inverse problem for gravitational-wave bursts,” Phys. Rev. D 40, 3884–3938 (1989).
  • Wen and Schutz (2005) Linqing Wen and Bernard F Schutz, “Coherent network detection of gravitational waves: the redundancy veto,” Classical and Quantum Gravity 22, S1321–S1335 (2005).
  • Ajith et al. (2006) P Ajith, M Hewitson,  and I S Heng, “Null-stream veto for two co-located detectors: implementation issues,” Classical and Quantum Gravity 23, S741–S749 (2006).
  • Chatterji et al. (2006) Shourov Chatterji, Albert Lazzarini, Leo Stein, Patrick J. Sutton, Antony Searle,  and Massimo Tinto, “Coherent network analysis technique for discriminating gravitational-wave bursts from instrumental noise,” Physical Review D 74 (2006), 10.1103/physrevd.74.082005.
  • Sutton et al. (2010) Patrick J Sutton, Gareth Jones, Shourov Chatterji, Peter Kalmus, Isabel Leonor, Stephen Poprocki, Jameson Rollins, Antony Searle, Leo Stein, Massimo Tinto,  and et al., “X-pipeline: an analysis package for autonomous gravitational-wave burst searches,” New Journal of Physics 12, 053034 (2010).
  • Chatziioannou et al. (2012b) Katerina Chatziioannou, Nicolás Yunes,  and Neil Cornish, “Model-independent test of general relativity: An extended post-einsteinian framework with complete polarization content,” Physical Review D 86 (2012b), 10.1103/physrevd.86.022004.
  • Necula et al. (2012) V Necula, S Klimenko,  and G Mitselmakher, “Transient analysis with fast wilson-daubechies time-frequency transform,” Journal of Physics: Conference Series 363, 012032 (2012).
  • Littenberg and Cornish (2015) Tyson B. Littenberg and Neil J. Cornish, “Bayesian inference for spectral estimation of gravitational wave detector noise,” Physical Review D 91 (2015), 10.1103/physrevd.91.084034.
  • Creighton and Anderson (2011) Jolien D. E. Creighton and Warren G. Anderson, Gravitational-wave physics and astronomy: an introduction to theory, experiment and data analysis (Wiley, 2011).
  • Demortier (2007) Luc Demortier, “P values and nuisance parameters,” in PHYSTAT-LHC Workshop on Statistical Issues for LHC Physics (2007) pp. 23–33.
  • Davison and Hinkley (1997) A. C. Davison and D. V. Hinkley, Bootstrap Methods and their Application, Cambridge Series in Statistical and Probabilistic Mathematics (Cambridge University Press, 1997).
  • Cahillane et al. (2017) Craig Cahillane, Joe Betzwieser, Duncan A. Brown, Evan Goetz, Evan D. Hall, Kiwamu Izumi, Shivaraj Kandhasamy, Sudarshan Karki, Jeff S. Kissel, Greg Mendell,  and et al., “Calibration uncertainty for advanced ligo’s first and second observing runs,” Physical Review D 96 (2017), 10.1103/physrevd.96.102001.
  • Accadia et al. (2010) T Accadia, F Acernese, F Antonucci, S Aoudia, K G Arun, P Astone, G Ballardin, F Barone, M Barsuglia, Th S Bauer,  and et al., “Virgo calibration and reconstruction of the gravitationnal wave strain during vsr1,” Journal of Physics: Conference Series 228, 012015 (2010).
  • Vitale et al. (2012) Salvatore Vitale, Walter Del Pozzo, Tjonnie G. F. Li, Chris Van Den Broeck, Ilya Mandel, Ben Aylott,  and John Veitch, “Effect of calibration errors on bayesian parameter estimation for gravitational wave signals from inspiral binary systems in the advanced detectors era,” Physical Review D 85 (2012), 10.1103/physrevd.85.064034.
  • Feroz and Hobson (2008) F. Feroz and M. P. Hobson, “Multimodal nested sampling: an efficient and robust alternative to markov chain monte carlo methods for astronomical data analyses,” Monthly Notices of the Royal Astronomical Society 384, 449–463 (2008).
  • Feroz et al. (2009) F. Feroz, M. P. Hobson,  and M. Bridges, “Multinest: an efficient and robust bayesian inference tool for cosmology and particle physics,” Monthly Notices of the Royal Astronomical Society 398, 1601–1614 (2009).
  • Feroz et al. (2019a) Farhan Feroz, Michael P. Hobson, Ewan Cameron,  and Anthony N. Pettitt, “Importance nested sampling and the multinest algorithm,” The Open Journal of Astrophysics 2 (2019a), 10.21105/astro.1306.2144.
  • Feroz et al. (2019b) Farhan Feroz, Matt Pitkin,  and Will Handley, “Multinest,” https://github.com/farhanferoz/MultiNest (2019b).
  • Husa et al. (2016) Sascha Husa, Sebastian Khan, Mark Hannam, Michael Pürrer, Frank Ohme, Xisco Jiménez Forteza,  and Alejandro Bohé, “Frequency-domain gravitational waves from nonprecessing black-hole binaries. i. new numerical waveforms and anatomy of the signal,” Phys. Rev. D 93, 044006 (2016).
  • Khan et al. (2016) Sebastian Khan, Sascha Husa, Mark Hannam, Frank Ohme, Michael Pürrer, Xisco Jiménez Forteza,  and Alejandro Bohé, “Frequency-domain gravitational waves from nonprecessing black-hole binaries. ii. a phenomenological model for the advanced detector era,” Phys. Rev. D 93, 044007 (2016).
  • Hannam et al. (2014) Mark Hannam, Patricia Schmidt, Alejandro Bohé, Leïla Haegel, Sascha Husa, Frank Ohme, Geraint Pratten,  and Michael Pürrer, “Simple model of complete precessing black-hole-binary gravitational waveforms,” Phys. Rev. Lett. 113, 151101 (2014).
  • Abbott et al. (2020b) B. P. Abbott, R. Abbott, T. D. Abbott, S. Abraham, F. Acernese, K. Ackley, C. Adams, V. B. Adya, C. Affeldt,  and et al., “Prospects for observing and localizing gravitational-wave transients with advanced ligo, advanced virgo and kagra,” Living Reviews in Relativity 23 (2020b), 10.1007/s41114-020-00026-9.
  • Sivia and Skilling (2006) D. Sivia and J. Skilling, Data Analysis: A Bayesian Tutorial, Oxford science publications (OUP Oxford, 2006).
  • Will and Eardley (1977) C. M. Will and D. M. Eardley, “Dipole gravitational radiation in Rosen’s theory of gravity: observable effects in the binary system PSR 1913+16.” ApJ 212, L91–L94 (1977).
  • Will and Zaglauer (1989) Clifford M. Will and Helmut W. Zaglauer, “Gravitational Radiation, Close Binary Systems, and the Brans-Dicke Theory of Gravity,” ApJ 346, 366 (1989).
  • Will (1977) C. M. Will, “Gravitational radiation from binary systems in alternative metric theories of gravity: dipole radiation and the binary pulsar.” ApJ 214, 826–839 (1977).
  • Roy and Vetterli (2007) Olivier Roy and Martin Vetterli, “The effective rank: A measure of effective dimensionality,” in 2007 15th European Signal Processing Conference (2007) pp. 606–610.
  • Akutsu et al. (2020) T. Akutsu et al., “Overview of kagra: Detector design and construction history,”  (2020), arXiv:2005.05574 [physics.ins-det] .
  • Aso et al. (2013) Yoichi Aso, Yuta Michimura, Kentaro Somiya, Masaki Ando, Osamu Miyakawa, Takanori Sekiguchi, Daisuke Tatsumi,  and Hiroaki Yamamoto (The KAGRA Collaboration), “Interferometer design of the kagra gravitational wave detector,” Phys. Rev. D 88, 043007 (2013).
  • Iyer et al. (2012) Bala Iyer, Tarun Souradeep, CS Unnikrishnan, Sanjeev Dhurandhar, Sendhil Raja,  and Anand Sengupta, “Ligo-india, proposal of the consortium for indian initiative in gravitational-wave observations (indigo),” LIGO DCC  (2012).
  • Punturo et al. (2010) M Punturo et al., “The third generation of gravitational wave observatories and their science reach,” Classical and Quantum Gravity 27, 084007 (2010).
  • Hild et al. (2011) S Hild et al., “Sensitivity studies for third-generation gravitational wave observatories,” Classical and Quantum Gravity 28, 094013 (2011).
  • Abbott et al. (2017c) B P Abbott, R Abbott, T D Abbott, M R Abernathy, K Ackley, C Adams, P Addesso, R X Adhikari, V B Adya, C Affeldt,  and et al., “Exploring the sensitivity of next generation gravitational wave detectors,” Classical and Quantum Gravity 34, 044001 (2017c).
  • Reitze et al. (2019) David Reitze et al., “Cosmic explorer: The u.s. contribution to gravitational-wave astronomy beyond ligo,”  (2019), arXiv:1907.04833 [astro-ph.IM] .
  • Guennebaud et al. (2010) Gaël Guennebaud, Benoît Jacob, et al., “Eigen v3,” http://eigen.tuxfamily.org (2010).
  • Frigo and Johnson (2005) M. Frigo and S. G. Johnson, “The design and implementation of fftw3,” Proceedings of the IEEE 93, 216–231 (2005).
  • Gough (2009) Brian Gough, GNU Scientific Library Reference Manual - Third Edition, 3rd ed. (Network Theory Ltd., 2009).
  • The HDF Group (2000-2010) The HDF Group, “Hierarchical data format version 5,”  (2000-2010).
  • Gorski et al. (2005) K. M. Gorski, E. Hivon, A. J. Banday, B. D. Wandelt, F. K. Hansen, M. Reinecke,  and M. Bartelmann, “Healpix: A framework for high‐resolution discretization and fast analysis of data distributed on the sphere,” The Astrophysical Journal 622, 759–771 (2005).
  • LIGO Scientific Collaboration (2018) LIGO Scientific Collaboration, “LIGO Algorithm Library - LALSuite,” free software (GPL) (2018).
  • 416 (2007) “Matplotlib: A 2d graphics environment,” Computing in Science Engineering 9, 90–95 (2007).
  • Harris et al. (2020) Charles R. Harris, K. Jarrod Millman, St’efan J. van der Walt, Ralf Gommers, Pauli Virtanen, David Cournapeau, Eric Wieser, Julian Taylor, Sebastian Berg, Nathaniel J. Smith, Robert Kern, Matti Picus, Stephan Hoyer, Marten H. van Kerkwijk, Matthew Brett, Allan Haldane, Jaime Fern’andez del R’ıo, Mark Wiebe, Pearu Peterson, Pierre G’erard-Marchant, Kevin Sheppard, Tyler Reddy, Warren Weckesser, Hameer Abbasi, Christoph Gohlke,  and Travis E. Oliphant, “Array programming with NumPy,” Nature 585, 357–362 (2020).
  • M. Farr et al. (2014) Will M. Farr, Benjamin Farr,  and Tyson Littenberg, “Modelling calibration errors in cbc waveforms,” LIGO DCC  (2014).
  • Collaboration and Collaboration (2019) LIGO Scientific Collaboration and Virgo Collaboration, “Calibration uncertainty envelope release for gwtc-1,” LIGO DCC  (2019).