Machine-enhanced CP-asymmetries in the electroweak sector
Abstract
The violation of charge conjugation (C) and parity (P) symmetries are a requirement for the observed dominance of matter over antimatter in the Universe. As an established effect of beyond the Standard Model physics, this could point towards additional CP violation in the Higgs-gauge sector. The phenomenological footprint of the associated anomalous couplings can be small, and designing measurement strategies with the highest sensitivity is therefore of the utmost importance in order to maximise the discovery potential of the Large Hadron Collider (LHC). There are, however, very few measurements of CP-sensitive observables in processes that probe the weak-boson self-interactions. In this article, we study the sensitivity to new sources of CP violation for a range of experimentally-accessible electroweak processes, including production, production via photon fusion, electroweak production, electroweak production, and electroweak production. We study simple angular observables as well CP-sensitive observables constructed using the outputs of machine-learning (ML) algorithms. We find that the ML-constructed CP-sensitive observables improve the sensitivity to CP-violating effects by up to a factor of five, depending on the process. We also find that inclusive and electroweak production have the potential to set the best possible constraints on certain CP-odd operators in the Higgs-gauge sector of dimension-six effective field theories.
I Introduction
Ten years after the Higgs boson discovery at CERN ATLAS:2012yve ; CMS:2012qbp , the search for a more complete picture of particle physics than the Standard Model (SM) continues. With mounting pressure on traditionally-motivated new-physics scenarios, measurements and searches increasingly aim to produce model-independent constraints, chiefly through the application of effective field theory methods Weinberg:1978kz . Such approaches imply a plethora of ad hoc anomalous interactions between SM particles and suggest a wide range of possible physics analyses at the Large Hadron Collider (LHC), even when a weak doublet character of the Higgs field is assumed Grzadkowski:2010es ; Baak:2014ora . In contrast, a range of astrophysical and cosmological facts have been established that cannot be explained by the SM alone. These highlight particular subsectors of the EFT deformations as particularly motivated BSM candidates for independent investigations. For instance, there is insufficient CP violation in the SM to explain the observed dominance of matter over anti-matter in the Universe, which could indicate the need for additional CP-violating gauge-Higgs interactions.
The Standard Model Effective Field Theory (SMEFT) extends the SM Lagrangian with CP-odd dimension-six operators, i.e.
(1) |
The Wilson coefficients, , specify the strength of the anomalous interactions that are induced, and is considered as the cut-off scale for the effective theory. Of particular interest are operators that affect the interactions between the Higgs boson and the electroweak bosons, or the self-interactions of the weak bosons, namely
(2) |
and are the field strengths, the tilded quantities refer to the dual field strengths, are the Pauli matrices, and denotes the Higgs doublet field. These interactions not only fully parameterise additional sources of weak CP-violating gauge-Higgs interactions, but they also form a closed set under the dimension-six renormalisation group flow Alonso:2013hga ; Grojean:2013kd ; Englert:2014cva so that their constraints form a theoretically consistent subset of CP violation for matching calculations (e.g. these interactions fully parameterise a broad class of lepton extensions of the SM Bakshi:2021ofj ; Naskar:2022rpg , for a more general discussion see also Degrande:2021zpv ).
The beyond-the-SM matrix-element for a given process following Eq. (1) is given by
(3) |
The dimension-six amplitude, , arises from the interactions induced by the operators of Eq. (2), whereas the SM amplitude, , arises from . The interference between the SM amplitude and the dimension-six amplitude induces asymmetries in appropriately-constructed (C)P-odd observables.
In the Higgs sector, searches for new sources of CP-violation have received considerable attention by the LHC experiments in recent years, with particular emphasis on constructing CP-sensitive observables ATLAS:2018hxb ; ATLAS:2020wny ; ATLAS:2021pkb ; CMS:2021sdq ; ATLAS:2016ifi ; CMS:2019jdw ; ATLAS:2020evk ; CMS:2021nnc . Efforts to maximise the sensitivity are necessary and underway, and observables constructed using matrix-element information are part of the existing analysis strategy ATLAS:2016ifi ; CMS:2019jdw ; ATLAS:2020evk ; CMS:2021nnc . Furthermore, machine-learning (ML) approaches have been proposed as an additional or alternative way to improve sensitivity to the asymmetries caused by new sources of CP violation Brehmer:2017lrt ; Gritsan:2020pib ; Bortolato:2020zcg ; Barman:2021yfh ; Davis:2021tiv ; Bhardwaj:2021ujv . Specifically, these ML approaches can be used to design highly-sensitive physics analyses when the CP-violating effects are difficult to observe. The machine-learning methods are also relatively easy to implement when compared to matrix-element-based techniques.
In comparison to the Higgs sector, surprisingly little attention has been paid to constructing CP-sensitive observables in processes sensitive to weak-boson self-interactions. This is despite the fact that the CP-odd operators in Eq. (2) induce anomalous weak-boson self-interactions and can be probed using measurements of diboson production and weak-boson fusion/scattering. The ATLAS measurement of a CP-sensitive observable in electroweak production ATLAS:2020nzk provided world-leading linearised constraints on and limits on that are competitive with those obtained from the Higgs sector. These constraints were derived using a simple angular observable, the rapidity-ordered azimuthal angle between the two jets (originally proposed in Ref. Plehn:2001nj ), without attempting to use the more sophisticated techniques based on matrix-element information or machine-learning algorithms. Similarly, it was recently proposed that measurements of CP-sensitive observables in production would provide even better sensitivity to the operator than was achieved in the electroweak analysis DasBakshi:2020ejz ; Biekotter:2021int . Again, that projection was based on simple angular observables and did not utilise matrix-element information or machine-learning algorithms.
In this paper, we use machine-learning algorithms to construct CP-sensitive observables for five processes that are sensitive to weak-boson self-interactions: electroweak production, inclusive production, electroweak production, electroweak production, and photon-induced production (). We have two goals in this study. The primary goal is to motivate new analyses at the LHC experiments in order to address the paucity of measurements of CP-sensitive observables in the electroweak sector. A secondary goal is to test the applicability of the neural-net based method developed in Ref. Bhardwaj:2021ujv when constructing CP-sensitive observables in a wider range of processes.
The layout of the paper is as follows. In Sec. II, we give a brief introduction of our simulation framework, where we also provide an overview of the fiducial search regions used for this study. After a discussion of CP-sensitive observables in Sec. III, we outline the analysis selection and limit setting procedure in Sec. IV. Section V is devoted to the discussion of our results; we summarise and conclude in Sec. VI.
II Simulation framework
Throughout, we use MadGraph5_aMC@NLO Alwall:2014hca which interfaced with Pythia8 Sjostrand:2007gs ; Sjostrand:2014zea to simulate events at leading-order precision in QCD. We employ the SmEftSim Brivio:2017btx ; Brivio:2020onw implementation to model the effective interactions of Eq. (2) via the Ufo Degrande:2011ua interface. We limit our analysis to interference effects of Eq. (3) and the simulated events are produced at . “Squared” CP-even dimension-6 effects will not affect the discrimination that we study below and would only change the normalisation of the cross sections as part of the limit setting. Normalisation modifications do also arise from the CP-even counter parts of Eq. (2) amongst other SMEFT interactions. Hence, limiting ourselves to interference effects provides not only a conservative estimate of the CP-sensitivity reach below, but also targets “genuine” CP-violation through designing tell-tale phenomenological discriminators through (ML-generalised) asymmetries. By attributing CP violation to the hard scattering matrix element, we also implicitly assume SM-like decays, hadronisation etc.
We use the NNPDF30NLO (NNPDF23LO) parton distribution functions Ball:2012cx and the default set of parameters that define the Pythia8 setup for parton showering, hadronisation and underlying event activity. For weak-boson fusion and weak-boson scattering processes, it is known that the default parton shower scheme produces too much quark and gluon radiation ATLAS:2019hoc ; Hoche:2021mkv . For EW production, which eventually will be analysed in a fiducial region that rejects events with additional jet activity (see Sec. IV), we simulate the SM and interference contributions using the dipole-recoil scheme for the initial-state radiation. For electroweak (EW) and EW production, we simulate the SM and interference contributions using the default shower scheme and rescale the sample weights by the ratio , where is the SM cross section in the fiducial region of the analysis and ‘dipole’ or ‘default’ label the shower scheme used to generate the sample. In this latter case, we have checked that the cross section ratio remains approximately flat across the kinematic variables that are of interest in the analysis.
III CP-sensitive observables
III.1 Simple angular observables
CP-sensitive observables can be easily constructed using the difference in azimuthal angle between two final particles, i.e.
(4) |
where and are ordered in rapidity such that .
The signed azimuthal angle between the jets, , is clearly P-odd, and has traditionally been used to search for CP-violation in measurements sensitive to weak boson fusion Plehn:2001nj . We therefore utilise for our analysis of electroweak production, electroweak production, and electroweak production. For the inclusive , and electroweak processes, the signed azimuthal angles between charged leptons and photons, and , can be constructed as CP-sensitive observables.
For the electroweak process, there are four charged leptons in the final state. CP-violating effects can be probed using the variable Bolognesi:2012mm ; Gritsan:2016hjl defined by
(5) |
which measures the signed angle between the decay planes of the two bosons. Here, the normal vectors to the -boson decay planes are defined as
(6) |
where represents the momentum of the lepton (or antilepton) that originates from the decay . The is the corresponding momentum of the .
III.2 Observables constructed using neural networks
CP-sensitive observables can also be constructed using neural networks Bhardwaj:2021ujv . Specifically, for a given interference contribution, the event sample can be divided into positively-weighted and negatively-weighted events. The neural network can then be trained to separate the two classes in a binary classification. The SM prediction can be included in the training to simultaneously optimise the separation of the interference contributions from the SM contribution. This is referred to as a multiclass model. We mainly focus on the use of multiclass models in this article.
The CP-sensitive observables can be defined on an event-by-event basis using the trained models, i.e.
(7) |
where and are the probabilities that an event is a positively-weighted or negatively-weighted event, respectively. For a binary classification, . For a multiclass model, , where is the probability that an event is a SM event.
We use TensorFlow 2.3.0 Abadi:2016kic to train the neural networks. The input variables are the transverse momentum (), pseudorapidity (), and azimuthal angle of each visible particle that defines the final state (i.e. leptons, photons, and jets). Lepton flavour and charge are also included, though found to add little sensitivity. In addition, the magnitude () and angle () of the missing transverse momentum are included for production. The choice of hyperparameters is optimised for each network and obtained using the Keras suite keras ; omalley2019kerastuner . The optimisation included the number of layers, the number of nodes, the learning rate and the batch size. A data augmentation procedure was used to prevent the networks learning features that arise from statistical fluctuations. In this procedure, each event is used twice in the training, once with the default input variables and once with a CP-operator applied to each of the input variables. The event weight is multiplied by -1 for the CP-flipped events in the interference sample.
IV Analysis methodology
For each process, we apply the selection cuts used in a recent ATLAS or CMS analysis and, where possible, validate our event generation by comparing the SM fiducial cross section that we obtain from our simulated samples to the theoretical values quoted in the relevant publication. Table 1 presents our fiducial cross sections, which are in good agreement with those reported in the literature after considering differences such as leading-order predictions versus next-to-leading order predictions (in particular in production these can be sizeable Ohnemus:1992jn ; Baur:1993ir ).111For EW production, we only generate the final state and our cross section is therefore a factor of two smaller than the theoretical prediction used in Ref. ATLAS:2020nlt .
Process | Fiducial region | SM cross section | SM yield normalisation |
---|---|---|---|
(limit setting only) | |||
EW () | Ref. ATLAS:2020nzk (signal region) | fb | 3712 ATLAS:2020nzk |
inclusive () | Ref. CMS:2021cxr (section 6) | 294 fb | 102191 CMS:2021cxr |
EW () | Ref. ATLAS:2020nlt | fb | 22.4 ATLAS:2020nlt |
EW () | Ref. ATLAS:2019cbr | fb | 60 ATLAS:2019cbr |
elastic () | Ref. ATLAS:2020iwi (signal region) | fb | 174 ATLAS:2020iwi |
To determine the sensitivity, we construct confidence intervals from each of the CP-odd observables of Sec. III by means of a binned likelihood function
(8) |
Here, labels the bins with denoting the expected number of events in bin assuming the SM-only hypothesis. is the predicted number of events given the SMEFT hypothesis, which is derived from a particular parameter choice in Wilson coefficient space . The likelihood of Eq. (8) is converted to a confidence level via a profile-likelihood test statistic Feldman:1997qc . Through Wilks’ theorem wilks1938 , we can assume this statistic to be distributed according to a distribution with one degree of freedom, from which the 95% confidence level can be obtained. The likelihood function does not include terms that account for systematic uncertainties. This is justified because the constraints are driven by asymmetries in the CP-odd observables, whereas the impact of systematic uncertainties on those distributions would be symmetric. The potential impact of systematic uncertainties is investigated futher in Sec. V.4.
The and in Eq. (8) are directly derived from the simulated samples, after applying normalisation factors to convert the predicted cross sections into event yields. The normalisation factor is defined for each process such that the SM prediction reproduces the number of SM events that were predicted in the relevant fiducial regions of the ATLAS or CMS analysis (see table 1). This normalisation factor is applied to both the SM signal prediction and the interference contribution. For the EW process, the effect of non-EW backgrounds is included in the calculation of and by generating a QCD sample at and normalising that sample to the observed signal-subtracted number of events observed in Ref. ATLAS:2020nzk . For all other processes, the effects of backgrounds are included in the calculation of and by scaling the SM signal sample to reproduce the number of background events reported in the relevant experimental analysis. This latter approach assumes the kinematic properties of the SM signal and the background are sufficiently similar.
The choice of binning can impact the final confidence level and it is therefore convenient to optimise the binning is such a way that the sensitivity is maximised. Typically about 20 bins are chosen per distribution, with bins subsequently merged to ensure that there is at least one SM event per bin. Post-fit, we check that the theoretical prediction remains positive at the values of the Wilson coefficient that correspond to the 95% CL, and merge bins if this is not the case.
V Results
V.1 EW and inclusive production
The differential cross section for EW production as a function of is shown in Fig. 1. Figure 2 shows the differential cross section for inclusive production as a function of . In both cases, the SM prediction is shown in addition to the interference contributions induced by the and operators, with Wilson coefficients set to TeV-2. As expected, the CP-even SM prediction is symmetric about , = 0, whereas the CP-odd interference contributions are all asymmetric with an integral of zero. For these simple angular variables, EW production is most sensitive to the operator, whereas inclusive production is roughly equally sensitive to both operators.


The differential cross section for EW production as a function of the CP-odd observable produced by a multiclass neural network is shown in Fig. 3. The corresponding distribution for inclusive production is shown in Fig. 4. For each operator and process, a neural network is trained to distinguish between the positively-weighted interference contribution, the negatively-weighted interference contribution, and the contribution from the SM. This leads to a different SM distribution depending on the operator being tested. The interference contributions are presented for TeV-2 and TeV-2.


For both processes, the network is effective at separating the positively-weighted and negatively-weighted interference contributions, with the positively-weighted events peaking close to and the negatively-weighted events at . The SM contribution is symmetric and peaked at output values very close to zero. In the case of inclusive production, it is clear that the network has improved the sensitivity over the use of alone, because the interference contributions are much larger relative to the SM prediction.
Process | CP-odd observable | [TeV-2] | [TeV-2] |
---|---|---|---|
[-1.05,1.05] | [-0.081,0.081] | ||
EW | (multi-class) | [-0.83,0.83] | [-0.047,0.047] |
vs | [-0.99,0.99] | [-0.074,0.074] | |
vs | [-1.04,1.04] | [-0.066,0.066] | |
[-0.165,0.165] | [-0.255,0.255] | ||
inclusive | (multi-class) | [-0.049,0.049] | [-0.056,0.056] |
vs | [-0.154,0.154] | [-0.219,0.219] | |
vs | [-0.163,0.163] | [-0.206,0.206] |
To quantify the sensitivity of the different observables, we use the profile-likelihood test outlined in Sec. IV. The constraints obtained for each Wilson coefficient are shown in Tab. 2. For EW production, the expected 95% confidence intervals obtained from fits to the distribution are similar to those reported by the ATLAS Collaboration ATLAS:2020nzk , with up to 30% improvement (for ) that is due to (i) using more finely binned distributions222This is a consequence of the ATLAS measurement being unfolded and requiring more events per bin of the distribution. and (ii) the small effect of missing systematic uncertainties. For inclusive production, the 95% confidence intervals obtained from fits to the distribution are similar to those reported in Ref. Degrande:2021zpv . The good agreement between our results and those previously reported indicates that our simulation (and subsequent event normalisation) can be used to assess the sensitivity to the different CP-sensitive observables.
The CP-sensitive observables constructed from the output of a multiclass neural network provide much better sensitivity than the simple angular observables alone, with 95% confidence intervals reduced by a factor of up to two for EW production and by a factor of 3-5 for inclusive production, depending on the specific dimension-six operator. This improvement is similar to that seen in studies of Higgs boson final states in Ref. Bhardwaj:2021ujv . Using the observable, the EW and inclusive processes provide similar sensitivity to CP-violating effects induced by the operator. The best current experimental constraints on this operator using linearised EFT contributions are those reported by ATLAS using the distribution in EW production, i.e. , a factor of two less sensitive. Thus the use of neural networks to construct the CP-sensitive observable and expanding the CP-sensitive measurements to the inclusive final state will dramatically improve the sensitivity in the future. For the operator, the constraints obtained from inclusive production will be a factor of 17 more precise than can be obtained from measurement of EW production. This is an important result, because the ATLAS collaboration measured using the distribution in EW production, with the SM prediction being outside of the 95% confidence interval.333This slight asymmetry is not considered to be significant when considering the global -value ATLAS:2020nzk . The measurements of CP-sensitive observables such as for inclusive production are therefore critical in searches for CP-violating effects in the Higgs-gauge sector.
The improved constraints obtained with the CP-sensitive observables constructed using neural networks can be investigated using feature importance techniques. In this approach, the importance of each input variable to the trained network is determined, by evaluating the decrease in accuracy that occurs when the values of the input variable are randomly interchanged in the dataset. An example is shown in Fig. 5 for the multiclass network trained on EW events, with the interference contribution produced by the operator. As expected, the most important variables are the azimuthal angles and rapidities of the two jets, which are needed for the network to learn the observable. However, the azimuthal angles and rapidities of the two leptons are also important, which we trace back to an underlying asymmetry in the observable, that the network is also exploiting to distinguish between positive and negative interference effects. Finally, the transverse momentum of the two leptons is also important in the multiclass network. However, in our cross check using a binary network, the lepton transverse momentum is not considered to be important. This means that the multiclass network is using the lepton transverse momentum to distinguish between the SM prediction and the interference contributions.

We can use the information from the feature importance tests to construct double-differential distributions that are more sensitive than the simple angular variables alone. The 95% confidence intervals obtained from (i) a two-dimensional fit to and , as well as (ii) a two-dimensional fit to and , are also shown in Table. 2. For both operators, the constraints are improved using a two-dimensional fit when compared to the use of the simple angular variable alone, but remain inferior to those obtained using a fit to the observable.
For inclusive production, the feature importance tests highlight the importance of the rapidities and azimuthal angles of the lepton and the photon, as expected. However, the azimuthal angle of the missing transverse momentum vector is also found to be important, along with the magnitude of the missing transverse momentum and the transverse momenta of the lepton and photon. The 95% confidence intervals were determined for a variety of two-dimensional fits to double-differential distributions; the constraints obtained using a two-dimensional fit to and is shown in Table. 2, those obtained from a two-dimensional fit to and the difference between the azimuthal angles of the lepton and the missing transverse momentum vector. Again, the constraints on the Wilson coefficients are improved with respect to the simple angular variable alone, but do not recover the full sensitivity of the CP-sensitive observable constructed from the output of the multiclass network.
In summary, these two examples demonstrate that CP-sensitive observables constructed using machine-learning techniques can dramatically improve the LHC experiments’ sensitivity to CP-violating effects in dimension-six effective field theories.
V.2 EW , EW and production
In this section, we turn our attention to the processes sensitive to the interaction between four electroweak bosons, namely electroweak production, electroweak production and production. These scattering processes have a significantly reduced SM cross section compared to inclusive and EW production as shown in Tab. 1, and we can therefore expect these processes to play a less relevant role in constraining the dimension-six operators investigated in this article. Nevertheless, the applicability of machine-learning methods in constructing CP-sensitive observables can still be investigated for each process, which could prove important in the longer term if the LHC experiments start to search for CP-violating effects predicted by dimension-eight effective field theories where multi-gauge boson interactions provide complementary tests to, e.g., -related interactions (see Larios:2000ni ; Grzadkowski:2016lpv ; Belusca-Maito:2017iob ; Corbett:2017ecn ; Hernandez-Juarez:2021mhi ).
The 95% confidence intervals obtained for the Wilson coefficients are presented in Tab. 3, where for each process and operator the constraints obtained using simple angular observables are compared to the constraints obtained using the observable constructed from the output of a multiclass network. These results are obtained by following all of the steps outlined in the previous sections regarding the ML training and limit setting. In general, many of the constraints in Tab. 3 indicate general insensitivity to CP violation as predicted by dimension-six effective field theory and are much less sensitive than those obtained for inclusive production and EW production in Sec. V.1. However, it is also clear that employing the ML-driven observables leads to significant improvement in sensitivity when compared to the simple angular observables alone.
Process | CP-odd observable | [TeV-2] | [TeV-2] | [TeV-2] | [TeV-2] |
---|---|---|---|---|---|
[-3.7,3.7] | [-43,43] | - | - | ||
EW | [-51,51] | [-64,64] | - | - | |
(multi-class) | [-3.0,3.0] | [-12,12] | - | - | |
- | - | [-35,34] | [-1.83,1.83] | ||
EW | - | - | [-105,105] | [-14,14] | |
(multi-class) | - | - | [-17,17] | [-0.76,0.76] | |
[-32,32] | [-14,14] | [-48,48] | [-19,19] | ||
(multi-class) | [-11,11] | [-13,13] | [-43,43] | [-11,11] |
In the case of EW production, the observable is sufficiently sensitive to the CP-violating effects predicted operator and experimental measurements of for this process would provide useful additional information in a global fit. Unsurprisingly, the sensitivity is mainly driven by the distribution, the differential cross section for which is shown in Fig. 6. However, the fits to the ML-constructed observable improves the constraints by a factor of about 2.5. Figure 7 shows the differential cross section as a function of , where the asymmetry is clearly enhanced with respect to the SM prediction when compared to the distribution.


V.3 Extrapolation to High Luminosity LHC
The constraints obtained on the Wilson coefficients will improve further in the High-Luminosity (HL) phase of the LHC, where the expected integrated luminosity will reach 3 ab-1 per experiment. In Table 4 we present the constraints obtained using the variable constructed from a multiclass model. For these results, the binning was reoptimised, following the procedure outlined in Sec. IV and allowing a larger number of bins to reflect the increased event yields expected at the HL-LHC. In general we find that allowing finer binning does not lead to an improvement in the 95% confidence intervals obtained for each Wilson coefficient. The constraints therefore improve by approximately the ratio , where is the integrated luminosity of the dataset (at HL-LHC or LHC as appropriate). This scaling also shows that the sensitivity is dominated by an inclusive selection channeled into the NN observable, without targeting exclusive phase space regions that become populated at the HL-LHC. Systematic uncertainties of the current LHC runs (see below) will therefore qualitatively carry over to the HL-LHC.
Process | [TeV-2] | [TeV-2] | [TeV-2] | [TeV-2] |
---|---|---|---|---|
EW | [-0.18,0.18] | - | - | [-0.010,0.010] |
inclusive | [-0.010,0.010] | - | - | [-0.012,0.012] |
EW | [-0.66,0.66] | [-2.4,2.4] | - | - |
EW | - | - | [-3.9,3.9] | [-0.19,0.19] |
[-2.5,2.5] | [-2.7,2.7] | [-9.6,9.6] | [-2.5,2.5] |
V.4 Impact of systematic uncertainties
The constraints on the Wilson coefficients presented in the previous sections have been estimated without considering the effect of systematic uncertainties. Although such uncertainties can be large, they will be symmetric as function of a given CP-sensitive observable. In the profile likelihood, therefore, any pull on the associated nuisance parameter to account for asymmetric deviations from the SM prediction would improve the agreement between data and theory in only half of the bins in the distribution, with poorer agreement obtained in the other half of the bins. This should reduce the impact of systematic uncertainties on the 95% confidence intervals obtained for CP-odd operators.
To demonstrate this effect, we introduce systematic uncertainties in the likelihood function for EW production and inclusive production. For each of these processes, the likelihood function is modified to include a single Gaussian-constrained nuisance parameter, which is intended to account for the systematic uncertainties in the measurements and the theoretical uncertainties in the SM predictions. The dominant uncertainty for EW production arises from the theoretical modelling of the non-EW process, with sizeable contributions also from experimental (jet-energy-scale) uncertainties vbfzdata . The total uncertainty is about 15% uncertainty at low- and 25% at high-. A weak dependence is observed as a function of the -boson transverse momentum. For inclusive production, the combined experimental and theoretical uncertainty rises from 7% at low photon transverse momentum to about 25% at high transverse momentum wydata . However, the uncertainty is relatively flat as a function of angular observables.
The impact of the systematic uncertainties is first tested for EW production, with the Wilson coefficients constrained using the distribution; we find that the 95% confidence intervals are increased by 1-2% with respect to those presented in Table 2. This is similar to, but slightly smaller than, the effect reported by the ATLAS Collaboration, in which the systematic uncertainties increase the confidence intervals by about 4% (see the auxiliary material of Ref. ATLAS:2020nzk ). It is likely that the smaller impact in our analysis is due to using a simplified model for systematic uncertainties, i.e. that the likelihood is too constraining on the single nuisance parameter. Dropping the Gaussian constraint on the nuisance parameter leads to a slightly larger impact of systematic uncertainties, with the 95% confidence intervals increased by 2-3% with respect to Table 2.
The dependence of the systematic uncertainties as a function of , for both inclusive EW production and production, is not known and so both uniform and linear dependences are tested in the likelihoods. Again, the impact is small, with a 1% increase in the confidence intervals if the nuisance parameter has a Gaussian penalty term, and a 2-3% impact if it does not. From these studies, we conclude that systematic uncertainties will be modest in any future analysis at the LHC.
VI Conclusions
The observed matter-antimatter asymmetry in the Universe requires new physics beyond the Standard Model (SM) and signposts specific areas for phenomenological scrutiny. Specifically, new sources of CP violation are required. Searches for CP violation at the LHC experiments increasingly aim to produce model-independent constraints through the application of effective field theories. Of particular interest is the possibility of CP-violating effects in the Higgs-gauge sector. Although the Higgs sector has received considerable attention along these lines recently, the search for CP violation using processes sensitive to the weak-boson self-interactions is much less explored. These processes however can provide similar sensitivity as the measurements in the Higgs sector, given the gauge structure of the relevant operators in the Standard Model Effective Field Theory (SMEFT). With this in mind, we have investigated a range of purely-electroweak processes in this article, which are sensitive to the underlying nature of the weak-boson self-interactions.
Deviations from the SM are likely to be small, however, showing up predominantly as asymmetries in CP-sensitive observables. Enhancing the new physics reach of the LHC is therefore directly related to formulation of most sensitive BSM discriminators. ‘Traditional’ angular observables, albeit theoretically motivated, might not provide the most sensitive approaches to uncover the anomalous interactions. Many (even termed ‘optimal’) observables have been proposed, typically motivated by considering the angular decompositions of scattering processes. Their choice, however, is not unique. The appropriate choice of observables or observable combinations therefore becomes a question of optimisation that feeds into an enhanced discovery potential at the LHC. This is the natural realm of machine learning (ML), which will perform exactly the task of constructing a tailored CP-sensitive asymmetry optimised to new physics deformations that are parameterised consistently in the SMEFT.
We have employed the machine-learning techniques proposed in Ref. Bhardwaj:2021ujv to simultaneously construct the CP-sensitive observables and optimise the analysis sensitivity for each of the electroweak processes under investigation. We show that the ML-constructed CP-sensitive observables can lead to large sensitivity enhancements in searches for CP violation at the LHC. The ML algorithm (neural network) achieves this improvement using a multi-class model, which discriminates between the positive- and negative- interference contributions that describe the CP-violating effects, as well as between the interference contributions and the SM prediction.
We show that future measurements of ML-based CP-sensitive observables for inclusive production and EW production should each improve the sensitivity to CP-violating effects predicted by the operator in the effective field theory by a factor of two compared to the current best constraints. We also show that the measurement of ML-based CP-sensitive observables for inclusive process can provide a factor of five improvement in sensitivity to the operator over the use of simple angular observables alone. This particular operator in the effective field theory is difficult to constrain; the measurement and technique outlined in this paper is likely to produce the best possible sensitivity.
Although the large improvements in sensitivity are a result of proof-of-principle analyses, they highlight the huge potential that ML-constructed CP-sensitive observables can unleash at the LHC, not only presently with the Run-II and Run-III datasets, but also during the high-luminosity phase in the future.
Acknowledgements — C.E. is supported by the STFC under grant ST/T000945/1, by the Leverhulme Trust under grant RPG-2021-031, and the IPPP Associateship Scheme. A.D.P is supported by the Royal Society and STFC under grants UF160396, ST/S000925/1 and ST/W000601/1. R.H. and A.D.P are supported by the Leverhulme Trust under grant RPG-2020-004.
References
- (1) ATLAS collaboration, G. Aad et al., Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC, Phys. Lett. B 716 (2012) 1–29, [1207.7214].
- (2) CMS collaboration, S. Chatrchyan et al., Observation of a New Boson at a Mass of 125 GeV with the CMS Experiment at the LHC, Phys. Lett. B 716 (2012) 30–61, [1207.7235].
- (3) S. Weinberg, Phenomenological Lagrangians, Physica A 96 (1979) 327–340.
- (4) B. Grzadkowski, M. Iskrzynski, M. Misiak and J. Rosiek, Dimension-Six Terms in the Standard Model Lagrangian, JHEP 10 (2010) 085, [1008.4884].
- (5) Gfitter Group collaboration, M. Baak, J. Cúth, J. Haller, A. Hoecker, R. Kogler, K. Mönig et al., The global electroweak fit at NNLO and prospects for the LHC and ILC, Eur. Phys. J. C 74 (2014) 3046, [1407.3792].
- (6) R. Alonso, E. E. Jenkins, A. V. Manohar and M. Trott, Renormalization Group Evolution of the Standard Model Dimension Six Operators III: Gauge Coupling Dependence and Phenomenology, JHEP 04 (2014) 159, [1312.2014].
- (7) C. Grojean, E. E. Jenkins, A. V. Manohar and M. Trott, Renormalization Group Scaling of Higgs Operators and , JHEP 04 (2013) 016, [1301.2588].
- (8) C. Englert and M. Spannowsky, Effective Theories and Measurements at Colliders, Phys. Lett. B 740 (2015) 8–15, [1408.5147].
- (9) S. D. Bakshi, J. Chakrabortty, C. Englert, M. Spannowsky and P. Stylianou, Landscaping CP-violating BSM scenarios, Nucl. Phys. B 975 (2022) 115676, [2103.15861].
- (10) W. Naskar, S. Prakash and S. U. Rahaman, EFT Diagrammatica II: Tracing the UV origin of bosonic D6 CPV and D8 SMEFT operators, 2205.00910.
- (11) C. Degrande and J. Touchèque, A Reduced basis for CP violation in SMEFT at colliders and its application to Diboson production, 2110.02993.
- (12) ATLAS collaboration, M. Aaboud et al., Measurements of Higgs boson properties in the diphoton decay channel with 36 fb-1 of collision data at TeV with the ATLAS detector, Phys. Rev. D 98 (2018) 052005, [1802.04146].
- (13) ATLAS collaboration, G. Aad et al., Measurements of the Higgs boson inclusive and differential fiducial cross sections in the 4 decay channel at = 13 TeV, Eur. Phys. J. C 80 (2020) 942, [2004.03969].
- (14) ATLAS collaboration, G. Aad et al., Constraints on Higgs boson properties using production in 36.1 fb-1 of =13 TeV collisions with the ATLAS detector, 2109.13808.
- (15) CMS collaboration, A. Tumasyan et al., Analysis of the CP structure of the Yukawa coupling between the Higgs boson and leptons in proton-proton collisions at = 13 TeV, 2110.04836.
- (16) ATLAS collaboration, G. Aad et al., Test of CP Invariance in vector-boson fusion production of the Higgs boson using the Optimal Observable method in the ditau decay channel with the ATLAS detector, Eur. Phys. J. C 76 (2016) 658, [1602.04516].
- (17) CMS collaboration, A. M. Sirunyan et al., Constraints on anomalous couplings from the production of Higgs bosons decaying to lepton pairs, Phys. Rev. D 100 (2019) 112002, [1903.06973].
- (18) ATLAS collaboration, G. Aad et al., Test of CP invariance in vector-boson fusion production of the Higgs boson in the channel in proton-proton collisions at TeV with the ATLAS detector, Phys. Lett. B 805 (2020) 135426, [2002.05315].
- (19) CMS collaboration, A. M. Sirunyan et al., Constraints on anomalous Higgs boson couplings to vector bosons and fermions in its production and decay using the four-lepton final state, Phys. Rev. D 104 (2021) 052004, [2104.12152].
- (20) J. Brehmer, F. Kling, T. Plehn and T. M. P. Tait, Better Higgs-CP Tests Through Information Geometry, Phys. Rev. D 97 (2018) 095017, [1712.02350].
- (21) A. V. Gritsan, J. Roskes, U. Sarica, M. Schulze, M. Xiao and Y. Zhou, New features in the JHU generator framework: constraining Higgs boson properties from on-shell and off-shell production, Phys. Rev. D 102 (2020) 056022, [2002.09888].
- (22) B. Bortolato, J. F. Kamenik, N. Košnik and A. Smolkovič, Optimized probes of -odd effects in the process at hadron colliders, Nucl. Phys. B 964 (2021) 115328, [2006.13110].
- (23) R. K. Barman, D. Gonçalves and F. Kling, Machine Learning the Higgs-Top CP Phase, 2110.07635.
- (24) J. Davis, A. V. Gritsan, L. S. M. Guerra, S. Kyriacou, J. Roskes and M. Schulze, Constraining anomalous Higgs boson couplings to virtual photons, 2109.13363.
- (25) A. Bhardwaj, C. Englert, R. Hankache and A. D. Pilkington, Machine-enhanced CP-asymmetries in the Higgs sector, Phys. Lett. B 832 (2022) 137246, [2112.05052].
- (26) ATLAS collaboration, G. Aad et al., Differential cross-section measurements for the electroweak production of dijets in association with a boson in proton–proton collisions at ATLAS, Eur. Phys. J. C 81 (2021) 163, [2006.15458].
- (27) T. Plehn, D. L. Rainwater and D. Zeppenfeld, Determining the Structure of Higgs Couplings at the LHC, Phys. Rev. Lett. 88 (2002) 051801, [hep-ph/0105325].
- (28) S. Das Bakshi, J. Chakrabortty, C. Englert, M. Spannowsky and P. Stylianou, violation at ATLAS in effective field theory, Phys. Rev. D 103 (2021) 055008, [2009.13394].
- (29) A. Biekötter, P. Gregg, F. Krauss and M. Schönherr, Constraining CP violating operators in charged and neutral triple gauge couplings, Phys. Lett. B 817 (2021) 136311, [2102.01115].
- (30) J. Alwall, R. Frederix, S. Frixione, V. Hirschi, F. Maltoni, O. Mattelaer et al., The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations, JHEP 07 (2014) 079, [1405.0301].
- (31) T. Sjostrand, S. Mrenna and P. Z. Skands, A Brief Introduction to PYTHIA 8.1, Comput. Phys. Commun. 178 (2008) 852–867, [0710.3820].
- (32) T. Sjöstrand, S. Ask, J. R. Christiansen, R. Corke, N. Desai, P. Ilten et al., An introduction to PYTHIA 8.2, Comput. Phys. Commun. 191 (2015) 159–177, [1410.3012].
- (33) I. Brivio, Y. Jiang and M. Trott, The SMEFTsim package, theory and tools, JHEP 12 (2017) 070, [1709.06492].
- (34) I. Brivio, SMEFTsim 3.0 — a practical guide, JHEP 04 (2021) 073, [2012.11343].
- (35) C. Degrande, C. Duhr, B. Fuks, D. Grellscheid, O. Mattelaer and T. Reiter, UFO - The Universal FeynRules Output, Comput. Phys. Commun. 183 (2012) 1201–1214, [1108.2040].
- (36) R. D. Ball et al., Parton distributions with LHC data, Nucl. Phys. B 867 (2013) 244–289, [1207.1303].
- (37) ATLAS collaboration, G. Aad et al., Modelling of the vector boson scattering process in Monte Carlo generators in ATLAS, ATL-PHYS-PUB-2019-004 (2019) .
- (38) S. Höche, S. Mrenna, S. Payne, C. T. Preuss and P. Skands, A Study of QCD Radiation in VBF Higgs Production with Vincia and Pythia, SciPost Phys. 12 (2022) 010, [2106.10987].
- (39) S. Bolognesi, Y. Gao, A. V. Gritsan, K. Melnikov, M. Schulze, N. V. Tran et al., On the spin and parity of a single-produced resonance at the LHC, Phys. Rev. D 86 (2012) 095031, [1208.4018].
- (40) A. V. Gritsan, R. Röntsch, M. Schulze and M. Xiao, Constraining anomalous Higgs boson couplings to the heavy flavor fermions using matrix element techniques, Phys. Rev. D 94 (2016) 055023, [1606.03107].
- (41) M. Abadi et al., TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems, 1603.04467.
- (42) F. Chollet, “Keras.” https://github.com/fchollet/keras, 2015.
- (43) T. O’Malley, E. Bursztein, J. Long, F. Chollet, H. Jin, L. Invernizzi et al., “Kerastuner.” https://github.com/keras-team/keras-tuner, 2019.
- (44) J. Ohnemus, Order- calculations of hadronic and production, Phys. Rev. D 47 (1993) 940–955.
- (45) U. Baur, T. Han and J. Ohnemus, QCD corrections to hadronic production with nonstandard couplings, Phys. Rev. D 48 (1993) 5140–5161, [hep-ph/9305314].
- (46) ATLAS collaboration, G. Aad et al., Observation of electroweak production of two jets and a -boson pair, 2004.10612.
- (47) CMS collaboration, A. Tumasyan et al., Measurement of W differential cross sections in proton-proton collisions at = 13 TeV and effective field theory constraints, Phys. Rev. D 105 (2022) 052003, [2111.13948].
- (48) ATLAS collaboration, M. Aaboud et al., Observation of electroweak production of a same-sign boson pair in association with two jets in collisions at TeV with the ATLAS detector, Phys. Rev. Lett. 123 (2019) 161801, [1906.03203].
- (49) ATLAS collaboration, G. Aad et al., Observation of photon-induced production in collisions at TeV using the ATLAS detector, Phys. Lett. B 816 (2021) 136190, [2010.04019].
- (50) G. J. Feldman and R. D. Cousins, A Unified approach to the classical statistical analysis of small signals, Phys. Rev. D 57 (1998) 3873–3889, [physics/9711021].
- (51) S. S. Wilks, The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses, Annals Math. Statist. 9 (1938) 60–62.
- (52) F. Larios, M. A. Perez, G. Tavares-Velasco and J. J. Toscano, Trilinear neutral gauge boson couplings in effective theories, Phys. Rev. D 63 (2001) 113014, [hep-ph/0012180].
- (53) B. Grzadkowski, O. M. Ogreid and P. Osland, CP-Violation in the and vertices at colliders in Two-Higgs-Doublet Models, JHEP 05 (2016) 025, [1603.01388].
- (54) H. Bélusca-Maïto, A. Falkowski, D. Fontes, J. C. Romão and J. a. P. Silva, CP violation in 2HDM and EFT: the vertex, JHEP 04 (2018) 002, [1710.05563].
- (55) T. Corbett, M. J. Dolan, C. Englert and K. Nordström, Anomalous neutral gauge boson interactions and simplified models, Phys. Rev. D 97 (2018) 115040, [1710.07530].
- (56) A. I. Hernández-Juárez, A. Moyotl and G. Tavares-Velasco, Contributions to () couplings from violating flavor changing couplings, Eur. Phys. J. C 81 (2021) 304, [2102.02197].
- (57) ATLAS collaboration. https://doi.org/10.17182/hepdata.94218.
- (58) CMS collaboration. https://doi.org/10.17182/hepdata.115354.