This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Position Reconstruction in the DEAP-3600 Dark Matter Search Experiment

P. Adhikari    R. Ajaj    M. Alpízar-Venegas    P.-A. Amaudruz    J. Anstey    G.R. Araujo    D.J. Auty    M. Baldwin    M. Batygov    B. Beltran    H. Benmansour    M.A. Bigentini    C.E. Bina    J. Bonatt    W.M. Bonivento    M.G. Boulay    B. Broerman    J.F. Bueno    P.M. Burghardt    A. Butcher    M. Cadeddu    B. Cai    M. Cárdenas-Montes    S. Cavuoti    M. Chen    Y. Chen    S. Choudhary    B.T. Cleveland    J.M. Corning    R. Crampton    D. Cranshaw    S. Daugherty    P. DelGobbo    K. Dering    P. Di Stefano    J. DiGioseffo    G. Dolganov    L. Doria    F.A. Duncan11footnotetext: Deceased    M. Dunford    E. Ellingwood    A. Erlandson    S.S. Farahani    N. Fatemighomi    G. Fiorillo    S. Florian    A. Flower    R.J. Ford    R. Gagnon    D. Gahan    D. Gallacher    A. Garai    P. García Abia    S. Garg    P. Giampa    A. Giménez-Alcázar    D. Goeldi    V.V. Golovko    P. Gorel    K. Graham    D.R. Grant    A. Grobov    A.L. Hallin    M. Hamstra    P.J. Harvey    S. Haskins    C. Hearns    J. Hu    J. Hucker    T. Hugues    A. Ilyasov    B. Jigmeddorj    C.J. Jillings    A. Joy    O. Kamaev    G. Kaur    A. Kemp    M. Khoshraftar Yazdi    M. Kuźniak    F. La Zia    M. Lai    S. Langrock    B. Lehnert    A. Leonhardt    J. LePage-Bourbonnais    N. Levashko    J. Lidgard    T. Lindner    M. Lissia    J. Lock    L. Luzzi    I. Machulin    P. Majewski    A. Maru    J. Mason    A.B. McDonald    T. McElroy    T. McGinn    J.B. McLaughlin    R. Mehdiyev    C. Mielnichuk    L. Mirasola    A. Moharana    J. Monroe22footnotetext: Currently at the University of Oxford, Oxford, OX1 3PU, United Kingdom    A. Murray    P. Nadeau    C. Nantais    C. Ng    A.J. Noble    E. O’Dwyer    G. Oliviéro    M. Olszewski    C. Ouellet    S. Pal    D. Papi    B. Park    P. Pasuthip    S.J.M. Peeters    M. Perry    V. Pesudo    E. Picciau    M.-C. Piro    T.R. Pollmann33footnotetext: Currently at Nikhef and the University of Amsterdam, Science Park 1098 XG, Amsterdam, The Netherlands    F. Rad    E.T. Rand    C. Rethmeier    F. Retière    I. Rodríguez García    L. Roszkowski    J.B. Ruhland    R. Santorelli    F.G. Schuckman II    N. Seeburn    S. Seth    V. Shalamova    K. Singhrao    P. Skensved    T. Smirnova    N.J.T. Smith    B. Smith    K. Sobotkiewich    T. Sonley    J. Sosiak    J. Soukup    R. Stainforth    G. Stanic    C. Stone    V. Strickland    M. Stringer    B. Sur    J. Tang44footnotetext: Currently at Sun Yat-sen University, No.135, Xingang Xi Road 510275, Guangzhou, China    E. Vázquez-Jáuregui    L. Veloce    S. Viel    B. Vyas    M. Walczak    J. Walding    M. Waqar    M. Ward    S. Westerdale    J. Willis    R. Wormington    A. Zuñiga-Reyes
(September 7, 2025)
Abstract

In the DEAP-3600 dark matter search experiment, precise reconstruction of the positions of scattering events in liquid argon is key for background rejection and defining a fiducial volume that enhances dark matter candidate events identification. This paper describes three distinct position reconstruction algorithms employed by DEAP-3600, leveraging the spatial and temporal information provided by photomultipliers surrounding a spherical liquid argon vessel. Two of these methods are maximum-likelihood algorithms: the first uses the spatial distribution of detected photoelectrons, while the second incorporates timing information from the detected scintillation light. Additionally, a machine learning approach based on the pattern of photoelectron counts across the photomultipliers is explored.

1 Introduction

A compelling array of astrophysical observations indicates that a form of non-luminous dark matter not described by current physical theories comprises about 27% of the energy density of the Universe [1]. By comparison, it appears that only about 5% of the energy density is constituted by baryonic matter described by the Standard Model of particle physics [2, 3]. Weakly interacting massive particles (WIMPs) are well-motivated dark matter candidates. Direct detection experiments try to observe the scattering of WIMPs with nuclei in massive detectors, located underground to shield them from cosmic backgrounds. The experimental WIMP-nucleus scattering signature is a low-energy deposit (\lesssim100 keV) caused by a nuclear recoil. Several experiments employing different detection media have set increasingly stringent limits on the existence of WIMPs [4].

The DEAP-3600 experiment [5], located 2 km underground at SNOLAB, utilizes liquid argon (LAr) as dark matter detection medium. So far, DEAP-3600 has collected data using a LAr mass of (3269±24)(3269\pm 24) kg [6]. The latest result from DEAP-3600 on WIMP searches is based on a 758 tonne\cdotday exposure collected in 231 live-days [7].

A detailed description of the DEAP-3600 detector can be found in [5], and a summary with the relevant information is reported here. In Figure 1, a schematic design of the detector with its most relevant parts is shown. The detector’s main component is a 5 cm thick spherical acrylic vessel (AV) with an inner radius of 85 cm that can contain up to 3600 kg of LAr. The LAr is viewed by 255 Hamamatsu R5912-HQE high quantum efficiency photomultiplier tubes (PMTs) [8] coupled with 45 cm long, 19 cm diameter acrylic light guides to the AV. The light guides provide shielding against neutrons to the LAr. Between the light guides, filler blocks made of high-density polyethylene and polystyrene contribute to neutron shielding. The inner surface of the AV is coated with a 3μm3\mu\rm{m} thick layer of tetraphenyl butadiene (TPB), deposited in situ and acting as a wavelength shifter [9]. The photon spectrum from argon scintillation is in the vacuum ultraviolet (VUV) region, peaking at 128 nm. This wavelength cannot excite the first atomic argon state, permitting light to travel through the LAr without absorption. When VUV photons reach the inner surface of the AV, they are absorbed by the TPB and re-emitted in the visible spectrum, peaking at around 420 nm. This wavelength allows transmission through the acrylic and is matched by the peak quantum efficiency of the PMTs.

LAr has unique scintillation time characteristics that allow the use of pulse-shape discrimination (PSD) [10, 11] for distinguishing electromagnetic interactions, often called electronic recoils (ER), from nuclear recoil events (NR). PSD allows the suppression of ER events by a factor better than 7.5×1097.5\times 10^{-9} [10] above the threshold used for the WIMP search. Events originating on the surface of the AV that contains the LAr are primarily due to α\alpha decays. These surface-based background events can be effectively rejected by using position reconstruction techniques that define a fiducial volume smaller than the full LAr content, thus excluding regions near the AV walls.

The AV is enclosed in a steel shell, which is immersed in a water tank acting as a veto detector. On the steel shell, 48 outward-facing PMTs are mounted for monitoring the water and vetoing cosmogenically-induced backgrounds.

The inner detector volume is accessed through a steel and acrylic neck connecting to the AV (see Figure. 1). The neck contains a cooling coil that uses liquid nitrogen to cool the gaseous argon. Radioactivity from α\alpha particles on the surfaces of the neck can create scintillation events. Significant shadowing of such events can lead to events reconstructed with a low energy that mimic WIMP recoils if they are mis-reconstructed into the main LAr volume. The presence of surface backgrounds in the AV and in the neck requires robust and precise methods for reconstructing the position of scintillation events.

This paper provides a detailed description of the position reconstruction algorithms employed by DEAP-3600 for volume fiducialization and background rejection. After a brief description of the photon counting and time information, three position reconstruction strategies are described and evaluated. The first two methods employ a maximum-likelihood technique: one method is based on the “hit pattern” of the number of photoelectrons (PE) detected by each PMT in an event, while the other method is based on photons time-of-flight information. The third method uses the hit pattern information as input to a neural network and extends position reconstruction to the detector’s neck region.

Refer to caption
Figure 1: Schematic view of the DEAP-3600 detector (The water tank in which it is immersed is not shown). Variables used in the charge-based hit-pattern position reconstruction (x\vec{x}, ri\vec{r_{i}}, θi\theta_{i}) are indicated.

2 Photon Counting and Time Information

The position reconstruction algorithms are based on the number and arrival times of the PEs detected by each PMT, extracted from the digitized pulse shapes by custom algorithms [10, 12]. The reconstruction exploits an accurate Monte Carlo (MC) simulation based on the software RAT [13], which combines Geant4 [14] simulation and analysis software ROOT [15], with event-based analysis tasks. These algorithms maximally use the information measured by the PMTs, which is key for an accurate position reconstruction.

The DEAP-3600 data acquisition system records waveforms of the PMTs from -2.6 μ\mus to 14.4 μ\mus relative to the trigger time using CAEN V1720 250 MHz digitizers [16]. The raw pulses from the PMTs have a rise time of about 2 ns, and are thus slowed down and amplified using signal conditioning boards [5] to allow the digitizers to optimally determine both pulse height and timing. This results in single PE pulses at the digitizer level that are about 20 ns wide and 20 mV in amplitude. Each PMT waveform has its baseline removed by subtracting an offset calculated by averaging 62 baseline samples. The photon count done in this way is biased because the PMTs have significant afterpulsing, which manifests as additional pulses that don’t correspond to scintillation photons.

A low-bias estimate of the photon count can be obtained using a likelihood fit [10]. With this method, the pulse is modelled as nPEn_{\rm PE} photoelectrons, which is the sum of signals from scintillation photons and signals from afterpulsing. Bayes’ theorem is used to estimate nPEn_{\rm PE} at the time of each pulse, given the pulse charge, the liquid argon scintillation probability density function, the times of preceding pulses, the afterpulsing time and charge probability density function, and the single PE charge distribution of the PMT [17]. This PE estimator is used as input to both the hit pattern algorithm and the machine-learning algorithm.

3 Position Reconstruction Algorithms

In DEAP-3600, two maximum-likelihood algorithms are used to reconstruct the interaction vertex position. Both approaches fit a model where a flash of VUV scintillation light is emitted from a point within the spherical inner volume of the detector and is propagated to the wavelength shifter and the PMTs. The first algorithm is based on the spatial PMT hit patterns which are analyzed to derive the most likely position of the interaction vertex. The second approach is based on the photons’ time-of-flight and leverages the finite speed of light in LAr: the general idea is that PMTs closer to the source produce signals earlier than distant ones.

A third approach uses a supervised machine-learning algorithm based on the spatial hit pattern, with the flexibility to train the method on specific regions of the detector and on different event types.

3.1 Hit Pattern Algorithm

This method is based on the pattern of charges measured by the 255 PMTs around the AV. The position is calculated by maximizing a likelihood function, constructed with a probability distribution determined with a MC simulation of the detector.

For a given simulated event position, the number of photons measured by each of the PMTs is calculated with the full optical model in our MC simulation [18] which includes the refractive index, group velocity, and Rayleigh scattering length for VUV and visible photons as well as their dependence on thermodynamic properties. The simulation also takes into account the solid angle of the PMTs as viewed from the vertex, light absorption, re-emission at the TPB wavelength shifter, diffusion, and reflections at surfaces. The probability of detecting photons by a PMT is a highly non-linear function of the event’s coordinates and energy, which changes rapidly as the interaction point comes closer to the PMT and has to be determined with the simulation. To determine the event’s location x\vec{x}, the following likelihood is maximized with a simplex algorithm:

ln(x)=i=1NPMTsln[P(ni;λi)],\ln\mathcal{L}(\vec{x})=\sum^{\rm{N_{PMTs}}}_{i=1}\ln\left[\rm{P}(n_{i};\,\lambda_{i})\right]\,, (3.1)

where P(ni;λi)P(n_{i};\lambda_{i}) is the Poisson probability of observing nin_{i} PE in PMT ii over the full 10 μ\mus time window. The coordinate system has its origin x=(0,0,0)\vec{x}=(0,0,0) at the center of the spherical AV and the zz axis pointing in the vertical direction towards the detector’s neck. The parameter

λi(|x|,θi,nPE)=nPEλi(|x|,θi),\lambda_{i}\left(|\vec{x}|,\theta_{i},n_{\rm PE}\right)=n_{\rm PE}\lambda_{i}\left(|\vec{x}|,\theta_{i}\right)\,, (3.2)

describes the expected number of PE hitting the PMT ii located at position ri\vec{r}_{i}, given the event position x\vec{x}, an angle θi\theta_{i} between x\vec{x} and ri\vec{r}_{i}, and the total number of detected photons nPEn_{\rm PE} (see Figure 1 for a visual description of the variables). The factorized total number of photons nPEn_{\rm PE} provides an absolute normalization for λi\lambda_{i}. The simulation assumes a completely filled detector and the simulated events are generated on a grid with 20 points along the positive xx axis, 20 along the positive yy axis, and 40 symmetrically on the positive and negative zz axis. This choice results in 20 values for the radius r=x2+y2+z2r=\sqrt{x^{2}+y^{2}+z^{2}}. Assuming the spherical symmetry of the detector, events with x<0x<0 and y<0y<0 are modeled as the corresponding positive coordinate values. The zz direction is probed in both directions to account for the asymmetry introduced by the presence of the detector neck. The points are chosen to minimize their total number while probing the volume uniformly. Since these points lay on the three axes, the influence of individual direction-dependent effects can be tested. After the simulation, we define 20 cubic spline interpolated functions λi(cosθ;rj/r0)\lambda_{i}(\cos\theta;r_{j}/r_{0}) with the index jj corresponding to each of the radius values. Here r=|x|r=|\vec{x}| and r0=85r_{0}=85 cm is the inner radius of the vessel. For a given angle, a second cubic spline λ(r/r0;cosθ)\lambda(r/r_{0};\cos\theta) is defined to interpolate in the radius variable.

3.2 Time-of-Flight Algorithm

The time-of-flight algorithm for position reconstruction calculates the arrival time of a photon at each PMT. The algorithm uses a simplified optical model, illustrated in Figure 2.

Refer to caption
Figure 2: The light propagation model and the coordinate system for the time-of-flight reconstruction algorithm.

For a scintillation interaction event taking place at event time t0t_{0} and position x0\vec{x_{0}} in LAr, the arrival time of a photon at a PMT can be calculated as

ti=t0+Tf(p,x0)+τ,t_{i}=t_{0}+T_{f}(\vec{p},\vec{x_{0}})+\tau\,, (3.3)

in which Tf(p,x0)T_{f}(\vec{p},\vec{x_{0}}) is the time-of-flight from the event vertex x0\vec{x_{0}} to position p\vec{p}, the center of the light guide front surface. The time τ\tau is a random variable based on three physical processes: singlet decay of argon with a time constant of 2.2 ns, TPB re-emission with a time constant of 1.5 ns, and a resolution term of 0.6 ns that includes PMT response, resolution, and variability in the propagation time through the light guides. These values were determined by matching the model to the measured time response based on reconstructed variables. The vector p\vec{p} rather than the physical PMT position is used both for the determination of time calibration constants and for reconstruction. When a VUV photon is wavelength-shifted by TPB at a point x\vec{x^{\prime}} on the acrylic surface, the resulting visible photon can be emitted into a light guide (in which case p=x\vec{p}=\vec{x^{\prime}}) or be emitted back into the inner detector volume and travel with a greater group velocity before reaching a light guide front point p\vec{p}. The time-of-flight can then be calculated,

Tf(p,x0)=|xx0|vuv+|px|vvis,T_{f}(\vec{p},\vec{x_{0}})=\frac{|\vec{x^{\prime}}-\vec{x_{0}}|}{\rm{v_{uv}}}+\frac{|\vec{p}-\vec{x^{\prime}}|}{\rm{v_{vis}}}\,, (3.4)

where vuv\rm{v_{uv}} is the group velocity (133 mm/ns) of the 128 nm VUV light [19] and vvis\rm{v_{vis}} is the group velocity (241 mm/ns) of 420 nm visible light in LAr [20]. A test with the values of group velocities shifted by ±5%\pm 5\% resulted in a negligible effect on the position reconstruction resolution. Reflections and scattering of visible photons in TPB and Rayleigh scattering in the LAr are neglected in this algorithm.

Pt(t;r,sinθ)P_{t}(t;r,\sin\theta) is the probability density function (PDF) that corresponds to the flight time from a particular position, with rr and sinθ\sin\theta defined in Figure 2. The probability of measuring a photon in a PMT-light guide assembly at position p\vec{p} starting from a position x0\vec{x_{0}} between times t1t_{1} and t2t_{2} is described by the following equations:

Pt(t1,t2;p,x0)=1(4π)211dcosθuv02π𝑑ϕuv11dcosθvis02π𝑑ϕvisH(t1,t2,duvvuv+dvisvvis)G(θvis,ϕvis),P_{t}(t_{1},t_{2};\vec{p},\vec{x_{0}})=\frac{1}{(4\pi)^{2}}\intop_{-1}^{1}d\cos\theta_{\mathrm{uv}}\intop_{0}^{2\pi}d\phi_{\mathrm{uv}}\intop_{-1}^{1}d\cos\theta_{\mathrm{vis}}\intop_{0}^{2\pi}d\phi_{\mathrm{vis}}H\left(t_{1,}t_{2},\frac{d_{\mathrm{uv}}}{\rm{v_{uv}}}+\frac{d_{\mathrm{vis}}}{\rm{v_{vis}}}\right)G(\theta_{\mathrm{vis}},\phi_{\mathrm{vis}})\,, (3.5)
r=|x0+duv(sinθuvcosϕuv,sinθuvsinϕuv,cosθuv)|,r=\left|\vec{x_{0}}+d_{\mathrm{uv}}\left(\sin\theta_{\mathrm{uv}}\cos\phi_{\mathrm{uv}},\sin\theta_{\mathrm{uv}}\sin\phi_{\mathrm{uv}},\cos\theta_{\mathrm{uv}}\right)\right|\,, (3.6)
dvis=|x0+duv(sinθuvcosϕuv,sinθuvsinϕuv,cosθuv)p|,d_{\mathrm{vis}}=\left|\vec{x_{0}}+d_{\mathrm{uv}}\left(\sin\theta_{\mathrm{uv}}\cos\phi_{\mathrm{uv}},\sin\theta_{\mathrm{uv}}\sin\phi_{\mathrm{uv}},\cos\theta_{\mathrm{uv}}\right)-\vec{p}\right|\,, (3.7)
H(t1,t2,x)={1t1xt2,0otherwise,H(t_{1},t_{2},x)=\begin{cases}1&t_{1}\leq x\leq t_{2}\,,\\ 0&\mathrm{otherwise}\,,\end{cases} (3.8)
G(θvis,ϕvis)={1rlg>|x0+duv(sinθuvcosϕuv,sinθuvsinϕuv,cosθuv)+dvis(sinθviscosϕvis,sinθvissinϕvis,cosθvis)p|,0otherwise.G(\theta_{\mathrm{vis}},\phi_{\mathrm{vis}})=\begin{cases}1&\begin{aligned} r_{\mathrm{lg}}>|\vec{x_{0}}+&d_{\mathrm{uv}}\left(\sin\theta_{\mathrm{uv}}\cos\phi_{\mathrm{uv}},\sin\theta_{\mathrm{uv}}\sin\phi_{\mathrm{uv}},\cos\theta_{\mathrm{uv}}\right)\\ +&d^{\prime}_{\mathrm{vis}}\left(\sin\theta_{\mathrm{vis}}\cos\phi_{\mathrm{vis}},\sin\theta_{\mathrm{vis}}\sin\phi_{\mathrm{vis}},\cos\theta_{\mathrm{vis}}\right)-\vec{p}|\end{aligned}\,,\\ 0&\mathrm{otherwise}\,.\end{cases} (3.9)

Eq. 3.5 gives the overall probability PtP_{t} and assumes that the scintillation light and the TPB fluorescence emission are isotropic. The functions HH and GG impose the conditions that the time-of-flight must be between t1t_{1} and t2t_{2} and that the light from the TPB fluorescence strikes the light guide of interest. duvd_{\mathrm{uv}} and dvisd_{\mathrm{vis}} are the distances traveled by VUV and visible light, respectively. rlg=9.5cmr_{\mathrm{lg}}=9.5\,\rm{cm} is the radius of the face of the light guide. Eq. 3.6 is solved to determine duvd_{\mathrm{uv}} and to impose the geometry of the TPB surface at a radius of r=85r=85 cm, which marks the termination of the VUV light. Eq. 3.7 is the distance between the TPB and PMT. For the time-of-flight calculation, we approximate that the distance to the center of the light guide can always be calculated. This is an approximation because the time-of-flight in the detector depends on where the photon strikes the face of the light guide. However, the largest uncertainty arises from fluorescence occurring directly in front of the PMT of interest, with a maximum value of approximately 1/3 ns (rlg/vvis)\left(r_{\mathrm{lg}}/\mathrm{v_{vis}}\right). This uncertainty is accounted for in the time resolution of the PMT-light guide system.

For the calculation of the solid angle GG, we use the distance traveled by visible light between the TPB fluorescence and the acrylic vessel at the face of the light guide, dvisd^{\prime}_{\mathrm{vis}}, and check if the point of impact is within the radius (rlg)(r_{\mathrm{lg}}) of the face of the light guide.

The likelihood (t0,x0)\mathcal{L}(t_{0},\vec{x_{0}}) for a given event time t0t_{0} and position x0\vec{x_{0}} is

ln(t0,x0)=i=1NlnPt(tit0;ri,sinθi),\ln\mathcal{L}(t_{0},\vec{x_{0}})=\sum_{i=1}^{N}\ln P_{t}(t_{i}-t_{0};r_{i},\sin\theta_{i}), (3.10)

where we sum over the NN pulses in the first 35 ns, with the detection time tit_{i}, distance rir_{i}, and angle θi\theta_{i}, based on the relevant PMT and vertex position. A time-of-flight correction is applied if the path from vertex to PMT passes through the gas layer on the top of the detector. The likelihood in Eq. 3.10 is maximized using the TMinuit2 fitter in ROOT [15].

Since the calculation of the PDF (PtP_{t}) requires significant time, we employ interpolation techniques to evaluate the PDF while fitting. The detailed calculation is performed at discrete points r=0,1,2,,170cmr=0,1,2,...,170~\rm{cm}, and at sinθ=0,0.01,0.02,,1.0\sin\theta=0,0.01,0.02,...,1.0, and in 0.25 ns time bins. The PDFs are then convolved with a time distribution that accounts for the random variable τ\tau from Eq. 3.3. We then use a cubic spline interpolation in time, followed by a bilinear interpolation in rr and sinθ\sin\theta during the fit.

Refer to caption
Refer to caption
Figure 3: The probability density function is shown as a two-dimensional histogram for the time-of-flight and spatial "cell" ID, with the color scale proportional to the probability density. The variables rr and θ\theta in the definition of cell ID are given in Fig. 2. Left: The histogram includes all cells. Right: The histogram is zoomed to show cells 8400 to 8700, representing three distance values rr = 840 mm, 850 mm, and 860 mm with sinθ\sin\theta varying from 0.00 to 0.99 for each distance value.

The left panel of Figure 3 shows the convolved PDF in a two-dimensional representation where the vertical axis is the time-of-flight, the horizontal axis reports the identification numbers (ID) of spatial cells defined as ID =100×sinθ+floor(r[mm]/10)×100=100\times\sin\theta+\textrm{\tt floor}(r[\textrm{mm}]/10)\times 100 and the colour is the value of the PDF. The right panel shows a zoomed view around rr = 850 mm representing a sinθ\sin\theta range from 0.00 to 0.99. For large values of sinθ\sin\theta, visible light arrives earlier than most VUV light (for cell numbers greater than 8560) because these cells are close to the surface.

3.3 Machine-Learning Algorithm

The non-linearity of the position reconstruction problem in LAr points towards a solution employing machine learning (ML). The good performance of the hit pattern algorithm-based likelihood reconstruction suggests a similar approach using a supervised training algorithm. Another strong motivation for considering a ML approach is the possibility of training it with scintillation events in the LAr volume and events in other parts of the detector. In this context, the ML training includes events originating in the neck of the detector which constitute a relevant background for dark matter searches.

We investigated several algorithms, from Naive Bayes [21] to more complex models based on boosted decision trees [22] and neural networks. Our choice converged on a fully connected feed-forward neural network (FFNN) [23] for its better performance, relative simplicity, and training speed [24].

For the position reconstruction in the DEAP-3600 detector volume, the algorithm must realize a mapping between the 255 input charge values measured by the PMTs and the (x,y,z)(x,y,z) position coordinates as output. In practice, three single-output FFNN models were trained: one for each output Cartesian coordinate. Including time-of-flight information as additional FFNN inputs was also tested, but no significant improvements in performance were observed. The final network architectures of the selected FFNNs is reported in Table 1. The number of layers and neurons was determined by iteratively enlarging the network until no increase in the performance was observed.

Coordinates x,yx,y # Neurons Activation Function
Input Layer 255 ReLU
Hidden Layer 1 255 ReLU
Hidden Layer 2 255 ReLU
Hidden Layer 3 16 ReLU
Output Layer 1 linear
Coordinate zz
Input Layer 255 ReLU
Hidden Layer 1 1024 ReLU
Hidden Layer 2 1024 ReLU
Hidden Layer 3 1024 ReLU
Output Layer 1 linear
Table 1: Architecture of the three FFNNs used for position reconstruction. The coordinates xx and yy are reconstructed with the same FFNN topology. ReLU is the rectifier linear unit function ReLU(x)=max(0,x){\rm ReLU}(x)=\max(0,x).

The training of the FFNNs was performed using MC simulations of 10510^{5} β\beta-decays from 39Ar, 10510^{5} nuclear recoils on 40Ar, and 10510^{5} shadowed α\alpha-decays from 210Po happening on the acrylic argon flow-guides in the neck. These samples are divided into training and validation sets with proportions of 70:3070:30. The training process for each Cartesian coordinate minimizes the mean squared distance (MSD) between the predicted position ypred,iy_{\rm pred,i} and the true simulated position ytrue,iy_{\rm true,i} for each event ii:

MSD=1Ni=1N(ypred,iytrue,i)2,MSD=\frac{1}{N}\sum_{i=1}^{N}\left(y_{{\rm pred},i}-y_{{\rm true},i}\right)^{2}\,, (3.11)

where NN is the number of input events presented to the neural network during the training stage. Training for reconstructing the xx and yy coordinates required 400 epochs, while the zz coordinate required 40 epochs.

4 Performance of the Position Reconstruction Algorithms on Data and Simulations

The DEAP-3600 experiment’s WIMP-search analysis primarily utilized the hit-pattern algorithm for fiducialization, with additional cross-validation against the time-of-flight algorithm to ensure consistency [7]. Both the hit pattern and time-of-flight algorithms apply a likelihood framework based on simulated events within the detector’s LAr region.

In contrast, the ML algorithm was specifically trained on interactions occurring in both the LAr region and the detector’s neck region. It was optimized to maximize discrimination between these two event locations. However, due to the limited availability of high-statistics samples of real neck events, the ML algorithm’s performance was evaluated in this case exclusively using simulated data, benchmarked against the hit-pattern and time-of-flight algorithms.

In the following, we compare the different algorithms in in their ability to reconstruct the position of events in liquid argon.

4.1 Position reconstruction with 39Ar β\beta-Decays and 40Ar Nuclear Recoils

To compare the three algorithms in their ability to reconstruct the position of scattering events from β\beta-decays of 39Ar and 40Ar recoils in LAr, we apply an appropriate selection to data and MC generated events. We consider events that have (i) good quality waveforms recorded by the digitizers, e.g. well-formed baseline voltage and valid trigger time, (ii) physics trigger signals only, excluding calibration/cosmics triggers, (iii) no pulse multiplicity consistent with two or more coincident physics events. To avoid light leakage from a previous event, the number of pulses registered in the first 1600 ns of the waveform must be less than or equal to three and the time between the event and the previous one must be greater than 20 μ\mus. The reconstructed number of PE is selected to be within 90–200 PE, matching the WIMP region of interest (ROI) [7].

The left panels of Figure 4 show the normalized cubed radius distribution of reconstructed Ar39{}^{39}\rm{Ar} ER events (data and MC) and Ar40{}^{40}\rm{Ar} nuclear recoil events (MC) with the three algorithms and the right panels show the zz distributions. The hit pattern algorithm produces a more uniform distribution in the bulk region and is biased near the AV surface. In contrast, the distribution obtained with the time-of-flight algorithm is less smooth in the bulk while less biased near the surface. The enhancement of the distribution is more pronounced for the hit pattern algorithm since the probability distribution used in the likelihood is strongly non-linear when an event is very close to a PMT which collects most of the light.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 4: Distributions of reconstructed angular position variables, normalized reconstructed cubed radius ((r/850mm)3(r/850\,\rm{mm})^{3}) and reconstructed zz, comparing Ar39{}^{39}\rm{Ar} data (solid) and Ar39{}^{39}\rm{Ar} simulation (dotted) and Ar40{}^{40}\rm{Ar} nuclear recoil simulation (dashed), for the hit pattern algorithm (top row), the time-of-flight algorithm (middle row), and the neural network algorithm (bottom row). All the plots are normalized with their respective area.

Figure 5 shows the contained LAr mass for Hit Pattern and Time-of-Flight based algorithms for 39Ar data and MC along with the curve calculated from the geometric volume of the detector.

Refer to caption
Figure 5: Estimates from the hit pattern and time-of-flight based algorithms of the contained mass of LAr within a radius of the reconstructed position. The estimate is based on the fraction of 39Ar decays observed in the 90–200 PE range reconstructing within a given radius. It is assumed that the true positions of the 39Ar nuclei are uniformly distributed throughout the LAr target. The analytical contained LAr mass calculated using the geometric volume and the density is also shown.

Figure 6 shows the uniformity of reconstructed positions from the hit pattern algorithm for Ar39{}^{39}\rm{Ar} data and Ar40{}^{40}\rm{Ar} nuclear recoil MC data.

Refer to caption
Refer to caption
Figure 6: Reconstructed distribution shown in z/(850z/(850 mm) vs (x2+y2)/(850mm)2(x^{2}+y^{2})/(850\,\rm{mm})^{2} comparing Ar39{}^{39}\rm{Ar} data (left) to Ar40{}^{40}\rm{Ar} nuclear recoils (right) MC for the hit pattern position reconstruction. Distributions are normalized by their respective maximum values.

4.2 Position Reconstruction of Neck Events

Time-of-flight and hit pattern position reconstruction algorithms use the hypothesis that an event comes from a single flash of isotropic light in LAr. Thus, the two algorithms should agree when the hypothesis is true, such as for WIMPs or Ar39{}^{39}\rm{Ar} ER events. However, for neck events, the light is emitted above the LAr surface and shadowed by the flow-guides: the result is a very different time-of-flight distributions and a hit pattern concentrated at the bottom of the detector.

Refer to caption
Refer to caption
Figure 7: Difference between the reconstructed vertex positions (left) and vertical positions (right) between the time-based and hit-pattern based algorithms in Ar39{}^{39}\rm{Ar} data, Ar40{}^{40}\rm{Ar} NR simulations, and neck alpha simulations.

Figure 7 shows the difference between the two algorithms’ reconstructed zz positions and vertex positions. Ar39{}^{39}\rm{Ar} data, simulated WIMP-like events (Ar40{}^{40}\rm{Ar} NR events), and simulated alpha events from the neck region are selected according to the WIMP ROI. It is shown that the reconstructed vertical positions agree within 35 mm for 50% of 39Ar events in data and simulated WIMPs. For simulated neck alpha decays, the reconstructed vertical positions of the two algorithms are remarkably different, with the time-of-flight algorithm systematically reconstructing these events much closer to the top of the detector than the hit pattern algorithm.

Refer to caption
(a) Error in xx for events originating in liquid argon.
Refer to caption
(b) Error in zz for events originating in liquid argon.
Refer to caption
(c) Error in xx for events originating in the neck.
Refer to caption
(d) Error in zz for events originating in the neck.
Figure 8: Distributions of the error in reconstructing xx (yy is analogous by symmetry) and zz, estimated in simulated events as the difference between reconstructed and MC "true" position.

In order to compare the three models in reconstructing the position of events in LAr and in the neck, the difference between the MC “true” position xtrue\vec{x}_{\rm{true}} and the reconstructed position xrec\vec{x}_{\rm{rec}} are calculated. Events in LAr are defined as the region 850mm<ztrue550mm-850\,\rm{mm}<z_{\rm{true}}\leq 550\,\rm{mm} while the neck region is ztrue>850mmz_{\rm{true}}>850\,\rm{mm}. The results are presented in Figure 8. The new ML algorithm is comparable to the existing hit pattern and time-of-flight algorithms in reconstructing the position in LAr, while it outperforms the other two in the reconstruction of events in the neck.

4.3 Position Reconstruction with 22Na Source Calibration Data

The possibility to employ a radioactive γ\gamma source offers additional opportunities for testing the position reconstruction algorithms. In particular, the source signal can be precisely tagged, effectively eliminating other background events. Moreover, the source can be placed above the LAr surface for investigating a region which is not covered by 39Ar decays. The DEAP-3600 detector has a calibration polyethylene tube (known as CAL F [5]) running around the steel shell holding the AV. The source is housed in a stainless steel capsule and can slide in the pipe and be placed at 9 pre-determined positions (see Figure 9) with an ex-situ measured uncertainty of 1\sim 1 cm.

The three γ\gammas emitted almost simultaneously by the source allow for a tagging scheme where two photons are detected by LYSO crystals (32 photons/keV light yield) read out by PMTs on both sides of the source, while the third one is directed towards the LAr. The tagging algorithm relies on the energy deposited in the LYSO crystals, particularly on the timing between the event time in the LAr volume and the time when one of the tagging PMTs triggers. More details about this source are given in [5].

Refer to caption
Figure 9: (Left) Picture of the steel shell containing the AV with the LAr. The CAL F pipe for the 22Na source is indicated. (Right) Drawing of the CAL F tube (in red) around the steel shell with the 9 standard positions. The two positions (2 and 4) used in this study and the insertion point are indicated with black arrows.

The energy spectrum of the γ\gammas from the 22Na source, interacting with the LAr volume, shows variations under three different tagging conditions (Figure 10).

  1. 1.

    No tagging by the calibration PMTs: this allows the observation of the 39Ar beta decay spectrum on top of the 22Na events;

  2. 2.

    Tagging by only one of the calibration PMTs: this occurs for instance, when the 1.27 MeV γ\gamma is detected by one of the calibration PMTs, one 511 keV γ\gamma escapes detection, and the other 511 keV γ\gamma reaches the LAr volume, resulting in the observed peak at 511 keV (\sim3200 PE in Figure 10, green spectrum);

  3. 3.

    Double tagging: this represents the most stringent condition, where both 511 keV γ\gammas are tagged by the calibration PMTs, and the 1.27 MeV interacts with the LAr (\sim8000 PE in Figure 10).

The full energy peak of the 1.27 MeV γ\gamma is visible for all the tagging conditions while the 511 keV peak is clearly visible only for the single tag configuration.

Refer to caption
Figure 10: Energy spectrum of the γ\gammas from the 22Na source (in PE) in LAr for different tagging configurations for position 2 (see Figure 9, right).

To study the performance of the hit pattern and time-of-flight reconstruction algorithms, MC simulations were performed with the 22Na source at positions 2 and 4 (Figure 9). The chosen positions are below and above the LAr surface.

A total of 1.2×\times107 and 4.5×\times106 events were generated from the 22Na capsule in position 2 and 4, respectively. The difference in the number of simulated events arises from variations in the number of events that trigger the detector, which depend on their position due to the detector’s geometry

Reconstructed coordinates for positions 2 and 4 are shown in Figures 11 and 12, for data and MC, both reconstructed with the hit-pattern and time-of-flight algorithms (normalizing to the same number of events).

A general good agreement between data and MC is observed for all coordinates but differences between the two algorithms can be noticed. From Figures 11(d) and 12(d), it is clear that the time-of-flight algorithm tends to reconstruct events closer to the center of the detector with respect to the hit pattern-based algorithm.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Figure 11: Reconstructed coordinates and normalized cubed radius, (r/850mm)3(r/850\,\rm{mm})^{3} for events from position 2: data (black) and MC (dashed blue) reconstructed with the hit-pattern algorithm, data (red) and MC (dashed green) reconstructed with the time-of-flight algorithm.
Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Figure 12: Reconstructed coordinates and normalized cubed radius, (r/850mm)3(r/850\,\rm{mm})^{3} for events from position 4: data (black) and MC (dashed blue) reconstructed with the hit-pattern algorithm, data (red) and MC (dashed green) reconstructed with the time-of-flight algorithm.

5 Position Resolution

A data-driven method, first introduced in [7], is used to quantify the position resolution obtained from the three algorithms. According to this method, starting from a sample of 39Ar decays, we create two “pseudo-events” each with about half the original event’s observed PE originating from the same position in the detector. This is done by cycling through all the PMTs that saw light in the event, then through each PMT pulse, and through each PE within pulses. Each such PE unit from the original event can be assigned to the first pseudo-event with 50% probability and independently can be assigned to the second pseudo-event with 50% probability to minimize correlations between the two pseudo-events555In general, a number nn of pseudo-events can be considered, but we consider n=2n=2 for this analysis.. In this way, a PE unit from a pulse can be added to both pseudo-events, one or the other, or neither. On average, each pseudo-event has approximately half the energy of the original event and a similar spatial and time profile.

The original event and pseudo-events can then be processed by the position reconstruction algorithms. Comparisons in reconstructed position can be drawn between the pseudo-events and the original event, since the origin of the scintillation photons in the detector is the same.

For a large sample of events distributed throughout the detector volume and in a range of energies, the distributions of the differences in reconstructed position (Δx,Δy,Δz)(\Delta x,\Delta y,\Delta z) between pairs of pseudo-events provide a data-driven estimate of the position resolution. The resolution estimator δr¯\overline{\delta r} on the event position from the origin of the detector is calculated from the resolution on measured Cartesian coordinates (σeffx,σeffy,σeffz)(\sigma_{\rm{eff}}^{x},\sigma_{\rm{eff}}^{y},\sigma_{\rm{eff}}^{z}) as:

δr¯=(σeffx)2+(σeffy)2+(σeffz)23,\overline{\delta r}=\sqrt{\frac{(\sigma_{\rm{eff}}^{x})^{2}+(\sigma_{\rm{eff}}^{y})^{2}+(\sigma_{\rm{eff}}^{z})^{2}}{3}}\quad, (5.1)

where the Cartesian resolution estimators are obtained from the characteristic widths of the distributions of Δx\Delta x, Δy\Delta y and Δz\Delta z (see Appendix A).

Refer to caption
(a) Hit pattern reconstructed xx, 0<r<1500<r<150 mm.
Refer to caption
(b) Hit pattern reconstructed xx, 750<r<800750<r<800 mm.
Refer to caption
(c) Time-based reconstructed x, 0<r<1500<r<150 mm.
Refer to caption
(d) Time-based reconstructed x, 750<r<800750<r<800 mm.
Refer to caption
(e) Neural network reconstructed xx, 0<r<1500<r<150 mm.
Refer to caption
(f) Neural network reconstructed xx, 750<r<800750<r<800 mm.
Refer to caption
(g) Neural network reconstructed zz, 0<r<1500<r<150 mm.
Refer to caption
(h) Neural network reconstructed zz, 750<r<800750<r<800 mm.
Figure 13: Examples of the distributions of the differences in reconstructed position between pseudo-events in the 100–200 PE range, for the three algorithms. Distributions for two regions, based on the reconstructed radius of the original 39Ar decay event, are shown: near the center of the detector (left), and near the edges (right).
Refer to caption
(a) Likelihood hit pattern-based position reconstruction
Refer to caption
(b) Likelihood time-based position reconstruction
Refer to caption
(c) Neural network PE-based position reconstruction
Figure 14: Resolution of the reconstructed radius measured using the pseudo-event method from the three position reconstruction algorithms, shown in bins of the average PE between the pseudo-events and reconstructed radius of the original event.
Refer to caption
(a) Example 1D distribution - Likelihood PE-based
Refer to caption
(b) Likelihood PE-based position resolution
Refer to caption
(c) Likelihood time-based position resolution
Refer to caption
(d) Neural network PE-based position resolution
Figure 15: In (a), the 2D resolution values per bin were sorted onto a 1D histogram. Then from the distribution, the quantile points corresponding to the median, 1σ\sigma (68%) and 2σ\sigma (95%), are shown as a function of time in (b) through (d). These figures demonstrate the stability over time of the estimated resolution from the three position reconstruction algorithms.

In the majority of cases, these distributions are described by the following function, which has a Gaussian core and an exponential component to account for non-Gaussian tails:

g(Δx)=(1ασ2π)e-(Δxμ)22σ2+α2τe-|Δxμ|τ,g(\Delta x)=\left(\frac{1-\alpha}{\sigma\sqrt{2\pi}}\right)\,e^{\scalebox{1.3}{-$\frac{(\Delta x-\mu)^{2}}{2\sigma^{2}}$}}\,+\,\frac{\alpha}{2\tau}\,e^{\scalebox{1.3}{-$\frac{\left|\Delta x-\mu\right|}{\tau}$}}\,, (5.2)

where the α\alpha parameter varies between 0 (entirely Gaussian) and 1 (entirely exponential). As shown in Appendix A, the characteristic width of this distribution is

σeff=(1α)σ2+2ατ2.\sigma_{\rm{eff}}=\sqrt{(1-\alpha)\sigma^{2}+2\alpha\tau^{2}}\,. (5.3)

In the case of the FFNN reconstructed zz distributions at low rr and PE, a different fit function is required as shown in Fig. 13(g). The fit function chosen to fit this distribution is a sum of four normalized Gaussians:

g(Δz)=(w0σ02π)e-(Δz+μ0)22σ02+(w0σ02π)e-(Δzμ0)22σ02+(w1σ12π)e-(Δz)22σ12+(w2σ22π)e-(Δz)22σ22.g(\Delta z)=\left(\frac{w_{\rm{0}}}{\sigma_{\rm{0}}\sqrt{2\pi}}\right)\,e^{\scalebox{1.3}{-$\frac{(\Delta z+\mu_{\rm{0}})^{2}}{2\sigma_{\rm{0}}^{2}}$}}+\left(\frac{w_{\rm{0}}}{\sigma_{\rm{0}}\sqrt{2\pi}}\right)\,e^{\scalebox{1.3}{-$\frac{(\Delta z-\mu_{\rm{0}})^{2}}{2\sigma_{\rm{0}}^{2}}$}}+\left(\frac{w_{\rm{1}}}{\sigma_{\rm{1}}\sqrt{2\pi}}\right)\,e^{\scalebox{1.3}{-$\frac{(\Delta z)^{2}}{2\sigma_{\rm{1}}^{2}}$}}+\left(\frac{w_{\rm{2}}}{\sigma_{\rm{2}}\sqrt{2\pi}}\right)\,e^{\scalebox{1.3}{-$\frac{(\Delta z)^{2}}{2\sigma_{\rm{2}}^{2}}$}}\,. (5.4)

The distribution contains “outer Gaussians”, which are symmetrically apart from the origin, and have the same scaling w0w_{\rm{0}} and standard deviation σ0\sigma_{\rm{0}}. They correspond to cases where the FFNN misclasifies only one of the two pseudo-events as originating from the neck, and the other from the AV, resulting in a large Δz\Delta z. The two central Gaussians with weights w1w_{\rm{1}} and w2w_{\rm{2}} have their means set to zero. As shown in Appendix A, the characteristic width of this distribution is

σeff=2w0(σ02+μ02)+w1σ12+w2σ22.\sigma_{\rm{eff}}=\sqrt{2w_{\rm{0}}(\sigma_{\rm{0}}^{2}+\mu_{\rm{0}}^{2})+w_{\rm{1}}\sigma_{\rm{1}}^{2}+w_{\rm{2}}\sigma_{\rm{2}}^{2}}. (5.5)

The collection of plots in Figure 13 showcase some examples of the difference distributions of positions between the pseudo-events for each reconstruction algorithm. Distributions are shown for two regions of the detector: near the center and near the edges, for a chosen PE range of 100–200. For the likelihood algorithms, only Δx\Delta x is displayed as example, since Δy\Delta y and Δz\Delta z follow the same behavior. For the neural network, Δx\Delta x and Δy\Delta y share the same profile, but Δz\Delta z is different, so an additional pair of examples is provided for the latter.

Figure 14 shows the position resolution of the three algorithms as a function of the average PE of the pseudo-events and the reconstructed radius of the original event. The position resolution improves as we get closer to the AV surface, and as the number of PE in the event increases.

To quantify how the position reconstruction resolution evolves over time, for each 2D plot, 1D distributions of the resolution values were created. Each of the values were sorted into 100 bins, 5 mm wide each. Next, the locations of the median, 1σ\sigma and 2σ\sigma quantiles were extracted from the distribution. Shown in Figure 15(a) is an example of the 1D distribution with lines added at the locations of these key quantiles. The other parts of Figure 15 show the stability of these position resolution metrics across the second-fill operation of DEAP-3600 in 2016–2020.

6 Summary

The DEAP-3600 experiment developed three complementary algorithms for reconstructing the positions of interactions within LAr: a maximum likelihood algorithm based on PE patterns, a maximum likelihood algorithm utilizing time-of-flight information, and a machine-learning algorithm employing neural networks with PE patterns as input. All three algorithms demonstrate comparable performance for position reconstruction within the LAr volume. However, in the detector’s neck region, only the machine-learning approach provides reliable position resolution, as it was specifically designed and trained on a simulated dataset containing events from both the LAr volume and the neck region.

The successful identification and rejection of background events, particularly shadowed alpha decays occurring in the neck, is critical for enhancing the experiment’s sensitivity to WIMP dark matter signals. The two maximum-likelihood algorithms specialized for reconstructing the event position in the LAr volume were further tested with data from a radioactive 22Na source, confirming their good agreement with simulations.

To further refine the position resolution determination, a fully data-driven method was developed. The dependence of position resolution on both position and energy was studied in detail. This method demonstrated the excellent stability of the position resolution over the data collection period. The capability of reconstructing the position of events in DEAP-3600 significantly enhances the detector’s sensitivity to WIMP dark matter within the region of interest, improving its ability to detect rare nuclear recoils in the bulk LAr volume while rejecting surface backgrounds.

7 Acknowledgements

We thank the Natural Sciences and Engineering Research Council of Canada (NSERC), the Canada Foundation for Innovation (CFI), the Ontario Ministry of Research and Innovation (MRI), and Alberta Advanced Education and Technology (ASRIP), the University of Alberta, Carleton University, Queen’s University, the Canada First Research Excellence Fund through the Arthur B. McDonald Canadian Astroparticle Physics Research Institute, Consejo Nacional de Ciencia y Tecnología Project No. CONACYT CB-2017-2018/A1-S-8960, DGAPA UNAM Grants No. PAPIIT IN108020 and IN105923, and Fundación Marcos Moshinsky, the European Research Council Project (ERC StG 279980), the UK Science and Technology Facilities Council (STFC) (ST/K002570/1 and ST/R002908/1), the Leverhulme Trust (ECF-20130496), the Russian Science Foundation (Grant No. 21-72-10065), the Spanish Ministry of Science and Innovation (PID2019-109374GB-I00) and the Community of Madrid (2018-T2/ TIC-10494), the International Research Agenda Programme AstroCeNT (MAB/2018/7) funded by the Foundation for Polish Science (FNP) from the European Regional Development Fund, and the Polish National Science Centre (2022/47/B/ST2/02015). Studentship support from the Rutherford Appleton Laboratory Particle Physics Division, STFC and SEPNet PhD is acknowledged. We thank SNOLAB and its staff for support through underground space, logistical, and technical services. SNOLAB operations are supported by the CFI and Province of Ontario MRI, with underground access provided by Vale at the Creighton mine site. We thank Vale for their continuing support, including the work of shipping the acrylic vessel underground. We gratefully acknowledge the support of the Digital Research Alliance of Canada, Calcul Québec, the Centre for Advanced Computing at Queen’s University, and the Computational Centre for Particle and Astrophysics (C2PAP) at the Leibniz Supercomputer Centre (LRZ) for providing the computing resources required to undertake this work.

Appendix A Data-Driven Estimation Method for Position Resolution

The spherical radius rr expressed in terms of three Cartesian coordinates x,y,zx,y,z is

r(x,y,z)=x2+y2+z2.r(x,y,z)=\sqrt{x^{2}\,+\,y^{2}\,+\,z^{2}}\quad. (A.1)

From standard error propagation, assuming uncorrelated variables, the error on the radius is

δr=(xδx)2+(yδy)2+(zδz)2r.\delta r=\frac{\sqrt{(x\delta x)^{2}+(y\delta y)^{2}+(z\delta z)^{2}}}{r}\quad. (A.2)

Under the assumption of spherical symmetry, we assume that δx\delta x, δy\delta y and δz\delta z are constant over fixed spherical coordinates θ\theta and ϕ\phi, thus the average resolution δr¯\overline{\delta r} of the reconstructed radius can be obtained by integrating over the solid angle:

δr¯=14πϕ=02πθ=0π(sinθcosϕδx)2+(sinθsinϕδy)2+(cosθδz)2sinθdθdϕ.\overline{\delta r}=\frac{1}{4\pi}\int_{\phi=0}^{2\pi}\int_{\theta=0}^{\pi}\sqrt{(\sin\theta\cos\phi\ \delta x)^{2}\,+\,(\sin\theta\sin\phi\ \delta y)^{2}\,+\,(\cos\theta\ \delta z)^{2}}\,\sin\theta\,d\theta\,d\phi\quad. (A.3)

Performing the integral,

δr¯=(δx)2+(δy)2+(δz)23.\overline{\delta r}=\sqrt{\frac{(\delta x)^{2}+(\delta y)^{2}+(\delta z)^{2}}{3}}\quad. (A.4)

The errors on the Cartesian reconstructed positions are determined from the characteristic widths of the distributions of the distances between pairs of pseudo-events, Δx\Delta x, Δy\Delta y and Δz\Delta z as explained in Section 5. These characteristic widths are derived from the variance of the distribution from the fit function used in each case.

First, expressing the mixture distribution of a Gaussian with exponential tails from Eq. 5.2 as:

g(Δx)=(1α)f1(Δx)+αf2(Δx),g(\Delta x)=(1-\alpha)f_{1}(\Delta x)+\alpha f_{2}(\Delta x)\,, (A.5)

where f1f_{\rm{1}} and f2f_{\rm{2}} are respectively the normalized Gaussian and exponential tail distributions, the expectation value and the variance of g(Δx)g(\Delta x) are respectively:

E[g(Δx)]\displaystyle{\rm E}[g(\Delta x)] =\displaystyle= E[(1α)f1(Δx)+αf2(Δx)]\displaystyle{\rm E}[(1-\alpha)f_{1}(\Delta x)+\alpha f_{2}(\Delta x)] (A.6)
=\displaystyle= (1α)μ+αμ=μ,\displaystyle(1-\alpha)\mu+\alpha\mu=\mu\,,
V[g(Δx)]\displaystyle{\rm V}[g(\Delta x)] =\displaystyle= (1α)x2f1(x)𝑑x+αx2f2(x)𝑑xμ2\displaystyle(1-\alpha)\int x^{2}f_{1}(x)\,dx\ +\alpha\int x^{2}f_{2}(x)\,dx\ -\mu^{2} (A.7)
=\displaystyle= (1α)(σ2+μ2)+α(2τ2+μ2)μ2\displaystyle(1-\alpha)(\sigma^{2}+\mu^{2})+\alpha(2\tau^{2}+\mu^{2})-\mu^{2}
=\displaystyle= (1α)σ2+2ατ2.\displaystyle(1-\alpha)\sigma^{2}+2\alpha\tau^{2}\,.

The characteristic width of the Gaussian-exponential mixture distribution is therefore:

σeff=(1α)σ2+2ατ2.\sigma_{\rm{eff}}=\sqrt{(1-\alpha)\sigma^{2}+2\alpha\tau^{2}}\,. (A.8)

In the case where the mixture distribution is a sum of four Gaussians, a similar process is followed. Starting with Eq. 5.4, the expectation value and the variance are:

E[g(Δx)]\displaystyle{\rm E}[g(\Delta x)] =\displaystyle= iwiμi=w0(μ0+μ0)=0,\displaystyle\sum_{i}w_{i}\mu_{i}=w_{0}(-\mu_{0}+\mu_{0})=0\,, (A.9)
V[g(Δx)]\displaystyle{\rm V}[g(\Delta x)] =\displaystyle= iwi(σi2+μi2)E[g(Δx)]2\displaystyle\sum_{i}w_{i}(\sigma_{i}^{2}+\mu_{i}^{2})-{\rm E}[g(\Delta x)]^{2} (A.10)
=\displaystyle= 2w0(σ02+μ02)+w1σ12+w2σ22.\displaystyle 2w_{0}(\sigma_{0}^{2}+\mu_{0}^{2})+w_{1}\sigma_{1}^{2}+w_{2}\sigma_{2}^{2}\,.

Therefore, in this case the characteristic width is:

σeff\displaystyle\sigma_{\rm{eff}} =\displaystyle= 2w0(σ02+μ02)+w1σ12+w2σ22\displaystyle\sqrt{2w_{0}(\sigma_{0}^{2}+\mu_{0}^{2})+w_{1}\sigma_{1}^{2}+w_{2}\sigma_{2}^{2}} (A.11)

Based on Eq. A.8 or Eq. A.11, the appropriate characteristic widths for δx,δy,δz\delta x,\delta y,\delta z are substituted into Eq. A.4, which becomes:

δr¯=(σeffx)2+(σeffy)2+(σeffz)23.\overline{\delta r}=\sqrt{\frac{(\sigma_{\rm{eff}}^{x})^{2}+(\sigma_{\rm{eff}}^{y})^{2}+(\sigma_{\rm{eff}}^{z})^{2}}{3}}. (A.12)

An additional correction needs to be taken into account to obtain a true estimate of the position resolution. Mathematically, the true resolution is defined as the width of the distribution P(|xrecxtrue|)P\left(\left|\vec{x}_{\rm rec}-\vec{x}_{\rm true}\right|\right) of the difference between the true (xtrue\vec{x}_{\rm true}) and reconstructed (xrec\vec{x}_{\rm rec}) position. In actual data, two pseudo-events are generated by randomly selecting PE units from the original event. Position reconstruction of two pseudo-events provides two fit positions (x1\vec{x}_{1} and x2\vec{x}_{2}), and the distribution of their difference is Q(|x1x2|)Q\left(\left|\vec{x}_{1}-\vec{x}_{2}\right|\right) (as in Figure 13(a)). So far, the distribution QQ, which is different from the distribution PP, is used to calculate the position resolution. If the distribution QQ follows Eq. A.8 or Eq. A.11 then we can assume the distribution PP follows it as well, as long as photon emission and detection are uncorrelated. The following equation shows the relationship between the PP and QQ distributions:

Q(|x1x2|)=Vd3xtrueP(|x1xtrue|)P(|x2xtrue|).Q\left(\left|\vec{x}_{1}-\vec{x}_{2}\right|\right)=\int_{V}d^{3}\vec{x}_{\rm true}P\left(\left|\vec{x}_{1}-\vec{x}_{\rm true}\right|\right)P\left(\left|\vec{x}_{2}-\vec{x}_{\rm true}\right|\right)\,. (A.13)

For distributions where only Eq. A.8 is used, the relationship between the ratio of PP-resolution (σtrue\sigma_{\rm true}) to QQ-resolution (σrec\sigma_{\rm rec}) and the α\alpha parameter is obtained using a toy MC. First, σtrue\sigma_{\rm true} is obtained from the parameters α,σ\alpha,\,\sigma, and τ\tau, varied in a range comparable to the observed data. Then, two random events are generated following Eq. A.8 and afterwards fitted to obtain the effective resolution σfit\sigma_{\rm fit}. The simulation results are shown in Fig. 16. A correction factor of 2/3, obtained when setting α=0.5\alpha=0.5 in the linear α\alpha-based correction, was therefore used to scale the data-driven QQ-resolution values calculated with Eq. A.12 in order to obtain the PP-resolution values shown in Fig. 14 and Fig. 15. The same correction factor was applied to cases involving the four-Gaussian mixture distribution for neural network position resolution calculations.

Refer to caption
Figure 16: Ratio of σtrue\sigma_{\rm{true}} to σfit\sigma_{\rm{fit}} versus the α\alpha parameter in the toy MC.

References