Stochastic Methods for Neutron Transport Equation III: Generational many-to-one and $k_{\texttt{eff}}$

A. M. G. Cox¹¹1 Department of Mathematical Sciences, University of Bath, Claverton Down, Bath, BA2 7AY, UK. Email: a.g.m.cox@bath.ac.uk, e.l.horton@bath.ac.uk, a.kyprianou@bath.ac.uk , E. L. Horton^∗, A. E. Kyprianou^∗, D. Villemonais²²2Institut Élie Cartan de Lorraine, Bureau 123, Université de Lorraine, 54506, Vandoeuvre-lès-Nancy Cedex, France. Email: denisvillemonais@gmail.com

(August 7, 2025)

Abstract

The Neutron Transport Equation (NTE) describes the flux of neutrons over time through an inhomogeneous fissile medium. In the recent articles [5, 10], a probabilistic solution of the NTE is considered in order to demonstrate a Perron-Frobenius type growth of the solution via its projection onto an associated leading eigenfunction. In [9, 4], further analysis is performed to understand the implications of this growth both in the stochastic sense, as well as from the perspective of Monte-Carlo simulation.

Such Monte-Carlo simulations are prevalent in industrial applications, in particular where regulatory checks are needed in the process of reactor core design. In that setting, however, it turns out that a different notion of growth takes centre stage, which is otherwise characterised by another eigenvalue problem. In that setting, the eigenvalue, sometimes called $k$ -effective (written $k_{\texttt{eff}}$ ), has the physical interpretation as being the ratio of neutrons produced (during fission events) to the number lost (due to absorption in the reactor or leakage at the boundary) per typical fission event.

In this article, we aim to supplement [5, 10, 9, 4], by developing the stochastic analysis of the NTE further to the setting where a rigorous probabilistic interpretation of $k_{\texttt{eff}}$ is given, both in terms of a Perron-Frobenius type analysis as well as via classical operator analysis.

To our knowledge, despite the fact that an extensive engineering literature and industrial Monte-Carlo software is concentrated around the estimation of $k_{\texttt{eff}}$ and its associated eigenfunction, we believe that our work is the first rigorous treatment in the probabilistic sense (which underpins some of the aforesaid Monte-Carlo simulations).

Key words: Neutron Transport Equation, principal eigenvalue, semigroup theory, Perron-Frobenius decomposition

MSC: 82D75, 60J80, 60J75, 60J99

1 Introduction

As described in [10, 9, 5] the NTE is a balance equation for the flux of neutrons across a planar cross-section in an inhomogeneous fissile medium. The backwards form of the equation can be written as follows,

	$\displaystyle\frac{\partial}{\partial t}\psi_{t}(r,\upsilon)$	$\displaystyle=\upsilon\cdot\nabla\psi_{t}(r,\upsilon)-\sigma(r,\upsilon)\psi_{t}(r,\upsilon)$
		$\displaystyle+\sigma_{\texttt{s}}(r,\upsilon)\int_{V}\psi_{t}(r,\upsilon^{\prime})\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}+\sigma_{\texttt{f}}(r,\upsilon)\int_{V}\psi_{t}(r,\upsilon^{\prime})\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime},$		(1.1)

where the flux $\psi_{t}(r,\upsilon)$ is a function of time, $t$ and the configuration variables $(r,\upsilon)\in D\times V$ where $D\subseteq\mathbb{R}^{3}$ is a non-empty, smooth, bounded convex domain such that $\partial D$ has zero Lebesgue measure, and $V=\{\upsilon\in\mathbb{R}^{3}:\upsilon_{\texttt{min}}\leq|\upsilon|\leq\upsilon_{\texttt{max}}\}$ . Furthermore, the other components of (1.1) have the following interpretation:

	$\displaystyle\sigma_{\texttt{s}}(r,\upsilon)$	$\displaystyle:\text{ the rate at which scattering occurs from incoming velocity $\upsilon$,}$
	$\displaystyle\sigma_{\texttt{f}}(r,\upsilon)$	$\displaystyle:\text{ the rate at which fission occurs from incoming velocity $\upsilon$,}$
	$\displaystyle\sigma(r,\upsilon)$	$\displaystyle:\text{ the sum of the rates }\sigma_{\texttt{f}}+\sigma_{\texttt{s}}\text{ and is known as the total cross section,}$
	$\displaystyle\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}$	$\displaystyle:\text{ the scattering yield at velocity $\upsilon^{\prime}$ from incoming velocity }\upsilon,$
		$\displaystyle\hskip 14.22636pt\text{ satisfying }\textstyle{\int_{V}}\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}){\rm d}\upsilon^{\prime}=1,\text{ and }$
	$\displaystyle\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}$	$\displaystyle:\text{ the neutron yield at velocity $\upsilon^{\prime}$ from fission with incoming velocity }\upsilon,$
		$\displaystyle\hskip 14.22636pt\text{ satisfying }{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}\textstyle{\int_{V}}\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}<\infty.}$

We also enforce the following initial and boundary conditions

\left\{\begin{array}[]{ll}\psi_{0}(r,\upsilon)=g(r,\upsilon)&\text{ for }r\in D,\upsilon\in{V},\\ &\\ \psi_{t}(r,\upsilon)=0&\text{ for }t\geq 0\text{ and }r\in\partial D\text{ if }\upsilon\cdot{\bf n}_{r}>0,\end{array}\right.

(1.2)

where ${\bf n}_{r}$ is the outward unit normal at $r\in\partial D$ and $g:D\times V\to[0,\infty)$ is a bounded, measurable function. Throughout we will rely on the following assumptions in some (but not all) of our results:

(H1):

Cross-sections $\sigma_{\texttt{s}}$ , $\sigma_{\texttt{f}}$ , $\pi_{\texttt{s}}$ and $\pi_{\texttt{f}}$ are uniformly bounded away from infinity.
(H2):

We have $\sigma_{\texttt{s}}\pi_{\texttt{s}}+\sigma_{\texttt{f}}\pi_{\texttt{f}}>0$ on $D\times V\times V$ .
(H3):

There is an open ball $B$ compactly embedded in $D$ such that $\sigma_{\texttt{f}}\pi_{\texttt{f}}>0$ on $B\times V\times V$ .
(H4):

the fission offspring are bounded in number by the constant $N_{\texttt{max}}>1$ .

Note, the assumption (H1) ensures that all activity occurs at a maximum rate. Assumption (H2) ensures that at least some activity occurs, whether it be scattering or fission, together with (H3), it ensures that there is at least some fission as well as scattering. Finally (H4) is a physical constraint that is natural to nuclear fission, typically no more than 3 neutrons are produced during an average fission event. Figure 1 illustrates the complex nature of the in homogeneity in the domain one typically considers.

Refer to caption — Figure 1: The geometry of a nuclear reactor core representing a physical domain $D$ , on to which the different cross-sectional values of $\sigma_{\emph{{s}}},\sigma_{\emph{{f}}},\pi_{\emph{{s}}},\pi_{\emph{{f}}}$ are mapped, also as a function of neutron velocity.

Due to the irregular nature of gradient operator, (1.1) is meaningless in the pointwise sense, so it is often stated in one of two forms. The first is to treat (1.1) as a weak linear partial integro-differential equation (PIDE) in an appropriate Banach space, usually $L_{2}(D\times V)$ , the space of functions $f:D\times V\mapsto[0,\infty)$ which are finite with respect to the norm $\lVert f\rVert_{2}=(\int_{D\times V}f(r,\upsilon){\textnormal{d}}r{\textnormal{d}}\upsilon)^{1/2}$ ); see e.g. [6, 7, 15]. The second is to consider the integral or mild form of (1.1). We refer the reader to [10, 9, 5] and the references therein for a discussion on the various formulations of the NTE and its solution. We will also elaborate on both in the forthcoming discussion.

For both formats of (1.1), the papers [6, 7, 15, 10, 5] dealt with the time-eigenvalue problem and an associated Perron-Frobenius decomposition. More precisely, they give a rigorous stochastic meaning to the asymptotic

\psi_{t}\sim{\rm e}^{\lambda_{*}t}c_{g}\varphi+o({\rm e}^{\lambda_{*}t}),

(1.3)

as $t\to\infty$ , where $\lambda_{*}$ and $\varphi$ are the leading eigenvalue and associated eigenfunction associated to the NTE in the appropriate sense and $c_{g}$ is a constant that depends on the initial data $g$ .

Such an understanding is important as it promotes a number of different Monte-Carlo algorithms that can be used to estimate both the lead eigenvalue $\lambda_{*}$ and the associated non-negative eigenfunction $\varphi$ . The latter can be formulated as an eigenpair in $L_{2}(D\times V)$ satisfying

({\mathcal{T}}+\mathcal{S}+{\mathcal{F}})\varphi=\lambda_{*}\varphi,

(1.4)

on $D\times V$ , where

\left\{\begin{array}[]{rl}{\mathcal{T}}{f}(r,\upsilon)&:=\upsilon\cdot\nabla{f}(r,\upsilon)-\sigma(r,\upsilon)\\ \\ {\mathcal{S}}{f}(r,\upsilon)&:=\sigma_{\texttt{s}}(r,\upsilon)\int_{V}{f}(r,\upsilon^{\prime})\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}\\ \\ {\mathcal{F}}{f}(r,\upsilon)&:=\sigma_{\texttt{f}}(r,\upsilon)\int_{V}{f}(r,\upsilon^{\prime})\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime},\end{array}\right.

(1.5)

Here, we can think of $\lambda_{*}$ as characterising the rate of growth of flux in the system over time.

It turns out that, predominantly in industrial, engineering and (some) physics literature, there is another eigenvalue problem that plays a fundamental role in the design and safety of nuclear reactors; see for example Section 1.5 of [13]. The aforesaid eigenvalue problem involves finding (in any appropriate sense) an eigenpair $k$ and $\phi$ such that

({\mathcal{T}}+\mathcal{S})\phi+\frac{1}{k}{\mathcal{F}}\phi=0.

(1.6)

The leading eigenvalue, which in the nuclear regulation industry is called $k$ -effective, written $k_{\texttt{eff}}$ , has the physical interpretation as being the ratio of neutrons produced (during fission events) to the number lost (due to absorption in the reactor or leakage at the boundary). Another interpretation of $k$ is that it represents the average number of neutrons produced per fission event. It is this second interpretation which we exploit, since $k_{\texttt{eff}}$ acts as a measure of neutrons produced between fission generations.

It is worth noting that the two eigenproblems offer potentially different sets of solutions, however, they agree in terms of criticality. More precisely, in (1.4), the triple $(\mathcal{T},\mathcal{S},\mathcal{F})$ is called critical if the leading eigenvalue, $\lambda_{*}$ , in (1.4) is zero, and otherwise called subcritical (resp. supercritical) if $\lambda_{*}<0$ (resp. $\lambda_{*}>0$ ). In the setting of (1.6), the triple $(\mathcal{T},\mathcal{S},\mathcal{F})$ is called critical if $k_{\texttt{eff}}=1$ and subcritical (resp. supercritical) if $k_{\texttt{eff}}<1$ (resp. $k_{\texttt{eff}}>1$ ).

We note however that in [2], there is a relationship between the two eigenvalues, regardless of the criticality of the system and at criticality, both (1.4) and (1.6) agree.

The main objective of this paper is to put into a rigorous setting the existence of the ‘leading’ solutions to (1.6) in the two main contexts that the NTE (1.1) is understood; that is, the weak linear PIDE context and the probabilistic context. Moreover, in the mild setting, we will build an expectation semigroup, say $(\Psi_{n},n\geq 0)$ , out of a stochastic process such that

\Psi_{n}[g]\sim k_{\texttt{eff}}^{-n}C_{g}\phi+o(k_{\texttt{eff}}^{-n}),

for $g\in L^{+}_{\infty}(D\times V)$ , as $n\to\infty$ , and an appropriate choice of $C_{g}\geq 0$ . (See Theorem 5.1 below.) This also provides a mathematically rigorous underpinning for many of the Monte-Carlo algorithms that are used in industry for computing $k_{\texttt{eff}}$ . We will offer further discussion in this direction at the end of the paper.

The rest of this article is organised as follows. In the next section, we formally introduce the description of (1.1) as a PIDE on a functional space, that is, we describe it as an abstract Cauchy problem. Moreover, we introduce two underlying stochastic processes, both of which can be used to describe the solution to the mild NTE. Also in this section, we introduce a second mild equation, (2.13), whose eigen-solutions give us a sense in which we can characterise solutions to (1.6).

In Section 3, we provide a solution to the newly introduced mild equation (2.13). In addition, we state the main result of this paper (Theorem 3.1) which shows the existence of a lead eigensolution to (2.13).

In Section 4, for comparison, we show how to construct and give meaning to the lead eigensolution to (1.6) in the setting of a functional space. In addition, we show how the two notions of the lead eigensolution, in this and the previous section, agree.

In Section 5, we give the proof of the main result of Section 3. Finally, we conclude in Section 6 with some discussion concerning the relevance of such results to previous work and Monte-Carlo methods.

2 Formulations of the NTE and associated eigenfunctions

As alluded to in the introduction, there are two principal ways in which the NTE is formulated. In this section, we will elaborate on them in a little more mathematical detail for later convenience and context of our main results.

2.1 Abstract Cauchy Problem (ACP)

Following e.g. [6, 7, 15], we want to formulate (1.1) in the function space $L_{2}(D\times V)$ . The so-called (initial-value) abstract Cauchy problem (ACP) takes the form

\dfrac{\partial{{u}}_{t}}{\partial t}=\mathcal{A}{{u}}_{t}\quad\text{ and }\quad{{u}}_{0}=g,

(2.1)

where $\mathcal{A}={\mathcal{T}}+{\mathcal{S}}+{\mathcal{F}}$ and ${{u}}_{t}$ belongs to the space $L_{2}({D}\times V)$ , for $t\geq 0$ (in particular $g\in L_{2}({D}\times V)$ ). Specifically, $({{u}}_{t},t\geq 0)$ is continuously differentiable in the space $L_{2}({D}\times V)$ , meaning there exists a $\dot{{u}}_{t}\in L_{2}({D}\times V)$ , which is time-continuous in $L_{2}({D}\times V)$ with respect to $\lVert\cdot\rVert_{2}$ and such that $\lim_{h\to 0}h^{-1}({u}_{t+h}-{u}_{t})=\dot{{u}}_{t}$ for all $t\geq 0$ . Necessarily, the solution to (2.1) forms a $c_{0}$ -semigroup³³3Recall that a $c_{0}$ -semigroup $(\texttt{V}_{t},t\geq 0)$ also goes by the name of a strongly continuous semigroup and, in the present context, this means it has has the properties that (i) $\texttt{V}_{0}={\rm Id}$ , (ii) $\texttt{V}_{t+s}[g]=\texttt{V}_{t}[\texttt{V}_{s}[g]]$ , for all $s,t\geq 0$ , $g\in L_{2}({D}\times V)$ and (iii) for all $g\in L_{2}({D}\times V)$ , $\lim_{h\to 0}\lVert\texttt{V}_{h}[g]-g\rVert_{2}=0$ .. Moreover, ${\rm Dom}(\mathcal{A}):=\{g\in L_{2}(D\times V):\upsilon\cdot\nabla g\in L_{2}(D\times V)\text{ and }g|_{\partial D^{+}}=0\}$ is the domain of $\mathcal{A}$ and ${{u}}_{t}\in{\rm Dom}(\mathcal{A})$ for all $t\geq 0$ .

Theorem 2.1.

Suppose (H1) holds. For $g\in L_{2}({D}\times V),$ the unique solution to (2.1) is given by $(\texttt{\emph{V}}_{t},t\geq 0)$ , the $c_{0}$ -semigroup generated by $(\mathcal{A},{\rm Dom}(\mathcal{A}))$ , i.e. the orbit $\texttt{\emph{V}}_{t}[g]:=\exp(t\mathcal{A})g$ .

In the ACP setting, the notion of an eigenpair $(\lambda,\varphi)$ is well formulated on $L_{2}(D\times V)$ via (1.4). Equivalently, it means we are looking for $\varphi\in L^{+}_{2}(D\times V)$ and $\lambda$ such that $\texttt{V}_{t}[\varphi]={\rm e}^{\lambda t}\varphi$ on $L^{+}_{2}(D\times V)$ , for all $t\geq 0$ . The sense in which we mean that $\lambda$ is a ‘leading’ eigenvalue roughly boils down it corresponding to the eigenvalue in the spectrum of the operator $\mathcal{A}$ on $L_{2}(D\times V)$ with the largest real part (and, as usual, it is real valued itself), and moreover, its associated eigenfunction $\varphi$ is non-negative. As such, one expects the existence of a non-negative left eigenfunction $\tilde{\varphi}$ (e.g. in the sense that $\langle\tilde{\varphi},\texttt{V}_{t}[g]\rangle={\rm e}^{\lambda t}\langle\tilde{\varphi},g\rangle$ for $t\geq 0$ ) such that

\lVert{\rm e}^{-\lambda t}\texttt{V}_{t}[g]-\langle\tilde{\varphi},g\rangle\varphi\rVert_{2}=o({\rm e}^{-\lambda t}),

(2.2)

as $t\to\infty$ . Here, we use the notation $\langle f,g\rangle=\int_{D\times V}f(r,\upsilon)g(r,\upsilon){\textnormal{d}}r{\textnormal{d}}\upsilon$ , so that $\lVert\cdot\rVert_{2}=\langle\cdot,\cdot\rangle^{1/2}$ . Precise results of this nature can be found in [7, 15, 5].

2.2 Neutron branching process (NBP) and the mild NTE

We recall the neutron branching process (NBP) defined in [10], which at time $t\geq 0$ is represented by a configuration of particles which are specified via their physical location and velocity in $D\times V$ , say $\{(r_{i}(t),\upsilon_{i}(t)):i=1,\dots,N_{t}\}$ , where $N_{t}$ is the number of particles alive at time $t\geq 0$ . The NBP is then given by the empirical distribution of these configurations,

X_{t}(A)=\sum_{i=1}^{N_{t}}\delta_{(r_{i}(t),\upsilon_{i}(t))}(A),\qquad A\in\mathcal{B}(D\times V),\;t\geq 0,

(2.3)

where $\delta$ is the Dirac measure, defined on $\mathcal{B}(D\times V)$ , the Borel subsets of $D\times V$ .

The evolution of $(X_{t},t\geq 0)$ is a stochastic process valued in the space of atomic measures $\mathcal{M}(D\times V):=\{\textstyle{\sum_{i=1}^{n}}\delta_{(r_{i},\upsilon_{i})}:n\in\mathbb{N},(r_{i},\upsilon_{i})\in D\times V,i=1,\cdots,n\}$ which evolves randomly as follows.

A particle positioned at $r$ with velocity $\upsilon$ will continue to move along the trajectory $r+\upsilon t$ , until one of the following things happens.

(i)

The particle leaves the physical domain $D$ , in which case it is instantaneously killed.
(ii)

Independently of all other neutrons, a scattering event occurs when a neutron comes in close proximity to an atomic nucleus and, accordingly, makes an instantaneous change of velocity. For a neutron in the system with position and velocity $(r,\upsilon)$ , if we write $T_{\texttt{s}}$ for the random time that scattering may occur, then independently of any other physical event that may affect the neutron, $\Pr(T_{\texttt{s}}>t)=\exp\{-\textstyle{\int_{0}^{t}}\sigma_{\texttt{s}}(r+\upsilon s,\upsilon){\rm d}s\},$ for $t\geq 0.$

When scattering occurs at space-velocity $(r,\upsilon)$ , the new velocity $\upsilon^{\prime}\in V$ is selected independently with probability $\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}$ .

(iii)

Independently of all other neutrons, a fission event occurs when a neutron smashes into an atomic nucleus. For a neutron in the system with initial position and velocity $(r,\upsilon)$ , if we write $T_{\texttt{f}}$ for the random time that scattering may occur, then, independently of any other physical event that may affect the neutron, $\Pr(T_{\texttt{f}}>t)=\exp\{-\textstyle{\int_{0}^{t}}\sigma_{\texttt{f}}(r+\upsilon s,\upsilon){\rm d}s\},$ for $t\geq 0.$

When fission occurs, the smashing of the atomic nucleus produces lower mass isotopes and releases a random number of neutrons, say $N\geq 0$ , which are ejected from the point of impact with randomly distributed, and possibly correlated, velocities, say $\{\upsilon_{i}:i=1,\cdots,N\}$ . The outgoing velocities are described by the atomic random measure

\mathcal{Z}(A):=\sum_{i=1}^{N}\delta_{\upsilon_{i}}(A),\qquad A\in\mathcal{B}(V).

(2.4)

If such an event occurs at location $r\in\mathbb{R}^{d}$ from a particle with incoming velocity $\upsilon\in{V}$ , we denote by ${\mathcal{P}}_{(r,\upsilon)}$ the law of $\mathcal{Z}$ . The probabilities ${\mathcal{P}}_{(r,\upsilon)}$ are such that, for $\upsilon^{\prime}\in{V}$ , for bounded and measurable $g:V\to[0,\infty)$ ,

\displaystyle\int_{V}g(\upsilon^{\prime})\pi_{\texttt{f}}(r,v,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}

\displaystyle={\mathcal{E}}_{(r,\upsilon)}\left[\int_{V}g(\upsilon^{\prime})\mathcal{Z}({\textnormal{d}}\upsilon^{\prime})\right]=:{\mathcal{E}}_{(r,\upsilon)}[\langle g,\mathcal{Z}\rangle].

(2.5)

Note, the possibility that $\Pr(N=0)>0$ , which will be tantamount to neutron capture (that is, where a neutron slams into a nucleus but no fission results and the neutron is absorbed into the nucleus).

The NBP is thus parameterised by the quantities $\sigma_{\texttt{s}},\pi_{\texttt{s}},\sigma_{\texttt{f}}$ and the family of measures ${\mathcal{P}}=({\mathcal{P}}_{(r,\upsilon)},r\in D,\upsilon\in V)$ and accordingly we refer to it as a $(\sigma_{\texttt{s}},\pi_{\texttt{s}},\sigma_{\texttt{f}},\mathcal{P})$ -NBP. It is associated to the NTE via the relation (2.5), and, although a $(\sigma_{\texttt{s}},\pi_{\texttt{s}},\sigma_{\texttt{f}},\mathcal{P})$ -NBP is uniquely defined, a NBP specified by $(\sigma_{\texttt{s}},\pi_{\texttt{s}},\sigma_{\texttt{f}},\pi_{\texttt{f}})$ alone is not. Nonetheless, it is easy to show that for a given $\pi_{\texttt{f}}$ , a $(\sigma_{\texttt{s}},\pi_{\texttt{s}},\sigma_{\texttt{f}},\mathcal{P})$ -NBP always exists which satisfies (2.5). See the discussion in Section 2 of [10].

Define

\psi_{t}[g](r,\upsilon):=\mathbb{E}_{\delta_{(r,\upsilon)}}[\langle g,X_{t}\rangle],\qquad t\geq 0,r\in\bar{D},\upsilon\in{V},

(2.6)

where $\mathbb{P}_{\delta_{(r,\upsilon)}}$ is the law of $X$ initiated from a single particle with configuration $(r,\upsilon)$ , and $g\in L^{+}_{\infty}(D\times V)$ , the space of non-negative uniformly bounded measurable functions on $D\times V$ . Here we have made a slight abuse of notation (see $\langle\cdot,\cdot\rangle$ as it appears in (2.5)) and written $\langle g,X_{t}\rangle$ to mean $\textstyle{\int_{D\times V}}g(r,\upsilon)X_{t}({\textnormal{d}}r,{\textnormal{d}}\upsilon)$ . The following result was shown in [5, 10, 7, 6].

Theorem 2.2.

Suppose (H1) and (H2) hold. For $g\in L^{+}_{\infty}(D\times V)$ , the space of non-negative and uniformly bounded measurable functions on $D\times V$ , there exist constants $C_{1},C_{2}>0$ such that $\psi_{t}[g]$ , as given in (2.6), is uniformly bounded by $C_{1}\exp(C_{2}t)$ , for all $t\geq 0$ . Moreover, $(\psi_{t}[g],t\geq 0)$ is the unique solution, which is bounded in time, to the so-called mild equation

\psi_{t}[g]=\emph{{U}}_{t}[g]+\int_{0}^{t}\emph{{U}}_{s}[({\emph{{S}}}+{\emph{{F}}})\psi_{t-s}[g]]{\textnormal{d}}s,\qquad t\geq 0,

(2.7)

for which (1.2) holds, where the deterministic evolution $\emph{{U}}_{t}[g](r,\upsilon)=g(r+\upsilon t,\upsilon)\mathbf{1}_{\{t<\kappa^{D}_{r,\upsilon}\}},t\geq 0,$ with $\kappa_{r,\upsilon}^{D}:=\inf\{t>0:r+\upsilon t\not\in D\},$ represents the advection semigroup associated with a single neutron travelling at velocity $\upsilon$ from $r$ at $t=0$ .

In [5] the below result was shown, which demonstrates the context in which the mild solution to the NTE and the ACP can be seen to coincide.

Theorem 2.3.

Suppose (H1) and (H2) hold. If $g\in L^{+}_{\infty}(D\times V)$ and if $(\psi_{t}[g],t\geq 0)$ is understood as the solution to the mild equation (2.7), then for $t\geq 0$ , $\emph{{V}}_{t}[g]=\psi_{t}[g]$ on $L^{+}_{2}(D\times V)$ , i.e. $\lVert{\texttt{\emph{V}}}_{t}[g]-\psi_{t}[g]\rVert_{2}=0$ .

In the probabilistic setting, the meaning of (1.4) can be interpreted as looking for a pair $\lambda$ and $\varphi$ such that, pointwise on $D\times V$ , $\psi_{t}[\varphi]={\rm e}^{\lambda t}\varphi$ , for $t\geq 0$ . As alluded to in (1.3), we have a similar asymptotic to (2.2), which isolates the eigenpair $(\lambda,\varphi)$ in its limit. The notion of ‘leading’ in the probabilistic setting is less obvious, however, due to Theorem 2.3, the eigenpairs that emerge from the mild setting and the weak linear PIDE setting should in principle agree on $L_{2}(D\times V)$ . This is discussed with greater precision in [5, 10].

2.3 Neutron random walk (NRW)

There is a second stochastic representation of the unique solution to (2.7), which makes use of the so-called neutron random walk (NRW). This process takes values in $D\times V$ and is defined by its scatter rates, $\alpha(r,\upsilon)$ , $r\in D,\upsilon\in V$ , and scatter probability densities $\pi(r,\upsilon,\upsilon^{\prime})$ , $r\in D,\upsilon,\upsilon^{\prime}\in V$ . When issued with a velocity $\upsilon$ , the NRW will propagate linearly with that velocity until either it exits the domain $D$ , in which case it is killed, or at the random time $T_{\texttt{s}}$ a scattering occurs, where $\Pr(T_{\texttt{s}}>t)=\exp\{-\textstyle{\int_{0}^{t}}\alpha(r+\upsilon t,\upsilon){\rm d}s\},$ for $t\geq 0.$ When the scattering event occurs at position-velocity configuration $(r,\upsilon)$ , a new velocity $\upsilon^{\prime}$ is selected with probability $\pi(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}$ . If we denote by $(R,\Upsilon)=((R_{t},\Upsilon_{t}),t\geq 0)$ , the position-velocity of the resulting continuous-time random walk on $D\times V$ with an additional cemetery state for when it leaves the domain $D$ , it is easy to show that $(R,\Upsilon)$ is a Markov process. We call the process $(R,\Upsilon)$ an $\alpha\pi$ -NRW.

Given a NBP defined by $\sigma_{\texttt{s}}$ , $\sigma_{\texttt{f}}$ , $\pi_{\texttt{s}}$ and $\mathcal{P}$ , set

\displaystyle\alpha(r,\upsilon)\pi(r,\upsilon,\upsilon^{\prime})=\sigma_{\texttt{s}}(r,\upsilon)\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime})+\sigma_{\texttt{f}}(r,\upsilon)\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime})\qquad r\in D,\upsilon,\upsilon^{\prime}\in V.

(2.8)

and

\beta(r,\upsilon)=\sigma_{\texttt{f}}(r,\upsilon)\left(\int_{V}\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}-1\right).

(2.9)

The following result, given in [5], gives the so-called many-to-one representation of solution to the NTE in the form (2.7).

Lemma 2.1.

Suppose (H1) and (H2) hold, we have that

\psi_{t}[g](r,\upsilon)=\mathbf{E}_{(r,\upsilon)}\left[{\rm e}^{\int_{0}^{t}\beta(R_{s},\Upsilon_{s}){\rm d}s}g(R_{t},\Upsilon_{t})\mathbf{1}_{\{t<\tau^{D}\}}\right],\qquad t\geq 0,r\in D,\upsilon\in V,

(2.10)

is a second representation of the unique mild solution (in the sense of Theorem 2.2) of the NTE (2.7), where $\tau^{D}=\inf\{t>0:R_{t}\not\in D\}$ and ${\bf P}_{(r,v)}$ for the law of the $\alpha\pi$ -NRW starting from a single neutron with configuration $(r,\upsilon)$ .

2.4 Neutron generational process (NGP)

In order to solve the $k$ -eigenvalue problem (1.6), it turns out that $(\psi_{t},t\geq 0)$ and $(\phi_{t},t\geq 0)$ are not the right objects to work with on account of their time-dependency. We now consider a generational model of the NBP. We can think of each line of descent in the sequence of neutron creation as genealogies. In place of $(X_{t},t\geq 0)$ , we consider the process $(\mathcal{X}_{n},n\geq 0)$ , where, for $n\geq 1$ , $\mathcal{X}_{n}$ is $\mathcal{M}(D\times V)$ -valued and can be written $\mathcal{X}_{n}=\sum_{i=1}^{\mathcal{N}_{n}}\delta_{(r^{(n)}_{i},\upsilon^{(n)}_{i})}$ , where $\{(r^{(n)}_{i},\upsilon^{(n)}_{i}),i=1,\cdots\mathcal{N}_{n}\}$ are the position-velocity configurations of the $\mathcal{N}_{n}$ particles that are $n$ -th in their genealogies to be the result of a fission event. $\mathcal{X}_{0}$ is consistent with $X_{0}$ and is the initial configuration of neutron positions and velocities. As such, for $n\geq 1$ we can think of $\mathcal{X}_{n}$ as the $n$ -th generation of the system and we refer to them as the neutron generational process (NGP). The reader who is more experienced with the theory of branching processes will know $\mathcal{X}_{n}$ to be an example of what is called a stopping line; see [12].

Appealing to the obvious meaning of $\langle g,\mathcal{X}_{n}\rangle$ , define the expectation semigroup $(\Psi_{n},n\geq 0)$ by

\Psi_{n}[g](r,\upsilon)=\mathbb{E}_{\delta_{(r,\upsilon)}}\left[\langle g,\mathcal{X}_{n}\rangle\right],\qquad n\geq 0,r\in D,\upsilon\in V,

(2.11)

with $\Psi_{0}[g]:=g\in L_{\infty}^{+}(D\times V)$ . The main motivation for introducing the NGP is that, just as we have seen that the meaning of (1.4) can be phrased in terms of a multiplicative invariance with respect to the solution of an ACP (2.1) or of the mild equation (2.7), we want to identify the eigen-problem (1.6) in terms of the semigroup above.

To this end, let us introduce the problem of finding a pair $k>0$ and $\phi\in L^{+}_{\infty}(D\times V)$ such that, pointwise,

\Psi_{1}[\phi](r,\upsilon)=k\phi(r,\upsilon),\qquad r\in D,\upsilon\in V.

(2.12)

In the next section we will show the existence of a solution to (2.12) which also plays an important role in the asymptotic behaviour of $\Psi_{n}$ as $n\to\infty$ . Before getting there, let us give a heuristic argument as to why (2.12) is another form of the eigenvalue problem (1.6).

By splitting on the first fission event, $\Psi_{n}$ solves the following mild equation

\Psi_{n}[g](r,\upsilon)=\int_{0}^{\infty}\texttt{Q}_{s}\left[{\mathcal{F}}\Psi_{n-1}[g]\right](r,\upsilon){\rm d}s,\qquad r\in D,\upsilon\in V,g\in L_{\infty}^{+}(D\times V),

(2.13)

where $(\texttt{Q}_{s},s\geq 0)$ is the expectation semigroup associated with the operator $\mathcal{T}+\mathcal{S}$ . More precisely,

\texttt{Q}_{s}[g](r,\upsilon)=\mathbb{E}_{\delta_{(r,\upsilon)}}\left[{\rm e}^{-\int_{0}^{s}\sigma_{\texttt{f}}(R_{u},\Upsilon_{u}){\rm d}u}g(R_{s},\Upsilon_{s})\mathbf{1}_{(s<\tau_{D})}\right],

(2.14)

where $(R_{s},\Upsilon_{s})_{s\geq 0}$ is the $\sigma_{\texttt{s}}\pi_{\texttt{s}}$ -NRW. Then, if the pair $(k,\phi)$ solves (2.12), the strong Markov property along with an iteration implies that

k^{n}\phi(r,\upsilon)=\Psi_{n}[\phi](r,\upsilon),\qquad r\in D,\upsilon\in V.

Using it in (2.13) and dividing through by $k^{n}$ yields

\phi(r,\upsilon)=\int_{0}^{\infty}\texttt{Q}_{s}\left[\frac{1}{k}\mathcal{F}\phi\right](r,\upsilon){\rm d}s.

(2.15)

Now set

V_{t}\coloneqq\int_{0}^{t}\texttt{Q}_{s}\left[g\right](r,\upsilon){\rm d}s.

Then, heuristically speaking, since Q is associated to the generator $\mathcal{T}+\mathcal{S}$ , classical Feynman-Kac theory suggests that $V_{t}$ ‘solves’ the equation

\frac{\partial V_{t}}{\partial t}=(\mathcal{T}+\mathcal{S})V_{t}+g.

with $V_{0}=0$ . Note that $\partial V_{t}/\partial t=\texttt{Q}_{t}[g]$ , which tends to zero as $t\to\infty$ thanks to the transience of $(R,\Upsilon)$ . Hence, taking $g={k}^{-1}\mathcal{F}\phi$ , letting $t\to\infty$ in the above equation, recalling that $(\texttt{Q}_{s},s\geq 0)$ is the expectation semigroup associated with the operator $\mathcal{T}+\mathcal{S}$ , and using the identity (2.15) yields

0=(\mathcal{T}+\mathcal{S})\phi+\frac{1}{k}\mathcal{F}\phi.

3 Probabilistic solution to (1.6)

In this section we state our main result regarding the existence of the eigenvalue and eigenfunction as specified by (2.12). We are once more motivated by the ideas presented in [3] and will use some of the techniques that were further developed in [10].

We start by constructing the many-to-one formula that is associated to the semigroup $(\Psi_{n},n\geq 0)$ in the spirit of the two representations of $(\psi_{t},t\geq 0)$ given in Sections 2.2 and 2.3. In this case it takes a slightly different form to the one in the time-dependent case. For ease of notation, let

m(r,\upsilon)\coloneqq\int_{V}\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\rm d}\upsilon^{\prime},

denote the mean number of offspring generated by a fission event at $(r,\upsilon)$ , and let $(T_{n},n\geq 1)$ denote the time of the scatter event in the $\alpha\pi$ -NRW that corresponds to the $n$ -th fission event in the corresponding NBP, $X$ .

More formally, referring to, (2.8), we can think of the $\alpha\pi$ -NRW at each scatter event as follows. For $k\geq 1$ , when the NRW $(R,\Upsilon)$ scatters for the $k$ -th time at $(r,\upsilon)$ (with rate $\alpha(r,\upsilon)$ ), a coin is tossed and the random variable $\texttt{I}_{k}(r,\upsilon)$ takes the value $1$ with probability ${\sigma_{\texttt{f}}(r,\upsilon)m(r,\upsilon)}/{({\sigma_{\texttt{s}}(r,\upsilon)+\sigma_{\texttt{f}}(r,\upsilon)m(r,\upsilon)})}$ and its new velocity, is selected according to an independent copy of the random variable $\Theta^{\texttt{f}}_{k}(r,\upsilon)$ , whose distribution has probability density $\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime})/m(r,\upsilon)$ . On the other hand, with probability ${\sigma_{\texttt{s}}(r,\upsilon)}/({\sigma_{\texttt{s}}(r,\upsilon)+\sigma_{\texttt{f}}(r,\upsilon)m(r,\upsilon)})$ the random variable $\texttt{I}_{k}(r,\upsilon)$ takes the value $0$ and its new velocity, is selected according to an independent copy of the random variable $\Theta^{\texttt{s}}_{k}(r,\upsilon)$ , whose distribution has probability density $\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime})$ . As such, the velocity immediately after the $n$ -th scatter of the NRW, given that the position-velocity configuration immediately before is $(r,\upsilon)$ , is coded by the random variable

\texttt{I}_{k}(r,\upsilon)\Theta^{\texttt{f}}_{k}(r,\upsilon)+(1-\texttt{I}_{k}(r,\upsilon))\Theta^{\texttt{s}}_{k}(r,\upsilon).

We thus can identify sequentially, $T_{0}=0$ and, for $n\geq 1$ ,

T_{n}=\inf\{t>T_{n-1}:\Upsilon_{t}\neq\Upsilon_{t-}\text{ and }\texttt{I}_{k_{t}}(R_{t},\Upsilon_{t-})=1\},

where $(k_{t},t\geq 0)$ is the process counting the number of scattering events of the NRW up to time $t$ .

Note, for the above construction of indicators to make sense, we should at least have some region of space for which fission can take place. As such the assumption (H3) becomes relevant here. Analogously to Lemma 2.1, we have the following many-to-one formula associated with the NBP.

Lemma 3.1.

Suppose (H1), (H2) and (H3) hold. The solution to (2.13) among the class of expectation semigroups is unique for $g\in L_{\infty}^{+}(D\times V)$ and the semigroup $(\Psi_{n},n\geq 0)$ may alternatively be represented⁴⁴4Here, we define $\prod_{i=1}^{0}\cdot\coloneqq 1$ . as

\Psi_{n}[g](r,\upsilon)=\mathbf{E}_{(r,\upsilon)}\left[\prod_{i=1}^{n}m(R_{T_{i}},\Upsilon_{T_{i}-})g(R_{T_{n}},\Upsilon_{T_{n}})\mathbf{1}_{(T_{n}<\kappa^{D})}\right],\qquad r\in D,\upsilon\in V,n\geq 1,

(3.1)

(with $\Psi_{0}[g]=g$ ), where $(R_{t},\Upsilon_{t})_{t\geq 0}$ is the $\alpha\pi$ -NRW, and

\kappa^{D}\coloneqq\inf\{t>0:R_{t}\notin D\}.

Proof.

We first note that the sequence $(\Psi_{n},n\geq 0)$ as defined in (3.1) is a semigroup since, due to the strong Markov property, we have

	$\displaystyle\Psi_{n+m}[g](r,\upsilon)$
	$\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[\mathbf{E}\left[\prod_{i=1}^{n+m}m(R_{T_{i}},\Upsilon_{T_{i}-})g(R_{T_{n+m}},\Upsilon_{T_{n+m}})\mathbf{1}_{(T_{n+m}<\kappa^{D})}\bigg{\|}\mathcal{F}_{n}\right]\right]$
	$\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[\prod_{i=1}^{n}m(R_{T_{i}},\Upsilon_{T_{i}-})\mathbf{E}_{(R_{T_{n}},\Upsilon_{T_{n}})}\left[\prod_{i=1}^{m}m(R_{T_{i}},\Upsilon_{T_{i}-})g(R_{T_{m}},\Upsilon_{T_{m}})\mathbf{1}_{(T_{m}<\kappa^{D})}\right]\mathbf{1}_{(T_{n}<\kappa^{D})}\right]$
	$\displaystyle=\Psi_{n}[\Psi_{m}[g]](r,\upsilon),\qquad r\in D,\upsilon\in V.$

In order to show that $\Psi_{n}$ as defined in (3.1) does indeed solve (2.13), we consider the process at time $T_{1}$ . Before doing so, we first note that the $\alpha\pi$ -NRW is exactly the same as the $\sigma_{\texttt{s}}\pi_{\texttt{s}}$ -NRW over the time interval $[0,T_{1})$ and, at time $T_{1}$ , the velocity of the former is chosen according to the expectation operator

\tilde{\mathcal{F}}[g](r,\upsilon)\coloneqq\int_{V}g(r,\upsilon^{\prime})\frac{\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime})}{m(r,\upsilon)}{\rm d}\upsilon^{\prime}.

Then, applying the strong Markov property at time $T_{1}$ ,

	$\displaystyle\Psi_{n}[g](r,\upsilon)$	$\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[\prod_{i=1}^{n}m(R_{T_{i}},\Upsilon_{T_{i}-})g(R_{T_{n}},\Upsilon_{T_{n}})\mathbf{1}_{(T_{n}<\kappa^{D})}\right]$
		$\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[m(R_{T_{1}},\Upsilon_{T_{1}-})\tilde{\mathcal{F}}[\Psi_{n-1}[g]](R_{T_{1}},\Upsilon_{T_{1}-})\mathbf{1}_{(T_{1}<\kappa^{D})}\right]$
		$\displaystyle=\int_{0}^{\infty}\mathbf{E}_{(r,\upsilon)}\left[\sigma_{\texttt{f}}(R_{s},\Upsilon_{s}){\rm e}^{-\int_{0}^{s}{\sigma_{\texttt{f}}(R_{u},\Upsilon_{u}){\rm d}u}}m(R_{s},\Upsilon_{s-})\tilde{\mathcal{F}}[\Psi_{n-1}[g]](R_{s},\Upsilon_{s-})\mathbf{1}_{(s<\kappa^{D})}\right]{\rm d}s$
		$\displaystyle=\int_{0}^{\infty}\texttt{Q}_{s}[\mathcal{F}\Psi_{n-1}[g]](r,\upsilon){\rm d}s,$

where the final equality follows from the fact that $m\sigma_{\texttt{f}}\tilde{\mathcal{F}}=\mathcal{F}$ .

It remains to show that (2.13) has a unique solution for $g\in L_{\infty}^{+}(D\times V)$ among the class of expectation semigroups, suppose that $(\Psi_{n}^{\prime},n\geq 0)$ is another such solution with $\Psi_{0}^{\prime}=g\in L^{+}_{\infty}(D\times V)$ . Define $\Phi_{n}=\Psi_{n}-\Psi_{n}^{\prime}$ , for $n\geq 0$ , and note by linearity that $(\Phi_{n},n\geq 0)$ is another expectation semigroup with $\Phi_{0}=0$ . Moreover, by linearity $(\Phi_{n},n\geq 0)$ also solves (2.13). On account of this, it is straightforward to see by induction that if $\Phi_{n}=0$ then $\Phi_{n+1}=0$ . The uniqueness of (2.13) in the class of expectation semigroups thus follows. ∎

The next result will provide the existence of a solution to (2.12) by working directly with a variant of the semigroup $(\Psi_{n},n\geq 0)$ . To this end, note that, under the assumption (H4), for non-negative functions $g$ that are bounded by one, say, we have

\mathbb{E}_{\delta_{(r,\upsilon)}}\left[\langle g,\mathcal{X}_{1}\rangle\right]\leq\|g\|_{\infty}\mathbb{E}_{\delta_{(r,\upsilon)}}\left[\langle 1,\mathcal{X}_{1}\rangle\right]\leq N_{\texttt{max}}.

(3.2)

Dividing both sides of the above inequality yields a sub-Markovian semigroup. Indeed,

$\displaystyle\Psi^{\dagger}_{n}[g](r,\upsilon)$	$\displaystyle\coloneqq N_{\texttt{max}}^{-n}\Psi_{n}[g](r,\upsilon)$
	$\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[\prod_{i=1}^{n}\frac{m(R_{T_{i}},\Upsilon_{T_{i}-})}{N_{\texttt{max}}}g(R_{T_{n}},\Upsilon_{T_{n}})\mathbf{1}_{(T_{n}<\kappa^{D})}\right]$
	$\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[g(R_{T_{n}},\Upsilon_{T_{n}})\mathbf{1}_{(T_{n}<\kappa^{D},\,n<\Gamma)}\right]$
	$\displaystyle\eqqcolon\mathbf{E}_{(r,\upsilon)}^{\dagger}\left[g(R_{T_{n}},\Upsilon_{T_{n}})\right],$	(3.3)

where $\Gamma=\min\{n\geq 0:\texttt{K}_{n}(R_{T_{n}},\Upsilon_{T_{n}-})=1\}$ where, for $n\geq 0$ , $r\in D$ and $\upsilon\in V$ , the random variable $\texttt{K}_{n}(r,\upsilon)$ is an independent indicator random variable which is equal to 0 with probability ${m(r,\upsilon)}/{N_{\texttt{max}}}$ (note, from the assumptions in Section 1, it follows that $\sup_{r\in D,\upsilon\in V}m(r,\upsilon)\leq N_{\texttt{max}}$ ).

We are now ready to state the main result of this section, and indeed the article. As its proof is quite lengthy we will delay it until Section 5. We will need the following stronger assumption of (H3):

(H3)^∗: The fission cross section satisfies $\textstyle{\inf_{r\in D,\upsilon,\upsilon^{\prime}\in V}\sigma_{\texttt{f}}(r,\upsilon)\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime})>0}$ .

Theorem 3.1.

Under the assumptions (H1), (H3)^∗ and (H4), for the semigroup $(\Psi_{n},n\geq 0)$ identified by (2.13), there exist $k_{*}\in\mathbb{R}$ , a positive⁵⁵5To be precise, by a positive eigenfunction, we mean a mapping from $D\times V\to(0,\infty)$ . This does not prevent it being valued zero on $\partial D$ , as $D$ is open. right eigenfunction $\varphi\in L^{+}_{\infty}(D\times V)$ and a left eigenmeasure, $\eta$ , on $D\times V$ , both having associated eigenvalue $k_{*}^{n}$ . Moreover, $k_{*}$ is the leading eigenvalue in the sense that, for all $g\in L^{+}_{\infty}(D\times V)$ ,

\langle\eta,\Psi_{n}[g]\rangle=k_{*}^{n}\langle\eta,g\rangle\quad\text{(resp. }\Psi_{n}[\varphi]=k_{*}^{n}\varphi\text{)}\quad n\geq 0,

(3.4)

and there exists $\gamma>1$ such that, for all $g\in L^{+}_{\infty}(D\times V)$ ,

\sup_{g\in L_{\infty}^{+}(D\times V):\lVert g\rVert_{\infty}\leq 1}\left\|k_{*}^{-n}{\varphi}^{-1}{\Psi_{n}[g]}-\langle\eta,g\rangle\right\|_{\infty}=O(\gamma^{-n})\text{ as }n\rightarrow+\infty.

(3.5)

4 Classical existence of solution to (1.6)

Our objective here is to make rigorous the sense in which solving (2.12) is consistent with solving the eigenvalue problem (1.6) in the classical sense.

We begin by considering the abstract Cauchy problem (ACP) on $L_{2}(D\times V)$ ,

\left\{\begin{array}[]{rl}\dfrac{\partial}{\partial t}u_{t}&=({\mathcal{T}}+{\mathcal{S}})u_{t}\\ u_{0}&=g.\end{array}\right.

(4.1)

Then, just as in the spirit of Theorems 2.1 and 2.3, it is not difficult to show that the operator $(\mathcal{T}+\mathcal{S},{\rm Dom}(\mathcal{T}+\mathcal{S}))$ generates a unique solution to (4.1) via the $c_{0}$ -semigroup $(\mathcal{V}_{t},t\geq 0)$ given by

\mathcal{V}_{t}[g]\coloneqq{\rm exp}(t(\mathcal{T}+\mathcal{S}))g,

on $L_{2}(D\times V)$ (and hence for $g\in L_{2}(D\times V)$ ). Moreover, we have that the expectation semigroup $(\texttt{Q}_{t},t\geq 0)$ agrees with $(\mathcal{V}_{t},t\geq 0)$ on $L_{2}(D\times V)$ , providing $g\in L^{+}_{\infty}(D\times V)$ . This latter claim follows the same idea as the proof of Theorem 8.1 in [5].

The equivalence of the semigroups $(\texttt{Q}_{t},t\geq 0)$ and $(\mathcal{V}_{t},t\geq 0)$ is what we will use to identify a classical (in the $L_{2}$ -sense) and probabilistic meaning to (1.6). We start by showing the classical existence of a solution to (1.6) on $L_{2}(D\times V)$ . We note that this problem has been previously considered in [14, 15]. In [14], the author converted the criticality problem (1.6) into a time-dependent problem in order to exploit the existing theory for time-dependent problems, whereas the methods used in [15, Section 5.11] are similar to those presented in [5]. Another more restrictive version of assumption (H2) is needed, which also implies that (H3) holds:

(H5): We have $\sigma_{\emph{{s}}}(r,\upsilon)\pi_{\emph{{s}}}(r,\upsilon,\upsilon^{\prime})>0$ and $\sigma_{\emph{{f}}}(r,\upsilon)\pi_{\emph{{f}}}(r,\upsilon,\upsilon^{\prime})>0$ on $D\times V\times V$ .

Theorem 4.1.

Suppose that the cross sections $\sigma_{\emph{{f}}}\pi_{\emph{{f}}}$ and $\sigma_{\emph{{s}}}\pi_{\emph{{s}}}$ are piecewise continuous⁶⁶6A function is piecewise continuous if its domain can be divided into an exhaustive finite partition (e.g. polytopes) such that there is continuity in each element of the partition. This is precisely how cross sections are stored in numerical libraries for modelling of nuclear reactor cores.. Further, assume that (H1) and (H5) hold. Then there exist a real eigenvalue $k>0$ and associated eigenfunction $\phi\in L_{2}^{+}(D\times V)$ such that (1.6) holds on $L_{2}(D\times V)$ . Moreover, $k$ can be explicitly identified as

k=\sup\left\{|\lambda|:(\mathcal{T}+\mathcal{S})\phi+\frac{1}{\lambda}\mathcal{F}\phi=0\text{ for some }\phi\in L_{2}(D\times V)\right\}.

(4.2)

Proof.

We start by considering a related eigenvalue problem. First recall from [5] that, due to the transience of $\mathcal{T}$ on $D$ , there exist constants $M_{1},\omega>0$ such that $\|{\rm e}^{t\mathcal{T}}\|\leq M_{1}{\rm e}^{-\omega t}$ for each $t\geq 0$ . Further, since $\mathcal{S}$ is conservative, there exists $M_{2}>0$ such that⁷⁷7We use the standard definition of operator norm, namely $\lVert A\rVert=\sup_{\lVert f\rVert_{2}\leq 1}\lVert\mathcal{A}f\rVert_{2}$ , where, as before, $\lVert\cdot\rVert_{2}$ is the usual norm on $L_{2}(D\times V)$ . $\|{\rm e}^{t\mathcal{S}}\|\leq M_{2}$ , $t\geq 0$ . Hence $\|\mathcal{V}_{t}\|\leq M{\rm e}^{-\omega t},t\geq 0$ , where $M=M_{1}M_{2}$ . Then, classical semigroup theory [18] gives the existence of the resolvent operator $(\lambda\mathcal{I}-(\mathcal{T}+\mathcal{S}))^{-1}$ for all $\lambda$ such that ${\rm Re}\lambda>-\omega$ , where $\mathcal{I}$ is the identity operator on $L_{2}(D\times V)$ . In particular, the resolvent is well-defined for $\lambda=0$ . Hence, the eigenvalue problem (1.6) is equivalent to

-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\phi=k\phi.

(4.3)

Due to the assumptions (H1) and (H5), almost identical arguments to those given in the proof of [5, Proposition 9.1] show that $-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}$ is a positive, compact, irreducible operator. Concluding in the same way as the aforementioned proposition, de Pagter’s theorem [15, Theorem 5.7] implies that its spectral radius, $r(-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F})$ , is strictly positive. It follows from the Krein-Rutman Theorem [5, Theorem 9.1] that $k\coloneqq r(-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F})\coloneqq\sup\{|\lambda|:-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\phi=\lambda\phi\text{ for some }\phi\in L_{2}(D\times V)\}$ is the leading eigenvalue of the operator $-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}$ with corresponding positive eigenfunction $\phi$ . ∎

In a similar manner to [5], we are able to provide more information about the structure of the spectrum of the operator $-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}$ .

Proposition 4.1.

Under the assumptions of Theorem 4.1, the part of the spectrum given by $\sigma(-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F})\cap\{\lambda:{\rm Re}(\lambda)>0\}$ consists of finitely many eigenvalues with finite multiplicities. In particular, $k$ is both algebraically and geometrically simple⁸⁸8An eigenvalue $\lambda$ associated with an operator $A$ is geometrically simple if dim $({\rm ker}(\lambda I-A))=1$ and algebraically simple if $\sup_{k\geq 1}{\rm dim}({\rm ker}(\lambda I-A)^{k})=1$ .

Proof.

We follow the idea of the proof of [15, Theorem 4.13] and consider the invertibility of the operator $\lambda\mathcal{I}+(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}$ by considering the following problem,

\left(\mathcal{I}+\frac{1}{\lambda}(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\right)f=\frac{1}{\lambda}g,

for $\lambda\in\sigma(-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F})\cap\{\lambda:{\rm Re}(\lambda)>0\}$ . Note that this latter set is non-empty on account of the previous theorem.

As stated in the proof of Theorem 4.1, the operator $-{\lambda}^{-1}(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}$ is compact in $L_{2}(D\times V)$ so that by Gohberg-Shmulyan’s Theorem [16], $\left(\mathcal{I}+{\lambda}^{-1}(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\right)^{-1}$ exists except for a finite set of discrete degenerate poles. This implies that $\left(\lambda\mathcal{I}+(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\right)^{-1},\lambda\in\sigma(-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F})\cap\{\lambda:{\rm Re}(\lambda)>0\}$ exists except for a finite set of eigenvalues with finite multiplicities.

We now prove that $k$ is a simple eigenvalue of the operator $-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}$ . In order to do so, we need to consider the adjoint eigenvalue problem

\mathcal{F}^{\top}({\mathcal{T}}^{\top}+\mathcal{S}^{\top})^{-1}\phi^{\top}=k^{\top}\phi^{\top},

(4.4)

where ${\mathcal{T}}^{\top}$ denotes the adjoint of ${\mathcal{T}}$ , with similar definitions for $\mathcal{F}^{\top}$ and $\mathcal{S}^{\top}$ .

We first note that, since the operator $\mathcal{T}^{\top}+\mathcal{S}^{\top}$ enjoys similar properties to $\mathcal{T}+\mathcal{S}$ , the same methods as those given in the proof of Theorem 4.2 apply to give existence of a leading eigenvalue $k^{\top}$ and corresponding eigenfunction $\phi^{\top}$ . Now, due to [11, p.184], if $\lambda$ is an isolated eigenvalue of $-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}$ , then its complex conjugate, $\bar{\lambda}$ , is an isolated eigenvalue of the adjoint of $-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}$ with the same multiplicity. Equivalently, for each isolated $\lambda$ solving (1.6) with eigenfunction $\phi$ , $\bar{\lambda}$ solves (4.4) with a corresponding eigenfunction $\phi^{\top}$ and has the same multiplicity as $\lambda$ . In particular, since $k$ is real, it follows that the leading eigenvalue associated with (4.4) is also $k$ . These observations along with similar arguments to those presented in [7, Theorem 7(iii)] and [19] yield geometric simplicity of $k$ . Then straightforward adaptations of the arguments in [7, Remark 12] yield algebraic simplicity. ∎

The next result shows that if we can find a solution to (1.6), then it must necessarily agree with the eigensolution constructed in Theorem 3.1 on $L_{2}(D\times V)$ .

Theorem 4.2.

Suppose the assumptions of Theorem 4.1 are in force⁹⁹9Note that these assumptions imply those required for Theorem 3.1., that $(k_{*},\phi_{*})$ solves (2.12) and $(k,\phi)$ denotes the leading eigensolution to (1.6). Then $k=k_{*}$ , and, up to a positive constant multiple, $\phi$ agrees with $\phi_{*}$ on $L_{2}(D\times V)$ .

Proof.

Recall the semigroup, $(\mathcal{V}_{t})_{t\geq 0}$ , generated by $\mathcal{T}+\mathcal{S}$ and note that, due to boundedness of the operator $\mathcal{F}$ , if $g\in L_{p}(D\times V)$ , then $\mathcal{F}g\in L_{p}(D\times V)$ , $p\in[1,\infty]$ . Thanks to [8, Chapter II, Lemma 1.3], $(\mathcal{V}_{t})_{t\geq 0}$ satisfies

\mathcal{V}_{t}[\mathcal{F}g]=(\mathcal{T}+\mathcal{S})\int_{0}^{t}\mathcal{V}_{s}[\mathcal{F}g]{\rm d}s+\mathcal{F}g.

(4.5)

Letting $t\to\infty$ in the above equation, we obtain

0=(\mathcal{T}+\mathcal{S})\int_{0}^{\infty}\mathcal{V}_{s}[\mathcal{F}g]{\rm d}s+\mathcal{F}g,

(4.6)

which follows from the fact that $(\mathcal{T}+\mathcal{S})$ is a transient operator so that $\lim_{t\to\infty}\mathcal{V}_{t}[g]=0$ . Setting $g=\phi_{*}$ in (4.6) and using the fact that $(\texttt{Q}_{s},s\geq 0)$ and $(\mathcal{V}_{s},s\geq 0)$ agree on $L_{2}(D\times V)$ , providing $g\in L^{+}_{\infty}(D\times V)$ , yields

0=(\mathcal{T}+\mathcal{S})\int_{0}^{\infty}\texttt{Q}_{s}[\mathcal{F}\phi_{*}]{\rm d}s+\mathcal{F}\phi_{*}.

(4.7)

Now taking advantage of (2.12) for $\phi_{*}$ , noting in particular (2.13), we have

\int_{0}^{\infty}\texttt{Q}_{s}[\mathcal{F}\phi_{*}]=\Psi_{1}[\phi_{*}]=k_{*}\phi_{*}.

(4.8)

Substituting this into (4.7) shows that $(k_{*},\phi_{*})$ is a solution to (1.6) on $L_{2}(D\times V)$ .

To conclude the proof, we first show that $k_{*}=k$ . Again, consider the adjoint problem (4.4) and note that

	$\displaystyle 0$	$\displaystyle=\langle(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\phi_{},\phi^{\top}\rangle-\langle\mathcal{F}^{\top}(\mathcal{T}^{\top}+\mathcal{S}^{\top})^{-1}\phi^{\top},\phi_{}\rangle$
		$\displaystyle=(k-k_{})\langle\phi^{\top},\phi_{}\rangle.$

Since $\phi_{*}$ and $\phi^{\top}$ are positive, we must have $k_{*}=k$ . Due to simplicity of $k$ from the previous proposition, it follows that $\phi=\phi_{*}$ up to a multiplicative constant. ∎

5 Proof of Theorem 3.1

As previously stated, our methods of proving Theorem 3.1 are motivated by those used in [10, 3]. The main part of the proof comes from [3, Theorem 2.1], which we restate (in the language of the desired application) here for convenience. To this end, recalling the notation in (3.3), define

\texttt{k}=\Gamma\wedge\min\{n\geq 1:T_{n}\geq\kappa^{D}\}.

Theorem 5.1.

Suppose that (H1), (H3)^∗ and (H4) are in force. Suppose that there exists a probability measure $\nu$ on $D\times V$ such that

(A1)

there exist $n_{0}$ , $c_{1}>0$ such that for each $(r,\upsilon)\in D\times V$ ,

$\mathbf{P}_{(r,\upsilon)}((R_{T_{n_{0}}},\Upsilon_{T_{n_{0}}})\in\cdot\;|n_{0}<\texttt{k})\geq c_{1}\nu(\cdot);$
(A2)

there exists a constant $c_{2}>0$ such that for each $(r,\upsilon)\in D\times V$ and for every $n\geq 0$ ,

$\mathbf{P}_{\nu}(n<\texttt{k})\geq c_{2}\mathbf{P}_{(r,\upsilon)}(n<\texttt{k}).$

Then, there exists $k_{c}\in(0,1)$ such that, there exist an eigenmeasure $\eta$ on $D\times V$ and a positive right eigenfunction $\varphi$ of $\Psi_{n}^{\dagger}$ (defined in (3.3)) with eigenvalue $k_{c}^{n}$ , such that $\eta$ is a probability measure and $\varphi\in L^{+}_{\infty}(D\times V)$ , i.e. for all $g\in L_{\infty}(D\times V)$ ,

\eta[\Psi^{\dagger}_{n}[g]]=k_{c}^{n}\eta[g]\quad\text{and}\quad\Psi^{\dagger}_{n}[\varphi]=k_{c}^{n}\varphi\quad n\geq 0.

(5.1)

Moreover, there exist $C,\gamma>0$ such that, for $g\in L^{+}_{\infty}(D\times V)$ and $n$ sufficiently large (independently of $g$ ),

\left\|k_{c}^{-n}\varphi^{-1}\Psi_{n}^{\dagger}[g]-\eta[g]\right\|_{\infty}\leq C\gamma^{-n}\|g\|_{\infty}.

(5.2)

In particular, setting $g\equiv 1$ , as $n\to\infty$ ,

\left\|k_{c}^{-n}\varphi^{-1}\mathbf{P}_{\cdot}(n<\texttt{k})-1\right\|_{\infty}\leq C\gamma^{-n}.

(5.3)

It is then straightforward to conclude that $\eta$ and $\varphi$ are the left eigenmeasure and right eigenfunction corresponding to the eigenvalue $k_{*}=k_{c}N_{\texttt{max}}$ for the semigroup $\Psi_{n}$ .

We now proceed to the proof of Theorem 5.1. We will use the notation $J_{k}$ to denote the $k^{th}$ scatter event of the random walk $(R,\Upsilon)$ under ${\bf P}^{\dagger}$ and recall that $T_{k}$ denotes the scatter event that corresponds to the $k^{th}$ fission event in the original NBP. The basis of our proof relies on the fact that, for each $k\geq 1$ , $T_{k}=J_{k}$ with positive probability.

A fundamental part of the proof of (A1) and (A2) is the following lemma. We refer the reader to [10, Lemma 7.3] for its proof.

Lemma 5.1.

Under the assumptions of Theorem 5.1, for all $r\in D$ and $\upsilon\in V$ , we have

\mathbf{P}_{(r,\upsilon)}^{\dagger}(J_{7}<\emph{{k}},R_{J_{7}}\in{\rm d}z)\leq C\mathbf{1}_{(z\in D)}\,{\rm d}z,

(5.4)

for some constant $C>0$ , and

\mathbf{P}^{\dagger}_{\nu}(J_{1}<\emph{{k}},R_{J_{1}}\in{\rm d}z)\geq c\mathbf{1}_{(z\in D)}\,{\rm d}z,

(5.5)

for another constant $c>0$ , where $\nu$ is Lebesgue measure on $D\times V$ .

Proof of (A1).

In order to prove (A1), we use similar arguments to those presented in the proof of (5.5). To this end, fix $r_{0}\in D$ and suppose $\Upsilon_{0}$ is uniformly distributed on $V$ . Then, due to the assumptions (H1) and (H3)^∗, the techniques used in [10] to prove (5.5) yield

\mathbf{E}_{(r_{0},\Upsilon_{0})}\left[f(R_{J_{1}})\mathbf{1}_{(T_{1}=J_{1})}\right]\geq C_{0}\int_{D}{\rm d}z\mathbf{1}_{([r,z]\subset D)}f(z).

(5.6)

Recall the (deterministic) quantity $\kappa_{r_{0},\upsilon_{0}}^{D}$ , for $r_{0}\in D$ , $\upsilon_{0}\in V$ , defined in Theorem 2.7. Also note that due to (H3)^∗, $\pi$ is bounded below by a constant (see discussion just before Lemma 7.2 of [10]). Using this, along with the strong Markov property, (H1) and (5.6), we have

	$\displaystyle\mathbb{E}_{(r_{0},\upsilon_{0})}^{\dagger}[f(R_{T_{2}},\Upsilon_{T_{2}})\mathds{1}_{(T_{2}=J_{2})}]$	$\displaystyle\geq C_{1}\int_{0}^{\kappa_{r_{0},\upsilon_{0}}^{D}}{\rm d}s{\rm e}^{-\bar{\alpha}s}\underline{\pi}\int_{V}{\rm d}\upsilon_{1}\mathbf{E}_{(r_{0}+\upsilon_{0}s,\upsilon_{1})}^{\dagger}[f(R_{J_{1}},\Upsilon_{J_{1}})\mathds{1}_{(T_{1}=J_{1})}]$
		$\displaystyle\geq C_{2}\kappa_{r_{0},\upsilon_{0}}^{D}\int_{D}{\rm d}r\int_{V}{\rm d}\upsilon f(r,\upsilon).$		(5.7)

Finally, we note that due to (H1) and (H3)^∗,

\mathbf{P}_{(r_{0},\upsilon_{0})}^{\dagger}(T_{2}<\texttt{k})\leq\mathbf{P}^{\dagger}(J_{1}<\texttt{k})\leq\int_{0}^{\kappa_{r_{0},\upsilon_{0}}^{D}}{\rm d}s\bar{\alpha}{\rm e}^{-\underline{\alpha}s}\leq C_{3}\kappa_{r_{0},\upsilon_{0}}^{D}.

(5.8)

Combining this with (5.7) yields (A1) with $\nu$ as Lebesgue measure on $D\times V$ and $n_{0}=2$ . ∎

We now prove (A2). Again, we use a similar method to the one used in [10], however, we state the proof in full to illustrate where the differences occur.

Proof of A2.

Let $n\geq 7$ and note that $T_{n}-J_{7}\geq T_{n}-T_{7}$ . This and the strong Markov property imply

	$\displaystyle\mathbf{P}_{(r,\upsilon)}(n<\texttt{k})$	$\displaystyle\leq\mathbf{E}_{(r,\upsilon)}^{\dagger}\left[\mathbf{P}_{(R_{J_{7}},\Upsilon_{J_{7}})}\left(n-7<\texttt{k}\right)\right]$
		$\displaystyle\leq C^{\prime}\int_{D}\int_{V}\mathbf{P}_{(z,w)}\left(n-7<\texttt{k}\right){\rm d}z{\rm d}w,$		(5.9)

where we have used Lemma 5.1 to obtain the final inequality.

Now suppose $n\geq 1$ . Recalling the measure $\nu$ from (A1), another application of Lemma 5.1 gives

	$\displaystyle\mathbf{P}_{\nu}(n<\texttt{k})$	$\displaystyle=\mathbf{E}_{\nu}^{\dagger}\left[\mathbf{1}_{(J_{1}<\texttt{k})}\mathbf{P}_{(R_{J_{1}},\Upsilon_{J_{1}})}(n<\texttt{k})\right]$
		$\displaystyle\geq c^{\prime}\int_{D}\int_{V}\mathbf{P}_{(z,w)}(n<\texttt{k}){\rm d}z{\rm d}w.$		(5.10)

Then, for $n\geq 8$ , combining (5.9) and (5.10) yields

\mathbf{P}_{(r,\upsilon)}(n<\texttt{k})\leq\frac{C^{\prime}}{c^{\prime}}\mathbf{P}_{\nu}\left(n-7<\texttt{k}\right).

(5.11)

Now recalling $n_{0}$ from (A1), it follows from (A1) that

\mathbf{P}_{\nu}^{\dagger}((R_{T_{n_{0}}},\Upsilon_{T_{n_{0}}})\in\cdot)\geq c_{1}\mathbf{P}_{\nu}(n_{0}<\texttt{k})\nu(\cdot).

(5.12)

Again, due to assumptions (H1) and (H3)^∗,

\mathbf{P}_{\nu}(n_{0}<\texttt{k})\geq\int_{D\times V}{\bf P}_{(r,\upsilon)}^{\dagger}(T_{n_{0}}=J_{n_{0}},n_{0}<\texttt{k})\nu({\rm d}r,{\rm d}\upsilon)\geq K,

(5.13)

for some constant $K>0$ . Then, for $n\geq 8$ , due to (5.12) and (5.13),

	$\displaystyle\mathbf{P}_{\nu}\left(n-7+n_{0}<\texttt{k}\right)$	$\displaystyle=\mathbf{E}_{\nu}\left[\mathbf{1}_{(n_{0}<\texttt{k})}\mathbf{P}_{(R_{T_{n_{0}}},\Upsilon_{T_{n_{0}}})}\left(n-7<\texttt{k}\right)\right]$
		$\displaystyle\geq Kc_{1}\mathbf{P}_{\nu}\left(n-7<\texttt{k}\right).$		(5.14)

Finally, noting that for $n\geq 1$ we have $n-7+4n_{0}\geq n$ , so that

\mathbf{P}_{\nu}(n<\texttt{k})\geq\mathbf{P}_{\nu}\left(n-7+4n_{0}<\texttt{k}\right),

and applying (5.14) four times implies

\mathbf{P}_{\nu}(n<\texttt{k})\geq(Kc_{1})^{4}\mathbf{P}_{\nu}\left(n-7<\texttt{k}\right).

(5.15)

Combining this with (5.11) yields the result. ∎

6 Concluding remarks

We complete this paper with a number of remarks that reflect on the main theorem here and previous work in [5, 10, 9, 4].

6.1 $\lambda$ -, $k$ - and $c$ -eigenvalue problems

There is a third eigenvalue problem associated with the NTE: find $(c,\varphi_{c})$ such that

\displaystyle\mathcal{T}\varphi_{c}+\frac{1}{c}\left(\mathcal{S}+\mathcal{F}\right)\varphi_{c}=0.

The associated mild form of this eigenvalue problem is

\texttt{S}_{t}[\varphi_{c}](r,\upsilon)+\frac{1}{c}\int_{0}^{t}\texttt{S}_{s}[(\mathcal{S}+\mathcal{F})\varphi_{c}](r,\upsilon){\rm d}s=\varphi_{c}(r,\upsilon),

(6.1)

where

\texttt{S}_{t}[g](r,\upsilon)={\rm e}^{-\int_{0}^{t}\sigma(r+\upsilon s,\upsilon){\rm d}s}g(r+\upsilon t,\upsilon)\mathbf{1}_{(t<\kappa_{r,\upsilon}^{D})}.

By considering the semigroup $\Pi_{n}[g](r,\upsilon)=\mathbb{E}_{\delta_{(r,\upsilon)}}[\langle g,\mathbb{X}_{n}\rangle]$ , where $\mathbb{X}_{n}$ is the neutron population at the $n^{th}$ collision (either a scatter or a fission), almost identical proofs to those given in the previous sections yield the existence of the $(c,\varphi_{c})$ , both in the classical sense and the probabilistic one.

In this case, the eigenvalue $c$ can be interpreted as the ratio between neutron production (from both scattering and fission) and neutron loss (due to absorption and leakage). Alternatively, it can be seen as the number of secondary neutrons per collision, rather than only collisions due to fission events.

6.2 Martingale convergence and strong law of large numbers

In a similar fashion to [10], Theorem 3.1 implies that

\mathcal{W}_{n}\coloneqq k^{-n}\frac{\langle\varphi,\mathcal{X}_{n}\rangle}{\langle\varphi,\mu\rangle},

is a non-negative mean one martingale under $\mathbb{P}_{\delta_{(r,\upsilon)}}$ . One could then show that $(\mathcal{W}_{n})_{n\geq 0}$ converges in $L^{2}(\mathbb{P})$ in the supercritical case, and otherwise has a degenerate limit.

One could also reconstruct the arguments presented in [9] to characterise the growth in the supercritical regime to obtain a strong law of large numbers:

\lim_{n\to\infty}k^{-n}\frac{\langle g,\mathcal{X}_{n}\rangle}{\langle\varphi,\mu\rangle}=\langle g,\tilde{\varphi}\rangle\mathcal{W}_{\infty},

where $\mathcal{W}_{\infty}$ is the limit of the martingale $(\mathcal{W}_{n})_{n\geq 0}$ .

We leave these arguments as an exercise to the reader to avoid unnecessary repetition.

6.3 Monte-Carlo considerations

We end this paper with a discussion of the existing Monte Carlo methods for calculating $k_{\texttt{eff}}$ and the associated eigenfunctions, and how we may use the semigroup approach to propose comparable algorithms, similar in style to those presented in [4].

Due to the interpretation of the eigenvalue $k_{\texttt{eff}}$ , most of the existing methods in the numerical analysis and engineering literature are based on iterative methods. For example, several algorithms are given in [17] that demonstrate how to calculate $k_{\texttt{eff}}$ and $\varphi$ . The main idea is to start with a set of $N$ neutrons, distributed in $D\times V$ according to some function $\varphi^{(0)}$ that serves as an initial guess¹⁰¹⁰10In practice, this is usually either the uniform distribution, or the solution to a diffusion approximation of the eigenvalue problem. at $\varphi$ . The system of neutrons then evolves until the first generation of fission events. Letting $\hat{\varphi}^{(1)}$ be the distribution of these first generation neutrons, the first approximation, $\varphi^{(1)}$ , of the eigenfunction $\varphi$ is then obtained by normalising¹¹¹¹11This is usually done by either setting $\varphi^{(1)}=\widetilde{\varphi}^{(1)}/\|\widetilde{\varphi}^{(1)}\|$ or by sampling $N$ neutrons according to $\widetilde{\varphi}^{(1)}$ $\hat{\varphi}^{(1)}$ . At the same time, the eigenvalue $k_{\texttt{eff}}$ is approximated by

k^{(1)}=\frac{\langle\mathbf{1},\mathcal{F}\varphi^{(1)}\rangle}{\langle\mathbf{1},(\mathcal{T}+\mathcal{S})\varphi^{(1)}\rangle},

which corresponds to the ratio of source neutrons for generation $2$ to the number of paths simulated in generation $1$ . The process is then repeated using $\varphi^{(1)}$ as the initial distribution of neutrons, in order to obtain $\varphi^{(2)}$ and $k^{(2)}$ , and so on.

However, some of the methods presented in the literature lead to bias and correlations between the neutrons in successive fission generations. To overcome this problem, the notion of superhistory powering was introduced in [1]. This idea is based on letting the initial set of neutrons evolve for some number, $L$ , of generations until the estimates for $k_{\texttt{eff}}$ and $\varphi$ are computed. It is usual in industry to set $L=10$ .

As we have shown in the previous sections, solving (1.6) is equivalent to look for the leading eigentriple $(k_{*},\varphi,\tilde{\varphi})$ of the semigroup $\Psi_{n}$ . Heuristically, this pertains to finding functions $\varphi$ and $\tilde{\varphi}$ that describe where neutron production (due to fission events) is most prominent, and a parameter $k_{*}$ that describes the average growth of the number of neutrons in the system. We may use the asymptotics (3.5) to inform Monte Carlo methods for the calculation of $k_{*}$ , $\varphi$ and $\tilde{\varphi}$ . Indeed, we have

k_{*}=\lim_{n\to\infty}\frac{1}{n}\log\Psi_{n}[\mathbf{1}](r,\upsilon),

where $\mathbf{1}$ is the constant function with value one. Here, as an expectation, $\Psi_{n}[\mathbf{1}]$ can be approximated by Monte-Carlo simulation.

In order to calculate the eigenfunction, one can manipulate the following asymptotic.

\langle\tilde{\varphi},g\rangle\varphi(r,\upsilon)=\lim_{n\to\infty}\mathbb{E}_{\delta_{(r,\upsilon)}}\left[\frac{1}{n}\sum_{m=1}^{n}k_{*}^{-m}\langle\mathcal{X}_{m},g\rangle\right].

Varying the test function $g$ , while keeping $(r,\upsilon)$ fixed allows us to obtain estimates for $\tilde{\varphi}$ , whereas varying the initial configuration $(r,\upsilon)$ and keeping the test function fixed allows us to estimate $\varphi$ .

Once again, the expectation can be replaced by a Monte-Carlo approximation.

We refer the reader to [4] for a more in-depth discussion of Monte Carlo algorithms based on the above asymptotics, as well as a complexity analysis of their methods. Although the algorithms and efficiency results given in [4] are for time-eigenvalues, cf. (1.4), it is straightforward to see how they may be adapted to fit the current situation (as well as their complexity). Of course, problems such as burn-in and inefficiencies that were encountered in [4] will still be present in the stationary case. We hope to carry out more formal work on this in the future.

Acknowledgements

We are indebted to Geoff Dobson and Paul Smith from the ANSWERS modelling group at Wood for the extensive discussions as well as hosting at their offices in Dorchester. AEK and AMGC are supported by EPSRC grant EP/P009220/1. ELH is supported by a PhD scholarship with funding from industrial partner Wood.

References

[1] R. J. Brissenden and A. R. Garlick. Biases in the estimation of k-eff and its error by monte carlo methods. Ann. nucl. Energy, 13:63–83, 1986.
[2] F. Brown. Fundamentals of monte carlo particle transport. Lecture notes for Monte Carlo course.
[3] N. Champagnat and D. Villemonais. Exponential convergence to quasi-stationary distribution and $Q$ -process. Probab. Theory Related Fields, 164(1-2):243–283, 2016.
[4] A. M. G. Cox, S.C. Harris, E. Horton, A.E. Kyprianou, and M. Wang. Monte carlo methods for the neutron transport equation. Working document, 2018.
[5] A.M.G. Cox, S.C. Harris, E. Horton, and A.E. Kyprianou. Multi-species neutron transport equation. Preprint, 2018.
[6] R. Dautray, M. Cessenat, G. Ledanois, P.-L. Lions, E. Pardoux, and R. Sentis. Méthodes probabilistes pour les équations de la physique. Collection du Commissariat a l’énergie atomique. Eyrolles, Paris, 1989.
[7] R. Dautray and J.-L. Lions. Mathematical analysis and numerical methods for science and technology. Vol. 6. Springer-Verlag, Berlin, 1993. Evolution problems. II, With the collaboration of Claude Bardos, Michel Cessenat, Alain Kavenoky, Patrick Lascaux, Bertrand Mercier, Olivier Pironneau, Bruno Scheurer and Rémi Sentis, Translated from the French by Alan Craig.
[8] K-J. Engel and R. Nagel. A short course on operator semigroups. Universitext. Springer, New York, 2006.
[9] S.C. Harris, E. Horton, and A.E. Kyprianou. Stochastic analysis of the neutron transport equation II: Almost sure growth. Preprint, 2018.
[10] E. Horton, A.E. Kyprianou, and D. Villemonais. Stochastic analysis of the neutron transport equation. Preprint.
[11] T. Kato. Perturbation Theory for Linear Operators. Springer-Verlag, Berlin, 1976.
[12] A. E. Kyprianou. Martingale convergence and the stopped branching random walk. Probab Theory Relat. Fields, 116(3):405–419, 2000.
[13] E. E. Lewis and Jr. W. F. Miller. Computational Methods of Neutron Transport. Wiley & Sons, New York, 1984.
[14] J. Mika. Existence and uniqueness of the solution to the critical problem in neutron transport theory. Stud. Math., 37:213–225, 1971.
[15] M. Mokhtar-Kharroubi. Mathematical topics in neutron transport theory, volume 46 of Series on Advances in Mathematics for Applied Sciences. World Scientific Publishing Co., Inc., River Edge, NJ, 1997. New aspects, With a chapter by M. Choulli and P. Stefanov.
[16] M. Ribarič and I. Vidav. Analytic properties of the inverse $a(z)^{-1}$ of an analytic linear operator valued function a(z). Comm. Pure Appl. Math., 11:219–242, 1958.
[17] F. Scheben and I. G. Graham. Iterative methods for neutron transport eigenvalue problems. SIAM J. Sci. Comput., 33(5):2785–2804, 2011.
[18] J. van Neervan. The Asymptotic Behaviour of Semigroups of Linear Operators. Operator Theory Advances and Applications Vol. 88. Birkhäuser, 1996.
[19] I. Vidav. Existence and uniqueness of nonnegative eigenfunctions of the boltzmann operator. J. Math. Anal. App., 22:144–155, 1968.

Stochastic Methods for Neutron Transport Equation III: Generational many-to-one and keffk_{\texttt{eff}}

Abstract

1 Introduction

2 Formulations of the NTE and associated eigenfunctions

2.1 Abstract Cauchy Problem (ACP)

Theorem 2.1.

2.2 Neutron branching process (NBP) and the mild NTE

Theorem 2.2.

Theorem 2.3.

2.3 Neutron random walk (NRW)

Lemma 2.1.

2.4 Neutron generational process (NGP)

3 Probabilistic solution to (1.6)

Lemma 3.1.

Proof.

Theorem 3.1.

4 Classical existence of solution to (1.6)

Theorem 4.1.

Proof.

Proposition 4.1.

Proof.

Theorem 4.2.

Proof.

5 Proof of Theorem 3.1

Theorem 5.1.

Lemma 5.1.

Proof of (A1).

Proof of A2.

6 Concluding remarks

6.1 λ\lambda-, kk- and cc-eigenvalue problems

6.2 Martingale convergence and strong law of large numbers

6.3 Monte-Carlo considerations

Acknowledgements

References

Stochastic Methods for Neutron Transport Equation III: Generational many-to-one and $k_{\texttt{eff}}$

6.1 $\lambda$ -, $k$ - and $c$ -eigenvalue problems