This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Stochastic Methods for Neutron Transport Equation III: Generational many-to-one and keffk_{\texttt{eff}}

A. M. G. Cox111 Department of Mathematical Sciences, University of Bath, Claverton Down, Bath, BA2 7AY, UK. Email: a.g.m.cox@bath.ac.uk, e.l.horton@bath.ac.uk, a.kyprianou@bath.ac.uk ,  E. L. Horton, A. E. Kyprianou,  D. Villemonais222Institut Élie Cartan de Lorraine, Bureau 123, Université de Lorraine, 54506, Vandoeuvre-lès-Nancy Cedex, France. Email: denisvillemonais@gmail.com
(August 7, 2025)
Abstract

The Neutron Transport Equation (NTE) describes the flux of neutrons over time through an inhomogeneous fissile medium. In the recent articles [5, 10], a probabilistic solution of the NTE is considered in order to demonstrate a Perron-Frobenius type growth of the solution via its projection onto an associated leading eigenfunction. In [9, 4], further analysis is performed to understand the implications of this growth both in the stochastic sense, as well as from the perspective of Monte-Carlo simulation.

Such Monte-Carlo simulations are prevalent in industrial applications, in particular where regulatory checks are needed in the process of reactor core design. In that setting, however, it turns out that a different notion of growth takes centre stage, which is otherwise characterised by another eigenvalue problem. In that setting, the eigenvalue, sometimes called kk-effective (written keffk_{\texttt{eff}}), has the physical interpretation as being the ratio of neutrons produced (during fission events) to the number lost (due to absorption in the reactor or leakage at the boundary) per typical fission event.

In this article, we aim to supplement [5, 10, 9, 4], by developing the stochastic analysis of the NTE further to the setting where a rigorous probabilistic interpretation of keffk_{\texttt{eff}} is given, both in terms of a Perron-Frobenius type analysis as well as via classical operator analysis.

To our knowledge, despite the fact that an extensive engineering literature and industrial Monte-Carlo software is concentrated around the estimation of keffk_{\texttt{eff}} and its associated eigenfunction, we believe that our work is the first rigorous treatment in the probabilistic sense (which underpins some of the aforesaid Monte-Carlo simulations).

Key words: Neutron Transport Equation, principal eigenvalue, semigroup theory, Perron-Frobenius decomposition

MSC: 82D75, 60J80, 60J75, 60J99

1 Introduction

As described in [10, 9, 5] the NTE is a balance equation for the flux of neutrons across a planar cross-section in an inhomogeneous fissile medium. The backwards form of the equation can be written as follows,

tψt(r,υ)\displaystyle\frac{\partial}{\partial t}\psi_{t}(r,\upsilon) =υψt(r,υ)σ(r,υ)ψt(r,υ)\displaystyle=\upsilon\cdot\nabla\psi_{t}(r,\upsilon)-\sigma(r,\upsilon)\psi_{t}(r,\upsilon)
+σs(r,υ)Vψt(r,υ)πs(r,υ,υ)dυ+σf(r,υ)Vψt(r,υ)πf(r,υ,υ)dυ,\displaystyle+\sigma_{\texttt{s}}(r,\upsilon)\int_{V}\psi_{t}(r,\upsilon^{\prime})\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}+\sigma_{\texttt{f}}(r,\upsilon)\int_{V}\psi_{t}(r,\upsilon^{\prime})\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}, (1.1)

where the flux ψt(r,υ)\psi_{t}(r,\upsilon) is a function of time, tt and the configuration variables (r,υ)D×V(r,\upsilon)\in D\times V where D3D\subseteq\mathbb{R}^{3} is a non-empty, smooth, bounded convex domain such that D\partial D has zero Lebesgue measure, and V={υ3:υmin|υ|υmax}V=\{\upsilon\in\mathbb{R}^{3}:\upsilon_{\texttt{min}}\leq|\upsilon|\leq\upsilon_{\texttt{max}}\}. Furthermore, the other components of (1.1) have the following interpretation:

σs(r,υ)\displaystyle\sigma_{\texttt{s}}(r,\upsilon) : the rate at which scattering occurs from incoming velocity υ,\displaystyle:\text{ the rate at which scattering occurs from incoming velocity $\upsilon$,}
σf(r,υ)\displaystyle\sigma_{\texttt{f}}(r,\upsilon) : the rate at which fission occurs from incoming velocity υ,\displaystyle:\text{ the rate at which fission occurs from incoming velocity $\upsilon$,}
σ(r,υ)\displaystyle\sigma(r,\upsilon) : the sum of the rates σf+σs and is known as the total cross section,\displaystyle:\text{ the sum of the rates }\sigma_{\texttt{f}}+\sigma_{\texttt{s}}\text{ and is known as the total cross section,}
πs(r,υ,υ)dυ\displaystyle\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime} : the scattering yield at velocity υ from incoming velocity υ,\displaystyle:\text{ the scattering yield at velocity $\upsilon^{\prime}$ from incoming velocity }\upsilon,
 satisfying Vπs(r,υ,υ)dυ=1, and\displaystyle\hskip 14.22636pt\text{ satisfying }\textstyle{\int_{V}}\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}){\rm d}\upsilon^{\prime}=1,\text{ and }
πf(r,υ,υ)dυ\displaystyle\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime} : the neutron yield at velocity υ from fission with incoming velocity υ,\displaystyle:\text{ the neutron yield at velocity $\upsilon^{\prime}$ from fission with incoming velocity }\upsilon,
 satisfying Vπf(r,υ,υ)dυ<.\displaystyle\hskip 14.22636pt\text{ satisfying }{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}\textstyle{\int_{V}}\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}<\infty.}

We also enforce the following initial and boundary conditions

{ψ0(r,υ)=g(r,υ) for rD,υV,ψt(r,υ)=0 for t0 and rD if υ𝐧r>0,\left\{\begin{array}[]{ll}\psi_{0}(r,\upsilon)=g(r,\upsilon)&\text{ for }r\in D,\upsilon\in{V},\\ &\\ \psi_{t}(r,\upsilon)=0&\text{ for }t\geq 0\text{ and }r\in\partial D\text{ if }\upsilon\cdot{\bf n}_{r}>0,\end{array}\right. (1.2)

where 𝐧r{\bf n}_{r} is the outward unit normal at rDr\in\partial D and g:D×V[0,)g:D\times V\to[0,\infty) is a bounded, measurable function. Throughout we will rely on the following assumptions in some (but not all) of our results:

  • (H1):

    Cross-sections σs\sigma_{\texttt{s}}, σf\sigma_{\texttt{f}}, πs\pi_{\texttt{s}} and πf\pi_{\texttt{f}} are uniformly bounded away from infinity.

  • (H2):

    We have σsπs+σfπf>0\sigma_{\texttt{s}}\pi_{\texttt{s}}+\sigma_{\texttt{f}}\pi_{\texttt{f}}>0 on D×V×VD\times V\times V.

  • (H3):

    There is an open ball BB compactly embedded in DD such that σfπf>0\sigma_{\texttt{f}}\pi_{\texttt{f}}>0 on B×V×VB\times V\times V.

  • (H4):

    the fission offspring are bounded in number by the constant Nmax>1N_{\texttt{max}}>1.

Note, the assumption (H1) ensures that all activity occurs at a maximum rate. Assumption (H2) ensures that at least some activity occurs, whether it be scattering or fission, together with (H3), it ensures that there is at least some fission as well as scattering. Finally (H4) is a physical constraint that is natural to nuclear fission, typically no more than 3 neutrons are produced during an average fission event. Figure 1 illustrates the complex nature of the in homogeneity in the domain one typically considers.

Refer to caption
Figure 1: The geometry of a nuclear reactor core representing a physical domain DD, on to which the different cross-sectional values of σs,σf,πs,πf\sigma_{\emph{{s}}},\sigma_{\emph{{f}}},\pi_{\emph{{s}}},\pi_{\emph{{f}}} are mapped, also as a function of neutron velocity.

Due to the irregular nature of gradient operator, (1.1) is meaningless in the pointwise sense, so it is often stated in one of two forms. The first is to treat (1.1) as a weak linear partial integro-differential equation (PIDE) in an appropriate Banach space, usually L2(D×V)L_{2}(D\times V), the space of functions f:D×V[0,)f:D\times V\mapsto[0,\infty) which are finite with respect to the norm f2=(D×Vf(r,υ)drdυ)1/2\lVert f\rVert_{2}=(\int_{D\times V}f(r,\upsilon){\textnormal{d}}r{\textnormal{d}}\upsilon)^{1/2}); see e.g. [6, 7, 15]. The second is to consider the integral or mild form of (1.1). We refer the reader to [10, 9, 5] and the references therein for a discussion on the various formulations of the NTE and its solution. We will also elaborate on both in the forthcoming discussion.


For both formats of (1.1), the papers [6, 7, 15, 10, 5] dealt with the time-eigenvalue problem and an associated Perron-Frobenius decomposition. More precisely, they give a rigorous stochastic meaning to the asymptotic

ψteλtcgφ+o(eλt),\psi_{t}\sim{\rm e}^{\lambda_{*}t}c_{g}\varphi+o({\rm e}^{\lambda_{*}t}), (1.3)

as tt\to\infty, where λ\lambda_{*} and φ\varphi are the leading eigenvalue and associated eigenfunction associated to the NTE in the appropriate sense and cgc_{g} is a constant that depends on the initial data gg.


Such an understanding is important as it promotes a number of different Monte-Carlo algorithms that can be used to estimate both the lead eigenvalue λ\lambda_{*} and the associated non-negative eigenfunction φ\varphi. The latter can be formulated as an eigenpair in L2(D×V)L_{2}(D\times V) satisfying

(𝒯+𝒮+)φ=λφ,({\mathcal{T}}+\mathcal{S}+{\mathcal{F}})\varphi=\lambda_{*}\varphi, (1.4)

on D×VD\times V, where

{𝒯f(r,υ):=υf(r,υ)σ(r,υ)𝒮f(r,υ):=σs(r,υ)Vf(r,υ)πs(r,υ,υ)dυf(r,υ):=σf(r,υ)Vf(r,υ)πf(r,υ,υ)dυ,\left\{\begin{array}[]{rl}{\mathcal{T}}{f}(r,\upsilon)&:=\upsilon\cdot\nabla{f}(r,\upsilon)-\sigma(r,\upsilon)\\ \\ {\mathcal{S}}{f}(r,\upsilon)&:=\sigma_{\texttt{s}}(r,\upsilon)\int_{V}{f}(r,\upsilon^{\prime})\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}\\ \\ {\mathcal{F}}{f}(r,\upsilon)&:=\sigma_{\texttt{f}}(r,\upsilon)\int_{V}{f}(r,\upsilon^{\prime})\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime},\end{array}\right. (1.5)

Here, we can think of λ\lambda_{*} as characterising the rate of growth of flux in the system over time.

It turns out that, predominantly in industrial, engineering and (some) physics literature, there is another eigenvalue problem that plays a fundamental role in the design and safety of nuclear reactors; see for example Section 1.5 of [13]. The aforesaid eigenvalue problem involves finding (in any appropriate sense) an eigenpair kk and ϕ\phi such that

(𝒯+𝒮)ϕ+1kϕ=0.({\mathcal{T}}+\mathcal{S})\phi+\frac{1}{k}{\mathcal{F}}\phi=0. (1.6)

The leading eigenvalue, which in the nuclear regulation industry is called kk-effective, written keffk_{\texttt{eff}}, has the physical interpretation as being the ratio of neutrons produced (during fission events) to the number lost (due to absorption in the reactor or leakage at the boundary). Another interpretation of kk is that it represents the average number of neutrons produced per fission event. It is this second interpretation which we exploit, since keffk_{\texttt{eff}} acts as a measure of neutrons produced between fission generations.


It is worth noting that the two eigenproblems offer potentially different sets of solutions, however, they agree in terms of criticality. More precisely, in (1.4), the triple (𝒯,𝒮,)(\mathcal{T},\mathcal{S},\mathcal{F}) is called critical if the leading eigenvalue, λ\lambda_{*}, in (1.4) is zero, and otherwise called subcritical (resp. supercritical) if λ<0\lambda_{*}<0 (resp. λ>0\lambda_{*}>0). In the setting of (1.6), the triple (𝒯,𝒮,)(\mathcal{T},\mathcal{S},\mathcal{F}) is called critical if keff=1k_{\texttt{eff}}=1 and subcritical (resp. supercritical) if keff<1k_{\texttt{eff}}<1 (resp. keff>1k_{\texttt{eff}}>1).

We note however that in [2], there is a relationship between the two eigenvalues, regardless of the criticality of the system and at criticality, both (1.4) and (1.6) agree.


The main objective of this paper is to put into a rigorous setting the existence of the ‘leading’ solutions to (1.6) in the two main contexts that the NTE (1.1) is understood; that is, the weak linear PIDE context and the probabilistic context. Moreover, in the mild setting, we will build an expectation semigroup, say (Ψn,n0)(\Psi_{n},n\geq 0), out of a stochastic process such that

Ψn[g]keffnCgϕ+o(keffn),\Psi_{n}[g]\sim k_{\texttt{eff}}^{-n}C_{g}\phi+o(k_{\texttt{eff}}^{-n}),

for gL+(D×V)g\in L^{+}_{\infty}(D\times V), as nn\to\infty, and an appropriate choice of Cg0C_{g}\geq 0. (See Theorem 5.1 below.) This also provides a mathematically rigorous underpinning for many of the Monte-Carlo algorithms that are used in industry for computing keffk_{\texttt{eff}}. We will offer further discussion in this direction at the end of the paper.


The rest of this article is organised as follows. In the next section, we formally introduce the description of (1.1) as a PIDE on a functional space, that is, we describe it as an abstract Cauchy problem. Moreover, we introduce two underlying stochastic processes, both of which can be used to describe the solution to the mild NTE. Also in this section, we introduce a second mild equation, (2.13), whose eigen-solutions give us a sense in which we can characterise solutions to (1.6).


In Section 3, we provide a solution to the newly introduced mild equation (2.13). In addition, we state the main result of this paper (Theorem 3.1) which shows the existence of a lead eigensolution to (2.13).


In Section 4, for comparison, we show how to construct and give meaning to the lead eigensolution to (1.6) in the setting of a functional space. In addition, we show how the two notions of the lead eigensolution, in this and the previous section, agree.


In Section 5, we give the proof of the main result of Section 3. Finally, we conclude in Section 6 with some discussion concerning the relevance of such results to previous work and Monte-Carlo methods.

2 Formulations of the NTE and associated eigenfunctions

As alluded to in the introduction, there are two principal ways in which the NTE is formulated. In this section, we will elaborate on them in a little more mathematical detail for later convenience and context of our main results.

2.1 Abstract Cauchy Problem (ACP)

Following e.g. [6, 7, 15], we want to formulate (1.1) in the function space L2(D×V)L_{2}(D\times V). The so-called (initial-value) abstract Cauchy problem (ACP) takes the form

utt=𝒜ut and u0=g,\dfrac{\partial{{u}}_{t}}{\partial t}=\mathcal{A}{{u}}_{t}\quad\text{ and }\quad{{u}}_{0}=g, (2.1)

where 𝒜=𝒯+𝒮+\mathcal{A}={\mathcal{T}}+{\mathcal{S}}+{\mathcal{F}} and ut{{u}}_{t} belongs to the space L2(D×V)L_{2}({D}\times V), for t0t\geq 0 (in particular gL2(D×V)g\in L_{2}({D}\times V)). Specifically, (ut,t0)({{u}}_{t},t\geq 0) is continuously differentiable in the space L2(D×V)L_{2}({D}\times V), meaning there exists a u˙tL2(D×V)\dot{{u}}_{t}\in L_{2}({D}\times V), which is time-continuous in L2(D×V)L_{2}({D}\times V) with respect to 2\lVert\cdot\rVert_{2} and such that limh0h1(ut+hut)=u˙t\lim_{h\to 0}h^{-1}({u}_{t+h}-{u}_{t})=\dot{{u}}_{t} for all t0t\geq 0. Necessarily, the solution to (2.1) forms a c0c_{0}-semigroup333Recall that a c0c_{0}-semigroup (Vt,t0)(\texttt{V}_{t},t\geq 0) also goes by the name of a strongly continuous semigroup and, in the present context, this means it has has the properties that (i) V0=Id\texttt{V}_{0}={\rm Id}, (ii) Vt+s[g]=Vt[Vs[g]]\texttt{V}_{t+s}[g]=\texttt{V}_{t}[\texttt{V}_{s}[g]], for all s,t0s,t\geq 0, gL2(D×V)g\in L_{2}({D}\times V) and (iii) for all gL2(D×V)g\in L_{2}({D}\times V), limh0Vh[g]g2=0\lim_{h\to 0}\lVert\texttt{V}_{h}[g]-g\rVert_{2}=0.. Moreover, Dom(𝒜):={gL2(D×V):υgL2(D×V) and g|D+=0}{\rm Dom}(\mathcal{A}):=\{g\in L_{2}(D\times V):\upsilon\cdot\nabla g\in L_{2}(D\times V)\text{ and }g|_{\partial D^{+}}=0\} is the domain of 𝒜\mathcal{A} and utDom(𝒜){{u}}_{t}\in{\rm Dom}(\mathcal{A}) for all t0t\geq 0.


Theorem 2.1.

Suppose (H1) holds. For gL2(D×V),g\in L_{2}({D}\times V), the unique solution to (2.1) is given by (Vt,t0)(\texttt{\emph{V}}_{t},t\geq 0), the c0c_{0}-semigroup generated by (𝒜,Dom(𝒜))(\mathcal{A},{\rm Dom}(\mathcal{A})), i.e. the orbit Vt[g]:=exp(t𝒜)g\texttt{\emph{V}}_{t}[g]:=\exp(t\mathcal{A})g.

In the ACP setting, the notion of an eigenpair (λ,φ)(\lambda,\varphi) is well formulated on L2(D×V)L_{2}(D\times V) via (1.4). Equivalently, it means we are looking for φL2+(D×V)\varphi\in L^{+}_{2}(D\times V) and λ\lambda such that Vt[φ]=eλtφ\texttt{V}_{t}[\varphi]={\rm e}^{\lambda t}\varphi on L2+(D×V)L^{+}_{2}(D\times V), for all t0t\geq 0. The sense in which we mean that λ\lambda is a ‘leading’ eigenvalue roughly boils down it corresponding to the eigenvalue in the spectrum of the operator 𝒜\mathcal{A} on L2(D×V)L_{2}(D\times V) with the largest real part (and, as usual, it is real valued itself), and moreover, its associated eigenfunction φ\varphi is non-negative. As such, one expects the existence of a non-negative left eigenfunction φ~\tilde{\varphi} (e.g. in the sense that φ~,Vt[g]=eλtφ~,g\langle\tilde{\varphi},\texttt{V}_{t}[g]\rangle={\rm e}^{\lambda t}\langle\tilde{\varphi},g\rangle for t0t\geq 0) such that

eλtVt[g]φ~,gφ2=o(eλt),\lVert{\rm e}^{-\lambda t}\texttt{V}_{t}[g]-\langle\tilde{\varphi},g\rangle\varphi\rVert_{2}=o({\rm e}^{-\lambda t}), (2.2)

as tt\to\infty. Here, we use the notation f,g=D×Vf(r,υ)g(r,υ)drdυ\langle f,g\rangle=\int_{D\times V}f(r,\upsilon)g(r,\upsilon){\textnormal{d}}r{\textnormal{d}}\upsilon, so that 2=,1/2\lVert\cdot\rVert_{2}=\langle\cdot,\cdot\rangle^{1/2}. Precise results of this nature can be found in [7, 15, 5].

2.2 Neutron branching process (NBP) and the mild NTE

We recall the neutron branching process (NBP) defined in [10], which at time t0t\geq 0 is represented by a configuration of particles which are specified via their physical location and velocity in D×VD\times V, say {(ri(t),υi(t)):i=1,,Nt}\{(r_{i}(t),\upsilon_{i}(t)):i=1,\dots,N_{t}\}, where NtN_{t} is the number of particles alive at time t0t\geq 0. The NBP is then given by the empirical distribution of these configurations,

Xt(A)=i=1Ntδ(ri(t),υi(t))(A),A(D×V),t0,X_{t}(A)=\sum_{i=1}^{N_{t}}\delta_{(r_{i}(t),\upsilon_{i}(t))}(A),\qquad A\in\mathcal{B}(D\times V),\;t\geq 0, (2.3)

where δ\delta is the Dirac measure, defined on (D×V)\mathcal{B}(D\times V), the Borel subsets of D×VD\times V.

The evolution of (Xt,t0)(X_{t},t\geq 0) is a stochastic process valued in the space of atomic measures (D×V):={i=1nδ(ri,υi):n,(ri,υi)D×V,i=1,,n}\mathcal{M}(D\times V):=\{\textstyle{\sum_{i=1}^{n}}\delta_{(r_{i},\upsilon_{i})}:n\in\mathbb{N},(r_{i},\upsilon_{i})\in D\times V,i=1,\cdots,n\} which evolves randomly as follows.


A particle positioned at rr with velocity υ\upsilon will continue to move along the trajectory r+υtr+\upsilon t, until one of the following things happens.

  1. (i)

    The particle leaves the physical domain DD, in which case it is instantaneously killed.

  2. (ii)

    Independently of all other neutrons, a scattering event occurs when a neutron comes in close proximity to an atomic nucleus and, accordingly, makes an instantaneous change of velocity. For a neutron in the system with position and velocity (r,υ)(r,\upsilon), if we write TsT_{\texttt{s}} for the random time that scattering may occur, then independently of any other physical event that may affect the neutron, Pr(Ts>t)=exp{0tσs(r+υs,υ)ds},\Pr(T_{\texttt{s}}>t)=\exp\{-\textstyle{\int_{0}^{t}}\sigma_{\texttt{s}}(r+\upsilon s,\upsilon){\rm d}s\}, for t0.t\geq 0.

    When scattering occurs at space-velocity (r,υ)(r,\upsilon), the new velocity υV\upsilon^{\prime}\in V is selected independently with probability πs(r,υ,υ)dυ\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}.

  3. (iii)

    Independently of all other neutrons, a fission event occurs when a neutron smashes into an atomic nucleus. For a neutron in the system with initial position and velocity (r,υ)(r,\upsilon), if we write TfT_{\texttt{f}} for the random time that scattering may occur, then, independently of any other physical event that may affect the neutron, Pr(Tf>t)=exp{0tσf(r+υs,υ)ds},\Pr(T_{\texttt{f}}>t)=\exp\{-\textstyle{\int_{0}^{t}}\sigma_{\texttt{f}}(r+\upsilon s,\upsilon){\rm d}s\}, for t0.t\geq 0.

    When fission occurs, the smashing of the atomic nucleus produces lower mass isotopes and releases a random number of neutrons, say N0N\geq 0, which are ejected from the point of impact with randomly distributed, and possibly correlated, velocities, say {υi:i=1,,N}\{\upsilon_{i}:i=1,\cdots,N\}. The outgoing velocities are described by the atomic random measure

    𝒵(A):=i=1Nδυi(A),A(V).\mathcal{Z}(A):=\sum_{i=1}^{N}\delta_{\upsilon_{i}}(A),\qquad A\in\mathcal{B}(V). (2.4)

    If such an event occurs at location rdr\in\mathbb{R}^{d} from a particle with incoming velocity υV\upsilon\in{V}, we denote by 𝒫(r,υ){\mathcal{P}}_{(r,\upsilon)} the law of 𝒵\mathcal{Z}. The probabilities 𝒫(r,υ){\mathcal{P}}_{(r,\upsilon)} are such that, for υV\upsilon^{\prime}\in{V}, for bounded and measurable g:V[0,)g:V\to[0,\infty),

    Vg(υ)πf(r,v,υ)dυ\displaystyle\int_{V}g(\upsilon^{\prime})\pi_{\texttt{f}}(r,v,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime} =(r,υ)[Vg(υ)𝒵(dυ)]=:(r,υ)[g,𝒵].\displaystyle={\mathcal{E}}_{(r,\upsilon)}\left[\int_{V}g(\upsilon^{\prime})\mathcal{Z}({\textnormal{d}}\upsilon^{\prime})\right]=:{\mathcal{E}}_{(r,\upsilon)}[\langle g,\mathcal{Z}\rangle]. (2.5)

    Note, the possibility that Pr(N=0)>0\Pr(N=0)>0, which will be tantamount to neutron capture (that is, where a neutron slams into a nucleus but no fission results and the neutron is absorbed into the nucleus).

The NBP is thus parameterised by the quantities σs,πs,σf\sigma_{\texttt{s}},\pi_{\texttt{s}},\sigma_{\texttt{f}} and the family of measures 𝒫=(𝒫(r,υ),rD,υV){\mathcal{P}}=({\mathcal{P}}_{(r,\upsilon)},r\in D,\upsilon\in V) and accordingly we refer to it as a (σs,πs,σf,𝒫)(\sigma_{\texttt{s}},\pi_{\texttt{s}},\sigma_{\texttt{f}},\mathcal{P})-NBP. It is associated to the NTE via the relation (2.5), and, although a (σs,πs,σf,𝒫)(\sigma_{\texttt{s}},\pi_{\texttt{s}},\sigma_{\texttt{f}},\mathcal{P})-NBP is uniquely defined, a NBP specified by (σs,πs,σf,πf)(\sigma_{\texttt{s}},\pi_{\texttt{s}},\sigma_{\texttt{f}},\pi_{\texttt{f}}) alone is not. Nonetheless, it is easy to show that for a given πf\pi_{\texttt{f}}, a (σs,πs,σf,𝒫)(\sigma_{\texttt{s}},\pi_{\texttt{s}},\sigma_{\texttt{f}},\mathcal{P})-NBP always exists which satisfies (2.5). See the discussion in Section 2 of [10].


Define

ψt[g](r,υ):=𝔼δ(r,υ)[g,Xt],t0,rD¯,υV,\psi_{t}[g](r,\upsilon):=\mathbb{E}_{\delta_{(r,\upsilon)}}[\langle g,X_{t}\rangle],\qquad t\geq 0,r\in\bar{D},\upsilon\in{V}, (2.6)

where δ(r,υ)\mathbb{P}_{\delta_{(r,\upsilon)}} is the law of XX initiated from a single particle with configuration (r,υ)(r,\upsilon), and gL+(D×V)g\in L^{+}_{\infty}(D\times V), the space of non-negative uniformly bounded measurable functions on D×VD\times V. Here we have made a slight abuse of notation (see ,\langle\cdot,\cdot\rangle as it appears in (2.5)) and written g,Xt\langle g,X_{t}\rangle to mean D×Vg(r,υ)Xt(dr,dυ)\textstyle{\int_{D\times V}}g(r,\upsilon)X_{t}({\textnormal{d}}r,{\textnormal{d}}\upsilon). The following result was shown in [5, 10, 7, 6].

Theorem 2.2.

Suppose (H1) and (H2) hold. For gL+(D×V)g\in L^{+}_{\infty}(D\times V), the space of non-negative and uniformly bounded measurable functions on D×VD\times V, there exist constants C1,C2>0C_{1},C_{2}>0 such that ψt[g]\psi_{t}[g], as given in (2.6), is uniformly bounded by C1exp(C2t)C_{1}\exp(C_{2}t), for all t0t\geq 0. Moreover, (ψt[g],t0)(\psi_{t}[g],t\geq 0) is the unique solution, which is bounded in time, to the so-called mild equation

ψt[g]=Ut[g]+0tUs[(S+F)ψts[g]]ds,t0,\psi_{t}[g]=\emph{{U}}_{t}[g]+\int_{0}^{t}\emph{{U}}_{s}[({\emph{{S}}}+{\emph{{F}}})\psi_{t-s}[g]]{\textnormal{d}}s,\qquad t\geq 0, (2.7)

for which (1.2) holds, where the deterministic evolution Ut[g](r,υ)=g(r+υt,υ)𝟏{t<κr,υD},t0,\emph{{U}}_{t}[g](r,\upsilon)=g(r+\upsilon t,\upsilon)\mathbf{1}_{\{t<\kappa^{D}_{r,\upsilon}\}},t\geq 0, with κr,υD:=inf{t>0:r+υtD},\kappa_{r,\upsilon}^{D}:=\inf\{t>0:r+\upsilon t\not\in D\}, represents the advection semigroup associated with a single neutron travelling at velocity υ\upsilon from rr at t=0t=0.

In [5] the below result was shown, which demonstrates the context in which the mild solution to the NTE and the ACP can be seen to coincide.

Theorem 2.3.

Suppose (H1) and (H2) hold. If gL+(D×V)g\in L^{+}_{\infty}(D\times V) and if (ψt[g],t0)(\psi_{t}[g],t\geq 0) is understood as the solution to the mild equation (2.7), then for t0t\geq 0, Vt[g]=ψt[g]\emph{{V}}_{t}[g]=\psi_{t}[g] on L2+(D×V)L^{+}_{2}(D\times V), i.e. Vt[g]ψt[g]2=0\lVert{\texttt{\emph{V}}}_{t}[g]-\psi_{t}[g]\rVert_{2}=0.

In the probabilistic setting, the meaning of (1.4) can be interpreted as looking for a pair λ\lambda and φ\varphi such that, pointwise on D×VD\times V, ψt[φ]=eλtφ\psi_{t}[\varphi]={\rm e}^{\lambda t}\varphi, for t0t\geq 0. As alluded to in (1.3), we have a similar asymptotic to (2.2), which isolates the eigenpair (λ,φ)(\lambda,\varphi) in its limit. The notion of ‘leading’ in the probabilistic setting is less obvious, however, due to Theorem 2.3, the eigenpairs that emerge from the mild setting and the weak linear PIDE setting should in principle agree on L2(D×V)L_{2}(D\times V). This is discussed with greater precision in [5, 10].

2.3 Neutron random walk (NRW)

There is a second stochastic representation of the unique solution to (2.7), which makes use of the so-called neutron random walk (NRW). This process takes values in D×VD\times V and is defined by its scatter rates, α(r,υ)\alpha(r,\upsilon), rD,υVr\in D,\upsilon\in V, and scatter probability densities π(r,υ,υ)\pi(r,\upsilon,\upsilon^{\prime}), rD,υ,υVr\in D,\upsilon,\upsilon^{\prime}\in V. When issued with a velocity υ\upsilon, the NRW will propagate linearly with that velocity until either it exits the domain DD, in which case it is killed, or at the random time TsT_{\texttt{s}} a scattering occurs, where Pr(Ts>t)=exp{0tα(r+υt,υ)ds},\Pr(T_{\texttt{s}}>t)=\exp\{-\textstyle{\int_{0}^{t}}\alpha(r+\upsilon t,\upsilon){\rm d}s\}, for t0.t\geq 0. When the scattering event occurs at position-velocity configuration (r,υ)(r,\upsilon), a new velocity υ\upsilon^{\prime} is selected with probability π(r,υ,υ)dυ\pi(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}. If we denote by (R,Υ)=((Rt,Υt),t0)(R,\Upsilon)=((R_{t},\Upsilon_{t}),t\geq 0), the position-velocity of the resulting continuous-time random walk on D×VD\times V with an additional cemetery state for when it leaves the domain DD, it is easy to show that (R,Υ)(R,\Upsilon) is a Markov process. We call the process (R,Υ)(R,\Upsilon) an απ\alpha\pi-NRW.


Given a NBP defined by σs\sigma_{\texttt{s}}, σf\sigma_{\texttt{f}}, πs\pi_{\texttt{s}} and 𝒫\mathcal{P}, set

α(r,υ)π(r,υ,υ)=σs(r,υ)πs(r,υ,υ)+σf(r,υ)πf(r,υ,υ)rD,υ,υV.\displaystyle\alpha(r,\upsilon)\pi(r,\upsilon,\upsilon^{\prime})=\sigma_{\texttt{s}}(r,\upsilon)\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime})+\sigma_{\texttt{f}}(r,\upsilon)\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime})\qquad r\in D,\upsilon,\upsilon^{\prime}\in V. (2.8)

and

β(r,υ)=σf(r,υ)(Vπf(r,υ,υ)dυ1).\beta(r,\upsilon)=\sigma_{\texttt{f}}(r,\upsilon)\left(\int_{V}\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\textnormal{d}}\upsilon^{\prime}-1\right). (2.9)

The following result, given in [5], gives the so-called many-to-one representation of solution to the NTE in the form (2.7).

Lemma 2.1.

Suppose (H1) and (H2) hold, we have that

ψt[g](r,υ)=𝐄(r,υ)[e0tβ(Rs,Υs)dsg(Rt,Υt)𝟏{t<τD}],t0,rD,υV,\psi_{t}[g](r,\upsilon)=\mathbf{E}_{(r,\upsilon)}\left[{\rm e}^{\int_{0}^{t}\beta(R_{s},\Upsilon_{s}){\rm d}s}g(R_{t},\Upsilon_{t})\mathbf{1}_{\{t<\tau^{D}\}}\right],\qquad t\geq 0,r\in D,\upsilon\in V, (2.10)

is a second representation of the unique mild solution (in the sense of Theorem 2.2) of the NTE (2.7), where τD=inf{t>0:RtD}\tau^{D}=\inf\{t>0:R_{t}\not\in D\} and 𝐏(r,v){\bf P}_{(r,v)} for the law of the απ\alpha\pi-NRW starting from a single neutron with configuration (r,υ)(r,\upsilon).

2.4 Neutron generational process (NGP)

In order to solve the kk-eigenvalue problem (1.6), it turns out that (ψt,t0)(\psi_{t},t\geq 0) and (ϕt,t0)(\phi_{t},t\geq 0) are not the right objects to work with on account of their time-dependency. We now consider a generational model of the NBP. We can think of each line of descent in the sequence of neutron creation as genealogies. In place of (Xt,t0)(X_{t},t\geq 0), we consider the process (𝒳n,n0)(\mathcal{X}_{n},n\geq 0), where, for n1n\geq 1, 𝒳n\mathcal{X}_{n} is (D×V)\mathcal{M}(D\times V)-valued and can be written 𝒳n=i=1𝒩nδ(ri(n),υi(n))\mathcal{X}_{n}=\sum_{i=1}^{\mathcal{N}_{n}}\delta_{(r^{(n)}_{i},\upsilon^{(n)}_{i})}, where {(ri(n),υi(n)),i=1,𝒩n}\{(r^{(n)}_{i},\upsilon^{(n)}_{i}),i=1,\cdots\mathcal{N}_{n}\} are the position-velocity configurations of the 𝒩n\mathcal{N}_{n} particles that are nn-th in their genealogies to be the result of a fission event. 𝒳0\mathcal{X}_{0} is consistent with X0X_{0} and is the initial configuration of neutron positions and velocities. As such, for n1n\geq 1 we can think of 𝒳n\mathcal{X}_{n} as the nn-th generation of the system and we refer to them as the neutron generational process (NGP). The reader who is more experienced with the theory of branching processes will know 𝒳n\mathcal{X}_{n} to be an example of what is called a stopping line; see [12].


Appealing to the obvious meaning of g,𝒳n\langle g,\mathcal{X}_{n}\rangle, define the expectation semigroup (Ψn,n0)(\Psi_{n},n\geq 0) by

Ψn[g](r,υ)=𝔼δ(r,υ)[g,𝒳n],n0,rD,υV,\Psi_{n}[g](r,\upsilon)=\mathbb{E}_{\delta_{(r,\upsilon)}}\left[\langle g,\mathcal{X}_{n}\rangle\right],\qquad n\geq 0,r\in D,\upsilon\in V, (2.11)

with Ψ0[g]:=gL+(D×V)\Psi_{0}[g]:=g\in L_{\infty}^{+}(D\times V). The main motivation for introducing the NGP is that, just as we have seen that the meaning of (1.4) can be phrased in terms of a multiplicative invariance with respect to the solution of an ACP (2.1) or of the mild equation (2.7), we want to identify the eigen-problem (1.6) in terms of the semigroup above.


To this end, let us introduce the problem of finding a pair k>0k>0 and ϕL+(D×V)\phi\in L^{+}_{\infty}(D\times V) such that, pointwise,

Ψ1[ϕ](r,υ)=kϕ(r,υ),rD,υV.\Psi_{1}[\phi](r,\upsilon)=k\phi(r,\upsilon),\qquad r\in D,\upsilon\in V. (2.12)

In the next section we will show the existence of a solution to (2.12) which also plays an important role in the asymptotic behaviour of Ψn\Psi_{n} as nn\to\infty. Before getting there, let us give a heuristic argument as to why (2.12) is another form of the eigenvalue problem (1.6).


By splitting on the first fission event, Ψn\Psi_{n} solves the following mild equation

Ψn[g](r,υ)=0Qs[Ψn1[g]](r,υ)ds,rD,υV,gL+(D×V),\Psi_{n}[g](r,\upsilon)=\int_{0}^{\infty}\texttt{Q}_{s}\left[{\mathcal{F}}\Psi_{n-1}[g]\right](r,\upsilon){\rm d}s,\qquad r\in D,\upsilon\in V,g\in L_{\infty}^{+}(D\times V), (2.13)

where (Qs,s0)(\texttt{Q}_{s},s\geq 0) is the expectation semigroup associated with the operator 𝒯+𝒮\mathcal{T}+\mathcal{S}. More precisely,

Qs[g](r,υ)=𝔼δ(r,υ)[e0sσf(Ru,Υu)dug(Rs,Υs)𝟏(s<τD)],\texttt{Q}_{s}[g](r,\upsilon)=\mathbb{E}_{\delta_{(r,\upsilon)}}\left[{\rm e}^{-\int_{0}^{s}\sigma_{\texttt{f}}(R_{u},\Upsilon_{u}){\rm d}u}g(R_{s},\Upsilon_{s})\mathbf{1}_{(s<\tau_{D})}\right], (2.14)

where (Rs,Υs)s0(R_{s},\Upsilon_{s})_{s\geq 0} is the σsπs\sigma_{\texttt{s}}\pi_{\texttt{s}}-NRW. Then, if the pair (k,ϕ)(k,\phi) solves (2.12), the strong Markov property along with an iteration implies that

knϕ(r,υ)=Ψn[ϕ](r,υ),rD,υV.k^{n}\phi(r,\upsilon)=\Psi_{n}[\phi](r,\upsilon),\qquad r\in D,\upsilon\in V.

Using it in (2.13) and dividing through by knk^{n} yields

ϕ(r,υ)=0Qs[1kϕ](r,υ)ds.\phi(r,\upsilon)=\int_{0}^{\infty}\texttt{Q}_{s}\left[\frac{1}{k}\mathcal{F}\phi\right](r,\upsilon){\rm d}s. (2.15)

Now set

Vt0tQs[g](r,υ)ds.V_{t}\coloneqq\int_{0}^{t}\texttt{Q}_{s}\left[g\right](r,\upsilon){\rm d}s.

Then, heuristically speaking, since Q is associated to the generator 𝒯+𝒮\mathcal{T}+\mathcal{S}, classical Feynman-Kac theory suggests that VtV_{t} ‘solves’ the equation

Vtt=(𝒯+𝒮)Vt+g.\frac{\partial V_{t}}{\partial t}=(\mathcal{T}+\mathcal{S})V_{t}+g.

with V0=0V_{0}=0. Note that Vt/t=Qt[g]\partial V_{t}/\partial t=\texttt{Q}_{t}[g], which tends to zero as tt\to\infty thanks to the transience of (R,Υ)(R,\Upsilon). Hence, taking g=k1ϕg={k}^{-1}\mathcal{F}\phi, letting tt\to\infty in the above equation, recalling that (Qs,s0)(\texttt{Q}_{s},s\geq 0) is the expectation semigroup associated with the operator 𝒯+𝒮\mathcal{T}+\mathcal{S}, and using the identity (2.15) yields

0=(𝒯+𝒮)ϕ+1kϕ.0=(\mathcal{T}+\mathcal{S})\phi+\frac{1}{k}\mathcal{F}\phi.

3 Probabilistic solution to (1.6)

In this section we state our main result regarding the existence of the eigenvalue and eigenfunction as specified by (2.12). We are once more motivated by the ideas presented in [3] and will use some of the techniques that were further developed in [10].


We start by constructing the many-to-one formula that is associated to the semigroup (Ψn,n0)(\Psi_{n},n\geq 0) in the spirit of the two representations of (ψt,t0)(\psi_{t},t\geq 0) given in Sections 2.2 and 2.3. In this case it takes a slightly different form to the one in the time-dependent case. For ease of notation, let

m(r,υ)Vπf(r,υ,υ)dυ,m(r,\upsilon)\coloneqq\int_{V}\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime}){\rm d}\upsilon^{\prime},

denote the mean number of offspring generated by a fission event at (r,υ)(r,\upsilon), and let (Tn,n1)(T_{n},n\geq 1) denote the time of the scatter event in the απ\alpha\pi-NRW that corresponds to the nn-th fission event in the corresponding NBP, XX.

More formally, referring to, (2.8), we can think of the απ\alpha\pi-NRW at each scatter event as follows. For k1k\geq 1, when the NRW (R,Υ)(R,\Upsilon) scatters for the kk-th time at (r,υ)(r,\upsilon) (with rate α(r,υ)\alpha(r,\upsilon)), a coin is tossed and the random variable Ik(r,υ)\texttt{I}_{k}(r,\upsilon) takes the value 11 with probability σf(r,υ)m(r,υ)/(σs(r,υ)+σf(r,υ)m(r,υ)){\sigma_{\texttt{f}}(r,\upsilon)m(r,\upsilon)}/{({\sigma_{\texttt{s}}(r,\upsilon)+\sigma_{\texttt{f}}(r,\upsilon)m(r,\upsilon)})} and its new velocity, is selected according to an independent copy of the random variable Θkf(r,υ)\Theta^{\texttt{f}}_{k}(r,\upsilon), whose distribution has probability density πf(r,υ,υ)/m(r,υ)\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime})/m(r,\upsilon). On the other hand, with probability σs(r,υ)/(σs(r,υ)+σf(r,υ)m(r,υ)){\sigma_{\texttt{s}}(r,\upsilon)}/({\sigma_{\texttt{s}}(r,\upsilon)+\sigma_{\texttt{f}}(r,\upsilon)m(r,\upsilon)}) the random variable Ik(r,υ)\texttt{I}_{k}(r,\upsilon) takes the value 0 and its new velocity, is selected according to an independent copy of the random variable Θks(r,υ)\Theta^{\texttt{s}}_{k}(r,\upsilon), whose distribution has probability density πs(r,υ,υ)\pi_{\texttt{s}}(r,\upsilon,\upsilon^{\prime}). As such, the velocity immediately after the nn-th scatter of the NRW, given that the position-velocity configuration immediately before is (r,υ)(r,\upsilon), is coded by the random variable

Ik(r,υ)Θkf(r,υ)+(1Ik(r,υ))Θks(r,υ).\texttt{I}_{k}(r,\upsilon)\Theta^{\texttt{f}}_{k}(r,\upsilon)+(1-\texttt{I}_{k}(r,\upsilon))\Theta^{\texttt{s}}_{k}(r,\upsilon).

We thus can identify sequentially, T0=0T_{0}=0 and, for n1n\geq 1,

Tn=inf{t>Tn1:ΥtΥt and Ikt(Rt,Υt)=1},T_{n}=\inf\{t>T_{n-1}:\Upsilon_{t}\neq\Upsilon_{t-}\text{ and }\texttt{I}_{k_{t}}(R_{t},\Upsilon_{t-})=1\},

where (kt,t0)(k_{t},t\geq 0) is the process counting the number of scattering events of the NRW up to time tt.

Note, for the above construction of indicators to make sense, we should at least have some region of space for which fission can take place. As such the assumption (H3) becomes relevant here. Analogously to Lemma 2.1, we have the following many-to-one formula associated with the NBP.

Lemma 3.1.

Suppose (H1), (H2) and (H3) hold. The solution to (2.13) among the class of expectation semigroups is unique for gL+(D×V)g\in L_{\infty}^{+}(D\times V) and the semigroup (Ψn,n0)(\Psi_{n},n\geq 0) may alternatively be represented444Here, we define i=101\prod_{i=1}^{0}\cdot\coloneqq 1. as

Ψn[g](r,υ)=𝐄(r,υ)[i=1nm(RTi,ΥTi)g(RTn,ΥTn)𝟏(Tn<κD)],rD,υV,n1,\Psi_{n}[g](r,\upsilon)=\mathbf{E}_{(r,\upsilon)}\left[\prod_{i=1}^{n}m(R_{T_{i}},\Upsilon_{T_{i}-})g(R_{T_{n}},\Upsilon_{T_{n}})\mathbf{1}_{(T_{n}<\kappa^{D})}\right],\qquad r\in D,\upsilon\in V,n\geq 1, (3.1)

(with Ψ0[g]=g\Psi_{0}[g]=g), where (Rt,Υt)t0(R_{t},\Upsilon_{t})_{t\geq 0} is the απ\alpha\pi-NRW, and

κDinf{t>0:RtD}.\kappa^{D}\coloneqq\inf\{t>0:R_{t}\notin D\}.
Proof.

We first note that the sequence (Ψn,n0)(\Psi_{n},n\geq 0) as defined in (3.1) is a semigroup since, due to the strong Markov property, we have

Ψn+m[g](r,υ)\displaystyle\Psi_{n+m}[g](r,\upsilon)
=𝐄(r,υ)[𝐄[i=1n+mm(RTi,ΥTi)g(RTn+m,ΥTn+m)𝟏(Tn+m<κD)|n]]\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[\mathbf{E}\left[\prod_{i=1}^{n+m}m(R_{T_{i}},\Upsilon_{T_{i}-})g(R_{T_{n+m}},\Upsilon_{T_{n+m}})\mathbf{1}_{(T_{n+m}<\kappa^{D})}\bigg{|}\mathcal{F}_{n}\right]\right]
=𝐄(r,υ)[i=1nm(RTi,ΥTi)𝐄(RTn,ΥTn)[i=1mm(RTi,ΥTi)g(RTm,ΥTm)𝟏(Tm<κD)]𝟏(Tn<κD)]\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[\prod_{i=1}^{n}m(R_{T_{i}},\Upsilon_{T_{i}-})\mathbf{E}_{(R_{T_{n}},\Upsilon_{T_{n}})}\left[\prod_{i=1}^{m}m(R_{T_{i}},\Upsilon_{T_{i}-})g(R_{T_{m}},\Upsilon_{T_{m}})\mathbf{1}_{(T_{m}<\kappa^{D})}\right]\mathbf{1}_{(T_{n}<\kappa^{D})}\right]
=Ψn[Ψm[g]](r,υ),rD,υV.\displaystyle=\Psi_{n}[\Psi_{m}[g]](r,\upsilon),\qquad r\in D,\upsilon\in V.

In order to show that Ψn\Psi_{n} as defined in (3.1) does indeed solve (2.13), we consider the process at time T1T_{1}. Before doing so, we first note that the απ\alpha\pi-NRW is exactly the same as the σsπs\sigma_{\texttt{s}}\pi_{\texttt{s}}-NRW over the time interval [0,T1)[0,T_{1}) and, at time T1T_{1}, the velocity of the former is chosen according to the expectation operator

~[g](r,υ)Vg(r,υ)πf(r,υ,υ)m(r,υ)dυ.\tilde{\mathcal{F}}[g](r,\upsilon)\coloneqq\int_{V}g(r,\upsilon^{\prime})\frac{\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime})}{m(r,\upsilon)}{\rm d}\upsilon^{\prime}.

Then, applying the strong Markov property at time T1T_{1},

Ψn[g](r,υ)\displaystyle\Psi_{n}[g](r,\upsilon) =𝐄(r,υ)[i=1nm(RTi,ΥTi)g(RTn,ΥTn)𝟏(Tn<κD)]\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[\prod_{i=1}^{n}m(R_{T_{i}},\Upsilon_{T_{i}-})g(R_{T_{n}},\Upsilon_{T_{n}})\mathbf{1}_{(T_{n}<\kappa^{D})}\right]
=𝐄(r,υ)[m(RT1,ΥT1)~[Ψn1[g]](RT1,ΥT1)𝟏(T1<κD)]\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[m(R_{T_{1}},\Upsilon_{T_{1}-})\tilde{\mathcal{F}}[\Psi_{n-1}[g]](R_{T_{1}},\Upsilon_{T_{1}-})\mathbf{1}_{(T_{1}<\kappa^{D})}\right]
=0𝐄(r,υ)[σf(Rs,Υs)e0sσf(Ru,Υu)dum(Rs,Υs)~[Ψn1[g]](Rs,Υs)𝟏(s<κD)]ds\displaystyle=\int_{0}^{\infty}\mathbf{E}_{(r,\upsilon)}\left[\sigma_{\texttt{f}}(R_{s},\Upsilon_{s}){\rm e}^{-\int_{0}^{s}{\sigma_{\texttt{f}}(R_{u},\Upsilon_{u}){\rm d}u}}m(R_{s},\Upsilon_{s-})\tilde{\mathcal{F}}[\Psi_{n-1}[g]](R_{s},\Upsilon_{s-})\mathbf{1}_{(s<\kappa^{D})}\right]{\rm d}s
=0Qs[Ψn1[g]](r,υ)ds,\displaystyle=\int_{0}^{\infty}\texttt{Q}_{s}[\mathcal{F}\Psi_{n-1}[g]](r,\upsilon){\rm d}s,

where the final equality follows from the fact that mσf~=m\sigma_{\texttt{f}}\tilde{\mathcal{F}}=\mathcal{F}.


It remains to show that (2.13) has a unique solution for gL+(D×V)g\in L_{\infty}^{+}(D\times V) among the class of expectation semigroups, suppose that (Ψn,n0)(\Psi_{n}^{\prime},n\geq 0) is another such solution with Ψ0=gL+(D×V)\Psi_{0}^{\prime}=g\in L^{+}_{\infty}(D\times V). Define Φn=ΨnΨn\Phi_{n}=\Psi_{n}-\Psi_{n}^{\prime}, for n0n\geq 0, and note by linearity that (Φn,n0)(\Phi_{n},n\geq 0) is another expectation semigroup with Φ0=0\Phi_{0}=0. Moreover, by linearity (Φn,n0)(\Phi_{n},n\geq 0) also solves (2.13). On account of this, it is straightforward to see by induction that if Φn=0\Phi_{n}=0 then Φn+1=0\Phi_{n+1}=0. The uniqueness of (2.13) in the class of expectation semigroups thus follows. ∎

The next result will provide the existence of a solution to (2.12) by working directly with a variant of the semigroup (Ψn,n0)(\Psi_{n},n\geq 0). To this end, note that, under the assumption (H4), for non-negative functions gg that are bounded by one, say, we have

𝔼δ(r,υ)[g,𝒳1]g𝔼δ(r,υ)[1,𝒳1]Nmax.\mathbb{E}_{\delta_{(r,\upsilon)}}\left[\langle g,\mathcal{X}_{1}\rangle\right]\leq\|g\|_{\infty}\mathbb{E}_{\delta_{(r,\upsilon)}}\left[\langle 1,\mathcal{X}_{1}\rangle\right]\leq N_{\texttt{max}}. (3.2)

Dividing both sides of the above inequality yields a sub-Markovian semigroup. Indeed,

Ψn[g](r,υ)\displaystyle\Psi^{\dagger}_{n}[g](r,\upsilon) NmaxnΨn[g](r,υ)\displaystyle\coloneqq N_{\texttt{max}}^{-n}\Psi_{n}[g](r,\upsilon)
=𝐄(r,υ)[i=1nm(RTi,ΥTi)Nmaxg(RTn,ΥTn)𝟏(Tn<κD)]\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[\prod_{i=1}^{n}\frac{m(R_{T_{i}},\Upsilon_{T_{i}-})}{N_{\texttt{max}}}g(R_{T_{n}},\Upsilon_{T_{n}})\mathbf{1}_{(T_{n}<\kappa^{D})}\right]
=𝐄(r,υ)[g(RTn,ΥTn)𝟏(Tn<κD,n<Γ)]\displaystyle=\mathbf{E}_{(r,\upsilon)}\left[g(R_{T_{n}},\Upsilon_{T_{n}})\mathbf{1}_{(T_{n}<\kappa^{D},\,n<\Gamma)}\right]
𝐄(r,υ)[g(RTn,ΥTn)],\displaystyle\eqqcolon\mathbf{E}_{(r,\upsilon)}^{\dagger}\left[g(R_{T_{n}},\Upsilon_{T_{n}})\right], (3.3)

where Γ=min{n0:Kn(RTn,ΥTn)=1}\Gamma=\min\{n\geq 0:\texttt{K}_{n}(R_{T_{n}},\Upsilon_{T_{n}-})=1\} where, for n0n\geq 0, rDr\in D and υV\upsilon\in V, the random variable Kn(r,υ)\texttt{K}_{n}(r,\upsilon) is an independent indicator random variable which is equal to 0 with probability m(r,υ)/Nmax{m(r,\upsilon)}/{N_{\texttt{max}}} (note, from the assumptions in Section 1, it follows that suprD,υVm(r,υ)Nmax\sup_{r\in D,\upsilon\in V}m(r,\upsilon)\leq N_{\texttt{max}}).


We are now ready to state the main result of this section, and indeed the article. As its proof is quite lengthy we will delay it until Section 5. We will need the following stronger assumption of (H3):

(H3): The fission cross section satisfies infrD,υ,υVσf(r,υ)πf(r,υ,υ)>0\textstyle{\inf_{r\in D,\upsilon,\upsilon^{\prime}\in V}\sigma_{\texttt{f}}(r,\upsilon)\pi_{\texttt{f}}(r,\upsilon,\upsilon^{\prime})>0}.

Theorem 3.1.

Under the assumptions (H1), (H3) and (H4), for the semigroup (Ψn,n0)(\Psi_{n},n\geq 0) identified by (2.13), there exist kk_{*}\in\mathbb{R}, a positive555To be precise, by a positive eigenfunction, we mean a mapping from D×V(0,)D\times V\to(0,\infty). This does not prevent it being valued zero on D\partial D, as DD is open. right eigenfunction φL+(D×V)\varphi\in L^{+}_{\infty}(D\times V) and a left eigenmeasure, η\eta, on D×VD\times V, both having associated eigenvalue knk_{*}^{n}. Moreover, kk_{*} is the leading eigenvalue in the sense that, for all gL+(D×V)g\in L^{+}_{\infty}(D\times V),

η,Ψn[g]=knη,g(resp. Ψn[φ]=knφ)n0,\langle\eta,\Psi_{n}[g]\rangle=k_{*}^{n}\langle\eta,g\rangle\quad\text{(resp. }\Psi_{n}[\varphi]=k_{*}^{n}\varphi\text{)}\quad n\geq 0, (3.4)

and there exists γ>1\gamma>1 such that, for all gL+(D×V)g\in L^{+}_{\infty}(D\times V),

supgL+(D×V):g1knφ1Ψn[g]η,g=O(γn) as n+.\sup_{g\in L_{\infty}^{+}(D\times V):\lVert g\rVert_{\infty}\leq 1}\left\|k_{*}^{-n}{\varphi}^{-1}{\Psi_{n}[g]}-\langle\eta,g\rangle\right\|_{\infty}=O(\gamma^{-n})\text{ as }n\rightarrow+\infty. (3.5)

4 Classical existence of solution to (1.6)

Our objective here is to make rigorous the sense in which solving (2.12) is consistent with solving the eigenvalue problem (1.6) in the classical sense.


We begin by considering the abstract Cauchy problem (ACP) on L2(D×V)L_{2}(D\times V),

{tut=(𝒯+𝒮)utu0=g.\left\{\begin{array}[]{rl}\dfrac{\partial}{\partial t}u_{t}&=({\mathcal{T}}+{\mathcal{S}})u_{t}\\ u_{0}&=g.\end{array}\right. (4.1)

Then, just as in the spirit of Theorems 2.1 and 2.3, it is not difficult to show that the operator (𝒯+𝒮,Dom(𝒯+𝒮))(\mathcal{T}+\mathcal{S},{\rm Dom}(\mathcal{T}+\mathcal{S})) generates a unique solution to (4.1) via the c0c_{0}-semigroup (𝒱t,t0)(\mathcal{V}_{t},t\geq 0) given by

𝒱t[g]exp(t(𝒯+𝒮))g,\mathcal{V}_{t}[g]\coloneqq{\rm exp}(t(\mathcal{T}+\mathcal{S}))g,

on L2(D×V)L_{2}(D\times V) (and hence for gL2(D×V)g\in L_{2}(D\times V)). Moreover, we have that the expectation semigroup (Qt,t0)(\texttt{Q}_{t},t\geq 0) agrees with (𝒱t,t0)(\mathcal{V}_{t},t\geq 0) on L2(D×V)L_{2}(D\times V), providing gL+(D×V)g\in L^{+}_{\infty}(D\times V). This latter claim follows the same idea as the proof of Theorem 8.1 in [5].


The equivalence of the semigroups (Qt,t0)(\texttt{Q}_{t},t\geq 0) and (𝒱t,t0)(\mathcal{V}_{t},t\geq 0) is what we will use to identify a classical (in the L2L_{2}-sense) and probabilistic meaning to (1.6). We start by showing the classical existence of a solution to (1.6) on L2(D×V)L_{2}(D\times V). We note that this problem has been previously considered in [14, 15]. In [14], the author converted the criticality problem (1.6) into a time-dependent problem in order to exploit the existing theory for time-dependent problems, whereas the methods used in [15, Section 5.11] are similar to those presented in [5]. Another more restrictive version of assumption (H2) is needed, which also implies that (H3) holds:


(H5): We have σs(r,υ)πs(r,υ,υ)>0\sigma_{\emph{{s}}}(r,\upsilon)\pi_{\emph{{s}}}(r,\upsilon,\upsilon^{\prime})>0 and σf(r,υ)πf(r,υ,υ)>0\sigma_{\emph{{f}}}(r,\upsilon)\pi_{\emph{{f}}}(r,\upsilon,\upsilon^{\prime})>0 on D×V×VD\times V\times V.

Theorem 4.1.

Suppose that the cross sections σfπf\sigma_{\emph{{f}}}\pi_{\emph{{f}}} and σsπs\sigma_{\emph{{s}}}\pi_{\emph{{s}}} are piecewise continuous666A function is piecewise continuous if its domain can be divided into an exhaustive finite partition (e.g. polytopes) such that there is continuity in each element of the partition. This is precisely how cross sections are stored in numerical libraries for modelling of nuclear reactor cores.. Further, assume that (H1) and (H5) hold. Then there exist a real eigenvalue k>0k>0 and associated eigenfunction ϕL2+(D×V)\phi\in L_{2}^{+}(D\times V) such that (1.6) holds on L2(D×V)L_{2}(D\times V). Moreover, kk can be explicitly identified as

k=sup{|λ|:(𝒯+𝒮)ϕ+1λϕ=0 for some ϕL2(D×V)}.k=\sup\left\{|\lambda|:(\mathcal{T}+\mathcal{S})\phi+\frac{1}{\lambda}\mathcal{F}\phi=0\text{ for some }\phi\in L_{2}(D\times V)\right\}. (4.2)
Proof.

We start by considering a related eigenvalue problem. First recall from [5] that, due to the transience of 𝒯\mathcal{T} on DD, there exist constants M1,ω>0M_{1},\omega>0 such that et𝒯M1eωt\|{\rm e}^{t\mathcal{T}}\|\leq M_{1}{\rm e}^{-\omega t} for each t0t\geq 0. Further, since 𝒮\mathcal{S} is conservative, there exists M2>0M_{2}>0 such that777We use the standard definition of operator norm, namely A=supf21𝒜f2\lVert A\rVert=\sup_{\lVert f\rVert_{2}\leq 1}\lVert\mathcal{A}f\rVert_{2}, where, as before, 2\lVert\cdot\rVert_{2} is the usual norm on L2(D×V)L_{2}(D\times V). et𝒮M2\|{\rm e}^{t\mathcal{S}}\|\leq M_{2}, t0t\geq 0. Hence 𝒱tMeωt,t0\|\mathcal{V}_{t}\|\leq M{\rm e}^{-\omega t},t\geq 0, where M=M1M2M=M_{1}M_{2}. Then, classical semigroup theory [18] gives the existence of the resolvent operator (λ(𝒯+𝒮))1(\lambda\mathcal{I}-(\mathcal{T}+\mathcal{S}))^{-1} for all λ\lambda such that Reλ>ω{\rm Re}\lambda>-\omega, where \mathcal{I} is the identity operator on L2(D×V)L_{2}(D\times V). In particular, the resolvent is well-defined for λ=0\lambda=0. Hence, the eigenvalue problem (1.6) is equivalent to

(𝒯+𝒮)1ϕ=kϕ.-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\phi=k\phi. (4.3)

Due to the assumptions (H1) and (H5), almost identical arguments to those given in the proof of [5, Proposition 9.1] show that (𝒯+𝒮)1-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F} is a positive, compact, irreducible operator. Concluding in the same way as the aforementioned proposition, de Pagter’s theorem [15, Theorem 5.7] implies that its spectral radius, r((𝒯+𝒮)1)r(-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}), is strictly positive. It follows from the Krein-Rutman Theorem [5, Theorem 9.1] that kr((𝒯+𝒮)1)sup{|λ|:(𝒯+𝒮)1ϕ=λϕ for some ϕL2(D×V)}k\coloneqq r(-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F})\coloneqq\sup\{|\lambda|:-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\phi=\lambda\phi\text{ for some }\phi\in L_{2}(D\times V)\} is the leading eigenvalue of the operator (𝒯+𝒮)1-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F} with corresponding positive eigenfunction ϕ\phi. ∎

In a similar manner to [5], we are able to provide more information about the structure of the spectrum of the operator (𝒯+𝒮)1-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}.

Proposition 4.1.

Under the assumptions of Theorem 4.1, the part of the spectrum given by σ((𝒯+𝒮)1){λ:Re(λ)>0}\sigma(-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F})\cap\{\lambda:{\rm Re}(\lambda)>0\} consists of finitely many eigenvalues with finite multiplicities. In particular, kk is both algebraically and geometrically simple888An eigenvalue λ\lambda associated with an operator AA is geometrically simple if dim(ker(λIA))=1({\rm ker}(\lambda I-A))=1 and algebraically simple if supk1dim(ker(λIA)k)=1\sup_{k\geq 1}{\rm dim}({\rm ker}(\lambda I-A)^{k})=1.

Proof.

We follow the idea of the proof of [15, Theorem 4.13] and consider the invertibility of the operator λ+(𝒯+𝒮)1\lambda\mathcal{I}+(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F} by considering the following problem,

(+1λ(𝒯+𝒮)1)f=1λg,\left(\mathcal{I}+\frac{1}{\lambda}(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\right)f=\frac{1}{\lambda}g,

for λσ((𝒯+𝒮)1){λ:Re(λ)>0}\lambda\in\sigma(-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F})\cap\{\lambda:{\rm Re}(\lambda)>0\}. Note that this latter set is non-empty on account of the previous theorem.


As stated in the proof of Theorem 4.1, the operator λ1(𝒯+𝒮)1-{\lambda}^{-1}(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F} is compact in L2(D×V)L_{2}(D\times V) so that by Gohberg-Shmulyan’s Theorem [16], (+λ1(𝒯+𝒮)1)1\left(\mathcal{I}+{\lambda}^{-1}(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\right)^{-1} exists except for a finite set of discrete degenerate poles. This implies that (λ+(𝒯+𝒮)1)1,λσ((𝒯+𝒮)1){λ:Re(λ)>0}\left(\lambda\mathcal{I}+(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\right)^{-1},\lambda\in\sigma(-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F})\cap\{\lambda:{\rm Re}(\lambda)>0\} exists except for a finite set of eigenvalues with finite multiplicities.

We now prove that kk is a simple eigenvalue of the operator (𝒯+𝒮)1-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}. In order to do so, we need to consider the adjoint eigenvalue problem

(𝒯+𝒮)1ϕ=kϕ,\mathcal{F}^{\top}({\mathcal{T}}^{\top}+\mathcal{S}^{\top})^{-1}\phi^{\top}=k^{\top}\phi^{\top}, (4.4)

where 𝒯{\mathcal{T}}^{\top} denotes the adjoint of 𝒯{\mathcal{T}}, with similar definitions for \mathcal{F}^{\top} and 𝒮\mathcal{S}^{\top}.


We first note that, since the operator 𝒯+𝒮\mathcal{T}^{\top}+\mathcal{S}^{\top} enjoys similar properties to 𝒯+𝒮\mathcal{T}+\mathcal{S}, the same methods as those given in the proof of Theorem 4.2 apply to give existence of a leading eigenvalue kk^{\top} and corresponding eigenfunction ϕ\phi^{\top}. Now, due to [11, p.184], if λ\lambda is an isolated eigenvalue of (𝒯+𝒮)1-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}, then its complex conjugate, λ¯\bar{\lambda}, is an isolated eigenvalue of the adjoint of (𝒯+𝒮)1-(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F} with the same multiplicity. Equivalently, for each isolated λ\lambda solving (1.6) with eigenfunction ϕ\phi, λ¯\bar{\lambda} solves (4.4) with a corresponding eigenfunction ϕ\phi^{\top} and has the same multiplicity as λ\lambda. In particular, since kk is real, it follows that the leading eigenvalue associated with (4.4) is also kk. These observations along with similar arguments to those presented in [7, Theorem 7(iii)] and [19] yield geometric simplicity of kk. Then straightforward adaptations of the arguments in [7, Remark 12] yield algebraic simplicity. ∎

The next result shows that if we can find a solution to (1.6), then it must necessarily agree with the eigensolution constructed in Theorem 3.1 on L2(D×V)L_{2}(D\times V).

Theorem 4.2.

Suppose the assumptions of Theorem 4.1 are in force999Note that these assumptions imply those required for Theorem 3.1., that (k,ϕ)(k_{*},\phi_{*}) solves (2.12) and (k,ϕ)(k,\phi) denotes the leading eigensolution to (1.6). Then k=kk=k_{*}, and, up to a positive constant multiple, ϕ\phi agrees with ϕ\phi_{*} on L2(D×V)L_{2}(D\times V).

Proof.

Recall the semigroup, (𝒱t)t0(\mathcal{V}_{t})_{t\geq 0}, generated by 𝒯+𝒮\mathcal{T}+\mathcal{S} and note that, due to boundedness of the operator \mathcal{F}, if gLp(D×V)g\in L_{p}(D\times V), then gLp(D×V)\mathcal{F}g\in L_{p}(D\times V), p[1,]p\in[1,\infty]. Thanks to [8, Chapter II, Lemma 1.3], (𝒱t)t0(\mathcal{V}_{t})_{t\geq 0} satisfies

𝒱t[g]=(𝒯+𝒮)0t𝒱s[g]ds+g.\mathcal{V}_{t}[\mathcal{F}g]=(\mathcal{T}+\mathcal{S})\int_{0}^{t}\mathcal{V}_{s}[\mathcal{F}g]{\rm d}s+\mathcal{F}g. (4.5)

Letting tt\to\infty in the above equation, we obtain

0=(𝒯+𝒮)0𝒱s[g]ds+g,0=(\mathcal{T}+\mathcal{S})\int_{0}^{\infty}\mathcal{V}_{s}[\mathcal{F}g]{\rm d}s+\mathcal{F}g, (4.6)

which follows from the fact that (𝒯+𝒮)(\mathcal{T}+\mathcal{S}) is a transient operator so that limt𝒱t[g]=0\lim_{t\to\infty}\mathcal{V}_{t}[g]=0. Setting g=ϕg=\phi_{*} in (4.6) and using the fact that (Qs,s0)(\texttt{Q}_{s},s\geq 0) and (𝒱s,s0)(\mathcal{V}_{s},s\geq 0) agree on L2(D×V)L_{2}(D\times V), providing gL+(D×V)g\in L^{+}_{\infty}(D\times V), yields

0=(𝒯+𝒮)0Qs[ϕ]ds+ϕ.0=(\mathcal{T}+\mathcal{S})\int_{0}^{\infty}\texttt{Q}_{s}[\mathcal{F}\phi_{*}]{\rm d}s+\mathcal{F}\phi_{*}. (4.7)

Now taking advantage of (2.12) for ϕ\phi_{*}, noting in particular (2.13), we have

0Qs[ϕ]=Ψ1[ϕ]=kϕ.\int_{0}^{\infty}\texttt{Q}_{s}[\mathcal{F}\phi_{*}]=\Psi_{1}[\phi_{*}]=k_{*}\phi_{*}. (4.8)

Substituting this into (4.7) shows that (k,ϕ)(k_{*},\phi_{*}) is a solution to (1.6) on L2(D×V)L_{2}(D\times V).


To conclude the proof, we first show that k=kk_{*}=k. Again, consider the adjoint problem (4.4) and note that

0\displaystyle 0 =(𝒯+𝒮)1ϕ,ϕ(𝒯+𝒮)1ϕ,ϕ\displaystyle=\langle(\mathcal{T}+\mathcal{S})^{-1}\mathcal{F}\phi_{*},\phi^{\top}\rangle-\langle\mathcal{F}^{\top}(\mathcal{T}^{\top}+\mathcal{S}^{\top})^{-1}\phi^{\top},\phi_{*}\rangle
=(kk)ϕ,ϕ.\displaystyle=(k-k_{*})\langle\phi^{\top},\phi_{*}\rangle.

Since ϕ\phi_{*} and ϕ\phi^{\top} are positive, we must have k=kk_{*}=k. Due to simplicity of kk from the previous proposition, it follows that ϕ=ϕ\phi=\phi_{*} up to a multiplicative constant. ∎

5 Proof of Theorem 3.1

As previously stated, our methods of proving Theorem 3.1 are motivated by those used in [10, 3]. The main part of the proof comes from [3, Theorem 2.1], which we restate (in the language of the desired application) here for convenience. To this end, recalling the notation in (3.3), define

k=Γmin{n1:TnκD}.\texttt{k}=\Gamma\wedge\min\{n\geq 1:T_{n}\geq\kappa^{D}\}.
Theorem 5.1.

Suppose that (H1), (H3) and (H4) are in force. Suppose that there exists a probability measure ν\nu on D×VD\times V such that

  1. (A1)

    there exist n0n_{0}, c1>0c_{1}>0 such that for each (r,υ)D×V(r,\upsilon)\in D\times V,

    𝐏(r,υ)((RTn0,ΥTn0)|n0<k)c1ν();\mathbf{P}_{(r,\upsilon)}((R_{T_{n_{0}}},\Upsilon_{T_{n_{0}}})\in\cdot\;|n_{0}<\texttt{k})\geq c_{1}\nu(\cdot);
  2. (A2)

    there exists a constant c2>0c_{2}>0 such that for each (r,υ)D×V(r,\upsilon)\in D\times V and for every n0n\geq 0,

    𝐏ν(n<k)c2𝐏(r,υ)(n<k).\mathbf{P}_{\nu}(n<\texttt{k})\geq c_{2}\mathbf{P}_{(r,\upsilon)}(n<\texttt{k}).

Then, there exists kc(0,1)k_{c}\in(0,1) such that, there exist an eigenmeasure η\eta on D×VD\times V and a positive right eigenfunction φ\varphi of Ψn\Psi_{n}^{\dagger} (defined in (3.3)) with eigenvalue kcnk_{c}^{n}, such that η\eta is a probability measure and φL+(D×V)\varphi\in L^{+}_{\infty}(D\times V), i.e. for all gL(D×V)g\in L_{\infty}(D\times V),

η[Ψn[g]]=kcnη[g]andΨn[φ]=kcnφn0.\eta[\Psi^{\dagger}_{n}[g]]=k_{c}^{n}\eta[g]\quad\text{and}\quad\Psi^{\dagger}_{n}[\varphi]=k_{c}^{n}\varphi\quad n\geq 0. (5.1)

Moreover, there exist C,γ>0C,\gamma>0 such that, for gL+(D×V)g\in L^{+}_{\infty}(D\times V) and nn sufficiently large (independently of gg),

kcnφ1Ψn[g]η[g]Cγng.\left\|k_{c}^{-n}\varphi^{-1}\Psi_{n}^{\dagger}[g]-\eta[g]\right\|_{\infty}\leq C\gamma^{-n}\|g\|_{\infty}. (5.2)

In particular, setting g1g\equiv 1, as nn\to\infty,

kcnφ1𝐏(n<k)1Cγn.\left\|k_{c}^{-n}\varphi^{-1}\mathbf{P}_{\cdot}(n<\texttt{k})-1\right\|_{\infty}\leq C\gamma^{-n}. (5.3)

It is then straightforward to conclude that η\eta and φ\varphi are the left eigenmeasure and right eigenfunction corresponding to the eigenvalue k=kcNmaxk_{*}=k_{c}N_{\texttt{max}} for the semigroup Ψn\Psi_{n}.


We now proceed to the proof of Theorem 5.1. We will use the notation JkJ_{k} to denote the kthk^{th} scatter event of the random walk (R,Υ)(R,\Upsilon) under 𝐏{\bf P}^{\dagger} and recall that TkT_{k} denotes the scatter event that corresponds to the kthk^{th} fission event in the original NBP. The basis of our proof relies on the fact that, for each k1k\geq 1, Tk=JkT_{k}=J_{k} with positive probability.


A fundamental part of the proof of (A1) and (A2) is the following lemma. We refer the reader to [10, Lemma 7.3] for its proof.

Lemma 5.1.

Under the assumptions of Theorem 5.1, for all rDr\in D and υV\upsilon\in V, we have

𝐏(r,υ)(J7<k,RJ7dz)C𝟏(zD)dz,\mathbf{P}_{(r,\upsilon)}^{\dagger}(J_{7}<\emph{{k}},R_{J_{7}}\in{\rm d}z)\leq C\mathbf{1}_{(z\in D)}\,{\rm d}z, (5.4)

for some constant C>0C>0, and

𝐏ν(J1<k,RJ1dz)c𝟏(zD)dz,\mathbf{P}^{\dagger}_{\nu}(J_{1}<\emph{{k}},R_{J_{1}}\in{\rm d}z)\geq c\mathbf{1}_{(z\in D)}\,{\rm d}z, (5.5)

for another constant c>0c>0, where ν\nu is Lebesgue measure on D×VD\times V.

Proof of (A1).

In order to prove (A1), we use similar arguments to those presented in the proof of (5.5). To this end, fix r0Dr_{0}\in D and suppose Υ0\Upsilon_{0} is uniformly distributed on VV. Then, due to the assumptions (H1) and (H3), the techniques used in [10] to prove (5.5) yield

𝐄(r0,Υ0)[f(RJ1)𝟏(T1=J1)]C0Ddz𝟏([r,z]D)f(z).\mathbf{E}_{(r_{0},\Upsilon_{0})}\left[f(R_{J_{1}})\mathbf{1}_{(T_{1}=J_{1})}\right]\geq C_{0}\int_{D}{\rm d}z\mathbf{1}_{([r,z]\subset D)}f(z). (5.6)

Recall the (deterministic) quantity κr0,υ0D\kappa_{r_{0},\upsilon_{0}}^{D}, for r0Dr_{0}\in D, υ0V\upsilon_{0}\in V, defined in Theorem 2.7. Also note that due to (H3), π\pi is bounded below by a constant (see discussion just before Lemma 7.2 of [10]). Using this, along with the strong Markov property, (H1) and (5.6), we have

𝔼(r0,υ0)[f(RT2,ΥT2)𝟙(T2=J2)]\displaystyle\mathbb{E}_{(r_{0},\upsilon_{0})}^{\dagger}[f(R_{T_{2}},\Upsilon_{T_{2}})\mathds{1}_{(T_{2}=J_{2})}] C10κr0,υ0Ddseα¯sπ¯Vdυ1𝐄(r0+υ0s,υ1)[f(RJ1,ΥJ1)𝟙(T1=J1)]\displaystyle\geq C_{1}\int_{0}^{\kappa_{r_{0},\upsilon_{0}}^{D}}{\rm d}s{\rm e}^{-\bar{\alpha}s}\underline{\pi}\int_{V}{\rm d}\upsilon_{1}\mathbf{E}_{(r_{0}+\upsilon_{0}s,\upsilon_{1})}^{\dagger}[f(R_{J_{1}},\Upsilon_{J_{1}})\mathds{1}_{(T_{1}=J_{1})}]
C2κr0,υ0DDdrVdυf(r,υ).\displaystyle\geq C_{2}\kappa_{r_{0},\upsilon_{0}}^{D}\int_{D}{\rm d}r\int_{V}{\rm d}\upsilon f(r,\upsilon). (5.7)

Finally, we note that due to (H1) and (H3),

𝐏(r0,υ0)(T2<k)𝐏(J1<k)0κr0,υ0Ddsα¯eα¯sC3κr0,υ0D.\mathbf{P}_{(r_{0},\upsilon_{0})}^{\dagger}(T_{2}<\texttt{k})\leq\mathbf{P}^{\dagger}(J_{1}<\texttt{k})\leq\int_{0}^{\kappa_{r_{0},\upsilon_{0}}^{D}}{\rm d}s\bar{\alpha}{\rm e}^{-\underline{\alpha}s}\leq C_{3}\kappa_{r_{0},\upsilon_{0}}^{D}. (5.8)

Combining this with (5.7) yields (A1) with ν\nu as Lebesgue measure on D×VD\times V and n0=2n_{0}=2. ∎

We now prove (A2). Again, we use a similar method to the one used in [10], however, we state the proof in full to illustrate where the differences occur.

Proof of A2.

Let n7n\geq 7 and note that TnJ7TnT7T_{n}-J_{7}\geq T_{n}-T_{7}. This and the strong Markov property imply

𝐏(r,υ)(n<k)\displaystyle\mathbf{P}_{(r,\upsilon)}(n<\texttt{k}) 𝐄(r,υ)[𝐏(RJ7,ΥJ7)(n7<k)]\displaystyle\leq\mathbf{E}_{(r,\upsilon)}^{\dagger}\left[\mathbf{P}_{(R_{J_{7}},\Upsilon_{J_{7}})}\left(n-7<\texttt{k}\right)\right]
CDV𝐏(z,w)(n7<k)dzdw,\displaystyle\leq C^{\prime}\int_{D}\int_{V}\mathbf{P}_{(z,w)}\left(n-7<\texttt{k}\right){\rm d}z{\rm d}w, (5.9)

where we have used Lemma 5.1 to obtain the final inequality.


Now suppose n1n\geq 1. Recalling the measure ν\nu from (A1), another application of Lemma 5.1 gives

𝐏ν(n<k)\displaystyle\mathbf{P}_{\nu}(n<\texttt{k}) =𝐄ν[𝟏(J1<k)𝐏(RJ1,ΥJ1)(n<k)]\displaystyle=\mathbf{E}_{\nu}^{\dagger}\left[\mathbf{1}_{(J_{1}<\texttt{k})}\mathbf{P}_{(R_{J_{1}},\Upsilon_{J_{1}})}(n<\texttt{k})\right]
cDV𝐏(z,w)(n<k)dzdw.\displaystyle\geq c^{\prime}\int_{D}\int_{V}\mathbf{P}_{(z,w)}(n<\texttt{k}){\rm d}z{\rm d}w. (5.10)

Then, for n8n\geq 8, combining  (5.9) and (5.10) yields

𝐏(r,υ)(n<k)Cc𝐏ν(n7<k).\mathbf{P}_{(r,\upsilon)}(n<\texttt{k})\leq\frac{C^{\prime}}{c^{\prime}}\mathbf{P}_{\nu}\left(n-7<\texttt{k}\right). (5.11)

Now recalling n0n_{0} from (A1), it follows from (A1) that

𝐏ν((RTn0,ΥTn0))c1𝐏ν(n0<k)ν().\mathbf{P}_{\nu}^{\dagger}((R_{T_{n_{0}}},\Upsilon_{T_{n_{0}}})\in\cdot)\geq c_{1}\mathbf{P}_{\nu}(n_{0}<\texttt{k})\nu(\cdot). (5.12)

Again, due to assumptions (H1) and (H3),

𝐏ν(n0<k)D×V𝐏(r,υ)(Tn0=Jn0,n0<k)ν(dr,dυ)K,\mathbf{P}_{\nu}(n_{0}<\texttt{k})\geq\int_{D\times V}{\bf P}_{(r,\upsilon)}^{\dagger}(T_{n_{0}}=J_{n_{0}},n_{0}<\texttt{k})\nu({\rm d}r,{\rm d}\upsilon)\geq K, (5.13)

for some constant K>0K>0. Then, for n8n\geq 8, due to (5.12) and (5.13),

𝐏ν(n7+n0<k)\displaystyle\mathbf{P}_{\nu}\left(n-7+n_{0}<\texttt{k}\right) =𝐄ν[𝟏(n0<k)𝐏(RTn0,ΥTn0)(n7<k)]\displaystyle=\mathbf{E}_{\nu}\left[\mathbf{1}_{(n_{0}<\texttt{k})}\mathbf{P}_{(R_{T_{n_{0}}},\Upsilon_{T_{n_{0}}})}\left(n-7<\texttt{k}\right)\right]
Kc1𝐏ν(n7<k).\displaystyle\geq Kc_{1}\mathbf{P}_{\nu}\left(n-7<\texttt{k}\right). (5.14)

Finally, noting that for n1n\geq 1 we have n7+4n0nn-7+4n_{0}\geq n, so that

𝐏ν(n<k)𝐏ν(n7+4n0<k),\mathbf{P}_{\nu}(n<\texttt{k})\geq\mathbf{P}_{\nu}\left(n-7+4n_{0}<\texttt{k}\right),

and applying (5.14) four times implies

𝐏ν(n<k)(Kc1)4𝐏ν(n7<k).\mathbf{P}_{\nu}(n<\texttt{k})\geq(Kc_{1})^{4}\mathbf{P}_{\nu}\left(n-7<\texttt{k}\right). (5.15)

Combining this with (5.11) yields the result. ∎

6 Concluding remarks

We complete this paper with a number of remarks that reflect on the main theorem here and previous work in [5, 10, 9, 4].

6.1 λ\lambda-, kk- and cc-eigenvalue problems

There is a third eigenvalue problem associated with the NTE: find (c,φc)(c,\varphi_{c}) such that

𝒯φc+1c(𝒮+)φc=0.\displaystyle\mathcal{T}\varphi_{c}+\frac{1}{c}\left(\mathcal{S}+\mathcal{F}\right)\varphi_{c}=0.

The associated mild form of this eigenvalue problem is

St[φc](r,υ)+1c0tSs[(𝒮+)φc](r,υ)ds=φc(r,υ),\texttt{S}_{t}[\varphi_{c}](r,\upsilon)+\frac{1}{c}\int_{0}^{t}\texttt{S}_{s}[(\mathcal{S}+\mathcal{F})\varphi_{c}](r,\upsilon){\rm d}s=\varphi_{c}(r,\upsilon), (6.1)

where

St[g](r,υ)=e0tσ(r+υs,υ)dsg(r+υt,υ)𝟏(t<κr,υD).\texttt{S}_{t}[g](r,\upsilon)={\rm e}^{-\int_{0}^{t}\sigma(r+\upsilon s,\upsilon){\rm d}s}g(r+\upsilon t,\upsilon)\mathbf{1}_{(t<\kappa_{r,\upsilon}^{D})}.

By considering the semigroup Πn[g](r,υ)=𝔼δ(r,υ)[g,𝕏n]\Pi_{n}[g](r,\upsilon)=\mathbb{E}_{\delta_{(r,\upsilon)}}[\langle g,\mathbb{X}_{n}\rangle], where 𝕏n\mathbb{X}_{n} is the neutron population at the nthn^{th} collision (either a scatter or a fission), almost identical proofs to those given in the previous sections yield the existence of the (c,φc)(c,\varphi_{c}), both in the classical sense and the probabilistic one.


In this case, the eigenvalue cc can be interpreted as the ratio between neutron production (from both scattering and fission) and neutron loss (due to absorption and leakage). Alternatively, it can be seen as the number of secondary neutrons per collision, rather than only collisions due to fission events.

6.2 Martingale convergence and strong law of large numbers

In a similar fashion to [10], Theorem 3.1 implies that

𝒲nknφ,𝒳nφ,μ,\mathcal{W}_{n}\coloneqq k^{-n}\frac{\langle\varphi,\mathcal{X}_{n}\rangle}{\langle\varphi,\mu\rangle},

is a non-negative mean one martingale under δ(r,υ)\mathbb{P}_{\delta_{(r,\upsilon)}}. One could then show that (𝒲n)n0(\mathcal{W}_{n})_{n\geq 0} converges in L2()L^{2}(\mathbb{P}) in the supercritical case, and otherwise has a degenerate limit.


One could also reconstruct the arguments presented in [9] to characterise the growth in the supercritical regime to obtain a strong law of large numbers:

limnkng,𝒳nφ,μ=g,φ~𝒲,\lim_{n\to\infty}k^{-n}\frac{\langle g,\mathcal{X}_{n}\rangle}{\langle\varphi,\mu\rangle}=\langle g,\tilde{\varphi}\rangle\mathcal{W}_{\infty},

where 𝒲\mathcal{W}_{\infty} is the limit of the martingale (𝒲n)n0(\mathcal{W}_{n})_{n\geq 0}.


We leave these arguments as an exercise to the reader to avoid unnecessary repetition.

6.3 Monte-Carlo considerations

We end this paper with a discussion of the existing Monte Carlo methods for calculating keffk_{\texttt{eff}} and the associated eigenfunctions, and how we may use the semigroup approach to propose comparable algorithms, similar in style to those presented in [4].


Due to the interpretation of the eigenvalue keffk_{\texttt{eff}}, most of the existing methods in the numerical analysis and engineering literature are based on iterative methods. For example, several algorithms are given in [17] that demonstrate how to calculate keffk_{\texttt{eff}} and φ\varphi. The main idea is to start with a set of NN neutrons, distributed in D×VD\times V according to some function φ(0)\varphi^{(0)} that serves as an initial guess101010In practice, this is usually either the uniform distribution, or the solution to a diffusion approximation of the eigenvalue problem. at φ\varphi. The system of neutrons then evolves until the first generation of fission events. Letting φ^(1)\hat{\varphi}^{(1)} be the distribution of these first generation neutrons, the first approximation, φ(1)\varphi^{(1)}, of the eigenfunction φ\varphi is then obtained by normalising111111This is usually done by either setting φ(1)=φ~(1)/φ~(1)\varphi^{(1)}=\widetilde{\varphi}^{(1)}/\|\widetilde{\varphi}^{(1)}\| or by sampling NN neutrons according to φ~(1)\widetilde{\varphi}^{(1)} φ^(1)\hat{\varphi}^{(1)}. At the same time, the eigenvalue keffk_{\texttt{eff}} is approximated by

k(1)=𝟏,φ(1)𝟏,(𝒯+𝒮)φ(1),k^{(1)}=\frac{\langle\mathbf{1},\mathcal{F}\varphi^{(1)}\rangle}{\langle\mathbf{1},(\mathcal{T}+\mathcal{S})\varphi^{(1)}\rangle},

which corresponds to the ratio of source neutrons for generation 22 to the number of paths simulated in generation 11. The process is then repeated using φ(1)\varphi^{(1)} as the initial distribution of neutrons, in order to obtain φ(2)\varphi^{(2)} and k(2)k^{(2)}, and so on.

However, some of the methods presented in the literature lead to bias and correlations between the neutrons in successive fission generations. To overcome this problem, the notion of superhistory powering was introduced in [1]. This idea is based on letting the initial set of neutrons evolve for some number, LL, of generations until the estimates for keffk_{\texttt{eff}} and φ\varphi are computed. It is usual in industry to set L=10L=10.


As we have shown in the previous sections, solving (1.6) is equivalent to look for the leading eigentriple (k,φ,φ~)(k_{*},\varphi,\tilde{\varphi}) of the semigroup Ψn\Psi_{n}. Heuristically, this pertains to finding functions φ\varphi and φ~\tilde{\varphi} that describe where neutron production (due to fission events) is most prominent, and a parameter kk_{*} that describes the average growth of the number of neutrons in the system. We may use the asymptotics (3.5) to inform Monte Carlo methods for the calculation of kk_{*}, φ\varphi and φ~\tilde{\varphi}. Indeed, we have

k=limn1nlogΨn[𝟏](r,υ),k_{*}=\lim_{n\to\infty}\frac{1}{n}\log\Psi_{n}[\mathbf{1}](r,\upsilon),

where 𝟏\mathbf{1} is the constant function with value one. Here, as an expectation, Ψn[𝟏]\Psi_{n}[\mathbf{1}] can be approximated by Monte-Carlo simulation.

In order to calculate the eigenfunction, one can manipulate the following asymptotic.

φ~,gφ(r,υ)=limn𝔼δ(r,υ)[1nm=1nkm𝒳m,g].\langle\tilde{\varphi},g\rangle\varphi(r,\upsilon)=\lim_{n\to\infty}\mathbb{E}_{\delta_{(r,\upsilon)}}\left[\frac{1}{n}\sum_{m=1}^{n}k_{*}^{-m}\langle\mathcal{X}_{m},g\rangle\right].

Varying the test function gg, while keeping (r,υ)(r,\upsilon) fixed allows us to obtain estimates for φ~\tilde{\varphi}, whereas varying the initial configuration (r,υ)(r,\upsilon) and keeping the test function fixed allows us to estimate φ\varphi.

Once again, the expectation can be replaced by a Monte-Carlo approximation.

We refer the reader to [4] for a more in-depth discussion of Monte Carlo algorithms based on the above asymptotics, as well as a complexity analysis of their methods. Although the algorithms and efficiency results given in [4] are for time-eigenvalues, cf. (1.4), it is straightforward to see how they may be adapted to fit the current situation (as well as their complexity). Of course, problems such as burn-in and inefficiencies that were encountered in [4] will still be present in the stationary case. We hope to carry out more formal work on this in the future.

Acknowledgements

We are indebted to Geoff Dobson and Paul Smith from the ANSWERS modelling group at Wood for the extensive discussions as well as hosting at their offices in Dorchester. AEK and AMGC are supported by EPSRC grant EP/P009220/1. ELH is supported by a PhD scholarship with funding from industrial partner Wood.

References

  • [1] R. J. Brissenden and A. R. Garlick. Biases in the estimation of k-eff and its error by monte carlo methods. Ann. nucl. Energy, 13:63–83, 1986.
  • [2] F. Brown. Fundamentals of monte carlo particle transport. Lecture notes for Monte Carlo course.
  • [3] N. Champagnat and D. Villemonais. Exponential convergence to quasi-stationary distribution and QQ-process. Probab. Theory Related Fields, 164(1-2):243–283, 2016.
  • [4] A. M. G. Cox, S.C. Harris, E. Horton, A.E. Kyprianou, and M. Wang. Monte carlo methods for the neutron transport equation. Working document, 2018.
  • [5] A.M.G. Cox, S.C. Harris, E. Horton, and A.E. Kyprianou. Multi-species neutron transport equation. Preprint, 2018.
  • [6] R. Dautray, M. Cessenat, G. Ledanois, P.-L. Lions, E. Pardoux, and R. Sentis. Méthodes probabilistes pour les équations de la physique. Collection du Commissariat a l’énergie atomique. Eyrolles, Paris, 1989.
  • [7] R. Dautray and J.-L. Lions. Mathematical analysis and numerical methods for science and technology. Vol. 6. Springer-Verlag, Berlin, 1993. Evolution problems. II, With the collaboration of Claude Bardos, Michel Cessenat, Alain Kavenoky, Patrick Lascaux, Bertrand Mercier, Olivier Pironneau, Bruno Scheurer and Rémi Sentis, Translated from the French by Alan Craig.
  • [8] K-J. Engel and R. Nagel. A short course on operator semigroups. Universitext. Springer, New York, 2006.
  • [9] S.C. Harris, E. Horton, and A.E. Kyprianou. Stochastic analysis of the neutron transport equation II: Almost sure growth. Preprint, 2018.
  • [10] E. Horton, A.E. Kyprianou, and D. Villemonais. Stochastic analysis of the neutron transport equation. Preprint.
  • [11] T. Kato. Perturbation Theory for Linear Operators. Springer-Verlag, Berlin, 1976.
  • [12] A. E. Kyprianou. Martingale convergence and the stopped branching random walk. Probab Theory Relat. Fields, 116(3):405–419, 2000.
  • [13] E. E. Lewis and Jr. W. F. Miller. Computational Methods of Neutron Transport. Wiley & Sons, New York, 1984.
  • [14] J. Mika. Existence and uniqueness of the solution to the critical problem in neutron transport theory. Stud. Math., 37:213–225, 1971.
  • [15] M. Mokhtar-Kharroubi. Mathematical topics in neutron transport theory, volume 46 of Series on Advances in Mathematics for Applied Sciences. World Scientific Publishing Co., Inc., River Edge, NJ, 1997. New aspects, With a chapter by M. Choulli and P. Stefanov.
  • [16] M. Ribarič and I. Vidav. Analytic properties of the inverse a(z)1a(z)^{-1} of an analytic linear operator valued function a(z). Comm. Pure Appl. Math., 11:219–242, 1958.
  • [17] F. Scheben and I. G. Graham. Iterative methods for neutron transport eigenvalue problems. SIAM J. Sci. Comput., 33(5):2785–2804, 2011.
  • [18] J. van Neervan. The Asymptotic Behaviour of Semigroups of Linear Operators. Operator Theory Advances and Applications Vol. 88. Birkhäuser, 1996.
  • [19] I. Vidav. Existence and uniqueness of nonnegative eigenfunctions of the boltzmann operator. J. Math. Anal. App., 22:144–155, 1968.