The Kramers turnover in terms of a macro-state projection on phase space

\name Luca Donati^a and Christof Schütte^b and Marcus Weber^c CONTACT Luca Donati. Email: donati@zib.de. Address: Zuse Institute Berlin, Takustraße 7, 14195 Berlin. Telephone: +49 30 841 85 403. ^a,b,cZuse Institute Berlin, Takustraße 7, 14195 Berlin;
^a,bFreie Universität Berlin, Department of Mathematics and Computer Science, Arnimallee 22, 14195 Berlin.

Abstract

We have investigated how Langevin dynamics is affected by the friction coefficient using the novel algorithm ISOKANN, which combines the transfer operator approach with modern machine learning techniques. ISOKANN describes the dynamics in terms of an invariant subspace projection of the Koopman operator defined in the entire state space, avoiding approximations due to dimensionality reduction and discretization. Our results are consistent with the Kramers turnover and show that in the low and moderate friction regimes, metastable macro-states and transition rates are defined in phase space, not only in position space.

keywords:

Langevin dynamics, Kramers turnover, reaction rate theory, ISOKANN, PCCA+, machine learning

1 Introduction

The dynamics of high-dimensional chemical systems can be modeled as one-dimensional Langevin dynamics governed by the stochastic differential equation

\begin{dcases}\dot{q}_{t}&=\frac{p_{t}}{m}\\ \dot{p}_{t}&=-\frac{d}{dq}V(q)-\gamma p_{t}+\sqrt{2k_{B}T\gamma m}\,\eta_{t}\,,\end{dcases}

(1)

where $q_{t}$ and $p_{t}$ denote the position and the momentum of the system at time $t$ on a one-dimensional relevant coordinate, $V:\mathbb{R}\rightarrow\mathbb{R}$ is the potential of mean force acting on the system along the relevant coordinate, and $m$ is the effective mass of the system, which represents how much inertia the system has along the relevant coordinate. The system is coupled to a thermostat at temperature $T$ , with Boltzmann constant $k_{B}$ , through the friction coefficient $\gamma$ and the stochastic force $\eta_{t}$ defined as Gaussian white noise with $\langle\eta_{t}\rangle=0$ and $\langle\eta_{t},\,\eta_{t^{\prime}}\rangle=\delta(t-t^{\prime})$ , where $\delta(t-t^{\prime})$ is the Dirac delta function. One possible field of application are molecular processes that exhibit metastability, where the energy function $V(q)$ presents minima and maxima with energy barriers that exceed the thermal energy $k_{B}T$ .

In this context, a fundamental problem is the calculation of transition rates between potential minima, more precise, between macro-states of the system. Indeed, transitions between the macro-states represent the most interesting biochemical processes in many applications, e.g. the folding of an amino acid chain or the binding/unbinding process between a receptor and a ligand. However, they are rare events, i.e. they occur on very large time scales compared to reference time scales such as the oscillation times of hydrogen atoms. Consequently, they are difficult to simulate and analyze, e.g. by means of Molecular Dynamics (MD) simulations.

Over the last century and a half, various theories and methods have been developed to solve this problem and find analytical solutions for calculating rates. The first approximate formula of the problem dates back to Arrhenius, who derived in 1884 [1] the proportionality equation

\displaystyle k\propto e^{-\beta E_{b}}\,,

(2)

where $k$ denotes the escape rate, $\beta=\nicefrac{{1}}{{k_{B}T}}$ is the inverse of the thermal energy, and $E_{b}$ is the height of the energy barrier, also known as activation energy of the reaction. Later, further theories were developed that apply to different contexts of chemistry and physics. Particularly noteworthy is the work of Kramers, who in 1940 [2] studied transition rates for one-dimensional systems driven by the Langevin equation and derived three formulas that apply to low, moderate and high friction regimes. The three formulas well reproduce the so called Kramers turnover, a curve describing the transition rate as a function of $\gamma$ : the transition rate is linear with the coefficient $\gamma$ at low friction, then, having reached a plateau, the rate decays inversely to $\gamma$ . However, Kramers’ theory remains incomplete in some aspects that were only later resolved. For example, Langer derived a formula for multidimensional systems that operates in high friction regime [3], Chandler derived a formula that takes into account non-Markovian effects [4], Mel’nikov and Meshkov have found an expression that improves the prediction in the transition from low to moderate friction [5], and, Pollak, Grabert and Hänggi found a single expression that covers the entire friction range using a normal mode approach [6]. These and other methods, which we do not mention for the sake of brevity, fall into the category of model-based methods, i.e based on the physical model of the system under investigation.

In this paper, we study the dependence of Langevin dynamics on the friction coefficient $\gamma$ using its representation in terms of the Koopman operator [7, 8], which allows to transform the nonlinear problem defined in eq. 1 into a linear problem. The price of this is that the finite-dimensional dynamics in phase space is transformed into an infinite-dimensional problem in the space of observable functions [9, 10]. For this reason, we seek invariant subspaces of the Koopman operator with finite dimensions. We use the ISOKANN algorithm [11], a data-driven method that identifies membership functions that constitute a basis of an invariant subspace of the Koopman operator preserving the Markovianity of the projected process. The peculiarity of ISOKANN is that it does not require the identification and the discretization of reaction coordinates, instead, membership functions can be estimated on states of the full space by means of machine learning techniques such as neural networks, overcoming the problem of the curse of dimensionality.

Membership functions are a generalization of ordinary crisp sets and characterize the metastable macro-states of the system preserving the time scales of the micro system when projected onto the macro-states [12, 13, 14]. Using ISOKANN, we estimated the phase space membership functions and calculated the rates, which represent transitions on the phase space. In this way, we reproduced a rate curve as a function of friction, which is analogous to the Kramers turnover. However, our results show that in low and moderate friction regimes, the rates in position space are an approximation due to the loss of Markovianity of the dynamics defined in eq. 1. To obtain a correct representation of dynamics, even in low and moderate friction, it is necessary to take momenta into account. Thus, with this work we intend to open up a new perspective in reaction rate theory. This is possible through the use of increasingly advanced machine learning techniques, such as ISOKANN, which allows for the estimation of rates as functions of the entire phase space, preventing errors induced by discretization or dimensionality reduction.

2 Theory

We briefly introduce the operator theory that is needed to project Langevin dynamics onto macro-states [15].

2.1 Transfer operator approach

The dynamics of a stochastic process solution of the Langevin equation defined in eq. 1 is equivalently described by the dynamics of the time-dependent probability density $\rho_{t}(x)$ solution of the partial differential equation

\displaystyle\frac{\partial\rho_{t}(x)}{\partial t}=\mathcal{Q}^{*}\rho_{t}(x)\,,

(3)

where the operator $\mathcal{Q}^{*}$ defines the Fokker-Planck equation, or forward Kolmogorov equation, and $x=(q,p)\in\Gamma\subset\mathbb{R}^{2}$ represents the state of the system in the phase space. The solution of eq. 3 is formally written as

	$\displaystyle\rho_{t+\tau}(x)$	$\displaystyle=$	$\displaystyle\exp\left(\mathcal{Q}^{*}\,\tau\right)\rho_{t}(x)$		(4)
		$\displaystyle=$	$\displaystyle\mathcal{P}_{\tau}\rho_{t}(x)\,,$		(5)

where $\mathcal{P}_{\tau}$ denotes the propagator of probability densities with stationary density

\displaystyle\lim_{t\rightarrow+\infty}\rho_{t}(x)=\pi(x)\,,

(6)

defined by the Boltzmann distribution

\displaystyle\pi(x):=\pi(q,p)=\frac{1}{Z}\exp\left(-\beta\left(\frac{p^{2}}{2m}+V(q)\right)\right)\,,

(7)

where $Z$ is a normalization constant.

Besides considering the evolution of probability densities, it is useful to study the evolution of observable functions $f(x)$ . To this end, we introduce the infinitesimal generator $\mathcal{Q}$ , adjoint of the operator $\mathcal{Q}^{*}$ that defines the backward Kolmogorov equation

\displaystyle\frac{\partial f_{t}(x)}{\partial t}=\mathcal{Q}f_{t}(x)\,.

(8)

Analogously to eq. 5, we can write a formal solution of eq. 8 as

$\displaystyle f_{t+\tau}(x)$	$\displaystyle=$	$\displaystyle\exp\left(\mathcal{Q}\,\tau\right)f_{t}(x)$	(9)
	$\displaystyle=$	$\displaystyle\mathcal{K}_{\tau}f_{t}(x)$	(10)
	$\displaystyle=$	$\displaystyle\mathbb{E}\left[f(x_{t+\tau})\|x_{t}=x\right]\,,$	(11)

where we introduced the Koopman operator $\mathcal{K}_{\tau}$ which propagates the expectation value of observable functions.

2.2 Rates from membership functions

Consider the $\tau$ -dependent eigenvalues $\lambda_{\tau,i}$ and the associated eigenfunctions $\Psi_{i}$ of the Koopman operator $\mathcal{Q}$ such that

\displaystyle\mathcal{Q}\Psi_{i}=\lambda_{\tau,i}\Psi_{i}\,.

(12)

If the dynamics is ergodic and not periodic, then the first eigenfunction is constant $\Psi_{1}=1$ and it is associated to the non-degenerate eigenvalue $\lambda_{\tau,1}=1$ . In reversible dynamics, the subsequent $n_{c}$ dominant eigenfunctions $\Psi=\{\Psi_{2},...,\Psi_{n_{c}}\}$ , associated to sorted and negative eigenvalues $\lambda_{\tau,2}>\dots>\lambda_{\tau,nc}$ , exhibit positive and negative values, which allows for the identification of metastable macro-states. In the non-reversible case, real-valued functions which span an invariant subspace of the Koopman operator can be applied instead of eigenfunctions. Each point in state space is represented by a vector which comprises of the values of these finitely many ( $n_{c}$ ) functions. These points can be mapped into a $(n_{c}-1)$ -simplex whose vertices represent the metastable states whereas the edges represent the transitions. The algorithm PCCA+ [12, 13], by means of a linear transformation, transforms the simplex into a standard simplex, i.e. a simplex whose vertices are unit vectors. Accordingly, the set of dominant eigenfunctions is transformed into a set of membership functions $\chi=\left(\chi_{1},\chi_{2},\dots,\chi_{n_{c}}\right)^{\top}$ , with $\chi_{i}:\Gamma\rightarrow[0,1]$ , $\forall i=1,2,\dots,n_{c}$ , such that $\sum_{i}\chi_{i}=1$ . Membership functions characterise the membership of a state $x$ in the macrostates of the system and by exploiting the linearity of the Koopman operator the exit rate from a macrostate is estimated as

\displaystyle\kappa=-\frac{1}{\tau}\log(a_{1})\left(1+\frac{a_{2}}{a_{1}-1}\right)\,,

(13)

where $a_{1}$ and $a_{2}$ are obtained solving the linear regression problem

\displaystyle\min\limits_{a_{1},a_{2}}\|\mathcal{K}_{\tau}\chi(x)-a_{1}\chi(x)+a_{2}\|.

(14)

For a complete discussion about the $\chi$ -exit rates and the derivation of eq. 13, we refer to [16].

2.3 ISOKANN

The calculation of rates according to eqs. 13 and 14 requires the membership function $\chi$ and the propagated membership function $\chi_{t}=\mathcal{K}_{\tau}\chi(x)$ . We use ISOKANN [11, 17], an iterative algorithm which modifies the Von-Mises-Algorithm [18] as iteration scheme

\displaystyle f_{k+1}

\displaystyle=

\displaystyle\frac{\mathcal{K}_{\tau}f_{k}}{\lVert\mathcal{K}_{\tau}f_{k}\rVert}\,,

(15)

where the initial function $f_{0}$ is an arbitrary function and $\|\cdot\|$ is the supreme norm. As $k\rightarrow\infty$ , eq. 15 converges to the first eigenfunction of the Koopman operator:

\displaystyle\lim_{k\rightarrow\infty}f_{k+1}=\Psi_{1}=1.

(16)

In fact, by applying $\mathcal{K}_{\tau}$ iteratively to a function $f$ , one obtains the same result as applying the operator with lag time $\tau$ tending to infinity, i.e. a constant function.

Consider now a two-metastable system, as the model studied by Kramers, then the Koopman operator has two dominant eigenfunctions $\Psi_{1}$ and $\Psi_{2}$ , and the membership functions are written as

\displaystyle\begin{cases}\chi_{1}=b_{1}\Psi_{1}+b_{2}\Psi_{2}\\ \chi_{2}=1-\chi_{1}\end{cases}\,,

(17)

with $b_{1}$ and $b_{2}$ appropriate coefficients. We introduce a linear transformation $S$ to prevent the convergence of the Koopman operator to $\Psi_{1}=1$ , and retrieve information regarding the eigenfunction $\Psi_{2}$ such that $\Psi_{1}$ and $\Psi_{2}$ span an invariant subspace of the Koopman operator. For this purpose, we choose as $S$ the shift-scale function

\displaystyle S\mathcal{K}_{\tau}f_{k}=\frac{\mathcal{K}_{\tau}f_{k}-\min\left(\mathcal{K}_{\tau}f_{k}\right)}{\max\left(\mathcal{K}_{\tau}f_{k}\right)-\min\left(\mathcal{K}_{\tau}f_{k}\right)}\,,

(18)

that guarantees that $f_{k}:\Gamma\rightarrow[0,1]$ . The algorithm defined in eq. 15 is rewritten as

\displaystyle f_{k+1}

\displaystyle=

\displaystyle S\mathcal{K}_{\tau}f_{k}\,,

(19)

and converges to one of the two membership function:

\displaystyle\lim_{k\rightarrow\infty}f_{k+1}

\displaystyle=

\displaystyle\chi_{i}\ \quad i=1\ \rm or\ 2.

(20)

In general, we do not have an analytical representation of the Koopman operator or do not discretize the entire state space to retrieve its matrix representation. However, we can calculate the action of the Koopman operator on observable functions applied to specific states in space $\Gamma$ . Exploiting the ergodic property, we approximate the expectation in eq. 11 as a time average:

	$\displaystyle f_{t+\tau}(x)$	$\displaystyle\approx$	$\displaystyle\bar{f}(x_{\tau})$		(21)
		$\displaystyle=$	$\displaystyle\frac{1}{N}\sum_{n=1}^{N}f(x_{\tau,n})\,,$		(22)

where $x_{\tau,n}$ are the final states of $N$ trajectories, solutions of eq. 1, starting at $x_{0}=x$ . Thus, eq. 19 is rewritten as

\displaystyle\bar{f}_{k+1}(x_{0})

\displaystyle=

\displaystyle S\bar{f}_{k}(x_{\tau})\,,

(23)

Regarding the choice of the initial function $f_{0}$ , a wide range of options is available. The function should be an interpolating function that can be trained at each iteration until it converges to one of the membership function. For higher dimensional systems, the use of neural networks is recommended, as was used in ref. [11]. However, for low-dimensional systems, other interpolation techniques may be used, e.g. spline functions or radial basis functions.

3 Results

As an illustrative example, we considered the Langevin dynamics of a fictitious particle of mass $m=1\,\mathrm{amu}$ which moves in a one-dimensional potential energy function

\displaystyle V(q)

\displaystyle=

\displaystyle 10(q^{2}-1)^{2}\ \mathrm{kJ\,mol^{-1}}\,.

(24)

The function is characterized by two wells with minima at $q_{A}=-1\,\mathrm{nm}$ and $q_{C}=1\,\mathrm{nm}$ , and a height barrier of $10\,\mathrm{kJ\,mol^{-1}}$ at $q_{B}=0\,\mathrm{nm}$ as illustrated in fig. 1-(a). For our numerical experiments, we used standard thermodynamic parameters: the temperature of the system was $T=300\,\mathrm{K}$ and the molar Boltzmann constant was $k_{B}=8.314\times 10^{-3}\,\mathrm{kJ\,K^{-1}\,mol^{-1}}$ .

Refer to caption — Figure 1: (a) Potential energy function (solid line) and harmonic approximation at the bottom of the wells and at the top of the barrier (dashed lines); (b) Phase space with energy levels (black contour lines and yellow dashed line denoting the $K_{B}T$ value) and three trajectories carried out with friction coefficients: $\gamma=0.1\,\mathrm{ps^{-1}}$ (blue), $\gamma=2.2\,\mathrm{ps^{-1}}$ (red), $\gamma=30.0\,\mathrm{ps^{-1}}$ (green).

3.0.1 Classic Kramers turnover

In order to reproduce the classic Kramers turnover, we selected 25 friction coefficient values $\gamma$ between $0.1\,\mathrm{ps^{-1}}$ and $30.0\,\mathrm{ps^{-1}}$ and we solved the Langevin eq. 1 using the Brünger, Brooks and Karplus (BBK) integrator scheme [19] with an integrator timestep $\Delta t=0.005\,\mathrm{ps}$ . For each value of $\gamma$ , we ran 500 simulations starting at the bottom of the left well of the potential with an initial momentum randomly drawn from the Boltzmann distribution. After the particle reached the bottom of the right well, the simulations were stopped and we calculated the mean time, i.e. the Mean First Passage Time (MFPT) $\langle\tau_{fp}\rangle$ [20], from which we obtained the transition rate as

\displaystyle k_{A\rightarrow C}=\frac{1}{\langle\tau_{fp}\rangle}\,.

(25)

The results of this numerical experiment are reported in fig. 2-(a) as black squares. If the friction is very low ( $\gamma\approx 0.1\,\mathrm{ps}^{-1}$ ), the dynamics of the system (eq. 1) is almost deterministic, and the system, unless it has enough initial momentum, is trapped in the well with an extremely low probability of escape. Correspondingly, the MFPT is very large and the value of the rate tends to zero. However, increasing the friction by a small amount ( $\gamma\approx 1.5\,\mathrm{ps}^{-1}$ ) the system gets enough thermal energy through the random force and increases the probability to escape from the well. In fact, for low values of $\gamma$ , the stochastic force $\sqrt{2k_{B}T\gamma m}\,\eta_{t}$ , which is weighted by the square root of the friction coefficient, dominates the friction force $-\gamma p_{t}$ , which is linear with the friction coefficient. Thus, we observe a rapid and linear increase in rates up to a maximum of $k_{A\rightarrow C}\approx 0.02\,\mathrm{ps}^{-1}$ . Beyond the threshold of $\gamma\approx 1.5\,\mathrm{ps}^{-1}$ , the system enters what is called the moderate friction regime. Here, the friction force dominates the Langevin equation, and the probability of escaping the well, despite the high thermal energy, decays as $k_{A\rightarrow C}\propto\sqrt{1+\nicefrac{{1}}{{\gamma^{2}}}}$ . For higher values of the friction coefficient ( $\gamma>20\,\mathrm{ps}^{-1}$ ), the friction term is so strong that the average acceleration of the system tends to zero. The dynamics is overdamped and the transition rate decays as $k_{A\rightarrow C}\propto\nicefrac{{1}}{{\gamma}}$ .

The three friction regimes, here qualitatively described, were formalized by Kramers in 1940 [2]. He assumed a two-metastable system governed by the Langevin dynamics with thermal energy $k_{B}T\ll E_{b}^{+}=V(q_{B})-V(q_{A})$ , so as to ensure metastability. In addition, he required that the left well and the top of the barrier of the potential $V(q)$ are approximated by harmonic potentials with angular frequencies

\displaystyle\omega_{A}=\sqrt{\frac{1}{m}\left.\frac{d^{2}V}{dq^{2}}\right|_{q_{A}}}\,,\ \mathrm{and}\ \omega_{B}=\sqrt{\frac{1}{m}\left.\frac{d^{2}V}{dq^{2}}\right|_{q_{B}}}\,.

(26)

Under these conditions, Kramers derived a transition rate formula for the low friction regime ( $\gamma<\omega_{B}$ )

k_{A\rightarrow C}=\frac{1}{2}\gamma\beta E_{b}^{+}\exp\left[-\beta E_{b}^{+}\right]\,,

(27)

for the moderate friction regime ( $\gamma>\omega_{B}$ )

\displaystyle k_{A\rightarrow C}

\displaystyle=

\displaystyle\frac{\gamma}{\omega_{B}}\left(\sqrt{\frac{1}{4}+\frac{\omega_{B}^{2}}{\gamma^{2}}}-\frac{1}{2}\right)\cdot\frac{\omega_{A}}{2\pi}\exp\left[-\beta E_{b}^{+}\right]\,,

(28)

and the high friction regime ( $\gamma\gg\omega_{B}$ )

\displaystyle k_{A\rightarrow C}

\displaystyle=

\displaystyle\frac{\omega_{B}}{\gamma}\cdot\frac{\omega_{A}}{2\pi}\exp\left[-\beta E_{b}^{+}\right]\,.

(29)

Note that Kramers defines the three regimes by comparing the coefficient of friction with the angular frequencies of the harmonic potentials that approximate the potential. In fact, the transition probability also depends on the curvature near the pit and barrier. The prediction of Kramers’ formulas, reported in fig. 2-(a), is excellent, it is only around the threshold separating the low and moderate friction regimes that the model becomes inaccurate. For more details about Kramers theory, we recommend Refs. [21, 22].

3.0.2 Kramers turnover of membership functions in phase space

In the second numerical experiment, we estimated the transition rates applying ISOKANN to the same setting of the first experiment. We generated 1000 random initial points $x_{0}=(q_{0},p_{0})$ from a uniform distribution over the phase space defined by the $q$ -range $[-2.0,\,2.0]$ nm and the $p$ -range $[-10.0,\,10.0]$ amu nm ps^-1, and for each state we simulated $N=100$ trajectories of length $\tau=7$ ps, corresponding to 1400 timesteps using a timestep integrator $\Delta t=0.005$ ps. The ISOKANN algorithm has been applied for 20 iterations using multiquadratic radial basis functions (RBF) [23], which are computationally undemanding and only require a few parameters to optimize during training, resulting in faster convergence. We considered two cases:

•

the membership functions $\chi_{A}(q)$ and $\chi_{C}(q)$ , and the transition rate $k_{\chi_{A}\rightarrow\chi_{C}}$ between the macro-states of the position space;
•

the membership functions $\hat{\chi}_{A}(q,p)$ and $\hat{\chi}_{C}(q,p)$ defined on the two-dimensional phase space, and the transition rate $\hat{k}_{\chi_{A}\rightarrow\chi_{C}}$ between macro-states of the phase space. Note that from now on, each quantity marked with the symbol $\hat{\cdot}$ denotes a quantity measured over the entire phase space.

The two rates, as functions of the friction coefficient $\gamma$ , are reported in fig. 2-(b), respectively as blue upside down triangles and red circles. Both curves show a turnover similar to the rate $k_{A\rightarrow C}$ reported in fig. 2-(a): rates have an ascending profile for very low range values, then, having reached the maximum ( $k_{\chi_{A}\rightarrow\chi_{C}}\approx 0.4\,\mathrm{ps}^{-1}$ and $\hat{k}_{\chi_{A}\rightarrow\chi_{C}}\approx 0.2\,\mathrm{ps}^{-1}$ ), descend slowly. However, while the values of the rate $\hat{k}_{\chi_{A}\rightarrow\chi_{C}}$ in phase space are overlapping with those of the Kramers rate $k_{A\rightarrow C}$ (although they are different physical quantities), the rate $k_{\chi_{A}\rightarrow\chi_{C}}$ defined in position space turns out to be higher in the low friction region but converges to the values of $\hat{k}_{\chi_{A}\rightarrow\chi_{C}}$ in the high friction regime. To understand these results, it is useful to take a look at the membership functions obtained from ISOKANN and shown in fig. 3, where figures (d,e,f) on the second row and (g,h,i) on the third row are respectively the membership functions in the phase space and the position space, for low, moderate and high friction.

In fig. 3-(a) (low friction regime), the membership functions of the macro-states only have significant values for those states whose total energy $E=\nicefrac{{p^{2}}}{{2m}}+V(q)$ is less than the height of the barrier. The points with a total energy exceeding the barrier are depicted in white, indicating that they have an equal probability of belonging to either $\chi_{A}$ or $\chi_{C}$ , approximately 0.5. This occurs because trajectories originating from this area undergo periodic oscillations in phase space, visiting both wells as depicted in fig. 1-(b) by the blue trajectory. Correspondingly, in fig. 3-(d), we show the membership function values as a projection of $\hat{\chi}_{A}(q,p)$ and $\hat{\chi}_{C}(q,p)$ onto the position space. The apparent noise is due to the fact that the membership functions on phase space are not constant along the axis of momenta. Therefore, when friction is low, the membership values projected to position space are not functions in a strict sense and do not describe position-based macro-states. In fig. 3-(b) (moderate friction regime), the membership functions draw concentric spirals that terminate in the minima of phase space respectively. This may seem counterintuitive, but observing how a trajectory behaves in the moderate friction regime helps to interpret the membership functions correctly. In fig. 1-(b), the red trajectory starts at position $q=-1\,\mathrm{nm}$ and momentum $p=-8\,\mathrm{amu\,nm\,ps^{-1}}$ , and reaches the right-hand minimum by following a clockwise trajectory. Similarly, if we start trajectories that are far from the central region of the phase space, we would observe spiral patterns that match with the membership functions. From this figure, we deduce that in the moderate friction regime, the effective barrier, i.e. the transition region, is not at $q=0$ , but between the two spirals. In particular, in the central box $[-1,\,1]\times[-5,\,5]$ of the phase space, the barrier corresponds to a diagonal line which is not parallel to the momentum axis. The reason is that if $q=0$ and $p>0$ , the system reaches the right well with low probability of recrossing. Conversely, if $q=0$ and $p<0$ , the system reaches the left region. Along the white diagonal the system is in an unstable equilibrium, i.e. the system has the same probability of reaching one of the wells, and ISOKANN assigns equal probability of membership to the two macro-states. In fig. 3-(c) (high friction regime), membership functions are independent from momentum space. The two regions $q<0$ and $q>0$ are assigned to the macro-states regardless of the momentum and the transition region is almost a vertical line. The projections $\chi_{A}(q)$ and $\chi_{C}(q)$ onto the position space also appear well defined in fig. 3-(f). This occurs because as the friction is very high, the momentum is quickly damped and it does not provide enough energy to overcome the barrier as shown by the green trajectory in fig. 1-(b). In the high friction regime, only the thermal noise can provide enough energy to jump over the barrier.

3.0.3 Results validation

In order to validate our results, we constructed a reference solution by means of a grid-based technique similar to Ulam’s method [24] which allows to discretize the operator $\mathcal{K}_{\tau}$ defined in eq. 11 into a transition probability matrix $\mathbf{K}(\tau)$ , cf. [25]. Given a discretization of the phase space $\Gamma$ into $M$ disjoint subsets $\Gamma_{i}$ , with $i=1,\dots,M$ , and a set of $N$ simulations of length $\tau$ started in a random position of the subset $\Gamma_{i}$ , then the entries of the matrix $\mathbf{K}(\tau)$ are written as

\displaystyle k_{ij}(\tau)=\frac{1}{N}\sum_{n=1}^{N}\mathbf{1}_{\Gamma_{j}}(x_{i,n}^{\tau})

(30)

where $\mathbf{1}_{\Gamma_{j}}$ is the indicator function

\displaystyle\mathbf{1}_{\Gamma_{j}}(x)=\begin{dcases}1~&\text{ if }~x\in\Gamma_{j}~,\\ 0~&\text{ if }~x\notin\Gamma_{j}~\,,\end{dcases}

(31)

and $x_{i,n}^{\tau}$ is the final state of the $n$ th simulation started in $\Gamma_{i}$ . In practice, one counts how many times a simulation starting in $\Gamma_{i}$ ends in $\Gamma_{j}$ and divides by the number of simulations to obtain an estimation of the transition probability. Afterward, an approximation of the infinitesimal generator, sometimes referred to as pseudogenerator, is obtained as

\displaystyle\widetilde{\mathbf{Q}}=\frac{\mathbf{K}(\tau)-\mathbf{I}}{\tau}\,,

(32)

where $\mathbf{I}$ denotes an identity matrix of the same size as $\mathbf{K}{\tau}$ . Then the membership functions $\hat{\chi}(q,p)$ are calculated applying PCCA+ to $\widetilde{\mathbf{Q}}$ and the coarse-grained rate matrix between macro-states is calculated as a Galerkin projection of $\widetilde{\mathbf{Q}}$ onto the membership functions:

\displaystyle\widetilde{\mathbf{Q}}_{c}

\displaystyle=

\displaystyle(\chi^{\top}\mathrm{diag}(\pi)\chi)^{-1}\chi^{\top}\mathrm{diag}(\pi)\widetilde{\mathbf{Q}}\chi\,

(33)

In eq. 33, $\mathrm{diag}(\pi)$ denotes an $M\times M$ diagonal matrix, whose diagonal entries are the entries of the Boltzmann distribution $\pi(q,p)$ (eq. 7) evaluated at the centers of subsets $\Gamma_{i}$ . Assuming a two-metastable system, the rate matrix $\widetilde{\mathbf{Q}}$ has size $2\times 2$ :

\displaystyle\widetilde{\mathbf{Q}}_{c}=\begin{pmatrix}-\widetilde{q}_{\chi_{A}\rightarrow\chi_{C}}&\widetilde{q}_{\chi_{A}\rightarrow\chi_{C}}\\ \widetilde{q}_{\chi_{C}\rightarrow\chi_{A}}&-\widetilde{q}_{\chi_{C}\rightarrow\chi_{A}}\\ \end{pmatrix}\,,

(34)

with $\widetilde{q}_{\chi_{A}\rightarrow\chi_{C}},\,\widetilde{q}_{\chi_{C}\rightarrow\chi_{A}}>0$ representing the transition rates between the macro-states. For the sole case of a bimetastable system, these rates are equivalent to the exit rates defined in eq. 13.

Here, we discretized the $q$ -range $[-2.0,\,2.0]$ nm in $80$ intervals of the same length $\Delta q=0.05$ nm, and the $p$ -range $[-10.0,\,10.0]$ nm in $70$ intervals of the same length $\Delta p=0.29$ nm. The transition rates estimated by PCCA+ are reported in fig. 2-(b) as black squares, while the membership functions are reported in fig. 3-(a,b,c). For each subset, we ran 500 simulations of length 7 ps, with an integrator timestep of 0.005 ps for a total of 1400 timesteps. There is excellent agreement between ISOKANN and the method based on the discretization of the phase space: both methods recreate the Kramers turnover and show the same patterns for the membership functions.

4 Discussion and conclusion

In this article, we studied the effect of the friction coefficient of Langevin dynamics on metastable macro-states of the phase space and calculated the transition rate.

For this purpose, we used the ISOKANN algorithm [11], which identifies macro-states by means of membership functions that form a basis function of an invariant subspace of the Koopman operator. In this subspace, the Koopman operator produces a linear dynamical system of finite dimensions that preserves the Markovianity of Langevin dynamics and can be used to determine kinetic observables such as transition rates.

We investigated a one-dimensional artificial potential, representing a bimetastable system, and reproduced the Kramers turnover. However, differently from the original Kramers work, the transition rate we estimated represents transitions between macro-states in phase space. Our results show that including both the positions and the momenta in defining the macro-states is necessary. Indeed, neglecting the momentum in the low and moderate friction regime introduces non-Markovian effects that are not properly captured by the position-dependent membership functions. In contrast, in the high friction regime, the velocity is instantaneously damped, and the macro-states can be defined as functions of only the position space.

This approach to estimating transition rates can be extended to highly dimensional problems. The typical strategy requires solving the fundamental equations of motion, projecting the dynamics on a small number of relevant coordinates, and discretizing the low-dimensional model to create a matrix representation of the Koopman operator, as is done with Markov State Models [26, 27, 28, 29] or Square Root Approximation of the infinitesimal generator [30, 31, 32]. The price of these techniques is the introduction of assumptions, such as the Markovianity of the projected dynamics, that can lead to significant errors [33]. In contrast, ISOKANN does not require dimensionality reduction or space discretization, and the measured rates can be considered the best representation of the system’s dynamics, net of approximations introduced a priori, e.g., when the equations of motion are numerically solved. Thus, the dimensionality of the system poses no limits to ISOKANN on a theoretical level. However, the implementation of ISOKANN for studying high-dimensional systems is more involved. Here, considering the low-dimensionality of the system, we used radial basis functions, but for higher dimensional systems, we suggest more advanced interpolating functions such as feed-forward neural networks, which allow the use of all system coordinates, including momenta. Another aspect to be taken into account is the choice of the mathematical representation of the molecular system. Indeed, neural networks are not invariant with respect to translations and rotations when Cartesian coordinates are used as input data. Thus, Cartesian coordinates must be transformed to a suitable set of input coordinates, for example pairwise distances, internal coordinates or atom-centred symmetry functions [34].

In summary, with this work, we have shown that ISOKANN is a valid tool for the study of dynamical systems that avoids the subspace projection of transfer operators. Here, we have focused on the classic Kramers problem, studying how macro-states are defined in phase space and highlighting the importance of considering momenta in rate calculation. Nevertheless, ISOKANN’s flexibility and modern machine learning techniques allow for the study of even more complex systems.

Acknowledgments

This research has been funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) through the Cluster of Excellence MATH+, project AA1-15: “Math-powered drug-design” and the Collaborative Research Center CRC 1114, Projects B03 and B05.

References

[1] S. Arrhenius, Z. Phys. Chem. 4U (1), 96–116 (1889).
[2] H.A. Kramers, Physica 7 (4), 284–304 (1940).
[3] J. Langer, Ann. Phys. 54 (2), 258–275 (1969).
[4] D. Chandler, The Journal of Chemical Physics 68 (6), 2959–2970 (1978).
[5] V.I. Mel’nikov and S.V. Meshkov, The Journal of Chemical Physics 85 (2), 1018–1027 (1986).
[6] E. Pollak, H. Grabert and P. Hänggi, The Journal of Chemical Physics 91 (7), 4073–4087 (1989).
[7] B. Koopman, Proc. Natl. Acad. Sci. USA 17 (5), 315—318 (1931).
[8] B. Koopman and J. Neumann, Proc. Natl. Acad. Sci. USA 3 (18), 255–63 (1932).
[9] L.A. Baxter, Probability in the Engineering and Informational Sciences 10 (2), 311–313 (1996).
[10] S. Klus, F. Nüske, P. Koltai, H. Wu, I. Kevrekidis, C. Schütte and F. Noé, J. Nonlinear Sci. 28 (2018).
[11] R.J. Rabben, S. Ray and M. Weber, J. Chem. Phys. 153 (11), 114109 (2020).
[12] P. Deuflhard and M. Weber, Linear Algebra Appl. 398, 161–184 (2004).
[13] M. Weber, Ph. D. thesis, FU Berlin, 2006.
[14] M. Weber, Computation 6 (1) (2018).
[15] C. Schütte, W. Huisinga and P. Deuflhard, in Ergodic Theory, Analysis, and Efficient Simulation of Dynamical Systems, edited by B. Fiedler (Springer, Berlin, 2001), pp. 191–223.
[16] M. Weber and N. Ernst, arXiv preprint arXiv:1708.00679 (2017).
[17] A. Sikorski, E.R. Borrell and M. Weber, arXiv preprint arXiv:2301.00065 (2022).
[18] R. von Mises and H. Pollaczek-Geiringer, Zamm-zeitschrift Fur Angewandte Mathematik Und Mechanik 9, 152–164 (1929).
[19] A. Brünger, C.L. Brooks and M. Karplus, Chem. Phys. Lett. 105 (5), 495–500 (1984).
[20] L. Pontryagin, A. Andronov and A. Vitt, Zh. Eksp. Teor. Fiz 3, 165 (1933).
[21] P. Hänggi, P. Talkner and M. Borkovec, Rev. Mod. Phys. 62, 251–341 (1990).
[22] B. Peters, Reaction Rate Theory and Rare Events, 1st ed. (, , 2017).
[23] P. Virtanen, R. Gommers, T.E. Oliphant, M. Haberland, T. Reddy, D. Cournapeau, E. Burovski, P. Peterson, W. Weckesser, J. Bright, S.J. van der Walt, M. Brett, J. Wilson, K.J. Millman, N. Mayorov, A.R.J. Nelson, E. Jones, R. Kern, E. Larson, C.J. Carey, İ. Polat, Y. Feng, E.W. Moore, J. VanderPlas, D. Laxalde, J. Perktold, R. Cimrman, I. Henriksen, E.A. Quintero, C.R. Harris, A.M. Archibald, A.H. Ribeiro, F. Pedregosa, P. van Mulbregt and SciPy 1.0 Contributors, Nat. Methods 17, 261–272 (2020).
[24] S. Ulam, A Collection of Mathematical Problems Interscience tracts in pure and applied mathematics (, , 1960).
[25] C. Schütte, S. Klus and C. Hartmann, Acta Numerica 32, 517–673 (2023).
[26] G.R. Bowman, V.S. Pande and F. Noé, editors, An Introduction to Markov State Models and Their Application to Long Timescale Molecular Simulation, Vol. 797 of Advances in Experimental Medicine and Biology (Springer, Heidelberg, 2014).
[27] W. Wang, S. Cao, L. Zhu and X. Huang, Wiley Interdisciplinary Reviews: Computational Molecular Science 8 (1), e1343 (2018).
[28] B.E. Husic and V.S. Pande, J. Am. Chem. Soc. 140 (7), 2386–2396 (2018).
[29] B.G. Keller, S. Aleksic and L. Donati, in Biomolecular Simulations in Structure-based Drug Discovery, edited by F. L. Gervasio (Wiley-Interscience, Weinheim, 2018), p. 67.
[30] L. Donati, M. Heida, B.G. Keller and M. Weber, J. Phys. Condens. Matter 30, 425201 (2018).
[31] L. Donati, M. Weber and B.G. Keller, J. Phys. Condens. Matter 33, 115902 (2021).
[32] L. Donati, M. Weber and B.G. Keller, J. Math. Phys. 63 (12), 123306 (2022).
[33] J.H. Prinz, H. Wu, M. Sarich, B. Keller, M. Senne, M. Held, J.D. Chodera, C. Schütte and F. Noé, J. Chem. Phys. 134, 174105 (2011).
[34] J. Behler, J. Chem. Phys. 134 (7), 074106 (2011).