This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The Kramers turnover in terms of a macro-state projection on phase space

\name Luca Donatia and Christof Schütteb and Marcus Weberc CONTACT Luca Donati. Email: donati@zib.de. Address: Zuse Institute Berlin, Takustraße 7, 14195 Berlin. Telephone: +49 30 841 85 403. a,b,cZuse Institute Berlin, Takustraße 7, 14195 Berlin;
a,bFreie Universität Berlin, Department of Mathematics and Computer Science, Arnimallee 22, 14195 Berlin.
Abstract

We have investigated how Langevin dynamics is affected by the friction coefficient using the novel algorithm ISOKANN, which combines the transfer operator approach with modern machine learning techniques. ISOKANN describes the dynamics in terms of an invariant subspace projection of the Koopman operator defined in the entire state space, avoiding approximations due to dimensionality reduction and discretization. Our results are consistent with the Kramers turnover and show that in the low and moderate friction regimes, metastable macro-states and transition rates are defined in phase space, not only in position space.

keywords:
Langevin dynamics, Kramers turnover, reaction rate theory, ISOKANN, PCCA+, machine learning

1 Introduction

The dynamics of high-dimensional chemical systems can be modeled as one-dimensional Langevin dynamics governed by the stochastic differential equation

{q˙t=ptmp˙t=ddqV(q)γpt+2kBTγmηt,\begin{dcases}\dot{q}_{t}&=\frac{p_{t}}{m}\\ \dot{p}_{t}&=-\frac{d}{dq}V(q)-\gamma p_{t}+\sqrt{2k_{B}T\gamma m}\,\eta_{t}\,,\end{dcases} (1)

where qtq_{t} and ptp_{t} denote the position and the momentum of the system at time tt on a one-dimensional relevant coordinate, V:V:\mathbb{R}\rightarrow\mathbb{R} is the potential of mean force acting on the system along the relevant coordinate, and mm is the effective mass of the system, which represents how much inertia the system has along the relevant coordinate. The system is coupled to a thermostat at temperature TT, with Boltzmann constant kBk_{B}, through the friction coefficient γ\gamma and the stochastic force ηt\eta_{t} defined as Gaussian white noise with ηt=0\langle\eta_{t}\rangle=0 and ηt,ηt=δ(tt)\langle\eta_{t},\,\eta_{t^{\prime}}\rangle=\delta(t-t^{\prime}), where δ(tt)\delta(t-t^{\prime}) is the Dirac delta function. One possible field of application are molecular processes that exhibit metastability, where the energy function V(q)V(q) presents minima and maxima with energy barriers that exceed the thermal energy kBTk_{B}T.

In this context, a fundamental problem is the calculation of transition rates between potential minima, more precise, between macro-states of the system. Indeed, transitions between the macro-states represent the most interesting biochemical processes in many applications, e.g. the folding of an amino acid chain or the binding/unbinding process between a receptor and a ligand. However, they are rare events, i.e. they occur on very large time scales compared to reference time scales such as the oscillation times of hydrogen atoms. Consequently, they are difficult to simulate and analyze, e.g. by means of Molecular Dynamics (MD) simulations.

Over the last century and a half, various theories and methods have been developed to solve this problem and find analytical solutions for calculating rates. The first approximate formula of the problem dates back to Arrhenius, who derived in 1884 [1] the proportionality equation

keβEb,\displaystyle k\propto e^{-\beta E_{b}}\,, (2)

where kk denotes the escape rate, β=1/kBT\beta=\nicefrac{{1}}{{k_{B}T}} is the inverse of the thermal energy, and EbE_{b} is the height of the energy barrier, also known as activation energy of the reaction. Later, further theories were developed that apply to different contexts of chemistry and physics. Particularly noteworthy is the work of Kramers, who in 1940 [2] studied transition rates for one-dimensional systems driven by the Langevin equation and derived three formulas that apply to low, moderate and high friction regimes. The three formulas well reproduce the so called Kramers turnover, a curve describing the transition rate as a function of γ\gamma: the transition rate is linear with the coefficient γ\gamma at low friction, then, having reached a plateau, the rate decays inversely to γ\gamma. However, Kramers’ theory remains incomplete in some aspects that were only later resolved. For example, Langer derived a formula for multidimensional systems that operates in high friction regime [3], Chandler derived a formula that takes into account non-Markovian effects [4], Mel’nikov and Meshkov have found an expression that improves the prediction in the transition from low to moderate friction [5], and, Pollak, Grabert and Hänggi found a single expression that covers the entire friction range using a normal mode approach [6]. These and other methods, which we do not mention for the sake of brevity, fall into the category of model-based methods, i.e based on the physical model of the system under investigation.

In this paper, we study the dependence of Langevin dynamics on the friction coefficient γ\gamma using its representation in terms of the Koopman operator [7, 8], which allows to transform the nonlinear problem defined in eq. 1 into a linear problem. The price of this is that the finite-dimensional dynamics in phase space is transformed into an infinite-dimensional problem in the space of observable functions [9, 10]. For this reason, we seek invariant subspaces of the Koopman operator with finite dimensions. We use the ISOKANN algorithm [11], a data-driven method that identifies membership functions that constitute a basis of an invariant subspace of the Koopman operator preserving the Markovianity of the projected process. The peculiarity of ISOKANN is that it does not require the identification and the discretization of reaction coordinates, instead, membership functions can be estimated on states of the full space by means of machine learning techniques such as neural networks, overcoming the problem of the curse of dimensionality.

Membership functions are a generalization of ordinary crisp sets and characterize the metastable macro-states of the system preserving the time scales of the micro system when projected onto the macro-states [12, 13, 14]. Using ISOKANN, we estimated the phase space membership functions and calculated the rates, which represent transitions on the phase space. In this way, we reproduced a rate curve as a function of friction, which is analogous to the Kramers turnover. However, our results show that in low and moderate friction regimes, the rates in position space are an approximation due to the loss of Markovianity of the dynamics defined in eq. 1. To obtain a correct representation of dynamics, even in low and moderate friction, it is necessary to take momenta into account. Thus, with this work we intend to open up a new perspective in reaction rate theory. This is possible through the use of increasingly advanced machine learning techniques, such as ISOKANN, which allows for the estimation of rates as functions of the entire phase space, preventing errors induced by discretization or dimensionality reduction.

2 Theory

We briefly introduce the operator theory that is needed to project Langevin dynamics onto macro-states [15].

2.1 Transfer operator approach

The dynamics of a stochastic process solution of the Langevin equation defined in eq. 1 is equivalently described by the dynamics of the time-dependent probability density ρt(x)\rho_{t}(x) solution of the partial differential equation

ρt(x)t=𝒬ρt(x),\displaystyle\frac{\partial\rho_{t}(x)}{\partial t}=\mathcal{Q}^{*}\rho_{t}(x)\,, (3)

where the operator 𝒬\mathcal{Q}^{*} defines the Fokker-Planck equation, or forward Kolmogorov equation, and x=(q,p)Γ2x=(q,p)\in\Gamma\subset\mathbb{R}^{2} represents the state of the system in the phase space. The solution of eq. 3 is formally written as

ρt+τ(x)\displaystyle\rho_{t+\tau}(x) =\displaystyle= exp(𝒬τ)ρt(x)\displaystyle\exp\left(\mathcal{Q}^{*}\,\tau\right)\rho_{t}(x) (4)
=\displaystyle= 𝒫τρt(x),\displaystyle\mathcal{P}_{\tau}\rho_{t}(x)\,, (5)

where 𝒫τ\mathcal{P}_{\tau} denotes the propagator of probability densities with stationary density

limt+ρt(x)=π(x),\displaystyle\lim_{t\rightarrow+\infty}\rho_{t}(x)=\pi(x)\,, (6)

defined by the Boltzmann distribution

π(x):=π(q,p)=1Zexp(β(p22m+V(q))),\displaystyle\pi(x):=\pi(q,p)=\frac{1}{Z}\exp\left(-\beta\left(\frac{p^{2}}{2m}+V(q)\right)\right)\,, (7)

where ZZ is a normalization constant.

Besides considering the evolution of probability densities, it is useful to study the evolution of observable functions f(x)f(x). To this end, we introduce the infinitesimal generator 𝒬\mathcal{Q}, adjoint of the operator 𝒬\mathcal{Q}^{*} that defines the backward Kolmogorov equation

ft(x)t=𝒬ft(x).\displaystyle\frac{\partial f_{t}(x)}{\partial t}=\mathcal{Q}f_{t}(x)\,. (8)

Analogously to eq. 5, we can write a formal solution of eq. 8 as

ft+τ(x)\displaystyle f_{t+\tau}(x) =\displaystyle= exp(𝒬τ)ft(x)\displaystyle\exp\left(\mathcal{Q}\,\tau\right)f_{t}(x) (9)
=\displaystyle= 𝒦τft(x)\displaystyle\mathcal{K}_{\tau}f_{t}(x) (10)
=\displaystyle= 𝔼[f(xt+τ)|xt=x],\displaystyle\mathbb{E}\left[f(x_{t+\tau})|x_{t}=x\right]\,, (11)

where we introduced the Koopman operator 𝒦τ\mathcal{K}_{\tau} which propagates the expectation value of observable functions.

2.2 Rates from membership functions

Consider the τ\tau-dependent eigenvalues λτ,i\lambda_{\tau,i} and the associated eigenfunctions Ψi\Psi_{i} of the Koopman operator 𝒬\mathcal{Q} such that

𝒬Ψi=λτ,iΨi.\displaystyle\mathcal{Q}\Psi_{i}=\lambda_{\tau,i}\Psi_{i}\,. (12)

If the dynamics is ergodic and not periodic, then the first eigenfunction is constant Ψ1=1\Psi_{1}=1 and it is associated to the non-degenerate eigenvalue λτ,1=1\lambda_{\tau,1}=1. In reversible dynamics, the subsequent ncn_{c} dominant eigenfunctions Ψ={Ψ2,,Ψnc}\Psi=\{\Psi_{2},...,\Psi_{n_{c}}\}, associated to sorted and negative eigenvalues λτ,2>>λτ,nc\lambda_{\tau,2}>\dots>\lambda_{\tau,nc}, exhibit positive and negative values, which allows for the identification of metastable macro-states. In the non-reversible case, real-valued functions which span an invariant subspace of the Koopman operator can be applied instead of eigenfunctions. Each point in state space is represented by a vector which comprises of the values of these finitely many (ncn_{c}) functions. These points can be mapped into a (nc1)(n_{c}-1)-simplex whose vertices represent the metastable states whereas the edges represent the transitions. The algorithm PCCA+ [12, 13], by means of a linear transformation, transforms the simplex into a standard simplex, i.e. a simplex whose vertices are unit vectors. Accordingly, the set of dominant eigenfunctions is transformed into a set of membership functions χ=(χ1,χ2,,χnc)\chi=\left(\chi_{1},\chi_{2},\dots,\chi_{n_{c}}\right)^{\top}, with χi:Γ[0,1]\chi_{i}:\Gamma\rightarrow[0,1], i=1,2,,nc\forall i=1,2,\dots,n_{c}, such that iχi=1\sum_{i}\chi_{i}=1. Membership functions characterise the membership of a state xx in the macrostates of the system and by exploiting the linearity of the Koopman operator the exit rate from a macrostate is estimated as

κ=1τlog(a1)(1+a2a11),\displaystyle\kappa=-\frac{1}{\tau}\log(a_{1})\left(1+\frac{a_{2}}{a_{1}-1}\right)\,, (13)

where a1a_{1} and a2a_{2} are obtained solving the linear regression problem

mina1,a2𝒦τχ(x)a1χ(x)+a2.\displaystyle\min\limits_{a_{1},a_{2}}\|\mathcal{K}_{\tau}\chi(x)-a_{1}\chi(x)+a_{2}\|. (14)

For a complete discussion about the χ\chi-exit rates and the derivation of eq. 13, we refer to [16].

2.3 ISOKANN

The calculation of rates according to eqs. 13 and 14 requires the membership function χ\chi and the propagated membership function χt=𝒦τχ(x)\chi_{t}=\mathcal{K}_{\tau}\chi(x). We use ISOKANN [11, 17], an iterative algorithm which modifies the Von-Mises-Algorithm [18] as iteration scheme

fk+1\displaystyle f_{k+1} =\displaystyle= 𝒦τfk𝒦τfk,\displaystyle\frac{\mathcal{K}_{\tau}f_{k}}{\lVert\mathcal{K}_{\tau}f_{k}\rVert}\,, (15)

where the initial function f0f_{0} is an arbitrary function and \|\cdot\| is the supreme norm. As kk\rightarrow\infty, eq. 15 converges to the first eigenfunction of the Koopman operator:

limkfk+1=Ψ1=1.\displaystyle\lim_{k\rightarrow\infty}f_{k+1}=\Psi_{1}=1. (16)

In fact, by applying 𝒦τ\mathcal{K}_{\tau} iteratively to a function ff, one obtains the same result as applying the operator with lag time τ\tau tending to infinity, i.e. a constant function.

Consider now a two-metastable system, as the model studied by Kramers, then the Koopman operator has two dominant eigenfunctions Ψ1\Psi_{1} and Ψ2\Psi_{2}, and the membership functions are written as

{χ1=b1Ψ1+b2Ψ2χ2=1χ1,\displaystyle\begin{cases}\chi_{1}=b_{1}\Psi_{1}+b_{2}\Psi_{2}\\ \chi_{2}=1-\chi_{1}\end{cases}\,, (17)

with b1b_{1} and b2b_{2} appropriate coefficients. We introduce a linear transformation SS to prevent the convergence of the Koopman operator to Ψ1=1\Psi_{1}=1, and retrieve information regarding the eigenfunction Ψ2\Psi_{2} such that Ψ1\Psi_{1} and Ψ2\Psi_{2} span an invariant subspace of the Koopman operator. For this purpose, we choose as SS the shift-scale function

S𝒦τfk=𝒦τfkmin(𝒦τfk)max(𝒦τfk)min(𝒦τfk),\displaystyle S\mathcal{K}_{\tau}f_{k}=\frac{\mathcal{K}_{\tau}f_{k}-\min\left(\mathcal{K}_{\tau}f_{k}\right)}{\max\left(\mathcal{K}_{\tau}f_{k}\right)-\min\left(\mathcal{K}_{\tau}f_{k}\right)}\,, (18)

that guarantees that fk:Γ[0,1]f_{k}:\Gamma\rightarrow[0,1]. The algorithm defined in eq. 15 is rewritten as

fk+1\displaystyle f_{k+1} =\displaystyle= S𝒦τfk,\displaystyle S\mathcal{K}_{\tau}f_{k}\,, (19)

and converges to one of the two membership function:

limkfk+1\displaystyle\lim_{k\rightarrow\infty}f_{k+1} =\displaystyle= χii=1or 2.\displaystyle\chi_{i}\ \quad i=1\ \rm or\ 2. (20)

In general, we do not have an analytical representation of the Koopman operator or do not discretize the entire state space to retrieve its matrix representation. However, we can calculate the action of the Koopman operator on observable functions applied to specific states in space Γ\Gamma. Exploiting the ergodic property, we approximate the expectation in eq. 11 as a time average:

ft+τ(x)\displaystyle f_{t+\tau}(x) \displaystyle\approx f¯(xτ)\displaystyle\bar{f}(x_{\tau}) (21)
=\displaystyle= 1Nn=1Nf(xτ,n),\displaystyle\frac{1}{N}\sum_{n=1}^{N}f(x_{\tau,n})\,, (22)

where xτ,nx_{\tau,n} are the final states of NN trajectories, solutions of eq. 1, starting at x0=xx_{0}=x. Thus, eq. 19 is rewritten as

f¯k+1(x0)\displaystyle\bar{f}_{k+1}(x_{0}) =\displaystyle= Sf¯k(xτ),\displaystyle S\bar{f}_{k}(x_{\tau})\,, (23)

Regarding the choice of the initial function f0f_{0}, a wide range of options is available. The function should be an interpolating function that can be trained at each iteration until it converges to one of the membership function. For higher dimensional systems, the use of neural networks is recommended, as was used in ref. [11]. However, for low-dimensional systems, other interpolation techniques may be used, e.g. spline functions or radial basis functions.

3 Results

As an illustrative example, we considered the Langevin dynamics of a fictitious particle of mass m=1amum=1\,\mathrm{amu} which moves in a one-dimensional potential energy function

V(q)\displaystyle V(q) =\displaystyle= 10(q21)2kJmol1.\displaystyle 10(q^{2}-1)^{2}\ \mathrm{kJ\,mol^{-1}}\,. (24)

The function is characterized by two wells with minima at qA=1nmq_{A}=-1\,\mathrm{nm} and qC=1nmq_{C}=1\,\mathrm{nm}, and a height barrier of 10kJmol110\,\mathrm{kJ\,mol^{-1}} at qB=0nmq_{B}=0\,\mathrm{nm} as illustrated in fig. 1-(a). For our numerical experiments, we used standard thermodynamic parameters: the temperature of the system was T=300KT=300\,\mathrm{K} and the molar Boltzmann constant was kB=8.314×103kJK1mol1k_{B}=8.314\times 10^{-3}\,\mathrm{kJ\,K^{-1}\,mol^{-1}}.

Refer to caption
Figure 1: (a) Potential energy function (solid line) and harmonic approximation at the bottom of the wells and at the top of the barrier (dashed lines); (b) Phase space with energy levels (black contour lines and yellow dashed line denoting the KBTK_{B}T value) and three trajectories carried out with friction coefficients: γ=0.1ps1\gamma=0.1\,\mathrm{ps^{-1}} (blue), γ=2.2ps1\gamma=2.2\,\mathrm{ps^{-1}} (red), γ=30.0ps1\gamma=30.0\,\mathrm{ps^{-1}} (green).

3.0.1 Classic Kramers turnover

In order to reproduce the classic Kramers turnover, we selected 25 friction coefficient values γ\gamma between 0.1ps10.1\,\mathrm{ps^{-1}} and 30.0ps130.0\,\mathrm{ps^{-1}} and we solved the Langevin eq. 1 using the Brünger, Brooks and Karplus (BBK) integrator scheme [19] with an integrator timestep Δt=0.005ps\Delta t=0.005\,\mathrm{ps}. For each value of γ\gamma, we ran 500 simulations starting at the bottom of the left well of the potential with an initial momentum randomly drawn from the Boltzmann distribution. After the particle reached the bottom of the right well, the simulations were stopped and we calculated the mean time, i.e. the Mean First Passage Time (MFPT) τfp\langle\tau_{fp}\rangle [20], from which we obtained the transition rate as

kAC=1τfp.\displaystyle k_{A\rightarrow C}=\frac{1}{\langle\tau_{fp}\rangle}\,. (25)

The results of this numerical experiment are reported in fig. 2-(a) as black squares. If the friction is very low (γ0.1ps1\gamma\approx 0.1\,\mathrm{ps}^{-1}), the dynamics of the system (eq. 1) is almost deterministic, and the system, unless it has enough initial momentum, is trapped in the well with an extremely low probability of escape. Correspondingly, the MFPT is very large and the value of the rate tends to zero. However, increasing the friction by a small amount (γ1.5ps1\gamma\approx 1.5\,\mathrm{ps}^{-1}) the system gets enough thermal energy through the random force and increases the probability to escape from the well. In fact, for low values of γ\gamma, the stochastic force 2kBTγmηt\sqrt{2k_{B}T\gamma m}\,\eta_{t}, which is weighted by the square root of the friction coefficient, dominates the friction force γpt-\gamma p_{t}, which is linear with the friction coefficient. Thus, we observe a rapid and linear increase in rates up to a maximum of kAC0.02ps1k_{A\rightarrow C}\approx 0.02\,\mathrm{ps}^{-1}. Beyond the threshold of γ1.5ps1\gamma\approx 1.5\,\mathrm{ps}^{-1}, the system enters what is called the moderate friction regime. Here, the friction force dominates the Langevin equation, and the probability of escaping the well, despite the high thermal energy, decays as kAC1+1/γ2k_{A\rightarrow C}\propto\sqrt{1+\nicefrac{{1}}{{\gamma^{2}}}}. For higher values of the friction coefficient (γ>20ps1\gamma>20\,\mathrm{ps}^{-1}), the friction term is so strong that the average acceleration of the system tends to zero. The dynamics is overdamped and the transition rate decays as kAC1/γk_{A\rightarrow C}\propto\nicefrac{{1}}{{\gamma}}.

The three friction regimes, here qualitatively described, were formalized by Kramers in 1940 [2]. He assumed a two-metastable system governed by the Langevin dynamics with thermal energy kBTEb+=V(qB)V(qA)k_{B}T\ll E_{b}^{+}=V(q_{B})-V(q_{A}), so as to ensure metastability. In addition, he required that the left well and the top of the barrier of the potential V(q)V(q) are approximated by harmonic potentials with angular frequencies

ωA=1md2Vdq2|qA,andωB=1md2Vdq2|qB.\displaystyle\omega_{A}=\sqrt{\frac{1}{m}\left.\frac{d^{2}V}{dq^{2}}\right|_{q_{A}}}\,,\ \mathrm{and}\ \omega_{B}=\sqrt{\frac{1}{m}\left.\frac{d^{2}V}{dq^{2}}\right|_{q_{B}}}\,. (26)

Under these conditions, Kramers derived a transition rate formula for the low friction regime (γ<ωB\gamma<\omega_{B})

kAC=12γβEb+exp[βEb+],k_{A\rightarrow C}=\frac{1}{2}\gamma\beta E_{b}^{+}\exp\left[-\beta E_{b}^{+}\right]\,, (27)

for the moderate friction regime (γ>ωB\gamma>\omega_{B})

kAC\displaystyle k_{A\rightarrow C} =\displaystyle= γωB(14+ωB2γ212)ωA2πexp[βEb+],\displaystyle\frac{\gamma}{\omega_{B}}\left(\sqrt{\frac{1}{4}+\frac{\omega_{B}^{2}}{\gamma^{2}}}-\frac{1}{2}\right)\cdot\frac{\omega_{A}}{2\pi}\exp\left[-\beta E_{b}^{+}\right]\,, (28)

and the high friction regime (γωB\gamma\gg\omega_{B})

kAC\displaystyle k_{A\rightarrow C} =\displaystyle= ωBγωA2πexp[βEb+].\displaystyle\frac{\omega_{B}}{\gamma}\cdot\frac{\omega_{A}}{2\pi}\exp\left[-\beta E_{b}^{+}\right]\,. (29)

Note that Kramers defines the three regimes by comparing the coefficient of friction with the angular frequencies of the harmonic potentials that approximate the potential. In fact, the transition probability also depends on the curvature near the pit and barrier. The prediction of Kramers’ formulas, reported in fig. 2-(a), is excellent, it is only around the threshold separating the low and moderate friction regimes that the model becomes inaccurate. For more details about Kramers theory, we recommend Refs. [21, 22].

3.0.2 Kramers turnover of membership functions in phase space

In the second numerical experiment, we estimated the transition rates applying ISOKANN to the same setting of the first experiment. We generated 1000 random initial points x0=(q0,p0)x_{0}=(q_{0},p_{0}) from a uniform distribution over the phase space defined by the qq-range [2.0, 2.0][-2.0,\,2.0] nm and the pp-range [10.0, 10.0][-10.0,\,10.0] amu nm ps-1, and for each state we simulated N=100N=100 trajectories of length τ=7\tau=7 ps, corresponding to 1400 timesteps using a timestep integrator Δt=0.005\Delta t=0.005 ps. The ISOKANN algorithm has been applied for 20 iterations using multiquadratic radial basis functions (RBF) [23], which are computationally undemanding and only require a few parameters to optimize during training, resulting in faster convergence. We considered two cases:

  • the membership functions χA(q)\chi_{A}(q) and χC(q)\chi_{C}(q), and the transition rate kχAχCk_{\chi_{A}\rightarrow\chi_{C}} between the macro-states of the position space;

  • the membership functions χ^A(q,p)\hat{\chi}_{A}(q,p) and χ^C(q,p)\hat{\chi}_{C}(q,p) defined on the two-dimensional phase space, and the transition rate k^χAχC\hat{k}_{\chi_{A}\rightarrow\chi_{C}} between macro-states of the phase space. Note that from now on, each quantity marked with the symbol ^\hat{\cdot} denotes a quantity measured over the entire phase space.

The two rates, as functions of the friction coefficient γ\gamma, are reported in fig. 2-(b), respectively as blue upside down triangles and red circles. Both curves show a turnover similar to the rate kACk_{A\rightarrow C} reported in fig. 2-(a): rates have an ascending profile for very low range values, then, having reached the maximum (kχAχC0.4ps1k_{\chi_{A}\rightarrow\chi_{C}}\approx 0.4\,\mathrm{ps}^{-1} and k^χAχC0.2ps1\hat{k}_{\chi_{A}\rightarrow\chi_{C}}\approx 0.2\,\mathrm{ps}^{-1}), descend slowly. However, while the values of the rate k^χAχC\hat{k}_{\chi_{A}\rightarrow\chi_{C}} in phase space are overlapping with those of the Kramers rate kACk_{A\rightarrow C} (although they are different physical quantities), the rate kχAχCk_{\chi_{A}\rightarrow\chi_{C}} defined in position space turns out to be higher in the low friction region but converges to the values of k^χAχC\hat{k}_{\chi_{A}\rightarrow\chi_{C}} in the high friction regime. To understand these results, it is useful to take a look at the membership functions obtained from ISOKANN and shown in fig. 3, where figures (d,e,f) on the second row and (g,h,i) on the third row are respectively the membership functions in the phase space and the position space, for low, moderate and high friction.

Refer to caption
Figure 2: (a) Classic Kramers turnover: transition rate kACk_{A\rightarrow C} estimated by numerical experiment (black squares), Kramers’ formulas for low (blue), moderate (red) and high (green) friction coefficient γ\gamma. The dashed vertical lines denote the threshold between friction regimes; (b) Kramers turnover between membership functions: transition rate k^χAχC\hat{k}_{\chi_{A}\rightarrow\chi_{C}} estimated by grid-based method (black squares) and ISOKANN (red circles), transition rate kχAχCk_{\chi_{A}\rightarrow\chi_{C}} estimated by ISOKANN (blue upside down triangles).

In fig. 3-(a) (low friction regime), the membership functions of the macro-states only have significant values for those states whose total energy E=p2/2m+V(q)E=\nicefrac{{p^{2}}}{{2m}}+V(q) is less than the height of the barrier. The points with a total energy exceeding the barrier are depicted in white, indicating that they have an equal probability of belonging to either χA\chi_{A} or χC\chi_{C}, approximately 0.5. This occurs because trajectories originating from this area undergo periodic oscillations in phase space, visiting both wells as depicted in fig. 1-(b) by the blue trajectory. Correspondingly, in fig. 3-(d), we show the membership function values as a projection of χ^A(q,p)\hat{\chi}_{A}(q,p) and χ^C(q,p)\hat{\chi}_{C}(q,p) onto the position space. The apparent noise is due to the fact that the membership functions on phase space are not constant along the axis of momenta. Therefore, when friction is low, the membership values projected to position space are not functions in a strict sense and do not describe position-based macro-states. In fig. 3-(b) (moderate friction regime), the membership functions draw concentric spirals that terminate in the minima of phase space respectively. This may seem counterintuitive, but observing how a trajectory behaves in the moderate friction regime helps to interpret the membership functions correctly. In fig. 1-(b), the red trajectory starts at position q=1nmq=-1\,\mathrm{nm} and momentum p=8amunmps1p=-8\,\mathrm{amu\,nm\,ps^{-1}}, and reaches the right-hand minimum by following a clockwise trajectory. Similarly, if we start trajectories that are far from the central region of the phase space, we would observe spiral patterns that match with the membership functions. From this figure, we deduce that in the moderate friction regime, the effective barrier, i.e. the transition region, is not at q=0q=0, but between the two spirals. In particular, in the central box [1, 1]×[5, 5][-1,\,1]\times[-5,\,5] of the phase space, the barrier corresponds to a diagonal line which is not parallel to the momentum axis. The reason is that if q=0q=0 and p>0p>0, the system reaches the right well with low probability of recrossing. Conversely, if q=0q=0 and p<0p<0, the system reaches the left region. Along the white diagonal the system is in an unstable equilibrium, i.e. the system has the same probability of reaching one of the wells, and ISOKANN assigns equal probability of membership to the two macro-states. In fig. 3-(c) (high friction regime), membership functions are independent from momentum space. The two regions q<0q<0 and q>0q>0 are assigned to the macro-states regardless of the momentum and the transition region is almost a vertical line. The projections χA(q)\chi_{A}(q) and χC(q)\chi_{C}(q) onto the position space also appear well defined in fig. 3-(f). This occurs because as the friction is very high, the momentum is quickly damped and it does not provide enough energy to overcome the barrier as shown by the green trajectory in fig. 1-(b). In the high friction regime, only the thermal noise can provide enough energy to jump over the barrier.

Refer to caption
Figure 3: (a,b,c) Membership functions χ^(q,p)\hat{\chi}(q,p) for γ=0.1, 2.0, 30ps1\gamma=0.1,\,2.0,\,30\,\mathrm{ps^{-1}} estimated by grid-based model: The blue-white-red colour gradient represents values in the range of 0 to 1. The membership functions are complimentary: χ1+χ2=1\chi_{1}+\chi_{2}=1, then the blue points represent the macro-state χ1\chi_{1} and the red points represent χ2\chi_{2}. The white points can be regarded as transitive regions. (d,e,f) Membership functions χ^(q,p)\hat{\chi}(q,p) for γ=0.1, 2.0, 30ps1\gamma=0.1,\,2.0,\,30\,\mathrm{ps^{-1}} estimated by ISOKANN; (g,h,i) Projection of the membership functions to position space.

3.0.3 Results validation

In order to validate our results, we constructed a reference solution by means of a grid-based technique similar to Ulam’s method [24] which allows to discretize the operator 𝒦τ\mathcal{K}_{\tau} defined in eq. 11 into a transition probability matrix 𝐊(τ)\mathbf{K}(\tau), cf. [25]. Given a discretization of the phase space Γ\Gamma into MM disjoint subsets Γi\Gamma_{i}, with i=1,,Mi=1,\dots,M, and a set of NN simulations of length τ\tau started in a random position of the subset Γi\Gamma_{i}, then the entries of the matrix 𝐊(τ)\mathbf{K}(\tau) are written as

kij(τ)=1Nn=1N𝟏Γj(xi,nτ)\displaystyle k_{ij}(\tau)=\frac{1}{N}\sum_{n=1}^{N}\mathbf{1}_{\Gamma_{j}}(x_{i,n}^{\tau}) (30)

where 𝟏Γj\mathbf{1}_{\Gamma_{j}} is the indicator function

𝟏Γj(x)={1 if xΓj,0 if xΓj,\displaystyle\mathbf{1}_{\Gamma_{j}}(x)=\begin{dcases}1~&\text{ if }~x\in\Gamma_{j}~,\\ 0~&\text{ if }~x\notin\Gamma_{j}~\,,\end{dcases} (31)

and xi,nτx_{i,n}^{\tau} is the final state of the nnth simulation started in Γi\Gamma_{i}. In practice, one counts how many times a simulation starting in Γi\Gamma_{i} ends in Γj\Gamma_{j} and divides by the number of simulations to obtain an estimation of the transition probability. Afterward, an approximation of the infinitesimal generator, sometimes referred to as pseudogenerator, is obtained as

𝐐~=𝐊(τ)𝐈τ,\displaystyle\widetilde{\mathbf{Q}}=\frac{\mathbf{K}(\tau)-\mathbf{I}}{\tau}\,, (32)

where 𝐈\mathbf{I} denotes an identity matrix of the same size as 𝐊τ\mathbf{K}{\tau}. Then the membership functions χ^(q,p)\hat{\chi}(q,p) are calculated applying PCCA+ to 𝐐~\widetilde{\mathbf{Q}} and the coarse-grained rate matrix between macro-states is calculated as a Galerkin projection of 𝐐~\widetilde{\mathbf{Q}} onto the membership functions:

𝐐~c\displaystyle\widetilde{\mathbf{Q}}_{c} =\displaystyle= (χdiag(π)χ)1χdiag(π)𝐐~χ\displaystyle(\chi^{\top}\mathrm{diag}(\pi)\chi)^{-1}\chi^{\top}\mathrm{diag}(\pi)\widetilde{\mathbf{Q}}\chi\, (33)

In eq. 33, diag(π)\mathrm{diag}(\pi) denotes an M×MM\times M diagonal matrix, whose diagonal entries are the entries of the Boltzmann distribution π(q,p)\pi(q,p) (eq. 7) evaluated at the centers of subsets Γi\Gamma_{i}. Assuming a two-metastable system, the rate matrix 𝐐~\widetilde{\mathbf{Q}} has size 2×22\times 2:

𝐐~c=(q~χAχCq~χAχCq~χCχAq~χCχA),\displaystyle\widetilde{\mathbf{Q}}_{c}=\begin{pmatrix}-\widetilde{q}_{\chi_{A}\rightarrow\chi_{C}}&\widetilde{q}_{\chi_{A}\rightarrow\chi_{C}}\\ \widetilde{q}_{\chi_{C}\rightarrow\chi_{A}}&-\widetilde{q}_{\chi_{C}\rightarrow\chi_{A}}\\ \end{pmatrix}\,, (34)

with q~χAχC,q~χCχA>0\widetilde{q}_{\chi_{A}\rightarrow\chi_{C}},\,\widetilde{q}_{\chi_{C}\rightarrow\chi_{A}}>0 representing the transition rates between the macro-states. For the sole case of a bimetastable system, these rates are equivalent to the exit rates defined in eq. 13.

Here, we discretized the qq-range [2.0, 2.0][-2.0,\,2.0] nm in 8080 intervals of the same length Δq=0.05\Delta q=0.05 nm, and the pp-range [10.0, 10.0][-10.0,\,10.0] nm in 7070 intervals of the same length Δp=0.29\Delta p=0.29 nm. The transition rates estimated by PCCA+ are reported in fig. 2-(b) as black squares, while the membership functions are reported in fig. 3-(a,b,c). For each subset, we ran 500 simulations of length 7 ps, with an integrator timestep of 0.005 ps for a total of 1400 timesteps. There is excellent agreement between ISOKANN and the method based on the discretization of the phase space: both methods recreate the Kramers turnover and show the same patterns for the membership functions.

4 Discussion and conclusion

In this article, we studied the effect of the friction coefficient of Langevin dynamics on metastable macro-states of the phase space and calculated the transition rate.

For this purpose, we used the ISOKANN algorithm [11], which identifies macro-states by means of membership functions that form a basis function of an invariant subspace of the Koopman operator. In this subspace, the Koopman operator produces a linear dynamical system of finite dimensions that preserves the Markovianity of Langevin dynamics and can be used to determine kinetic observables such as transition rates.

We investigated a one-dimensional artificial potential, representing a bimetastable system, and reproduced the Kramers turnover. However, differently from the original Kramers work, the transition rate we estimated represents transitions between macro-states in phase space. Our results show that including both the positions and the momenta in defining the macro-states is necessary. Indeed, neglecting the momentum in the low and moderate friction regime introduces non-Markovian effects that are not properly captured by the position-dependent membership functions. In contrast, in the high friction regime, the velocity is instantaneously damped, and the macro-states can be defined as functions of only the position space.

This approach to estimating transition rates can be extended to highly dimensional problems. The typical strategy requires solving the fundamental equations of motion, projecting the dynamics on a small number of relevant coordinates, and discretizing the low-dimensional model to create a matrix representation of the Koopman operator, as is done with Markov State Models [26, 27, 28, 29] or Square Root Approximation of the infinitesimal generator [30, 31, 32]. The price of these techniques is the introduction of assumptions, such as the Markovianity of the projected dynamics, that can lead to significant errors [33]. In contrast, ISOKANN does not require dimensionality reduction or space discretization, and the measured rates can be considered the best representation of the system’s dynamics, net of approximations introduced a priori, e.g., when the equations of motion are numerically solved. Thus, the dimensionality of the system poses no limits to ISOKANN on a theoretical level. However, the implementation of ISOKANN for studying high-dimensional systems is more involved. Here, considering the low-dimensionality of the system, we used radial basis functions, but for higher dimensional systems, we suggest more advanced interpolating functions such as feed-forward neural networks, which allow the use of all system coordinates, including momenta. Another aspect to be taken into account is the choice of the mathematical representation of the molecular system. Indeed, neural networks are not invariant with respect to translations and rotations when Cartesian coordinates are used as input data. Thus, Cartesian coordinates must be transformed to a suitable set of input coordinates, for example pairwise distances, internal coordinates or atom-centred symmetry functions [34].

In summary, with this work, we have shown that ISOKANN is a valid tool for the study of dynamical systems that avoids the subspace projection of transfer operators. Here, we have focused on the classic Kramers problem, studying how macro-states are defined in phase space and highlighting the importance of considering momenta in rate calculation. Nevertheless, ISOKANN’s flexibility and modern machine learning techniques allow for the study of even more complex systems.

Acknowledgments

This research has been funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) through the Cluster of Excellence MATH+, project AA1-15: “Math-powered drug-design” and the Collaborative Research Center CRC 1114, Projects B03 and B05.

References

  • [1] S. Arrhenius, Z. Phys. Chem. 4U (1), 96–116 (1889).
  • [2] H.A. Kramers, Physica 7 (4), 284–304 (1940).
  • [3] J. Langer, Ann. Phys. 54 (2), 258–275 (1969).
  • [4] D. Chandler, The Journal of Chemical Physics 68 (6), 2959–2970 (1978).
  • [5] V.I. Mel’nikov and S.V. Meshkov, The Journal of Chemical Physics 85 (2), 1018–1027 (1986).
  • [6] E. Pollak, H. Grabert and P. Hänggi, The Journal of Chemical Physics 91 (7), 4073–4087 (1989).
  • [7] B. Koopman, Proc. Natl. Acad. Sci. USA 17 (5), 315—318 (1931).
  • [8] B. Koopman and J. Neumann, Proc. Natl. Acad. Sci. USA 3 (18), 255–63 (1932).
  • [9] L.A. Baxter, Probability in the Engineering and Informational Sciences 10 (2), 311–313 (1996).
  • [10] S. Klus, F. Nüske, P. Koltai, H. Wu, I. Kevrekidis, C. Schütte and F. Noé, J. Nonlinear Sci. 28 (2018).
  • [11] R.J. Rabben, S. Ray and M. Weber, J. Chem. Phys. 153 (11), 114109 (2020).
  • [12] P. Deuflhard and M. Weber, Linear Algebra Appl. 398, 161–184 (2004).
  • [13] M. Weber, Ph. D. thesis, FU Berlin, 2006.
  • [14] M. Weber, Computation 6 (1) (2018).
  • [15] C. Schütte, W. Huisinga and P. Deuflhard, in Ergodic Theory, Analysis, and Efficient Simulation of Dynamical Systems, edited by B. Fiedler (Springer, Berlin, 2001), pp. 191–223.
  • [16] M. Weber and N. Ernst, arXiv preprint arXiv:1708.00679 (2017).
  • [17] A. Sikorski, E.R. Borrell and M. Weber, arXiv preprint arXiv:2301.00065 (2022).
  • [18] R. von Mises and H. Pollaczek-Geiringer, Zamm-zeitschrift Fur Angewandte Mathematik Und Mechanik 9, 152–164 (1929).
  • [19] A. Brünger, C.L. Brooks and M. Karplus, Chem. Phys. Lett. 105 (5), 495–500 (1984).
  • [20] L. Pontryagin, A. Andronov and A. Vitt, Zh. Eksp. Teor. Fiz 3, 165 (1933).
  • [21] P. Hänggi, P. Talkner and M. Borkovec, Rev. Mod. Phys. 62, 251–341 (1990).
  • [22] B. Peters, Reaction Rate Theory and Rare Events, 1st ed. (, , 2017).
  • [23] P. Virtanen, R. Gommers, T.E. Oliphant, M. Haberland, T. Reddy, D. Cournapeau, E. Burovski, P. Peterson, W. Weckesser, J. Bright, S.J. van der Walt, M. Brett, J. Wilson, K.J. Millman, N. Mayorov, A.R.J. Nelson, E. Jones, R. Kern, E. Larson, C.J. Carey, İ. Polat, Y. Feng, E.W. Moore, J. VanderPlas, D. Laxalde, J. Perktold, R. Cimrman, I. Henriksen, E.A. Quintero, C.R. Harris, A.M. Archibald, A.H. Ribeiro, F. Pedregosa, P. van Mulbregt and SciPy 1.0 Contributors, Nat. Methods 17, 261–272 (2020).
  • [24] S. Ulam, A Collection of Mathematical Problems Interscience tracts in pure and applied mathematics (, , 1960).
  • [25] C. Schütte, S. Klus and C. Hartmann, Acta Numerica 32, 517–673 (2023).
  • [26] G.R. Bowman, V.S. Pande and F. Noé, editors, An Introduction to Markov State Models and Their Application to Long Timescale Molecular Simulation, Vol. 797 of Advances in Experimental Medicine and Biology (Springer, Heidelberg, 2014).
  • [27] W. Wang, S. Cao, L. Zhu and X. Huang, Wiley Interdisciplinary Reviews: Computational Molecular Science 8 (1), e1343 (2018).
  • [28] B.E. Husic and V.S. Pande, J. Am. Chem. Soc. 140 (7), 2386–2396 (2018).
  • [29] B.G. Keller, S. Aleksic and L. Donati, in Biomolecular Simulations in Structure-based Drug Discovery, edited by F. L. Gervasio (Wiley-Interscience, Weinheim, 2018), p. 67.
  • [30] L. Donati, M. Heida, B.G. Keller and M. Weber, J. Phys. Condens. Matter 30, 425201 (2018).
  • [31] L. Donati, M. Weber and B.G. Keller, J. Phys. Condens. Matter 33, 115902 (2021).
  • [32] L. Donati, M. Weber and B.G. Keller, J. Math. Phys. 63 (12), 123306 (2022).
  • [33] J.H. Prinz, H. Wu, M. Sarich, B. Keller, M. Senne, M. Held, J.D. Chodera, C. Schütte and F. Noé, J. Chem. Phys. 134, 174105 (2011).
  • [34] J. Behler, J. Chem. Phys. 134 (7), 074106 (2011).