Information Thermodynamics of the Transition-Path Ensemble

Miranda D. Louwerse mdlouwer@sfu.ca Department of Chemistry, Simon Fraser University, Burnaby, British Columbia V5A1S6, Canada David A. Sivak dsivak@sfu.ca Department of Physics, Simon Fraser University, Burnaby, British Columbia V5A1S6, Canada

Abstract

The reaction coordinate describing a transition between reactant and product is a fundamental concept in the theory of chemical reactions. Within transition-path theory, a quantitative definition of the reaction coordinate is found in the committor, which is the probability that a trajectory initiated from a given microstate first reaches the product before the reactant. Here we develop an information-theoretic origin for the committor and show how selecting transition paths from a long ergodic equilibrium trajectory induces entropy production which exactly equals the information that system dynamics provide about the reactivity of trajectories. This equality of entropy production and dynamical information generation also holds at the level of arbitrary individual coordinates, providing parallel measures of the coordinate’s relevance to the reaction, each of which is maximized by the committor.

^†^†preprint: APS/123-QED

Understanding the mechanism for a transition between metastable states of a system is of fundamental interest to the natural sciences. Reaction theories seek to derive the rate constant from underlying system dynamics and have led to increased insight into the reaction mechanism, the sequence of elementary steps by which a reaction occurs. A notable example is transition-state theory and its extensions Eyring (1935); Evans and Polanyi (1935); Kramers (1940), which conceptualize the activated complex (or transition-state species) as a key dynamical intermediate and makes use of its properties (e.g., free energy relative to the reactant) to derive an approximate rate constant for large classes of reactions. The transition state is one identifiable state along the reaction coordinate, a one-dimensional collective variable that preserves all quantitative and qualitative aspects of a reaction under projection of the multidimensional dynamics Peters (2016); Bolhuis and Dellago (2015).

Motivated by rare-event sampling methods Dellago et al. (1998), transition-path theory E and Vanden-Eijnden (2006) was developed to quantitatively describe the entire reaction and determine its rate constant, without assumptions of metastability for the reactant and product or any specific details of the reaction mechanism (e.g., the presence of a single transition state). This statistical description relies on the definition of the committor function $q_{\bm{\phi}}$ (also called the commitment or splitting probability), the probability that a trajectory initiated from microstate $\bm{\phi}$ reaches the product before returning to the reactant. The committor maps the state space onto the interval $q_{\bm{\phi}}\in[0,1]$ and has been called the “true” or “ideal” one-dimensional reaction coordinate Peters (2016); E and Vanden-Eijnden (2010); Li and Ma (2014); Peters et al. (2013); Banushkina and Krivov (2016). The committor allows calculation of the reaction rate from a one-dimensional description Berezhkovskii and Szabo (2013) and identifies the transition-state ensemble as states making up the $q_{\bm{\phi}}=0.5$ isocommittor surface Berezhkovskii and Szabo (2005).

In this Letter, we derive a novel information-theoretic justification of the committor as the reaction coordinate. We show how selecting the transition-path ensemble (the set of trajectories from reactant to product) from a long ergodic equilibrium trajectory results in entropy production that precisely equals the information generated by system dynamics about the reactivity of trajectories.

The components of entropy production and information generation due to an arbitrary system coordinate are also equal; this reveals equivalent thermodynamic and information-theoretic measures of the suitability of low-dimensional collective variables that encode information relevant for describing reaction mechanisms. The committor is a single coordinate that preserves all system entropy production and distills all system information about reactivity, giving further support for its role as the reaction coordinate.

Information-theoretic formulation of the committor as reaction coordinate.—Consider a multidimensional system $\bm{\Phi}$ evolving according to Markovian dynamics governed by the master equation Zwanzig (2001), $\mathrm{d}_{t}p(\bm{\phi})=\sum_{\bm{\phi}^{\prime}}T_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime})$ , where $T_{\bm{\phi}\bm{\phi}^{\prime}}$ is the transition rate from state $\bm{\phi}^{\prime}\to\bm{\phi}$ and $p(\bm{\phi})$ is the probability of state $\bm{\phi}$ . We assume the transition rates obey detailed balance Zwanzig (2001) and the system is in equilibrium with its environment so that $p(\bm{\phi})=\pi(\bm{\phi})$ , the equilibrium probability of $\bm{\phi}$ . We study the transition-path ensemble (TPE), the set of trajectories that leave one subset of states $A\in\bm{\Phi}$ and next visit a distinct subset $B\in\bm{\Phi}\setminus A$ before $A$ . In most applications, $A$ and $B$ are metastable states separated by a dynamical barrier; following Refs. Metzner et al. (2009); Vanden-Eijnden (2014), we only assume that $A$ and $B$ do not overlap and lack direct transitions, i.e., $T_{\bm{\phi}\bm{\phi}^{\prime}}=0$ for $\bm{\phi}^{\prime}\in A$ and $\bm{\phi}\in B$ .

The TPE can be formed by selecting from a long ergodic equilibrium supertrajectory the trajectory segments that leave $A$ and reach $B$ before $A$ . Transition paths are therefore selected based on the trajectory outcome $S_{\rm{+}}$ (the next mesostate ( $A$ or $B$ ) visited by the system) and origin $S_{\rm{-}}$ (the mesostate most recently visited by the system). This partitions the supertrajectory into four trajectory subensembles, each with particular $\bm{s}\equiv(s_{\rm{-}},s_{\rm{+}})$ : The forward (reverse) transition-path ensemble is the set of trajectory segments with $\bm{s}=(A,B)$ [ $\bm{s}=(B,A)$ ], and the stationary subensemble from $A\to A$ ( $B\to B$ ) has $\bm{s}=(A,A)$ [ $\bm{s}=(B,B)$ ], as depicted in Fig. 1. Every trajectory segment in the forward TPE has a corresponding equally probable time-reversed trajectory segment in the reverse TPE.

At any time during the equilibrium supertrajectory, we define random variables $\bm{\Phi}$ and $\bm{S}$ , respectively, denoting the current system state and trajectory subensemble, with $p(\bm{\phi},\bm{s})$ the joint distribution that the system is currently in state $\bm{\phi}$ and is currently on a trajectory segment with respective origin and outcome $\bm{s}=\{s_{\rm{-}},s_{\rm{+}}\}$ . Since the system dynamics are Markovian, the trajectory outcome and origin are conditionally independent given current state $\bm{\phi}$ , so the joint distribution can be factored as $p(\bm{\phi},\bm{s})=\pi(\bm{\phi})p(s_{\rm{+}}|\bm{\phi})p(s_{\rm{-}}|\bm{\phi})$ Metzner et al. (2009). The conditional probabilities of trajectory outcome and origin given current state $\bm{\phi}$ are


$\displaystyle p(S_{\rm{+}}=B\,\|\,\bm{\phi})$	$\displaystyle=q^{+}_{\bm{\phi}}$	(1a)
$\displaystyle p(S_{\rm{-}}=A\,\|\,\bm{\phi})$	$\displaystyle=q^{-}_{\bm{\phi}}\>.$	(1b)

Here $q^{+}_{\bm{\phi}}$ is the forward committor, the probability that the system currently in state $\bm{\phi}$ will next reach $B$ before $A$ , and $q^{-}_{\bm{\phi}}$ is the backward committor, the probability that the system (currently in $\bm{\phi}$ ) was more recently in mesostate $A$ than in $B$ . The committors obey boundary conditions $q^{+}_{\bm{\phi}}=0$ and $q^{-}_{\bm{\phi}}=1$ for $\bm{\phi}\in A$ , and $q^{+}_{\bm{\phi}}=1$ and $q^{-}_{\bm{\phi}}=0$ for $\bm{\phi}\in B$ . Since the system is in equilibrium and the transition rates obey detailed balance, $q^{-}_{\bm{\phi}}=1-q^{+}_{\bm{\phi}}$ Metzner et al. (2009), a single committor (without loss of generality, the forward committor $q^{+}_{\bm{\phi}}$ ) provides information about both the outcome and origin of the trajectory segment, so we refer to $q^{+}_{\bm{\phi}}$ as the reaction coordinate.

Refer to caption — Figure 1: Partitioning a long ergodic equilibrium supertrajectory into subensembles based on trajectory outcome $S_{\rm{+}}$ and origin $S_{\rm{-}}$ . Contours: example double-well potential energy. Heat map: probability distribution $p(\bm{\phi}|\bm{s})$ of system state conditioned on trajectory subensemble $\bm{s}$ . Solid curves: representative trajectories from each subensemble. The forward (reverse) TPE in the top-right (bottom-left) panel has net flux of trajectories from $A\to B$ ( $B\to A$ ). The top-left (bottom-right) panel shows the stationary subensemble from $A\to A$ ( $B\to B$ ).

During the equilibrium supertrajectory, the system continually evolves from $A$ to $B$ and $B$ to $A$ , completing a unidirectional cycle through each subensemble with stochastic transition times depending on underlying microscopic dynamics. Transition-path theory Vanden-Eijnden (2014); Berezhkovskii and Szabo (2019); E and Vanden-Eijnden (2006) derives quantitative properties (reaction rate and free-energy difference) of the $A\to B$ reaction from the equilibrium probability flux of subensemble transitions

\displaystyle\nu_{\bm{S}}=\sum_{\bm{\phi}\notin A,\bm{\phi}^{\prime}\in A}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})q^{+}_{\bm{\phi}}

(2)

and the respective marginal probabilities $p(s_{\rm{+}})$ and $p(s_{\rm{-}})$ :


$\displaystyle k_{AB}$	$\displaystyle=$		$\displaystyle\frac{\nu_{\bm{S}}}{p(S_{\rm{-}}=A)}$	$\displaystyle=$		$\displaystyle\frac{\nu_{\bm{S}}}{p(S_{\rm{+}}=A)}$	(3a)
$\displaystyle\quad k_{BA}$	$\displaystyle=$		$\displaystyle\frac{\nu_{\bm{S}}}{p(S_{\rm{-}}=B)}$	$\displaystyle=$		$\displaystyle\frac{\nu_{\bm{S}}}{p(S_{\rm{+}}=B)}$	(3b)
$\displaystyle\beta\Delta F_{AB}$	$\displaystyle=$	$\displaystyle\ln\,$	$\displaystyle\frac{p(S_{\rm{-}}=A)}{p(S_{\rm{-}}=B)}$	$\displaystyle=$	$\displaystyle\ln\,$	$\displaystyle\frac{p(S_{\rm{+}}=A)}{p(S_{\rm{+}}=B)}\>,$	(3c)

where $k_{AB}$ ( $k_{BA}$ ) is the rate constant for the $A\to B$ ( $B\to A$ ) transition and $\Delta F_{AB}\equiv F_{B}-F_{A}$ is the free-energy difference. Mesoscopic reaction properties are therefore derived from information about the subensembles, specifically the proportion of time spent in each subensemble and how frequently the subensemble switches.

The reaction coordinate should be maximally informative about the current subensemble. This is precisely quantified by mutual information, a nonlinear statistical measure of the relationship between two random variables, specifically quantifying the reduction of uncertainty (given by Shannon entropy $H(X)\equiv-\sum_{x}p(x)\ln p(x)$ ) about one random variable from measuring another Cover and Thomas (2006):

I(\bm{S};\bm{\Phi})\equiv\sum_{\bm{\phi},\bm{s}}p(\bm{\phi},\bm{s})\ln\frac{p(\bm{\phi},\bm{s})}{\pi(\bm{\phi})p(\bm{s})}\>,

(4)

where $p(\bm{s})=\sum_{\bm{\phi}}p(\bm{\phi},\bm{s})$ is the marginal probability that the system is currently on a trajectory segment with outcome and origin $\bm{s}=(s_{\rm{-}},s_{\rm{+}})$ . Operationally, $p(\bm{s})$ can be estimated from the proportion of time $\tau_{\bm{s}}$ spent in subensemble $\bm{s}$ during an equilibrium supertrajectory of length $\tau$ , $p(\bm{s})=\lim_{\tau\to\infty}\tau_{\bm{s}}/\tau$ . If the committor depends only on a one-dimensional coordinate $X\in\bm{\Phi}$ (i.e. $q_{\bm{\phi}}=q_{x}$ ), then $X$ is a sufficient statistic for the mutual information between trajectory subensemble and full system state, i.e., $I(\bm{S};\bm{\Phi})=I(\bm{S};X)$ . In this sense, the committor is the “optimal” reaction coordinate, since it is maximally informative about the trajectory subensemble given a measurement of system state. This is our first major result.

Physically, the trajectory outcome and origin (and hence the committors) represent uncertainty in the state of the environment. Classical mechanics assumes a constant-energy universe (system $\bm{\Phi}$ plus environment $\bm{\Psi}$ ) governed by deterministic dynamics so that the outcome and origin of the trajectory initiated from a given state of system and environment are deterministic (and can be determined by integrating the state of the universe forward and backward in time until the system reaches $A$ or $B$ ), i.e., $p(\bm{s}|\bm{\phi},\bm{\psi})$ is either 0 or 1. This partitions the state space of the universe into four quadrants corresponding to each trajectory subensemble, with each state $(\bm{\phi},\bm{\psi})$ belonging to only one subensemble; thus the uncertainty about the trajectory subensemble given a state of the universe is zero, $H(\bm{S}|\bm{\Phi},\bm{\Psi})\equiv-\sum_{\bm{\phi},\bm{\psi},\bm{s}}p(\bm{\phi},\bm{\psi},\bm{s})\ln p(\bm{s}|\bm{\phi},\bm{\psi})=0$ . In this case, the mutual information between the universe and trajectory subensemble is the uncertainty about the trajectory subensemble, $I(\bm{S};\bm{\Phi},\bm{\Psi})=H(\bm{S})-H(\bm{S}|\bm{\Phi},\bm{\Psi})=H(\bm{S})$ ; the measurement of the state of the universe fully determines the trajectory outcome and origin.

However, we typically do not resolve the microstate of the environment, instead coarse-graining its interaction with the system into friction and fluctuations Zwanzig (2001). Measurement of the system state alone does not fully determine the trajectory outcome and origin, which become random variables with positive conditional Shannon entropy $H(\bm{S}|\bm{\Phi})\equiv-\sum_{\bm{s},\bm{\phi}}p(\bm{\phi},\bm{s})\ln p(\bm{s}|\bm{\phi})>0$ reflecting uncertainty in the state of the environment that is relevant to classification of the current subensemble.

Transition-path thermodynamics.—The joint dynamics of $(\bm{\Phi},\bm{S})$ is given by the master equation

\mathrm{d}_{t}p(\bm{\phi},\bm{s})=\sum_{\bm{\phi}^{\prime},\bm{s}^{\prime}}T^{\bm{s}\bm{s}^{\prime}}_{\bm{\phi}\bm{\phi}^{\prime}}\,p(\bm{\phi}^{\prime},\bm{s}^{\prime})

(5)

where the $(\bm{\phi}^{\prime},\bm{s}^{\prime})\to(\bm{\phi},\bm{s})$ transition rate is (see Supplemental Material I SM )

T^{\bm{s}\bm{s}^{\prime}}_{\bm{\phi}\bm{\phi}^{\prime}}=\begin{cases}T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}\equiv T_{\bm{\phi}\bm{\phi}^{\prime}}\frac{p(s_{\rm{+}}|\bm{\phi})}{p(s_{\rm{+}}|\bm{\phi}^{\prime})},&\bm{s}^{\prime}=\bm{s}\\ T_{\bm{\phi}\bm{\phi}^{\prime}}p(S_{\rm{+}}=B|\bm{\phi})\,,&\begin{cases}\bm{\phi}^{\prime}\in A\>{\rm{,}}\>\bm{\phi}\notin A\>\rm{,}\\ \bm{s}^{\prime}=(A,A)\>{\rm{,}}\>\bm{s}=(A,B)\\ \end{cases}\\ \vspace{-2.5ex}\\ T_{\bm{\phi}\bm{\phi}^{\prime}}p(S_{\rm{+}}=A|\bm{\phi})\,,&\begin{cases}\bm{\phi}^{\prime}\in B\>{\rm{,}}\>\bm{\phi}\notin B\>\rm{,}\\ \bm{s}^{\prime}=(B,B)\>{\rm{,}}\>\bm{s}=(B,A)\\ \end{cases}\\ \vspace{-2.5ex}\\ T_{\bm{\phi}\bm{\phi}^{\prime}}/p(S_{\rm{+}}=A|\bm{\phi}^{\prime})\,,&\begin{cases}\bm{\phi}^{\prime}\notin A\>{\rm{,}}\>\bm{\phi}\in A\>\rm{,}\\ \bm{s}^{\prime}=(B,A)\>{\rm{,}}\>\bm{s}=(A,A)\\ \end{cases}\\ \vspace{-2.5ex}\\ T_{\bm{\phi}\bm{\phi}^{\prime}}/p(S_{\rm{+}}=B|\bm{\phi}^{\prime})\,,&\begin{cases}\bm{\phi}^{\prime}\notin B\>{\rm{,}}\>\bm{\phi}\in B\>\rm{,}\\ \bm{s}^{\prime}=(A,B)\>{\rm{,}}\>\bm{s}=(B,B)\\ \end{cases}\\ -\sum\limits_{\begin{subarray}{c}\bm{\phi}^{\prime\prime}\neq\bm{\phi}^{\prime}\\ \bm{s}^{\prime\prime}\neq\bm{s}^{\prime}\end{subarray}}\,T_{\bm{\phi}^{\prime\prime}\bm{\phi}^{\prime}}^{\bm{s}^{\prime\prime}\bm{s}^{\prime}},&\bm{\phi}=\bm{\phi}^{\prime}\>{\rm{,}}\>\bm{s}=\bm{s}^{\prime}\\ 0&\rm{otherwise}\end{cases}\>.

(6)

The top transition does not change the subensemble, and biases transitions within subensemble $\bm{s}$ toward states with higher probability of trajectory outcome $s_{\rm{+}}$ . The middle four transitions switch subensembles and are unidirectional, contributing to the probability flux $\nu_{\bm{S}}$ [Eq. (2) and Supplemental Material Eq. (S9)]. These joint dynamics are Markovian: Since the underlying system dynamics are Markovian, the transition rates (6) do not depend on the trajectory origin $s_{\rm{-}}$ , and the outcome $s_{\rm{+}}$ does not induce dependence on earlier system states. Considered alone, system dynamics are at equilibrium and microscopically reversible; adding the trajectory outcome and origin variables (that are not functions of system state and explicitly depend on the past and future) breaks time-reversal symmetry, producing absolutely irreversible trajectory-subensemble transitions and time-asymmetric system transitions within a given subensemble.

To quantify the time asymmetry for a particular $\bm{\phi}^{\prime}\to\bm{\phi}$ transition in subensemble $\bm{s}$ , we combine (6) Bayes’ rule, and the equilibrium detailed-balance relation $T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})=T_{\bm{\phi}^{\prime}\bm{\phi}}\pi(\bm{\phi})$ to derive a local detailed-balance relation,

\frac{T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime}|\bm{s})}{T^{\bm{s}}_{\bm{\phi}^{\prime}\bm{\phi}}p(\bm{\phi}|\bm{s})}=\frac{p(s_{\rm{-}}|\bm{\phi}^{\prime})p(s_{\rm{+}}|\bm{\phi})}{p(s_{\rm{-}}|\bm{\phi})p(s_{\rm{+}}|\bm{\phi}^{\prime})}\ .

(7)

The $A\to A$ (and analogously $B\to B$ ) stationary subensemble has $s_{\rm{+}}=s_{\rm{-}}=A$ , and due to system detailed balance $p(S_{\rm{-}}=A|\bm{\phi})=p(S_{\rm{+}}=A|\bm{\phi})$ , so the rhs is unity and detailed balance holds for transitions within stationary subensembles. The reactive subensembles (forward or reverse TPE) have different trajectory outcome and origin so the rhs side differs from unity, leading to a detailed-balance-breaking flux (and hence entropy production) along particular transitions within these subensembles.

Within a fixed subensemble $\bm{s}$ , the net trajectory flux is


$\displaystyle J^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}$	$\displaystyle=T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime},\bm{s})-T^{\bm{s}}_{\bm{\phi}^{\prime}\bm{\phi}}p(\bm{\phi},\bm{s})$	(8a)
	$\displaystyle=\big{[}p(s_{\rm{-}}\|\bm{\phi}^{\prime})p(s_{\rm{+}}\|\bm{\phi})-p(s_{\rm{-}}\|\bm{\phi})p(s_{\rm{+}}\|\bm{\phi}^{\prime})\big{]}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})\>.$	(8b)

The second equality follows from $p(\bm{\phi},\bm{s})=p(\bm{s}|\bm{\phi})\pi(\bm{\phi})$ ; the conditional independence of $s_{\rm{+}}$ and $s_{\rm{-}}$ given state $\bm{\phi}$ , i.e., $p(s_{\rm{+}},s_{\rm{-}}|\bm{\phi})=p(s_{\rm{+}}|\bm{\phi})p(s_{\rm{-}}|\bm{\phi})$ ; and substitution for $T^{\bm{s}}_{\bm{\phi}^{\prime}\bm{\phi}}$ using (6). The stationary subensembles ( $A\to A$ and $B\to B$ ) have no net flux because each trajectory segment and its time-reversed counterpart occur at equal rates within the same subensemble. In contrast, the forward and reverse TPEs have net trajectory flux since each transition path and its time-reversed counterpart occur in different subensembles. (Our procedure of effectively replicating the state space and introducing opposing fluxes in the replicas by modification of the transition rates bears similarity to nonreversible Markov chains obeying skew detailed balance used to speed convergence to a stationary distribution Turitsyn et al. (2011).)

We decompose (see Supplemental Material II SM ) the change in joint entropy $H(\bm{\Phi},\bm{S})\equiv-\sum_{\bm{\phi},\bm{s}}p(\bm{\phi},\bm{s})\ln p(\bm{\phi},\bm{s})$ at steady state Busiello et al. (2020); Esposito (2012) into


$\displaystyle 0$	$\displaystyle=\mathrm{d}_{t}H(\bm{\Phi},\bm{S})$	(9a)
	$\displaystyle=\underbrace{\sum_{\bm{s}}p(\bm{s})\sum_{\bm{\phi},\bm{\phi}^{\prime}}T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime}\|\bm{s})\ln\frac{T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime}\|\bm{s})}{T^{\bm{s}}_{\bm{\phi}^{\prime}\bm{\phi}}p(\bm{\phi}\|\bm{s})}}_{\langle\dot{\Sigma}\rangle}$
	$\displaystyle\quad-2\ \underbrace{\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{+}}}T^{s_{\rm{+}}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime},s_{\rm{+}})\ln\frac{p(s_{\rm{+}}\|\bm{\phi})}{p(s_{\rm{+}}\|\bm{\phi}^{\prime})}}_{\dot{I}^{\bm{\Phi}}(S_{\rm{+}};\bm{\Phi})}\>,$	(9b)

where $\langle\dot{\Sigma}\rangle=\sum_{\bm{s}}p(\bm{s})\dot{\Sigma}_{\bm{s}}$ is the subensemble-weighted average of the irreversible entropy production rate $\dot{\Sigma}_{\bm{s}}$ conditioned on subensemble $\bm{s}$ , which quantifies the time irreversibility of system dynamics within that subensemble. $\dot{I}^{\bm{\Phi}}(S_{\rm{+}};\bm{\Phi})\geq 0$ is the rate of change in mutual information between the trajectory outcome and system state due to system dynamics in a fixed subensemble Horowitz and Esposito (2014). Rearranging Eq. (9b) gives (see Supplemental Material III SM ):


$\displaystyle 0\leq\langle\dot{\Sigma}\rangle$	$\displaystyle=2\dot{I}^{\bm{\Phi}}(S_{\rm{+}};\bm{\Phi})$	(10a)
	$\displaystyle=\dot{I}^{\bm{\Phi}}(S_{\rm{+}};\bm{\Phi})-\dot{I}^{\bm{\Phi}}(S_{\rm{-}};\bm{\Phi})\>,$	(10b)

where $\dot{I}^{\bm{\Phi}}(S_{\rm{-}};\bm{\Phi})\leq 0$ is the rate of change in mutual information between trajectory origin and system state due to $\bm{\Phi}$ dynamics in a fixed subensemble. These information rates reflect the dependence of the trajectory outcome and origin variables on the past and future states of the system: As the system evolves, uncertainty about the outcome $S_{\rm{+}}$ diminishes, increasing the information the current system state carries about $S_{\rm{+}}$ , while uncertainty (given current system state $\bm{\Phi}$ ) about the origin $S_{\rm{-}}$ increases, decreasing information $\bm{\Phi}$ carries about the $S_{\rm{-}}$ .

Since the stationary subensembles have no entropy production ( $\dot{\Sigma}_{\bm{s}=(A,A)}=\dot{\Sigma}_{\bm{s}=(B,B)}=0$ ), Eq. (10a) reduces to an equation for a single subensemble,

0\leq p_{\rm{R}}\dot{\Sigma}_{\rm{R}}=\dot{I}^{\bm{\Phi}}(S_{\rm{+}};\bm{\Phi})\>.

(11)

This equates the rate $\dot{I}^{\bm{\Phi}}(S_{\rm{+}};\bm{\Phi})$ of generating information about the outcome with the product of the entropy production rate of a reactive subensemble $\dot{\Sigma}_{\rm{R}}=\dot{\Sigma}_{\bm{s}=(A,B)}=\dot{\Sigma}_{\bm{s}=(B,A)}$ and that subensemble’s marginal probability $p_{\rm{R}}=p(\bm{S}=(A,B))=p(\bm{S}=(B,A))$ . Although the supertrajectory is at equilibrium with no entropy production, $\dot{\Sigma}_{\rm{R}}$ physically represents the dissipation that would be necessary in a system evolving according to the TPE’s detailed-balance-breaking transition rates (top line of (6) for $s_{\rm{-}}\neq s_{\rm{+}}$ ). Equation (11) is our second major result: The entropy production in a reactive subensemble equals the information generated about the reactivity of trajectories.

When the state space $\bm{\Phi}$ is continuous, we derive (see Supplemental Material IV SM ) a Fisher-information metric $\mathcal{I}(\bm{\phi})$ that imposes an information geometry on the state space Amari (2016); Nielsen (2020). The metric measures distance on the reaction coordinate (committor) as the system evolves and thereby defines a reaction-coordinate length $\mathcal{L}_{AB}$ . From this, the TPE entropy production is

\dot{\Sigma}_{\rm{R}}\approx\frac{\mathcal{L}^{2}_{AB}}{2\tau_{\rm{R}}}\>,

(12)

where $\tau_{\rm{R}}$ is the mean duration of a transition path. This relates the TPE entropy production to the squared length between $A$ and $B$ along the reaction coordinate.

Bipartite dynamics.—We now demonstrate how the TPE entropy production quantitatively measures the relevance of an arbitrary coordinate to the reaction. We assume bipartite dynamics Hartich et al. (2014); Barato et al. (2013), essentially that instantaneous transitions only happen in either a one-dimensional coordinate $X$ or in all other degrees of freedom $\bm{Y}$ making up the system state $\bm{\Phi}=(X,\bm{Y})$ :

T_{xx^{\prime},\bm{y}\bm{y}^{\prime}}=\begin{cases}T_{xx^{\prime},\bm{y}}&x\neq x^{\prime}\>{\rm{,}}\>\bm{y}=\bm{y}^{\prime}\\ T_{x,\bm{y}\bm{y}^{\prime}}&x=x^{\prime}\>{\rm{,}}\>\bm{y}\neq\bm{y}^{\prime}\\ -\sum\limits_{\begin{subarray}{c}x^{\prime\prime}\neq x^{\prime}\\ \bm{y}^{\prime\prime}\neq\bm{y}^{\prime}\end{subarray}}\,T_{x^{\prime\prime}x^{\prime},\bm{y}^{\prime\prime}\bm{y}^{\prime}}&x=x^{\prime}\>{\rm{,}}\>\bm{y}=\bm{y}^{\prime}\\ 0&\rm{otherwise}\end{cases}\>.

(13)

Dynamics that do not obey the bipartite assumption introduce further complications in unambiguously partitioning the entropy production between coordinates Chetrite et al. (2019).

Combining Eqs. (1), (8a), and (9b) gives the full TPE entropy production as a function of the forward committor,

p_{\rm{R}}\dot{\Sigma}_{\rm{R}}=\tfrac{1}{2}\sum_{\bm{\phi},\bm{\phi}^{\prime}}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})(q^{+}_{\bm{\phi}}-q^{+}_{\bm{\phi}^{\prime}})\ln\frac{q^{+}_{\bm{\phi}}(1-q^{+}_{\bm{\phi}^{\prime}})}{q^{+}_{\bm{\phi}^{\prime}}(1-q^{+}_{\bm{\phi}})}\>,

(14)

which splits into contributions from the two transition types:


	$\displaystyle p_{\rm{R}}\dot{\Sigma}_{\rm{R}}=p_{\rm{R}}\dot{\Sigma}_{\rm{R}}^{X}+p_{\rm{R}}\dot{\Sigma}_{\rm{R}}^{\bm{Y}}$		(15a)
	$\displaystyle=\tfrac{1}{2}\sum_{x,x^{\prime},\bm{y}}T_{xx^{\prime},\bm{y}}\pi(x^{\prime},\bm{y})(q^{+}_{x\bm{y}}-q^{+}_{x^{\prime}\bm{y}})\ln{\frac{q^{+}_{x^{\prime}\bm{y}}(1-q^{+}_{x\bm{y}})}{q^{+}_{x\bm{y}}(1-q^{+}_{x^{\prime}\bm{y}})}}$		(15b)
	$\displaystyle\quad+\tfrac{1}{2}\sum_{x,\bm{y},\bm{y}^{\prime}}T_{x,\bm{y}\bm{y}^{\prime}}\pi(x,\bm{y}^{\prime})(q^{+}_{x\bm{y}}-q^{+}_{x\bm{y}^{\prime}})\ln{\frac{q^{+}_{x\bm{y}}(1-q^{+}_{x\bm{y}^{\prime}})}{q^{+}_{x\bm{y}^{\prime}}(1-q^{+}_{x\bm{y}})}}\>.$

The same decomposition holds for the information rate Horowitz and Esposito (2014), so that TPE entropy production due to $X$ dynamics is equal to the information rate (due to $X$ dynamics) between $\bm{\Phi}$ and $S_{\rm{+}}$ :

p_{\rm{R}}\dot{\Sigma}_{\rm{R}}^{X}=\dot{I}^{X}(S_{\rm{+}};\bm{\Phi})\>.

(16)

This is our third major result: The entropy production due to dynamics of coordinate $X$ equals the mutual information generated by $X$ dynamics, thereby quantifying the relevance of $X$ transitions to identifying the current subensemble and highlighting those transitions that are “correlated” with reactive trajectories and therefore important to the reaction mechanism.

In particular, for $X^{*}$ determining the committor and $\bm{Y}^{*}$ orthogonal degrees of freedom that are therefore not relevant to the reaction ( $q_{x\bm{y}}=q_{x}$ ), the entropy production rate due to $\bm{Y}^{*}$ dynamics is [simplifying Eq. (15b)]:


$\displaystyle\dot{\Sigma}_{\rm{R}}^{\bm{Y}^{*}}$	$\displaystyle=\sum_{x,\bm{y},\bm{y}^{\prime}}T_{x,\bm{y}\bm{y}^{\prime}}\pi(x,\bm{y}^{\prime})(q^{+}_{x}-q^{+}_{x})\ln\frac{q^{+}_{x}(1-q^{+}_{x})}{q^{+}_{x}(1-q^{+}_{x})}$	(17a)
	$\displaystyle=0\>.$	(17b)

Therefore $\dot{\Sigma}_{\rm{R}}^{X^{*}}=\dot{\Sigma}_{\rm{R}}$ . This is additional confirmation that the committor is the reaction coordinate, in that it provides a thermodynamically complete coarse-grained representation of the transition-path ensemble, fully accounting for its entropy production Esposito (2012).

We illustrate with overdamped dynamics in a double-well energy landscape [Fig. 2(a), details in Supplemental Material V SM ]. To exemplify the typical situation where the reaction coordinate is not known a priori and coordinates are thus chosen based on convenience or intuition, fixed system coordinates $(x,y)$ lie at an angle $\theta$ to the correct reaction coordinate, the linear coordinate passing through both energy minima. For $\theta=0^{\circ}$ , $X$ is the reaction coordinate, $Y$ is an orthogonal bath mode Li and Ma (2016), and $X$ dynamics fully capture the TPE entropy production without $Y$ contribution. Figure 2(b) shows that as the underlying energy landscape is rotated relative to system coordinates, the $X$ -coordinate entropy production decreases and $Y$ -coordinate entropy production increases, with equal contribution at $\theta=45^{\circ}$ . The entropy production for each coordinate is proportional to the squared Euclidean distance between $A$ and $B$ projected onto each coordinate, $\dot{\Sigma}_{\rm{R}}^{X}(\theta)\propto\cos^{2}\theta$ and $\dot{\Sigma}_{\rm{R}}^{Y}(\theta)\propto\sin^{2}\theta$ .

Discussion.—We have derived the information thermodynamics of a system undergoing reactions between distinct state-space subsets $A$ and $B$ , making a fundamental connection between transition-path theory, information theory, and stochastic thermodynamics. Partitioning a long ergodic equilibrium trajectory into reactive and nonreactive subensembles results in entropy production for system dynamics in the reactive subensembles (physically representing the dissipation needed to implement the detailed-balance-breaking transition rates of the TPE), which in turn identifies transitions that are relevant to the overall reaction mechanism. This rigorous equality between TPE entropy production and informativeness of dynamics also holds for an arbitrary coordinate, revealing parallel stochastic-thermodynamic and information-theoretic measures of the relevance of collective variables to the system reaction, that are each maximized by the committor.

This work has implications for the identification of important collective variables and analysis of reaction mechanisms. While the committor provides a microscopically detailed reaction coordinate that maps each system microstate to a scalar value, it does not immediately identify physically meaningful collective variables (e.g., internal molecular coordinates) that are relevant to the reaction Bolhuis and Dellago (2015); Peters (2016); Johnson and Hummer (2012). Our results have indicated that relevant coordinates are identified by entropy production in the transition-path ensemble; thus partitioning the entropy production between multiple relevant collective variables for which one has physical intuition can provide a low-dimensional model that allows increased insight into the reaction mechanism.

More concretely, this connection we have established between transition-path theory and stochastic thermodynamics suggests a novel method for rigorously grounded inference of reaction coordinates: generate an ensemble of transition paths using transition-path sampling Dellago et al. (1998); Bolhuis et al. (2002) or related algorithms Van Erp et al. (2003); Allen et al. (2005); Faradjian and Elber (2004); estimate entropy production along chosen coordinates Seifert (2019); Li et al. (2019a); Skinner and Dunkel (2021) or identify linear combinations of coordinates producing the most entropy using dissipative components analysis Gnesotto et al. (2020); use these most dissipative coordinates to enhance sampling of transition paths; and through further iteration identify system coordinates producing the most entropy in the transition-path ensemble and hence of most relevance to the reaction.

Machine-learning approaches to solve for high-dimensional committor coordinates Khoo et al. (2019); Li et al. (2019b); Rotskoff et al. or find low-dimensional reaction models that retain predictive power Ma and Dinner (2005); Wang and Tiwary (2021) are active areas of research Wang et al. (2020). The information-theoretic and thermodynamic perspectives on reactive trajectories described in this Letter provide guidance to the development of data-intensive automated methods to infer these models and their corresponding reaction mechanisms.

This work was supported by Natural Sciences and Engineering Research Council of Canada (NSERC) Canada Graduate Scholarships Masters and Doctoral (MDL), an NSERC Discovery Grant (DAS), and a Tier-II Canada Research Chair (DAS). The authors thank Jannik Ehrich (SFU Physics) for enlightening feedback on the manuscript.

References

Eyring (1935) H. Eyring, J. Chem. Phys. 3, 107 (1935).
Evans and Polanyi (1935) M. G. Evans and M. Polanyi, Trans. Faraday Soc. 31, 875 (1935).
Kramers (1940) H. A. Kramers, Physica 7, 284 (1940).
Peters (2016) B. Peters, Annu. Rev. Phys. Chem 67, 669 (2016).
Bolhuis and Dellago (2015) P. G. Bolhuis and C. Dellago, Eur. Phys. J. Spec. Top. 224, 2409 (2015).
Dellago et al. (1998) C. Dellago, P. G. Bolhuis, F. S. Csajka, and D. Chandler, J. Chem. Phys. 108, 1964 (1998).
E and Vanden-Eijnden (2006) W. E and E. Vanden-Eijnden, J. Stat. Phys. 123, 503 (2006).
E and Vanden-Eijnden (2010) W. E and E. Vanden-Eijnden, Annu. Rev. Phys. Chem 61, 391 (2010).
Li and Ma (2014) W. Li and A. Ma, Mol Simul. 40, 784 (2014).
Peters et al. (2013) B. Peters, P. G. Bolhuis, R. G. Mullen, and J.-E. Shea, J. Chem. Phys. 138, 054106 (2013).
Banushkina and Krivov (2016) P. V. Banushkina and S. V. Krivov, WIREs Comput Mol Sci 6, 748 (2016).
Berezhkovskii and Szabo (2013) A. M. Berezhkovskii and A. Szabo, J. Phys. Chem. B 117, 13115 (2013).
Berezhkovskii and Szabo (2005) A. Berezhkovskii and A. Szabo, J. Chem. Phys. 122, 014503 (2005).
Zwanzig (2001) R. Zwanzig, Nonequilibrium statistical mechanics (Oxford University Press, New York, 2001).
Metzner et al. (2009) P. Metzner, C. Schutte, and E. Vanden-Eijnden, SIAM Multiscale Model. Simul. 7, 1192 (2009).
Vanden-Eijnden (2014) E. Vanden-Eijnden, in An introduction to Markov state models their application to long timescale molecular simulation, edited by G. R. Bowman, V. S. Pande, and F. Noe (Springer, 2014), chap. 7, pp. 91–100.
Berezhkovskii and Szabo (2019) A. M. Berezhkovskii and A. Szabo, J. Chem. Phys 150, 054106 (2019).
Cover and Thomas (2006) T. M. Cover and J. A. Thomas, Elements of information theory (John Wiley & Sons, Inc., Hoboken, New Jersey, 2006), 2nd ed.
(19) See Supplemental Material at [URL will be inserted by publisher].
Turitsyn et al. (2011) K. S. Turitsyn, M. Chertkov, and M. Vucelja, Physica D 240, 410 (2011).
Busiello et al. (2020) D. M. Busiello, D. Gupta, and A. Maritan, Phys. Rev. Res. 2, 1 (2020).
Esposito (2012) M. Esposito, Phys. Rev. E 85, 041125 (2012).
Horowitz and Esposito (2014) J. M. Horowitz and M. Esposito, Phys. Rev. X 4, 031015 (2014).
Amari (2016) S.-I. Amari, Information geometry and its applications (Springer, Tokyo, 2016), 1st ed.
Nielsen (2020) F. Nielsen, Entropy 22, 1100 (2020).
Hartich et al. (2014) D. Hartich, A. Barato, and U. Seifert, J. Stat. Mech Theory Exp. 2014, 02016 (2014).
Barato et al. (2013) A. C. Barato, D. Hartich, and U. Seifert, J Stat Phys 153 (2013).
Chetrite et al. (2019) R. Chetrite, M. L. Rosinberg, T. Sagawa, and G. Tarjus, J. Stat. Mech. 21, 114002 (2019).
Li and Ma (2016) W. Li and A. Ma, J. Chem. Phys. 144, 114103 (2016).
Johnson and Hummer (2012) M. E. Johnson and G. Hummer, J. Phys. Chem. B 116, 8573 (2012).
Bolhuis et al. (2002) P. G. Bolhuis, D. Chandler, C. Dellago, and P. L. Geissler, Annu. Rev. Phys. Chem 53, 291 (2002).
Van Erp et al. (2003) T. S. Van Erp, D. Moroni, and P. G. Bolhuis, J. Chem. Phys. 118, 6617 (2003).
Allen et al. (2005) R. J. Allen, P. B. Warren, and P. R. Ten Wolde, Phys. Rev. Lett. 94, 018104 (2005).
Faradjian and Elber (2004) A. K. Faradjian and R. Elber, J. Chem. Phys. 120, 10880 (2004).
Seifert (2019) U. Seifert, Annu. Rev. Condens. Matter Phys. 10, 171 (2019).
Li et al. (2019a) J. M. Li, Junang andHorowitz, T. R. Gingrich, and N. Fakhri, Nature Communications 10 (2019a).
Skinner and Dunkel (2021) D. J. Skinner and J. Dunkel, PNAS 118 (2021).
Gnesotto et al. (2020) F. S. Gnesotto, G. Gradziuk, P. Ronceray, and C. P. Broedersz, Nat. Commun. 11, 5378 (2020).
Khoo et al. (2019) Y. Khoo, J. Lu, and L. Ying, Res. Math. Sci. 6, 1 (2019).
Li et al. (2019b) Q. Li, B. Lin, and W. Ren, J. Chem. Phys. 151, 54112 (2019b).
(41) G. M. Rotskoff, A. R. Mitchell, and E. Vanden-Eijnden, arXiv:2008.06334v2.
Ma and Dinner (2005) A. Ma and A. R. Dinner, J. Phys. Chem. B 109, 6769 (2005).
Wang and Tiwary (2021) Y. Wang and P. Tiwary, J. Chem. Phys. 154, 134111 (2021).
Wang et al. (2020) Y. Wang, J. M. L. Ribeiro, and P. Tiwary, Curr. Opin. Struct. Biol. 61, 139 (2020).
Ito (2018) S. Ito, Phys. Rev. Lett. 121, 30605 (2018).
Ruppeiner (1979) G. Ruppeiner, Phys. Rev. A 20, 1608 (1979).
Crooks (2007) G. E. Crooks, Phys. Rev. Lett. 99, 100602 (2007).
Metropolis et al. (1953) N. Metropolis, A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller, and E. Teller, J. Chem. Phys. 21, 1087 (1953).

Supplemental Material for “Information Thermodynamics of the Transition-Path Ensemble”

I Joint transition rates

Following Vanden-Eijnden (2014), the transition probability over time $\mathrm{d}t$ for a $\bm{\phi}^{\prime}\to\bm{\phi}$ transition given the trajectory remains in subensemble $\bm{s}$ is


$\displaystyle T^{\bm{s}=\bm{s}^{\prime}}_{\bm{\phi}\bm{\phi}^{\prime}}\mathrm{d}t$	$\displaystyle=p(\bm{\phi},s_{\rm{+}},s_{\rm{-}}\|\bm{\phi}^{\prime},s_{\rm{+}}^{\prime},s_{\rm{-}}^{\prime})$	(S1a)
	$\displaystyle=p(s_{\rm{+}},s_{\rm{-}}\|\bm{\phi},\bm{\phi}^{\prime},s_{\rm{+}}^{\prime},s_{\rm{-}}^{\prime})\ p(\bm{\phi}\|\bm{\phi}^{\prime},s_{\rm{+}}^{\prime},s_{\rm{-}}^{\prime})$	(S1b)
	$\displaystyle=p(\bm{\phi}\|\bm{\phi}^{\prime},s_{\rm{+}}^{\prime},s_{\rm{-}}^{\prime})$	(S1c)
	$\displaystyle=p(\bm{\phi}\|\bm{\phi}^{\prime},s_{\rm{+}}^{\prime})$	(S1d)
	$\displaystyle=\frac{p(\bm{\phi},s_{\rm{+}}^{\prime}\|\bm{\phi}^{\prime})}{p(s_{\rm{+}}^{\prime}\|\bm{\phi}^{\prime})}$	(S1e)
	$\displaystyle=\frac{p(s_{\rm{+}}^{\prime}\|\bm{\phi})\,p(\bm{\phi}\|\bm{\phi}^{\prime})}{p(s_{\rm{+}}^{\prime}\|\bm{\phi}^{\prime})}\>.$	(S1f)

(S1b) splits the joint probability into conditional and marginal probabilities. In (S1c), we recognize that $p(s_{\rm{+}},s_{\rm{-}}|\bm{\phi},\bm{\phi}^{\prime},s_{\rm{+}}^{\prime},s_{\rm{-}}^{\prime})=1$ when the subensemble doesn’t change. In (S1d), we use the Markov property to eliminate the dependence of the next state on trajectory origin. In (S1e), we express the conditional probability as the ratio of joint and marginal probabilities. In (S1f), we recognize that the trajectory outcome depends only on the most recent state $\bm{\phi}$ and drop the dependency on $\bm{\phi}^{\prime}$ . Finally, we recall that for these transitions $s_{\rm{+}}=s_{\rm{+}}^{\prime}$ and re-express the transition probability as a transition rate,

\displaystyle T^{\bm{s}=\bm{s}^{\prime}}_{\bm{\phi}\bm{\phi}^{\prime}}=\frac{p(s_{\rm{+}}|\bm{\phi})}{p(s_{\rm{+}}|\bm{\phi}^{\prime})}T_{\bm{\phi}\bm{\phi}^{\prime}}\>.

(S2)

We similarly derive transition rates for the four sets of transitions that change subensemble, where either the trajectory origin or outcome changes while the other is constant. The transition probability in time $\mathrm{d}t$ for a $\bm{\phi}^{\prime}\to\bm{\phi}$ transition out of $A$ where the subensemble changes from $\bm{s}^{\prime}=(A,A)\to\bm{s}=(A,B)$ is:


$\displaystyle T^{\bm{s}=(A,B),\bm{s}^{\prime}=(A,A)}_{\bm{\phi}\bm{\phi}^{\prime}}\mathrm{d}t$	$\displaystyle=p(\bm{\phi},S_{\rm{+}}=B,S_{\rm{-}}=A\|\bm{\phi}^{\prime},S_{\rm{+}}^{\prime}=A,S_{\rm{-}}^{\prime}=A)$	(S3a)
	$\displaystyle=p(S_{\rm{+}}=B,S_{\rm{-}}=A\|\bm{\phi},\bm{\phi}^{\prime},S_{\rm{+}}^{\prime}=A,S_{\rm{-}}^{\prime}=A)\ p(\bm{\phi}\|\bm{\phi}^{\prime},S_{\rm{+}}^{\prime}=A,S_{\rm{-}}^{\prime}=A)$	(S3b)
	$\displaystyle=p(S_{\rm{+}}=B\|\bm{\phi},\bm{\phi}^{\prime},S_{\rm{+}}^{\prime}=A,S_{\rm{-}}^{\prime}=A)\ p(\bm{\phi}\|\bm{\phi}^{\prime},S_{\rm{+}}^{\prime}=A,S_{\rm{-}}^{\prime}=A)$	(S3c)
	$\displaystyle=p(S_{\rm{+}}=B\|\bm{\phi})\ p(\bm{\phi}\|\bm{\phi}^{\prime})\ .$	(S3d)

(S3c) uses the fact that $S_{\rm{-}}=A$ when $\bm{\phi}^{\prime}\in A$ . (S3d) uses the Markov property to simplify the conditional distributions. We then re-express the transition probability as a transition rate

\displaystyle T^{\bm{s}=(A,B),\bm{s}^{\prime}=(A,A)}_{\bm{\phi}\bm{\phi}^{\prime}}=p(S_{\rm{+}}=B|\bm{\phi})\,T_{\bm{\phi}\bm{\phi}^{\prime}}\>.

(S4)

Similarly, the transition rate for a $\bm{\phi}^{\prime}\to\bm{\phi}$ transition where the subensemble outcome changes from $\bm{s}^{\prime}=(B,B)\to\bm{s}^{\prime}=(B,A)$ is

\displaystyle T^{\bm{s}=(B,A),\bm{s}^{\prime}=(B,B)}_{\bm{\phi}\bm{\phi}^{\prime}}=p(S_{\rm{+}}=A|\bm{\phi})\,T_{\bm{\phi}\bm{\phi}^{\prime}}\>.

(S5)

The trajectory origin changes when the system finishes a transition path at the boundary of $A$ or $B$ . The probability for a $\bm{\phi}^{\prime}\to\bm{\phi}$ transition into $A$ where the subensemble changes from $\bm{s}^{\prime}=(B,A)\to\bm{s}=(A,A)$ is:


$\displaystyle T^{\bm{s}=(A,A),\bm{s}^{\prime}=(B,A)}_{\bm{\phi}\bm{\phi}^{\prime}}\mathrm{d}t$	$\displaystyle=p(\bm{\phi},S_{\rm{+}}=A,S_{\rm{-}}=A\|\bm{\phi}^{\prime},S_{\rm{+}}^{\prime}=A,S_{\rm{-}}^{\prime}=B)$	(S6a)
	$\displaystyle=p(S_{\rm{+}}=A,S_{\rm{-}}=A\|\bm{\phi},\bm{\phi}^{\prime},S_{\rm{+}}^{\prime}=A,S_{\rm{-}}^{\prime}=B)\ p(\bm{\phi}\|\bm{\phi}^{\prime},S_{\rm{+}}^{\prime}=A,S_{\rm{-}}^{\prime}=B)$	(S6b)
	$\displaystyle=p(\bm{\phi}\|\bm{\phi}^{\prime},S_{\rm{+}}^{\prime}=A)$	(S6c)
	$\displaystyle=\frac{p(\bm{\phi},S_{\rm{+}}^{\prime}=A\|\bm{\phi}^{\prime})}{p(S_{\rm{+}}^{\prime}=A\|\bm{\phi}^{\prime})}$	(S6d)
	$\displaystyle=\frac{p(S_{\rm{+}}^{\prime}=A\|\bm{\phi},\bm{\phi}^{\prime})\,p(\bm{\phi}\|\bm{\phi}^{\prime})}{p(S_{\rm{+}}^{\prime}=A\|\bm{\phi}^{\prime})}$	(S6e)
	$\displaystyle=\frac{p(\bm{\phi}\|\bm{\phi}^{\prime})}{p(S_{\rm{+}}^{\prime}=A\|\bm{\phi}^{\prime})}\>.$	(S6f)

(S6c) recognizes that $S_{\rm{+}}=A$ and $S_{\rm{-}}=A$ for $\bm{\phi}\in A$ and uses the Markov property to eliminate dependence on $S_{\rm{-}}^{\prime}$ in $p(\bm{\phi}|\bm{\phi}^{\prime},S_{\rm{+}}^{\prime}=A,S_{\rm{-}}^{\prime}=A)$ . (S6f) uses $S_{\rm{+}}^{\prime}=A$ for $\bm{\phi}\in A$ . Finally, we re-express the transition probability as the transition rate

\displaystyle T^{\bm{s}=(A,A),\bm{s}^{\prime}=(B,A)}_{\bm{\phi}\bm{\phi}^{\prime}}=\frac{1}{p(S_{\rm{+}}=A|\bm{\phi}^{\prime})}T_{\bm{\phi}\bm{\phi}^{\prime}}\>,

(S7)

and similarly derive the transition rate for a $\bm{\phi}^{\prime}\to\bm{\phi}$ transition where the system enters $B$ and finishes a forward TPE ( $\bm{s}^{\prime}=(A,B)\to\bm{s}=(B,B)$ ) as

\displaystyle T^{\bm{s}=(B,B),\bm{s}^{\prime}=(A,B)}_{\bm{\phi}\bm{\phi}^{\prime}}=\frac{1}{p(S_{\rm{+}}=B|\bm{\phi}^{\prime})}T_{\bm{\phi}\bm{\phi}^{\prime}}\>.

(S8)

These rates are explicitly written for each transition in (2), and when averaged over all such transitions yield the unidirectional probability flux $\nu_{\bm{S}}$ between trajectory subensembles (3),


$\displaystyle\nu_{\bm{S}}$	$\displaystyle=\sum_{\bm{\phi}\notin A,\bm{\phi}^{\prime}\in A}p(S_{\rm{+}}=B\|\bm{\phi})\,T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})$	(S9a)
	$\displaystyle=\sum_{\bm{\phi}\notin B,\bm{\phi}^{\prime}\in B}p(S_{\rm{+}}=A\|\bm{\phi})\,T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})$	(S9b)
	$\displaystyle=\sum_{\bm{\phi}\in A,\bm{\phi}^{\prime}\notin A}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})\,p(S_{\rm{-}}=B\|\bm{\phi}^{\prime})$	(S9c)
	$\displaystyle=\sum_{\bm{\phi}\in B,\bm{\phi}^{\prime}\notin B}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})\,p(S_{\rm{-}}=A\|\bm{\phi}^{\prime})\>.$	(S9d)

Note that each RHS of (S9) has an implicit conditional probability for the other element of the trajectory subsensemble, each of which equals unity on the relevant system subspace, e.g., $p(S_{\rm{-}}=A|\bm{\phi}^{\prime})=1$ for $\bm{\phi}^{\prime}\in A$ in (S9a).

II Entropy production for joint dynamics in $(\bm{\Phi},\bm{S})$

We decompose the change in joint entropy (9a) into three terms Busiello et al. (2020)


	$\displaystyle 0$	$\displaystyle=\mathrm{d}_{t}H(\bm{\Phi},\bm{S})=\underbrace{\sum_{\bm{\phi},\bm{\phi}^{\prime},\bm{s}}T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime},\bm{s})\ln\frac{T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime},\bm{s})}{T^{\bm{s}}_{\bm{\phi}^{\prime}\bm{\phi}}p(\bm{\phi},\bm{s})}}_{\dot{H}^{\rm{irr}}(\bm{\Phi},\bm{S})}-\underbrace{\sum_{\bm{\phi},\bm{\phi}^{\prime},\bm{s}}T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime},\bm{s})\ln\frac{T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}}{T^{\bm{s}}_{\bm{\phi}^{\prime}\bm{\phi}}}}_{\dot{H}^{\rm{env}}(\bm{\Phi},\bm{S})}+\underbrace{\sum_{\bm{\phi},\bm{\phi}^{\prime},\bm{s}\neq\bm{s}^{\prime}}T^{\bm{s}\bm{s}^{\prime}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime},\bm{s}^{\prime})\ln\frac{p(\bm{\phi}^{\prime},\bm{s}^{\prime})}{p(\bm{\phi},\bm{s})}}_{\dot{H}^{\rm{sub}}(\bm{\Phi},\bm{S})}\>,$		(S10a)

where $\dot{H}^{\rm{irr}}(\bm{\Phi},\bm{S})$ is the irreversible entropy production and $\dot{H}^{\rm{env}}(\bm{\Phi},\bm{S})$ the environmental entropy change for transitions that do not change the subensemble, and $\dot{H}^{\rm{sub}}(\bm{\Phi},\bm{S})$ is the change in joint entropy due to transitions that change the subensemble.

The transitions that change the trajectory subensemble do not change joint entropy:


$\displaystyle\dot{H}$	${}^{\rm{sub}}(\bm{\Phi},\bm{S})$
	$\displaystyle=\sum_{\bm{\phi}\notin A,\bm{\phi}^{\prime}\in A}T_{\bm{\phi}^{\prime}\bm{\phi}}\pi(\bm{\phi})p(S_{\rm{-}}=B\|\bm{\phi})\ln\frac{p(\bm{\phi},\bm{S}=(B,A))}{\pi(\bm{\phi}^{\prime})}-\sum_{\bm{\phi}\notin A,\bm{\phi}^{\prime}\in A}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})p(S_{\rm{+}}=B\|\bm{\phi})\ln\frac{p(\bm{\phi},\bm{S}=(A,B))}{\pi(\bm{\phi}^{\prime})}$	(S11a)
	$\displaystyle\quad+\sum_{\bm{\phi}\notin B,\bm{\phi}^{\prime}\in B}T_{\bm{\phi}^{\prime}\bm{\phi}}\pi(\bm{\phi})p(S_{\rm{-}}=A\|\bm{\phi})\ln\frac{p(\bm{\phi},\bm{S}=(A,B))}{\pi(\bm{\phi}^{\prime})}-\sum_{\bm{\phi}\notin B,\bm{\phi}^{\prime}\in B}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})p(S_{\rm{+}}=A\|\bm{\phi})\ln\frac{p(\bm{\phi},\bm{S}=(B,A))}{\pi(\bm{\phi}^{\prime})}$
	$\displaystyle=0\>.$	(S11b)

In (S11a), the first and second terms and the third and fourth terms cancel because of the equilibrium relationships $p(S_{\rm{+}}=A|\bm{\phi})=p(S_{\rm{-}}=A|\bm{\phi})$ , $p(S_{\rm{+}}=B|\bm{\phi})=p(S_{\rm{-}}=B|\bm{\phi})$ , and $p(\phi,\bm{S}=(B,A))=p(\phi,\bm{S}=(A,B))$ .

We express the irreversible entropy production $\dot{H}^{\rm{irr}}(\bm{\Phi},\bm{S})$ in terms of the irreversible entropy production of dynamics given fixed subensemble $\bm{s}$ , $\dot{\Sigma}_{\bm{s}}$ Esposito (2012):


$\displaystyle\dot{H}^{\rm{irr}}(\bm{\Phi},\bm{S})$	$\displaystyle=\sum_{\bm{s}}p(\bm{s})\sum_{\bm{\phi},\bm{\phi}^{\prime}}T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime}\|\bm{s})\ln\frac{T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime}\|\bm{s})}{T^{\bm{s}}_{\bm{\phi}^{\prime}\bm{\phi}}p(\bm{\phi}\|\bm{s})}$	(S12a)
	$\displaystyle=\sum_{\bm{s}}p(\bm{s})\dot{\Sigma}_{\bm{s}}$	(S12b)
	$\displaystyle\equiv\langle\dot{\Sigma}\rangle\ .$	(S12c)

Finally, the environmental entropy change is


$\displaystyle\dot{H}^{\rm{env}}(\bm{\Phi},\bm{S})$	$\displaystyle=2\sum_{\bm{\phi},\bm{\phi}^{\prime},\bm{s}}T^{\bm{s}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime},\bm{s})\ln\frac{p(s_{\rm{+}}\|\bm{\phi})}{p(s_{\rm{+}}\|\bm{\phi}^{\prime})}$	(S13a)
	$\displaystyle=2\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{+}}}T^{s_{\rm{+}}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime},s_{\rm{+}})\ln\frac{p(s_{\rm{+}}\|\bm{\phi})}{p(s_{\rm{+}}\|\bm{\phi}^{\prime})}$	(S13b)
	$\displaystyle=2\dot{I}^{\bm{\Phi}}(S_{\rm{+}};\bm{\Phi})\>,$	(S13c)

where we get (S13b) by summing over the trajectory origin.

III Mutual information rates between system state and either trajectory outcome or trajectory origin have equal magnitude

The rate of change in mutual information between system state and trajectory outcome due to $\bm{\Phi}$ dynamics is of equal magnitude and opposite sign from the rate of change in mutual information between system state and trajectory origin due to $\bm{\Phi}$ dynamics:


$\displaystyle\dot{I}^{\bm{\Phi}}(S_{\rm{+}};\bm{\Phi})$	$\displaystyle=\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{+}}}T^{s_{\rm{+}}}_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime},s_{\rm{+}})\ln\frac{p(s_{\rm{+}}\|\bm{\phi})}{p(s_{\rm{+}}\|\bm{\phi}^{\prime})}$	(S14a)
	$\displaystyle=\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{+}}}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})p(s_{\rm{+}}\|\bm{\phi})\ln\frac{p(s_{\rm{+}}\|\bm{\phi})}{p(s_{\rm{+}}\|\bm{\phi}^{\prime})}$	(S14b)
	$\displaystyle=\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{-}}}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})p(s_{\rm{-}}\|\bm{\phi})\ln\frac{p(s_{\rm{-}}\|\bm{\phi})}{p(s_{\rm{-}}\|\bm{\phi}^{\prime})}$	(S14c)
	$\displaystyle=\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{-}}}T_{\bm{\phi}^{\prime}\bm{\phi}}\pi(\bm{\phi})p(s_{\rm{-}}\|\bm{\phi})\ln\frac{p(s_{\rm{-}}\|\bm{\phi})}{p(s_{\rm{-}}\|\bm{\phi}^{\prime})}$	(S14d)
	$\displaystyle=-\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{-}}}T_{\bm{\phi}^{\prime}\bm{\phi}}p(\bm{\phi},s_{\rm{-}})\ln\frac{p(s_{\rm{-}}\|\bm{\phi}^{\prime})}{p(s_{\rm{-}}\|\bm{\phi})}$	(S14e)
	$\displaystyle=-\dot{I}^{\bm{\Phi}}(S_{\rm{-}};\bm{\Phi})\>.$	(S14f)

In (S14b) we expand the joint transition rate using (6); (S14c) relates outcome and origin probabilities using equilibrium relations $p(S_{\rm{+}}=A|\bm{\phi})=p(S_{\rm{-}}=A|\bm{\phi})$ , $p(S_{\rm{+}}=B|\bm{\phi})=p(S_{\rm{-}}=B|\bm{\phi})$ ; and (S14d) uses detailed balance, $T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})=T_{\bm{\phi}^{\prime}\bm{\phi}}\pi(\bm{\phi})$ .

IV Thermodynamic metric

Here we assume that the state space $\bm{\Phi}$ is continuous, and the master equation $\mathrm{d}_{t}p(\bm{\phi})=\sum_{\bm{\phi}^{\prime}}T_{\bm{\phi}\bm{\phi}^{\prime}}p(\bm{\phi}^{\prime})$ represents a discrete approximation of its dynamics. We rearrange the difference in mutual information rates (10b) to obtain the transition-weighted relative entropy $D[p(\bm{s}|\bm{\phi}^{\prime})||p(\bm{s}|\bm{\phi})]\equiv\sum_{\bm{s}}p(\bm{s}|\bm{\phi}^{\prime})\ln p(\bm{s}|\bm{\phi}^{\prime})/p(\bm{s}|\bm{\phi})$ between the conditional subensemble distributions $p(\bm{s}|\bm{\phi}^{\prime})$ and $p(\bm{s}|\bm{\phi})$ before and after the transition, respectively, then expand in small state changes $\bm{\phi}-\bm{\phi}^{\prime}$ :


$\displaystyle p_{\rm{R}}\dot{\Sigma}_{\rm{R}}$	$\displaystyle=\dot{I}^{\bm{\Phi}}(S_{\rm{+}};\bm{\Phi})-\dot{I}^{\bm{\Phi}}(S_{\rm{-}};\bm{\Phi})$	(S15a)
	$\displaystyle=\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{+}}}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})p(s_{\rm{+}}\|\bm{\phi})\ln\frac{p(s_{\rm{+}}\|\bm{\phi})}{p(s_{\rm{+}}\|\bm{\phi}^{\prime})}-\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{-}}}T_{\bm{\phi}^{\prime}\bm{\phi}}\pi(\bm{\phi})p(s_{\rm{-}}\|\bm{\phi})\ln\frac{p(s_{\rm{-}}\|\bm{\phi}^{\prime})}{p(s_{\rm{-}}\|\bm{\phi})}$	(S15b)
	$\displaystyle=\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{+}},s_{\rm{-}}}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})p(s_{\rm{+}}\|\bm{\phi})p(s_{\rm{-}}\|\bm{\phi})\ln\frac{p(s_{\rm{+}}\|\bm{\phi})}{p(s_{\rm{+}}\|\bm{\phi}^{\prime})}-\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{+}},s_{\rm{-}}}T_{\bm{\phi}^{\prime}\bm{\phi}}\pi(\bm{\phi})p(s_{\rm{-}}\|\bm{\phi})p(s_{\rm{+}}\|\bm{\phi})\ln\frac{p(s_{\rm{-}}\|\bm{\phi}^{\prime})}{p(s_{\rm{-}}\|\bm{\phi})}$	(S15c)
	$\displaystyle=\sum_{\bm{\phi},\bm{\phi}^{\prime},s_{\rm{+}},s_{\rm{-}}}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})p(s_{\rm{+}}\|\bm{\phi})p(s_{\rm{-}}\|\bm{\phi})\ln\frac{p(s_{\rm{+}}\|\bm{\phi})p(s_{\rm{-}}\|\bm{\phi})}{p(s_{\rm{+}}\|\bm{\phi}^{\prime})p(s_{\rm{-}}\|\bm{\phi}^{\prime})}$	(S15d)
	$\displaystyle=\sum_{\bm{\phi},\bm{\phi}^{\prime}}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})D[p(\bm{s}\|\bm{\phi}^{\prime})\|\|p(\bm{s}\|\bm{\phi})]$	(S15e)
	$\displaystyle\approx\sum_{\bm{\phi},\bm{\phi}^{\prime}}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})\tfrac{1}{2}\sum_{i,j}(\phi_{i}-\phi^{\prime}_{i})\mathcal{I}_{ij}(\bm{\phi}^{\prime})(\phi_{j}-\phi^{\prime}_{j})\>.$	(S15f)

In (S15c), we multiply each term by unity ( $\sum_{s_{\rm{-}}}p(s_{\rm{-}}|\bm{\phi})$ and $\sum_{s_{\rm{+}}}p(s_{\rm{+}}|\bm{\phi})$ respectively), then use detailed balance ( $T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})=T_{\bm{\phi}^{\prime}\bm{\phi}}\pi(\bm{\phi})$ ) to combine terms in (S15d). $\phi_{i}$ is the $i$ th component of the state-space vector and $\mathcal{I}_{ij}(\bm{\phi})$ is the Fisher information of the trajectory outcome/origin distribution at state $\bm{\phi}$ ,


$\displaystyle\mathcal{I}_{ij}(\bm{\phi})$	$\displaystyle\equiv\sum_{\bm{s}}p\left(\bm{s}\|\bm{\phi}\right)\frac{\partial\ln p(\bm{s}\|\bm{\phi})}{\partial\bm{\phi}_{i}}\frac{\partial\ln p(\bm{s}\|\bm{\phi})}{\partial\bm{\phi}_{j}}$	(S16a)
	$\displaystyle=\sum_{\bm{s}}\frac{1}{p\left(\bm{s}\|\bm{\phi}\right)}\frac{\partial p(\bm{s}\|\bm{\phi})}{\partial\bm{\phi}_{i}}\frac{\partial p(\bm{s}\|\bm{\phi})}{\partial\bm{\phi}_{j}}$	(S16b)
	$\displaystyle=\frac{1}{(1-q^{+}_{\bm{\phi}})q^{+}_{\bm{\phi}}}\frac{\partial(1-q^{+}_{\bm{\phi}})q^{+}_{\bm{\phi}}}{\partial\bm{\phi}_{i}}\frac{\partial(1-q^{+}_{\bm{\phi}})q^{+}_{\bm{\phi}}}{\partial\bm{\phi}_{j}}+\frac{1}{(1-q^{+}_{\bm{\phi}})^{2}}\frac{\partial(1-q^{+}_{\bm{\phi}})^{2}}{\partial\bm{\phi}_{i}}\frac{\partial(1-q^{+}_{\bm{\phi}})^{2}}{\partial\bm{\phi}_{j}}$
	$\displaystyle\quad+\frac{1}{(q^{+}_{\bm{\phi}})^{2}}\frac{\partial(q^{+}_{\bm{\phi}})^{2}}{\partial\bm{\phi}_{i}}\frac{\partial(q^{+}_{\bm{\phi}})^{2}}{\partial\bm{\phi}_{j}}+\frac{1}{q^{+}_{\bm{\phi}}(1-q^{+}_{\bm{\phi}})}\frac{\partial q^{+}_{\bm{\phi}}(1-q^{+}_{\bm{\phi}})}{\partial\bm{\phi}_{i}}\frac{\partial q^{+}_{\bm{\phi}}(1-q^{+}_{\bm{\phi}})}{\partial\bm{\phi}_{j}}$	(S16c)
	$\displaystyle=\left[\frac{(1-2q^{+}_{\bm{\phi}})^{2}}{(1-q^{+}_{\bm{\phi}})q^{+}_{\bm{\phi}}}+\frac{4(1-q^{+}_{\bm{\phi}})^{2}}{(1-q^{+}_{\bm{\phi}})^{2}}+\frac{4(q^{+}_{\bm{\phi}})^{2}}{(q^{+}_{\bm{\phi}})^{2}}+\frac{(1-2q^{+}_{\bm{\phi}})^{2}}{q^{+}_{\bm{\phi}}(1-q^{+}_{\bm{\phi}})}\right]\frac{\partial q^{+}_{\bm{\phi}}}{\partial\bm{\phi}_{i}}\frac{\partial q^{+}_{\bm{\phi}}}{\partial\bm{\phi}_{j}}$	(S16d)
	$\displaystyle=\frac{2}{q^{+}_{\bm{\phi}}\left(1-q^{+}_{\bm{\phi}}\right)}\frac{\partial q^{+}_{\bm{\phi}}}{\partial\bm{\phi}_{i}}\frac{\partial q^{+}_{\bm{\phi}}}{\partial\bm{\phi}_{j}}\>.$	(S16e)

In (S16), we write out each term from (S16b) in terms of the forward committor $q^{+}_{\bm{\phi}}$ . We use the chain rule to pull out a common factor in (S16d) and simplify in (S16e).

In information geometry Amari (2016); Nielsen (2020), Fisher information arises as a distance metric in the parameter space of a probability distribution. Here, we consider changes in conditional probability $p(\bm{s}|\bm{\phi})$ as the system evolves, where the system state parameterizes the conditional probability distribution through the committor $q^{+}_{\bm{\phi}}$ . When the system is in $A$ ( $B$ ), there is no uncertainty about trajectory outcome $S_{\rm{+}}$ and origin $S_{\rm{-}}$ , so the conditional probability $p(\bm{s}|\bm{\phi}\in A)$ ( $p(\bm{s}|\bm{\phi}\in B)$ ) is unity for $\bm{s}=(A,A)$ ( $\bm{s}=(B,B)$ ) and zero for all other subensembles. As the system evolves, the distribution $p(\bm{s}|\bm{\phi})$ corresponding to the current system state changes, and the information-geometric distance between successive distributions is quantified by the Fisher information metric, with square line element Ito (2018)

\displaystyle\mathrm{d}\ell_{\bm{\phi}\bm{\phi}^{\prime}}^{2}\equiv\tfrac{1}{2}\sum_{i,j}(\phi_{i}-\phi^{\prime}_{i})\mathcal{I}_{ij}(\bm{\phi}^{\prime})(\phi_{j}-\phi^{\prime}_{j})\>.

(S17)

Equation (S15f) is then the rate of mean square distance accumulated by the system evolving at equilibrium,

\displaystyle\frac{\langle\mathrm{d}\ell^{2}\rangle}{\mathrm{d}t}=\sum_{\bm{\phi},\bm{\phi}^{\prime}}T_{\bm{\phi}\bm{\phi}^{\prime}}\pi(\bm{\phi}^{\prime})\mathrm{d}\ell_{\bm{\phi}\bm{\phi}^{\prime}}^{2}\>.

(S18)

Multiplying by the mean round-trip time $\tau_{A}+\tau_{B}=(\nu_{\bm{s}})^{-1}$ through the subensembles (the sum of mean first-passage times from $A$ to $B$ and $B$ to $A$ Berezhkovskii and Szabo (2019)), we obtain the squared reaction-coordinate length $\mathcal{L}_{AB}^{2}$ as the mean square metric distance for a round trip $A\to B\to A$ :


$\displaystyle\mathcal{L}_{AB}^{2}$	$\displaystyle\equiv(\tau_{A}+\tau_{B})\frac{\langle\mathrm{d}\ell^{2}\rangle}{\mathrm{d}t}$	(S19a)
	$\displaystyle\approx(\tau_{A}+\tau_{B})2p_{\rm{R}}\dot{\Sigma}_{\rm{R}}$	(S19b)
	$\displaystyle=2\tau_{\rm{R}}\dot{\Sigma}_{\rm{R}}$	(S19c)
$\displaystyle\dot{\Sigma}_{\rm{R}}$	$\displaystyle\approx\frac{\mathcal{L}_{AB}^{2}}{2\tau_{\rm{R}}}\>,$	(S19d)

for mean transition-path duration $\tau_{\rm{R}}=(\tau_{A}+\tau_{B})p_{\rm{R}}$ . The reaction-coordinate length $\mathcal{L}_{AB}$ roughly quantifies the mean number of fluctuations Ruppeiner (1979); Crooks (2007) required for the system to complete a round trip.

V Computational details

The bistable energy potential is separable into terms only depending on the reaction coordinate $r$ and on the bath mode $b$ :

\displaystyle E(r,b)=-k_{\rm B}T\ln\left[e^{-\tfrac{1}{2}\beta k_{\rm{m}}(r+r_{\rm{m}})^{2}}+e^{-\tfrac{1}{2}\beta k_{\rm{m}}(r-r_{\rm{m}})^{2}}\right]+\tfrac{1}{2}kb^{2}\>,

(S20)

where $r_{\rm{m}}=1$ defines the locations of the energy minima. $k_{\rm{m}}$ is the landscape curvature near those energy minima, chosen such that the energy barrier $E(0,0)-E(r_{\rm{m}},0)$ is $4k_{\rm B}T$ .

We represent the system state with orthogonal coordinates $(x,y)$ , related to $(r,b)$ by rotational angle $\theta$ :


$\displaystyle r$	$\displaystyle=x\cos\theta+y\sin\theta$	(S21a)
$\displaystyle b$	$\displaystyle=-x\cos\theta+y\sin\theta\>.$	(S21b)

We discretize the state space $(x,y)$ so that $\mathrm{d}x=\mathrm{d}y=0.04r_{\rm{m}}$ , and dynamically evolve bipartite dynamics using the master equation. The transition rate is $T_{xx^{\prime},yy^{\prime}}=\Gamma*{\rm{min}}\left[1,e^{-\beta\Delta E}\right]$ with diffusion prefactor $\Gamma=0.1\,\mathrm{d}t^{-1}$ and energy change $\Delta E=E(x,y)-E(x^{\prime},y^{\prime})$ Metropolis et al. (1953). We solve the committor on the discrete state space using the recursion relations Metzner et al. (2009)


$\displaystyle q^{+}_{\bm{\phi}}$	$\displaystyle=\mathrm{d}t\sum_{\bm{\phi}^{\prime}}T_{\bm{\phi}^{\prime}\bm{\phi}}q^{\rm{+}}_{\bm{\phi}^{\prime}}$	(S22a)
$\displaystyle q^{-}_{\bm{\phi}}$	$\displaystyle=\mathrm{d}t\sum_{\bm{\phi}^{\prime}}T_{\bm{\phi}\bm{\phi}^{\prime}}\frac{\pi(\bm{\phi}^{\prime})}{\pi(\bm{\phi})}q^{-}_{\bm{\phi}^{\prime}}\>.$	(S22b)

Mesostates are defined by $A=\{x,y\,|\,r(x,y)\leq-r_{m}\}$ and $B=\{x,y\,|\,r(x,y)\geq r_{m}\}$ , so that the committor is independent of $b$ . We calculate the TPE entropy production from (15).