This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Optimization of nonequilibrium free energy harvesting illustrated on bacteriorhodopsin

Jordi Piñero ICREA-Complex Systems Lab, Universitat Pompeu Fabra, 08003 Barcelona, Spain    Ricard Solé ICREA-Complex Systems Lab, Universitat Pompeu Fabra, 08003 Barcelona, Spain Institut de Biologia Evolutiva (CSIC-UPF), 08003 Barcelona, Spain Santa Fe Institute, Santa Fe, New Mexico 87501, United States    Artemy Kolchinsky artemyk@gmail.com ICREA-Complex Systems Lab, Universitat Pompeu Fabra, 08003 Barcelona, Spain Universal Biology Institute, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan
Abstract

Harvesting free energy from the environment is essential for the operation of many biological and artificial systems. We use techniques from stochastic thermodynamics to investigate the maximum rate of harvesting achievable by optimizing a set of reactions in a Markovian system, possibly under various kinds of topological, kinetic, and thermodynamic constraints. This question is relevant for the optimal design of new harvesting devices as well as for quantifying the efficiency of existing systems. We first demonstrate that the maximum harvesting rate can be expressed as a constrained convex optimization problem. We illustrate it on bacteriorhodopsin, a light-driven proton pump from Archaea, which we find is close to optimal under realistic conditions. In our second result, we solve the optimization problem in closed-form in three physically meaningful limiting regimes. These closed-form solutions are illustrated on two idealized models of unicyclic harvesting systems.

I Introduction

Many molecular systems, both biological and artificial, harvest free energy from their environments. Biological organisms require free energy to grow and replicate [1, 2], and many species undergo selection for increased harvesting [3, 4, 5, 6]. Artificial harvesting systems have also been constructed and optimized in the field of synthetic biology [7, 8, 9, 10, 11, 12, 13, 14]. The optimization of free energy harvesting is thus a central problem both in biology and engineering.

As an example, consider a harvesting system such as a biological metabolic network that converts glucose to ATP [15]. Suppose that the kinetic and thermodynamic parameters of one or more reactions can be optimized, either by natural selection or artificial design. What is the maximum rate of free energy harvesting that the network can achieve, and what are the kinetic and thermodynamic parameters that achieve it? These questions are relevant both for design of optimal harvesting devices and for quantifying the efficiency of existing systems.

In this paper, we use techniques from stochastic thermodynamics to derive bounds on maximum rate of free energy harvesting. We consider a harvesting system in nonequilibrium steady state which is coupled to an external source of free energy, an internal free energy reservoir, and a heat bath. The setup is well-suited for studying the kinds of small-scale systems usually considered in stochastic thermodynamics [16], where assumptions of local detailed balance and steady state are justified. The steady-state assumption is reasonable in many molecular systems, where there is a separation of timescales between internal relaxation and environmental change.

We suppose that the system’s dynamics can be separate into two kinds of processes, termed baseline and control. The baseline processes, which are held fixed, mediate the coupling to the external source of free energy. Control refers to all other processes which can be optimized for increasing the harvesting rate at which free energy flows into the internal reservoir. The particular separation of baseline/control generally depends on domain knowledge about the system and the scientific question at hand. For example, to study the efficiency of a particular reaction in a metabolic network, that reaction may be treated as control while the other reactions are baseline. The baseline/control separation is similar to the distinction in control theory between “plant” (a given system with fixed dynamical laws) and “controller” (the part that undergoes optimization) [17].

In our first set of results, we demonstrate that the optimization of the harvesting rate can be expressed as a convex optimization problem. The optimal solution of this problem determines both the maximum harvesting rate and the specific control processes that achieve that maximum. Importantly, the optimization can also account for various types of constraints on the possible network topology, kinetic timescales, and thermodynamic forces of the control processes.

Refer to caption
Figure 1: Left: Bacteriorhodopsin is a biomolecular free energy harvesting machine [18], illustrated in its trimer configuration by D. Goodsell (CC-BY-4.0) [19]. Right: during each turn of the bacteriorhodopsin photocycle, the molecule absorbs a photon and pumps a proton against the cell’s membrane potential.

We illustrate our results on bacteriorhodopsin (Fig. 1), a proton-pumping membrane protein. Bacteriorhodopsin is a photosynthetic system found in Archaea, with close relatives in bacteria [20, 21]. It is also used in many artificial light-harvesting systems [7, 9, 8, 11]. Using published thermodynamic and kinetic data, we develop a thermodynamically consistent stochastic model of bacteriorhodopsin. We demonstrate that, under normal operating conditions, the bacteriorhodopsin system is remarkably efficient.

Our main result is formulated as a convex optimization problem which must be solved numerically in the general case. In the second part of this paper, we derive closed-form solutions of this problem for three physically meaningful regimes: the weakly-driven linear response regime, the irreversible deterministic regime, and the intermediate near-deterministic regime. These solutions illustrate how optimal harvesting reflects the “alignment” between free energy input and relaxation dynamics. We illustrate these closed-form solutions on two unicyclic systems, which may be interpreted as idealized models of two types of nonequilibrium harvesting devices.

We finish our paper with a brief Discussion. There we relate our approach to previous work, including maximization of power output in steady-state engines and flux balance analysis. We also propose directions for future research.

II Setup

We consider a system with nn mesostates described by the distribution 𝒑=(p1,,pn)+n\bm{p}=(p_{1},\dots,p_{n})\in\mathbb{R}_{+}^{n}. The distribution evolves according to the master equation p˙i=jRijpj\dot{p}_{i}=\sum_{j}R_{ij}p_{j}, where RjiR_{ji} is the transition rate iji\to j (Rii=jRjiR_{ii}=-\sum_{j}R_{ji}). Usually 𝒑\bm{p} represents a probability distribution of a stochastic system with Markovian dynamics [22, 23]. However, under an appropriate choice of units, it may also represent chemical concentrations in a deterministic chemical reaction network with pseudo-first-order reactions, such as an enzymatic cycle [24, 25].

The system is coupled to a heat bath at inverse temperature β=1/kBT\beta=1/k_{B}T, an internal free energy reservoir, and a nonequilibrium environment that serves as an external source of free energy. For example, in a metabolic network, one may consider an internal reservoir of ATP and an external source of glucose. The system has nonequilibrium free energy

(𝒑)=ipifiβ1S(𝒑),\displaystyle\mathcal{F}(\bm{p})=\sum_{i}p_{i}f_{i}-{\beta}^{-1}S(\bm{p})\,, (1)

where S(𝒑):=ipilnpiS(\bm{p}):=-\sum_{i}p_{i}\ln p_{i} is the Shannon entropy and fif_{i} is the internal free energy of mesostate ii [26].

As mentioned in the Introduction, we suppose that the dynamics of the system can be separated into baseline and control processes. We make one important assumption in our analysis: the control processes are only coupled to the heat bath and internal free energy reservoir, but not directly to the external source of free energy. This means that control can only increase harvesting by interacting with the baseline, rather than directly increasing the inflow of free energy from the external source. For example, in a metabolic network, control processes cannot directly increase the import of glucose, but they can affect the rate at which glucose is converted into ATP by optimizing intermediate reactions and mechanisms. Control processes may be driven by the internal reservoir (e.g., coupled to ATP hydrolysis) or undriven (e.g., enzymes).

To formalize the baseline/control distinction, we write the rate matrix as R=Rb+RcR=R^{b}+R^{c}, where RjibR^{b}_{ji} and RjicR^{c}_{ji} represent the transition rate of iji\to j due to baseline and control. Given distribution 𝒑\bm{p}, the increase of system free energy due to baseline processes is

˙b(𝒑)=i,jpiRjib(fj+β1lnpj).\displaystyle\dot{\mathcal{F}}^{b}(\bm{p})=\sum_{i,j}p_{i}R^{b}_{ji}(f_{j}+{\beta}^{-1}\ln p_{j})\,. (2)

The increase due to control processes is defined analogously but using rate matrix RcR^{c},

˙c(𝒑)=i,jpiRjic(fj+β1lnpj).\displaystyle\dot{\mathcal{F}}^{c}(\bm{p})=\sum_{i,j}p_{i}R^{c}_{ji}(f_{j}+{\beta}^{-1}\ln p_{j})\,. (3)

The harvesting rate is the rate at which free energy flows to the internal reservoir. The harvesting rate due to baseline processes is

𝒢˙b(𝒑)\displaystyle\dot{\mathcal{G}}^{b}(\bm{p}) =i,jpiRjibgjib+ipig˙ib,\displaystyle=\sum_{i,j}p_{i}R^{b}_{ji}{g}^{b}_{ji}+\sum_{i}p_{i}\dot{g}^{b}_{i}\,, (4)

where gjib{g}^{b}_{ji} is the free energy increase in the internal reservoir due to a single baseline transition iji\to j and g˙ib\dot{g}^{b}_{i} is the rate of free energy flow to the internal reservoir due to internal transitions within ii (assuming ii is a coarse-grained mesostate). Similarly, the harvesting rate due to control processes is

𝒢˙c(𝒑)\displaystyle\dot{\mathcal{G}}^{c}(\bm{p}) =i,jpiRjicgjic,\displaystyle=\sum_{i,j}p_{i}R^{c}_{ji}{g}^{c}_{ji}\,, (5)

where gjic{g}^{c}_{ji} is the free energy increase in the internal reservoir due to control transition iji\to j. For simplicity, we assume that control cannot exchange free energy with the internal reservoir due to internal transitions within ii. Negative values of gjib,g˙ib,gjic{g}^{b}_{ji},\dot{g}^{b}_{i},{g}^{c}_{ji} indicate driving done on the system by the internal reservoir.

For a concrete example of how (Rb,Rc,𝒇,𝒈b,𝒈˙b,𝒈c)(R^{b},R^{c},\bm{f},\bm{g}^{b},\dot{\bm{g}}^{b},\bm{g}^{c}) are defined for a real biomolecular system, see the bacteriorhodopsin example below and SM-II [27].

As standard in stochastic thermodynamics, we assume that control processes obey local detailed balance (LDB) [16, 28],

ln(Rjic/Rijc)=β(fifjgjic)\displaystyle\ln(R^{c}_{ji}/R^{c}_{ij})=\beta(f_{i}-f_{j}-{g}^{c}_{ji}) for Rjic>0.\displaystyle\qquad\text{for }R^{c}_{ji}>0\,. (6)

Eq. (6) guarantees that the irreversibility of each control transition is determined by the amount of free energy dissipated by that transition [29]. Observe that the right side accounts for free energy changes of the system (fifjf_{i}-f_{j}) and the internal reservoir (gjic{g}^{c}_{ji}), but not the external source. This formalizes the assumption that control processes do not exchange free energy with the external source.

We do not require that the baseline rate matrix obeys LDB, although it will often do so for reasons of thermodynamic consistency.

III Maximum harvesting rate

Suppose that the combined rate matrix R=Rb+RcR=R^{b}+R^{c} has the steady-state distribution 𝝅\bm{\pi}, which satisfies Rb𝝅+Rc𝝅=0R^{b}\bm{\pi}+R^{c}\bm{\pi}=0. The total steady-state harvesting rate due to baseline and control is

𝒢˙tot=𝒢˙b(𝝅)+𝒢˙c(𝝅).\displaystyle\dot{\mathcal{G}}^{\mathrm{tot}}=\dot{\mathcal{G}}^{b}(\bm{\pi})+\dot{\mathcal{G}}^{c}(\bm{\pi}). (7)

We seek to maximize this harvesting rate by varying the parameters of the control processes (Rc,𝒈c)(R^{c},\bm{g}^{c}) while holding the baseline parameters (𝒇,Rb,𝒈b,𝒈˙b\bm{f},R^{b},\bm{g}^{b},\dot{\bm{g}}^{b}) fixed. Finding this maximum would allow us to determine fundamental bounds on harvesting and to evaluate the efficiency of existing harvesting systems.

However, 𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} is not a concave function of the parameters (Rc,𝒈c)(R^{c},\bm{g}^{c}), therefore maximization of (7) is not a convex optimization problem and is not generally intractable. In the following, we reformulate this maximization as a convex optimization with a physically interpretable objective. This allows us to solve the optimization numerically and, at least for some special cases, also in closed form.

To begin, we rewrite (7) as

𝒢˙tot=˙b(𝝅)+𝒢˙b(𝝅)Σ˙(𝑱c)/β.\displaystyle\dot{\mathcal{G}}^{\mathrm{tot}}=\dot{\mathcal{F}}^{b}(\bm{\pi})+\dot{\mathcal{G}}^{b}(\bm{\pi})-\dot{\Sigma}({\boldsymbol{J}}^{c})/\beta. (8)

where we introduced the Schnackenberg formula for the entropy production rate (EPR) [22],

Σ˙(𝑱c)=ijJjicln(Jjic/Jijc)0,\displaystyle\dot{\Sigma}({\boldsymbol{J}}^{c})=\sum_{i\neq j}{J^{c}_{ji}}\ln({J^{c}_{ji}}/{J^{c}_{ij}})\geq 0\,, (9)

where Jjic=πiRjic0{J^{c}_{ji}}=\pi_{i}R^{c}_{ji}\geq 0 is the one-way probability flux due to control transition iji\to j.

Eq. (8) has an intuitive physical interpretation: the total steady-state harvesting rate is the rate of free energy increase in the system and internal reservoir due to baseline, minus the rate of dissipation (EPR) due to the control fluxes. The derivation of this result proceeds in two steps (see SM-I.1 [27] for details). The first step is to show that Σ˙(𝑱c)=β[˙c(𝝅)+β𝒢˙c(𝝅)]\dot{\Sigma}({\boldsymbol{J}}^{c})=-\beta[\dot{\mathcal{F}}^{c}(\bm{\pi})+\beta\dot{\mathcal{G}}^{c}(\bm{\pi})], which follows by combining (9) with (3) and (6). This states that the EPR due to control is proportional to the free energy loss in the system and internal reservoir due to control. The second step is to show that ˙b(𝝅)+˙c(𝝅)=0\dot{\mathcal{F}}^{b}(\bm{\pi})+\dot{\mathcal{F}}^{c}(\bm{\pi})=0, which follows because the left side is the overall derivative of the nonequilibrium free energy \mathcal{F}, as defined in (1), therefore it must vanish in steady state. The result (8) then follows by combining with (7) and rearranging.

Importantly, when expressed in the form (8), the harvesting rate is a concave function of the steady-state distribution 𝝅\bm{\pi} and the control fluxes 𝑱c{\boldsymbol{J}}^{c} (see SM-I.2 [27]). To find the maximum harvesting rate, we optimize (8) with respect to 𝝅\bm{\pi} and 𝑱c{\boldsymbol{J}}^{c}. Note that varying 𝝅\bm{\pi} and 𝑱c{\boldsymbol{J}}^{c} is equivalent to varying the control rate matrix via Rjic=Jjic/πiR^{c}_{ji}={J^{c}_{ji}}/\pi_{i} and control driving gjic{g}^{c}_{ji} via (6). However, when performing this optimization, we must also ensure that 𝝅\bm{\pi} is the steady-state distribution induced by the fluxes 𝑱c{\boldsymbol{J}}^{c}. This condition can be expressed as a linear constraint on 𝝅\bm{\pi} and 𝑱c{\boldsymbol{J}}^{c} via Rb𝝅+𝔹𝑱c=0R^{b}\bm{\pi}+\mathbb{B}{\boldsymbol{J}}^{c}=0. Here 𝑱c{\boldsymbol{J}}^{c} is treated as a vector in n2\mathbb{R}^{n^{2}} and 𝔹n×n2\mathbb{B}\in\mathbb{R}^{n\times n^{2}} is the incidence matrix with entries 𝔹k,ij=δkiδkj\mathbb{B}_{k,ij}=\delta_{ki}-\delta_{kj}, which guarantees Rc𝝅=𝔹𝑱cR^{c}\bm{\pi}=\mathbb{B}{\boldsymbol{J}}^{c}.

Combining, we arrive at the bound 𝒢˙tot𝒢\dot{\mathcal{G}}^{\mathrm{tot}}\leq\mathscr{G}, where

𝒢\displaystyle\mathscr{G} =sup(𝒑,𝑱)Λ:Rb𝒑+𝔹𝑱=0˙b(𝒑)+𝒢˙b(𝒑)Σ˙(𝑱)/β.\displaystyle=\sup_{(\bm{p},{\boldsymbol{J}})\in\Lambda:R^{b}\bm{p}+\mathbb{B}{\boldsymbol{J}}=0}\,\dot{\mathcal{F}}^{b}(\bm{p})+\dot{\mathcal{G}}^{b}(\bm{p})-\dot{\Sigma}({\boldsymbol{J}})/\beta. (10)

In this expression, Λ\Lambda is the feasible set of distributions 𝒑\bm{p} and control fluxes 𝑱{\boldsymbol{J}}. At a minimum, Λ\Lambda ensures the validity of the distribution 𝒑\bm{p} and the fluxes 𝑱{\boldsymbol{J}} via the linear constraints pi=1\sum p_{i}=1, pi0p_{i}\geq 0, and Jji0J_{ji}\geq 0. We write sup\sup instead of max\max because the set of allowed fluxes is potentially unbounded. Eq. (10) implies a tradeoff between the total gain of free energy in the system and internal reservoir due to baseline (which depends only on 𝒑\bm{p}) and the dissipation due to control fluxes (which depends only on 𝑱{\boldsymbol{J}}).

Importantly, the feasible set Λ\Lambda can include additional convex constraints, which may introduce topological, kinetic, thermodynamic, etc. restrictions on the control processes. Topological constraints restrict which transitions can be controlled; e.g., Jji=0J_{ji}=0 ensures that control does not use transition iji\to j). Kinetic constraints restrict timescales of control processes, as might reflect underlying physical processes like diffusion; e.g., an upper bound on control transition rate Rjic=Jji/piκR^{c}_{ji}=J_{ji}/p_{i}\leq\kappa can be enforced via JjipiκJ_{ji}\leq p_{i}\kappa. Thermodynamic constraints bound the affinity [22] of control transitions; e.g., Jjie𝒜JijJjie𝒜J_{ji}e^{-\mathcal{A}}\leq J_{ij}\leq J_{ji}e^{\mathcal{A}} ensures that |ln(Jij/Jji)|𝒜|\ln(J_{ij}/J_{ji})|\leq\mathcal{A}. The above examples all involve linear constraints. An example of a nonlinear, but still convex, constraint is an upper bound on the EPR incurred by control, Σ˙(𝑱)Σ˙maxc\dot{\Sigma}({\boldsymbol{J}})\leq\dot{\Sigma}^{c}_{\max}.

Eq. (10) is our first main result. Importantly, 𝒢\mathscr{G} is defined purely in terms of the thermodynamic and kinetic properties of the baseline process, along with desired constraints encoded in Λ\Lambda. Thus, 𝒢\mathscr{G} is the maximum steady-state harvesting rate that can be achieved by any allowed control processes, given a fixed baseline. In addition, Eq. (10) involves the maximization of a concave objective given convex constraints. This is equivalent to the minimization of a convex objective, thus Eq. (10) is a convex optimization problem that can be efficiently solved using standard numerical techniques [30]. The optimization also identifies an optimal steady-state distribution 𝒑\bm{p}^{*} and control fluxes 𝑱{{\boldsymbol{J}}}^{*} that achieve the maximum harvesting rate 𝒢\mathscr{G} (or come arbitrarily close to achieving it). These fix the optimal control rate matrix via Rjic=Jij/piR^{c*}_{ji}={J_{ij}^{*}}/p^{*}_{i}. Thus, our optimization specifies an upper bound on harvesting as well as the optimal control strategy that achieves this bound.

There is an important special case in which our optimization problem is simplified. Suppose that Λ\Lambda does not enforce additional constraints on 𝒑\bm{p} and 𝑱{\boldsymbol{J}} (more generally, we permit topological constraints if the graph of allowed transitions is connected and contains all nn states). Then, the objective is maximized in limit of fast control, 𝑱{\boldsymbol{J}}\to\infty and Σ˙(𝑱)0\dot{\Sigma}({\boldsymbol{J}})\to 0. We can then write (10) as an optimization over steady-state distributions:

𝒢:=max𝒑:pi=1,pi0˙b(𝒑)+𝒢˙b(𝒑).\displaystyle\mathscr{G}:=\max_{\bm{p}:\sum p_{i}=1,p_{i}\geq 0}\,\dot{\mathcal{F}}^{b}(\bm{p})+\dot{\mathcal{G}}^{b}(\bm{p})\,. (11)

The optimal 𝒑\bm{p}^{*} is unique as long as the baseline rate matrix is irreducible. The optimal control rate matrix is very fast (RcR^{c*}\to\infty) and obeys detailed balance for 𝒑\bm{p}^{*}, Rjicpi=RijcpjR^{c*}_{ji}{p}^{*}_{i}=R^{c*}_{ij}{p}^{*}_{j}. For details, see SM-I.3 and SM-I.4 [27].

IV Bacteriorhodopsin

We illustrate our results using bacteriorhodopsin, a light-driven proton pump from Archaea [18].

We define a thermodynamically consistent model of the bacteriorhodopsin cycle using published thermodynamic [31] and kinetic [32] data (see SM-II [27]). The system operates in a cyclical manner, absorbing a photon and pumping a proton during each turn of the cycle (Fig. 1, right). Specifically, the transition M1M2M_{1}\to M_{2} pumps a proton out of the cell. This stores free energy in the internal reservoir (the membrane electrochemical potential),

gM2M1=gM1M2=eΔψ(ln10)β1ΔpH,\displaystyle g_{M_{2}M_{1}}=-g_{M_{1}M_{2}}=e\Delta\psi-(\ln 10){\beta}^{-1}\Delta\text{pH}\,, (12)

where ee is the elementary charge constant, Δψ\Delta\psi is the membrane electrical potential, and ΔpH\Delta\text{pH} is the membrane pH difference. The other transitions in the cycle do not affect the free energy of the internal reservoir (gij=0g_{ij}=0 and g˙i=0\dot{g}_{i}=0).

During the transition bRK\mathrm{bR}\to K, the system leaves the ground state by absorbing a photon at 580nm, thereby harvesting free energy from the external source and dissipating some heat to the environment at T=293T=293^{\circ} K. This transition is much faster (picoseconds) than the other transitions in the photocycle (micro- to milliseconds). As commonly done in photochemistry [33], we coarse-grain transitions ObRO\to\mathrm{bR} and bRK\mathrm{bR}\to K into a single effective transition OKO\to K.

To explore the performance of bacteriorhodopsin under different conditions, we vary the membrane electrical potential Δψ\Delta\psi between 75-75 and 350350 mV, while using a realistic fixed ΔpH=0.6\Delta\text{pH}=-0.6 [34]. We show the actual harvesting rate (𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} in units of kBT/k_{B}T/sec) at different potentials as a black line in Fig. 2 (a). At a plausible in vivo Δψ=120\Delta\psi=120 mV [34], the model exhibits a steady-state current of 11.5 protons/sec, each proton carrying 6.1 kBTk_{B}T of free energy, corresponding to 𝒢˙tot70kBT{\dot{\mathcal{G}}^{\mathrm{tot}}}\approx 70\,k_{B}T/sec. The largest output is achieved near the in vivo potential: at lower potentials, the cycle current saturates while the free energy per proton drops, and at higher potentials the pump stalls.

Next, we quantify the maximum harvesting rate that can be achieved by optimizing the parameters of individual transitions. This analysis is relevant for understanding limits on increasing bacteriorhodopsin output, whether via natural selection or biosynthetic methods [35, 36, 37, 38]. Interestingly, such transition-level optimization may be feasible in bacteriorhodopsin, as the properties of several transitions are known to be individually controlled by particular amino acid residues in the bacteriorhodopsin protein [35, 39, 40, 41].

For each reversible transition in the cycle, for instance NON\leftrightarrow O, we define the baseline as the rest of the cycle without that transition. We then optimize control under the topological constraint that only the relevant transition (e.g., NON\leftrightarrow O) is allowed. The analysis is repeated for all transitions, except for the (coarse-grained) photon-absorbing transition OKO\leftrightarrow K, which is in accordance with our assumption that control cannot directly exchange free energy directly with the external source.

Refer to caption
Figure 2: (a) Comparison of the actual harvesting rate 𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} at different electrical potentials Δψ\Delta\psi, versus maximum rate 𝒢\mathscr{G} achieved by optimizing five intermediate transitions (color scheme from Fig. 1 Right). (b) Efficiency 𝒢˙tot/𝒢\dot{\mathcal{G}}^{\mathrm{tot}}/\mathscr{G} computed while separately optimizing each transition, with colors as in (a). (c) The actual steady state 𝝅\bm{\pi} versus the optimal distribution 𝒑\bm{p}^{*} when optimizing the NON\leftrightarrow O transition (at Δψ=120\Delta\psi=120 mV).

Fig. 2 (a) shows 𝒢\mathscr{G}, the maximum 𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} achievable by optimizing each reversible transition. In Fig. 2 (b), we also show the efficiency 𝒢˙tot/𝒢1\dot{\mathcal{G}}^{\mathrm{tot}}/\mathscr{G}\leq 1 for each transition, that is the ratio of the actual and maximum harvesting rate.

Several transitions, such as KLK\leftrightarrow L,LM1L\leftrightarrow M_{1},M1M2M_{1}\leftrightarrow M_{2}, are remarkably efficient (85%\geq 85\%) near in vivo membrane potentials. The reprotonation step NON\leftrightarrow O is the least efficient (40%\sim 40\%) and also has the slowest kinetics of the 5 transitions studied in Fig. 2. This suggests that NON\leftrightarrow O is a bottleneck whose optimization can have a big impact on the harvesting rate, while optimization of other non-bottleneck transitions has a more limited effect.

Observe that 𝒢\mathscr{G} for M1M2M_{1}\leftrightarrow M_{2} does not depend on Δψ\Delta\psi. This is because 𝒢\mathscr{G} is a function of baseline properties, which do not depend on the membrane potential when M1M2M_{1}\leftrightarrow M_{2} is treated as control. Conversely, M1M2M_{1}\leftrightarrow M_{2} as control transition can be optimized by varying the membrane potential and/or scaling up the forward/backward rates. Our results show that this transition is very close to optimal at in vivo membrane potentials and kinetic timescales.

Optimal distributions 𝒑\bm{p}^{*} are also obtained, with a typical one shown in Fig. 2 (c). We find a consistent shift toward state OO, which accelerates the reset of the cycle and increases the flux across the photon-absorbing transition OKO\to K.

In SM-II [27], we illustrate how the efficiency of bacteriorhodopsin transitions can be evaluated under other types of constraints, including constraints on thermodynamic affinity, dynamical activity, and kinetics.

Refer to caption
Figure 3: (a) Comparison of the actual harvesting rate 𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} at different electrical potentials Δψ\Delta\psi, versus maximum rate 𝒢\mathscr{G} achieved by fixing the bacteriorhodopsin cycle as baseline and allowing any additional transitions as control. (b) Efficiency of the actual bacteriorhodopsin cycle with respect to the optimized cycle. (c) The actual steady state 𝝅\bm{\pi} and optimal distribution 𝒑\bm{p}^{*} (at Δψ=120\Delta\psi=120 mV).

As a final analysis, instead of optimizing individual existing transitions in the bacteriorhodopsin cycle, we ask to what extent the harvesting rate can be increased by any additional control processes. For example, this could involve an additional enzyme that shifts the cycle’s steady state by permitting a new transition between distant states (e.g. LNL\leftrightarrow N), possibly yielding an increase of the proton pumping rate.

In this case, we treat the entire bacteriorhodopsin system as the baseline, and we do not introduce any additional constraints on the steady-state distribution or the control fluxes. Then, as shown in Sec. SM-I.4 [27], the objective in (10) is achieved in the limit of fast control, and the maximum harvesting rate can be found by solving the simplified optimization problem (11).

For this setup, Fig. 3 (a) shows the baseline (actual) harvesting rate 𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} and the maximum harvesting rate 𝒢\mathscr{G} at varying Δψ\Delta\psi. Interestingly, both peak at around the in vivo values of the membrane potential. In Fig. 3 (b), we show that the actual bacteriorhodopsin cycle harvests approximately 50% of the fundamental bound given by 𝒢\mathscr{G} (at in vivo values of Δψ\Delta\psi). This suggests that bacteriorhodopsin is remarkably close to optimal, relative to improvements that could be achieved by introducing any additional control processes.

We also show the actual steady-state distribution and the optimal distribution 𝒑\bm{p}^{*} in Fig. 3 (c). The optimal distribution increases the probability of state OO, similar to the optimal distribution found by optimizing the NON\leftrightarrow O transition, shown in Fig. 2 (c). However, unlike Fig. 2 (c), where most of the extra probability is taken from state NN, in Fig. 3 (c) the probability is drawn more uniformly from other states in the cycle, indicating the presence of distributed control.

V Limiting regimes

Our results are stated via an optimization problem that generally does not have a closed-form solution. In our second set of results, we identify closed-form expressions in three physically meaningful regimes. For simplicity, here we focus on the simplified objective (11). See SM-III [27] for detailed derivations, including analysis of the conditions under which each of these three approximation are be valid.

For convenience, we first rewrite (11) as

𝒢=max𝒑S˙b(𝒑)/β+ipiϕi,\displaystyle\mathscr{G}=\max_{\bm{p}}\,-\dot{S}^{b}(\bm{p})/\beta+\sum_{i}p_{i}\phi_{i}\,, (13)

where S˙b(𝒑)=i,jRijbpjlnpi\dot{S}^{b}(\bm{p})=-\sum_{i,j}R^{b}_{ij}p_{j}\ln p_{i} is the increase of the Shannon entropy of 𝒑\bm{p} under RbR^{b} and for convenience we defined ϕi:=g˙ib+jRjib(fjfi+gjib)\phi_{i}:=\dot{g}^{b}_{i}+\sum_{j}R^{b}_{ji}(f_{j}-f_{i}+{g}^{b}_{ji}). The objective (13) contains a nonlinear term S˙b(𝒑)/β-\dot{S}^{b}(\bm{p})/\beta quantifying the decrease of information-theoretic entropy and a linear term ipiϕi\sum_{i}p_{i}\phi_{i} quantifying the flow of thermodynamic free energy.

Next, we consider three regimes.

200200footnotetext: The dynamical activity refers to the overall rate of back-and-forth jumps across a transitions, Rijbπjb+RjibπibR^{b}_{ij}\pi^{b}_{j}+R^{b}_{ji}\pi^{b}_{i}.

Linear response (LR) applies when the optimal distribution 𝒑\bm{p}^{*} is close to the steady-state distribution of the baseline rate matrix RbR^{b}. Suppose that RbR^{b} is irreducible and has a unique steady state 𝝅b{\bm{\pi}^{b}} with full support. We introduce the “additive reversibilization” of RbR^{b},

Aij=(Rijb+Rjibπib/πjb)/2.\displaystyle A_{ij}=(R^{b}_{ij}+R^{b}_{ji}\pi^{b}_{i}/\pi^{b}_{j})/2\,. (14)

The rate matrix AA obeys detailed balance (DB) for the steady-state distribution 𝝅b{\bm{\pi}^{b}} and has the same dynamical activity [42] on all edges as RbR^{b}. AA may be considered as a DB version of RbR^{b}, and it is equal to RbR^{b} when the latter obeys DB [43, 44]. Let 𝒖α\boldsymbol{u}^{\alpha} indicate the α\alpha-th right eigenvector of AA normalized as i(uiα)2/πib=1\sum_{i}({u^{\alpha}_{i}})^{2}/\pi^{b}_{i}=1, and λα\lambda_{\alpha} the corresponding real-valued eigenvalue (λ1=0\lambda_{1}=0). The LR solution for the maximum harvesting rate and the optimal distribution is

𝒢𝒢˙b(𝝅b)+βα>1|Ωα|2λα𝒑𝝅b+βα>1Ωαλα𝒖α\displaystyle\begin{aligned} \mathscr{G}&\approx\dot{\mathcal{G}}^{b}({\bm{\pi}^{b}})+\beta\sum_{\alpha>1}\frac{\left|\Omega_{\alpha}\right|^{2}}{-\lambda_{\alpha}}\\ \bm{p}^{*}&\approx\quad{\bm{\pi}^{b}}\;\;\;+\beta\sum_{\alpha>1}\frac{\Omega_{\alpha}}{-\lambda_{\alpha}}\boldsymbol{u}^{\alpha}\end{aligned} (15)

where Ωα=(ϕ+β1RbTln𝝅b)T𝒖α/2\Omega_{\alpha}=(\boldsymbol{\phi}+{\beta}^{-1}{R^{b}}^{T}\ln{\bm{\pi}^{b}})^{T}\boldsymbol{u}^{\alpha}/2 quantifies the harvesting “amplitude” for mode α\alpha.

Eq. (15) has a simple interpretation. In addition to the baseline harvesting rate 𝒢˙b(𝝅b)\dot{\mathcal{G}}^{b}({\bm{\pi}^{b}}), 𝒢\mathscr{G} contains contributions from the relaxation modes of AA, with each mode weighed by its squared amplitude and relaxation timescale 1/λα-1/\lambda_{\alpha}. All else being equal, 𝒢\mathscr{G} is large when slow modes have large harvesting amplitudes. The optimal 𝒑\bm{p}^{*} shifts the baseline steady state 𝝅b{\bm{\pi}^{b}} toward mode α\alpha in proportion to that mode’s harvesting amplitude and relaxation timescale, thereby optimally balancing the tradeoff between harvesting and dissipation.

The Deterministic (D) regime applies when the nonlinear information-theoretic term in (13) is much smaller than the linear thermodynamic term. We can then ignore the former, turning (13) into a simple linear optimization. This gives the approximate solution

𝒢ϕipiδii\displaystyle\mathscr{G}\approx\phi_{{i}^{*}}\quad\qquad{p}^{*}_{i}\approx\delta_{{i}^{*}i}\, (16)

where i=argmaxiϕi{i}^{*}=\arg\max_{i}\phi_{i} is the optimal mesostate. This solution concentrates probability on a single mesostate, effectively ignoring the cost of maintaining this low-entropy distribution.

The Near-Deterministic (ND) regime lies between Linear Response and Deterministic ones. By perturbing 𝒑\bm{p}^{*} around δii\delta_{{i}^{*}i}, we can decouple the values of pip_{i} in the objective function (11). The maximal harvesting rate and optimal distribution in this regime are then given by

𝒢ϕi+β1iiRiib(lnpi1),pi{Riib/[β(ϕiϕi)+Riib]ii1i:iipii=i\displaystyle\begin{aligned} \mathscr{G}&\approx\phi_{{i}^{*}}+{\beta}^{-1}\sum_{i\neq{i}^{*}}R^{b}_{i{i}^{*}}(\ln{p}^{*}_{i}-1)\,,\\ {p}^{*}_{i}&\approx\begin{cases}{R^{b}_{i{i}^{*}}}/[{\beta(\phi_{{i}^{*}}-\phi_{i})+R^{b}_{{i}^{*}{i}^{*}}}]&\qquad i\neq{i}^{*}\\ 1-\sum_{i:i\neq{i}^{*}}{{p}^{*}_{i}}&\qquad i={i}^{*}\end{cases}\end{aligned} (17)

The ND solution also has a simple interpretation. It perturbs the Deterministic solution by shifting probability towards states with high transition rates away from the optimal state (large RiibR^{b}_{i{i}^{*}}) and small decreases in harvesting (ϕiϕi\phi_{{i}^{*}}-\phi_{i}). This balances the benefit of harvesting against the cost of pumping probability against RiibR^{b}_{i{i}^{*}}.

Refer to caption
Figure 4: (a) Unicyclic system where free energy Θ\Theta is harvested by a single transition. (b) Unicyclic system where free energy per unit time θ\theta is harvested when the system is in a particular mesostate.

VI Example: unicyclic systems

We illustrate our closed-form solutions using two simple models, both based on a unicyclic system with nn states. The baseline dynamics involve diffusion across a 1-dimensional ring, with left and right jump rates set to 1. The baseline steady state is a uniform distribution, πib=1/n\pi^{b}_{i}=1/n, with no cyclic current. We assume a uniform free energy function, fi=0f_{i}=0 for all ii.

We consider two different scenarios. In the first scenario, shown schematically in Fig. 4 (a), Θ\Theta of free energy is harvested each time the system carries out the transition 121\to 2, so

g21b=g12b=Θ,{g}^{b}_{21}=-{g}^{b}_{12}=\Theta\,,

and gijb=g˙ib=0{g}^{b}_{ij}=\dot{g}^{b}_{i}=0 otherwise. This scenario may be interpreted as an idealized model of a biomolecular harvesting cycle, such as a transporter. In the second scenario, shown schematically in Fig. 4 (b), free energy is harvested at a rate of θ\theta per unit time when the system is located in one particular mesostate i=1{i}^{*}=1, so

g˙1b=θ,\displaystyle\dot{g}^{b}_{1}=\theta\,, (18)

and gijb=g˙ib=0{g}^{b}_{ij}=\dot{g}^{b}_{i}=0 otherwise. This scenario may be interpreted as an idealized model of error correction or self-assembly, where free energy can only be harvested when the system is in some particular functional mesostate.

For both scenarios, we evaluate the maximum harvesting rate 𝒢\mathscr{G} and the optimal distributions achievable by adding any possible control to the system, without constraints. We report exact values found by numerical optimization of Eq. (13), as well as the LR, ND, and D approximations described above. To calculate the LR values, we exploit the fact that the baseline unicyclic rate matrix is a circulant matrix with a simple eigendecomposition [45]. Full details of the derivations for the two scenarios are provided in SM-IV [27] and SM-IV.3 [27], respectively.

Refer to caption
Figure 5: (a) Maximum harvesting rate 𝒢\mathscr{G} for the unicyclic system from Fig. 4 (a), as a function of supplied free energy Θ\Theta. Exact value is found numerically, LR, D, and ND are calculated using approximations described in the text. Exact and approximate optimal distributions 𝒑\bm{p}^{*} in ND (b) and LR (c) regimes are shown, with the optimal state i=1{i}^{*}=1 located in the middle of the histograms. (d) 𝒢\mathscr{G} and its LR approximation for different Θ\Theta and system sizes nn.
Refer to caption
Figure 6: (a) Maximum harvesting rate 𝒢\mathscr{G} for the unicyclic system from Fig. 4 (b), as a function of supplied free energy rate θ\theta. Exact value is found numerically, LR, D, and ND are calculated using approximations described in the text. Exact and approximate optimal distributions 𝒑\bm{p}^{*} in ND (b) and LR (c) regimes are shown, with the optimal state i=1{i}^{*}=1 located in the middle of the histograms. (d) 𝒢\mathscr{G} and its LR approximation for different θ\theta and system sizes nn.

We first report results for the first scenario from Fig. 4 (a), where free energy is harvested during the transition 212\to 1. Observe that baseline harvesting rate vanishes, 𝒢˙b(𝝅b)=0\dot{\mathcal{G}}^{b}({\bm{\pi}^{b}})=0, since harvesting free energy requires a cyclic current. In Fig. 5 (a), we plot the maximum harvesting rate 𝒢\mathscr{G} and its approximations as a function of the supplied free energy Θ\Theta. For small Θ\Theta, LR applies and the maximum harvesting rate is

𝒢βΘ2(n1)/4n2.\displaystyle\mathscr{G}\approx\beta\Theta^{2}{(n-1)}/{4n^{2}}. (19)

The optimal distribution in the LR regime, shown in Fig. 5 (c), builds up in equal increments starting from i=i+1i={i}^{*}+1 until the optimal state i=1{i}^{*}=1, after which it drops sharply. For large Θ\Theta, the D regime is relevant and the optimal distribution concentrates on the optimal state i=1{i}^{*}=1, so

𝒢Θ.\displaystyle\mathscr{G}\approx\Theta\,. (20)

At intermediate Θ\Theta, the ND regime applies, which gives

𝒢Θβ1{2+ln[2(βΘ1)(βΘ2)]}.\displaystyle\mathscr{G}\approx\Theta-{\beta}^{-1}\big{\{}2+\ln[2(\beta\Theta-1)(\beta\Theta-2)]\big{\}}. (21)

The optimal distribution in the ND regime, shown in Fig. 5 (b), allocates pi1=1/(βΘ2){p}^{*}_{{i}^{*}-1}=1/(\beta\Theta-2), pi+1=1/(2βΘ2){p}^{*}_{{i}^{*}+1}=1/(2\beta\Theta-2) and the rest to the optimal state pi{p}^{*}_{{i}^{*}}.

Next, we consider the second scenario from Fig. 4 (b), where free energy is harvested when the system is in the optimal mesostate i=1{i}^{*}=1. Observe that the uniform baseline steady state assigns 1/n1/n probability to the optimal state, thus in this scenario the baseline harvesting rate is 𝒢˙b(𝝅b)=θ/n\dot{\mathcal{G}}^{b}({\bm{\pi}^{b}})=\theta/n. To facilitate comparison with the first scenario, we focus on the increase of the maximum harvesting rate relative to baseline,

Δ𝒢:=𝒢𝒢˙b(𝝅b)=𝒢θ/n.\displaystyle\Delta\mathscr{G}:=\mathscr{G}-\dot{\mathcal{G}}^{b}({\bm{\pi}^{b}})=\mathscr{G}-\theta/n. (22)

In Fig. 6 (a), we show Δ𝒢\Delta\mathscr{G} and its approximations as a function of the free energy supply rate θ\theta. For small θ\theta, LR applies and the maximum harvesting rate is

Δ𝒢βθ2/48.\displaystyle\Delta\mathscr{G}\approx\beta\theta^{2}/48\,. (23)

The optimal distribution in the LR regime, shown in Fig. 6 (c), is symmetric about the optimal state i=1{i}^{*}=1. For large θ\theta, the D regime is relevant and the optimal distribution concentrates on the optimal state i=1{i}^{*}=1, so

Δ𝒢θθ/n.\displaystyle\Delta\mathscr{G}\approx\theta-\theta/n\,. (24)

At intermediate θ\theta, the ND regime applies, giving

Δ𝒢θθ/n2β1[1+ln(βθ2)].\displaystyle\Delta\mathscr{G}\approx\theta-\theta/n-2{\beta}^{-1}\left[1+\ln\left(\beta\theta-2\right)\right]. (25)

The optimal distribution in the ND regime, shown in Fig. 6 (b), allocates pi1=pi+1=1/(βθ2){p}^{*}_{{i}^{*}-1}={p}^{*}_{{i}^{*}+1}=1/(\beta\theta-2) and the rest to pi{p}^{*}_{{i}^{*}}.

There are some similarities among the two harvesting scenarios. For both scenarios, in the LR regime, the increase of the harvesting rate scales quadratically in the supplied free energy (Θ\Theta or θ\theta) and linearly in inverse temperature β\beta. This scaling reflects the fact that the optimal strategy has to balance harvesting (Θ\Theta or θ\theta contributions) with the thermodynamic cost of maintaining a low entropy 𝒑\bm{p}^{*} (β\beta contributions). In the ND and D regimes, 𝒢\mathscr{G} scales linearly in the supplied free energy and loses its linear dependence on β\beta. Thus, outside of LR, the thermodynamic cost of maintaining a low entropy distribution has a minor effect on the optimal strategy.

There are also important differences between the two scenarios. For the first scenario, the optimal strategy maintains an asymmetric 𝒑\bm{p}^{*}, thereby generating a net flux across the transition 212\to 1. In the LR regime, the cost of maintaining this asymmetric distribution grows with the system size nn, therefore the maximum harvesting rate in Eq. (19) scales as O(n1)\sim O(n^{-1}). This is shown in Fig. 5 (d), where we plot 𝒢\mathscr{G} and its LR approximation at various Θ\Theta and nn. For the second scenario, the optimal strategy maintains a peaked but symmetric 𝒑\bm{p}^{*}. Remarkably, the cost of maintaining this distribution does not depend on system size nn. This is shown in Fig. 6 (d), where we plot Δ𝒢\Delta\mathscr{G} at various θ\theta and nn.

VII Discussion

In this paper, we consider the problem of optimizing free energy harvesting in a nonequilibrium steady-state system. We demonstrate that this problem can be formulated as a constrained convex optimization problem, and we use this formulation to study optimal harvesting and efficiency in the bacteriorhodopsin proton pump. We also solve the convex optimization problem in closed-form for three limiting regimes, as illustrated on two unicyclic models discussed above.

A key step in our analysis is to separate the dynamics of the system into separate contributions from fixed baseline processes and optimizable control processes. We note that, in stochastic thermodynamics, the baseline/control separation has been previously used to study autonomous Maxwellian demons [46, 47], counterdiabatic driving [48], and the cost of maintaining a nonequilibrium steady state [49, 50].

We derive a simplified bound on the maximum harvesting rate in (13), which is achieved in the limit of fast dissipation-less control. Interestingly, this expression involves a tradeoff between two terms, one information-theoretic and one thermodynamic. At first glance, this resembles information/free-energy tradeoffs characteristic of Maxwellian demons and other “information engines” [51, 52, 53, 54, 55, 56, 57, 58]. However, there are important differences. In a typical information engine, there is no external source of driving and information serves as fuel, which can be converted into β1ln2{\beta}^{-1}\ln 2 of thermodynamic free energy per bit. In our case, there is an external source of free energy that in some cases can be harvested more effectively by reducing the system’s statistical entropy, e.g. by concentrating it on favorable states. Here, a bit of information can increase the harvesting rate by a very large amount (much larger than β1ln2/bit{\beta}^{-1}\ln 2/\text{bit}), and information acts more like a catalyst than a fuel [59, 60]. Loosely speaking, this is similar to how information encoded in the sequence of a metabolic enzyme is not itself fuel, but rather allows metabolism to harvest a large amount of fuel from elsewhere.

We finish by mentioning some connections to previous work and future directions. First, our approach may be related to prior work on optimizing power output and free energy transduction in steady-state molecular machines [61, 62, 63, 64, 65, 29, 66]. Here we consider the general problem of optimizing a set of control processes, given a fixed baseline and possible additional constraints on kinetics, topology, and thermodynamics. Previous work does not make the baseline/control distinction; instead, it is mostly concerned with the problem of optimizing system performance with respect to a small set of specific parameters or observables of interest, such as the location of free energy barriers [66, 62, 63, 65], efficiency [66], and the size of fluctuations [61, 64].

There is also an interesting relation between our work and flux balance analysis (FBA) [67, 68, 69]. The goal of FBA is to identify deterministic fluxes in biological metabolic systems that optimize biomass production, or other similar metrics of performance. This can be formulated as a linear program, which may include linear constraints that enforce thermodynamically favored reaction directions [69] (interestingly, in Ref. [70], the authors propose a version of FBA that also accounts for the entropy production rate). Our setting and optimization are different from FBA and its variants. We seek to optimize free energy harvesting at the stochastic level, and our objective involves nonlinear information-theoretic contributions to free energy. In addition, our optimization involves both the steady-state distribution 𝒑\bm{p} and fluxes 𝑱{\boldsymbol{J}}, which allows us to optimize harvesting due to to internal transitions within coarse-grained mesostates, as in Fig. 4 (b). Nonetheless, investigating the relationship between our approach and FBA is an interesting direction for future work.

Another interesting direction for future work is to consider stochastic fluctuations of free energy harvesting. In particular, the thermodynamic uncertainty relation may be used to study tradeoffs between the entropy production rate, the average harvesting rate (the quantity 𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} considered here), and the fluctuations in the amount of harvested free energy [71, 64]. For biomolecular systems, large fluctuations in harvesting can lead to starvation, suggesting that minimizing fluctuations may be of significant biological importance.

Finally, an interesting direction for future work is to consider free energy harvesting in a system embedded in a fluctuating environment. For example, one may imagine a harvesting system in an environment with fluctuating sugar sources, or with a fluctuating amount of available light. In this setting, it is natural to optimize the harvesting rate under the topological constraint that control fluxes cannot directly change the state of the environment, for instance using bipartite models of Markovian dynamics [72]. It would be interesting to investigate how, under the optimal harvesting strategy, the information flow from the environment to the system varies with the abundance of free energy and complexity of the environment.

Acknowledgements.
We thank members of the Complex Systems Lab, B. Corominas-Murtra, L. Seoane, D. Wolpert, and D. Sowinski for useful discussions. A.K. also thanks Sosuke Ito for support and encouragement. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Grant Agreement No. 101068029. J.P. was supported by the María de Maeztú fellowship MDM-2014-0370-17-2 and Grant No. 62417 from the John Templeton Foundation. The opinions expressed in this publication are those of the authors and do not necessarily reflect the views of the John Templeton Foundation.

References

  • [1] T. L. Hill, Free energy transduction in biology: the steady-state kinetic and thermodynamic formalism. New York: Academic Press, 1977.
  • [2] H. Morowitz, Foundations of bioenergetics. Elsevier, 2012.
  • [3] A. J. Lotka, “Contribution to the energetics of evolution,” Proceedings of the National Academy of Sciences, vol. 8, no. 6, pp. 147–151, 1922.
  • [4] J. H. Brown, P. A. Marquet, and M. L. Taper, “Evolution of body size: consequences of an energetic definition of fitness,” The American Naturalist, vol. 142, no. 4, pp. 573–584, 1993.
  • [5] W. B. Watt, “Bioenergetics and evolutionary genetics: opportunities for new synthesis,” The American Naturalist, vol. 125, no. 1, pp. 118–143, 1985.
  • [6] O. P. Judson, “The energy expansions of evolution,” Nature ecology & evolution, vol. 1, no. 6, p. 0138, 2017.
  • [7] H.-J. Choi and C. D. Montemagno, “Artificial organelle: Atp synthesis from cellular mimetic polymersomes,” Nano letters, vol. 5, no. 12, pp. 2538–2542, 2005.
  • [8] K. Y. Lee, S.-J. Park, K. A. Lee, S.-H. Kim, H. Kim, Y. Meroz, L. Mahadevan, K.-H. Jung, T. K. Ahn, K. K. Parker, et al., “Photosynthetic artificial organelles sustain and control atp-dependent reactions in a protocellular system,” Nature biotechnology, vol. 36, no. 6, pp. 530–535, 2018.
  • [9] S. Berhanu, T. Ueda, and Y. Kuruma, “Artificial photosynthetic cell producing energy for protein synthesis,” Nature communications, vol. 10, no. 1, p. 1325, 2019.
  • [10] K. Villa and M. Pumera, “Fuel-free light-driven micro/nanomachines: Artificial active matter mimicking nature,” Chemical Society Reviews, vol. 48, no. 19, pp. 4966–4978, 2019.
  • [11] C. Kleineberg, C. Wölfer, A. Abbasnia, D. Pischel, C. Bednarz, I. Ivanov, T. Heitkamp, M. Börsch, K. Sundmacher, and T. Vidaković-Koch, “Light-driven atp regeneration in diblock/grafted hybrid vesicles,” ChemBioChem, vol. 21, no. 15, pp. 2149–2160, 2020.
  • [12] T. E. Miller, T. Beneyton, T. Schwander, C. Diehl, M. Girault, R. McLean, T. Chotel, P. Claus, N. S. Cortina, J.-C. Baret, et al., “Light-powered co2 fixation in a chloroplast mimic with natural and synthetic parts,” Science, vol. 368, no. 6491, pp. 649–654, 2020.
  • [13] C. Guindani, L. C. da Silva, S. Cao, T. Ivanov, and K. Landfester, “Synthetic cells: From simple bio-inspired modules to sophisticated integrated systems,” Angewandte Chemie, vol. 134, no. 16, p. e202110855, 2022.
  • [14] P. Albanese, F. Mavelli, and E. Altamura, “Light energy transduction in liposome-based artificial cells,” Frontiers in Bioengineering and Biotechnology, vol. 11, p. 1161730, 2023.
  • [15] N. S. Chandel, “Glycolysis,” Cold Spring Harbor Perspectives in Biology, vol. 13, no. 5, p. a040535, 2021.
  • [16] U. Seifert, “Stochastic thermodynamics, fluctuation theorems and molecular machines,” Reports on progress in physics, vol. 75, no. 12, p. 126001, 2012.
  • [17] J. C. Doyle, B. A. Francis, and A. R. Tannenbaum, Feedback control theory. Courier Corporation, 2013.
  • [18] J. K. Lanyi, “Bacteriorhodopsin,” Annu. Rev. Physiol., vol. 66, pp. 665–688, 2004.
  • [19] D. Goodsell, “Bacteriorhodopsin,” The RSCB PDB Molecule of the Month, vol. 3, 2002. doi:10.2210/rcsb_pdb/mom_2002_3.
  • [20] O. Béja, L. Aravind, E. V. Koonin, M. T. Suzuki, A. Hadd, L. P. Nguyen, S. B. Jovanovich, C. M. Gates, R. A. Feldman, J. L. Spudich, et al., “Bacterial rhodopsin: evidence for a new type of phototrophy in the sea,” Science, vol. 289, no. 5486, pp. 1902–1906, 2000.
  • [21] L. Gómez-Consarnau, J. A. Raven, N. M. Levine, L. S. Cutter, D. Wang, B. Seegers, J. Arístegui, J. A. Fuhrman, J. M. Gasol, and S. A. Sañudo-Wilhelmy, “Microbial rhodopsins are major contributors to the solar energy captured in the sea,” Science advances, vol. 5, no. 8, p. eaaw8855, 2019.
  • [22] J. Schnakenberg, “Network theory of microscopic and macroscopic behavior of master equation systems,” Reviews of Modern physics, vol. 48, no. 4, p. 571, 1976.
  • [23] M. Esposito and C. Van den Broeck, “Three faces of the second law. i. master equation formulation,” Physical Review E, vol. 82, no. 1, p. 011143, 2010.
  • [24] A. Wachtel, R. Rao, and M. Esposito, “Thermodynamically consistent coarse graining of biocatalysts beyond michaelis–menten,” New Journal of Physics, vol. 20, no. 4, p. 042002, 2018.
  • [25] A. Cornish-Bowden, Fundamentals of enzyme kinetics. John Wiley & Sons, 2013.
  • [26] J. M. Horowitz, T. Sagawa, and J. M. Parrondo, “Imitating chemical motors with optimal information motors,” Physical review letters, vol. 111, no. 1, p. 010602, 2013.
  • [27] See Supplemental Material at [URL will be inserted by publisher] for a derivation of our main results, a full description of the bacteriorhodopsin model and the details for the analyses of limiting regimes and unicyclic examples.
  • [28] C. Maes, “Local detailed balance,” SciPost Physics Lecture Notes, p. 032, 2021.
  • [29] A. I. Brown and D. A. Sivak, “Theory of nonequilibrium free energy transduction by molecular machines,” Chemical Reviews, vol. 120, pp. 434–459, Jan. 2020.
  • [30] S. P. Boyd and L. Vandenberghe, Convex optimization. Cambridge university press, 2004.
  • [31] G. Varo and J. K. Lanyi, “Thermodynamics and energy coupling in the bacteriorhodopsin photocycle,” Biochemistry, vol. 30, no. 20, pp. 5016–5022, 1991.
  • [32] V. A. Lórenz-Fonfría and H. Kandori, “Spectroscopic and kinetic evidence on how bacteriorhodopsin accomplishes vectorial proton transport under functional conditions,” Journal of the American Chemical Society, vol. 131, pp. 5891–5901, Apr. 2009.
  • [33] E. Penocchio, R. Rao, and M. Esposito, “Nonequilibrium thermodynamics of light-induced reactions,” The Journal of Chemical Physics, vol. 155, no. 11, 2021.
  • [34] J. K. Lanyi, “Light energy conversion in Halobacterium halobium,” Microbiological Reviews, vol. 42, no. 4, pp. 682–706, 1978.
  • [35] A. Miller and D. Oesterhelt, “Kinetic optimization of bacteriorhodopsin by aspartic acid 96 as an internal proton donor,” Biochimica et Biophysica Acta (BBA)-Bioenergetics, vol. 1020, no. 1, pp. 57–64, 1990.
  • [36] A. Seitz and N. Hampp, “Kinetic optimization of bacteriorhodopsin films for holographic interferometry,” The Journal of Physical Chemistry B, vol. 104, no. 30, pp. 7183–7192, 2000.
  • [37] K. J. Wise, N. B. Gillespie, J. A. Stuart, M. P. Krebs, and R. R. Birge, “Optimization of bacteriorhodopsin for bioelectronic devices,” Trends in biotechnology, vol. 20, no. 9, pp. 387–394, 2002.
  • [38] J. R. Hillebrecht, K. J. Wise, J. F. Koscielecki, and R. R. Birge, “Directed evolution of bacteriorhodopsin for device applications,” in Methods in enzymology, vol. 388, pp. 333–347, Elsevier, 2004.
  • [39] J. Tittor, M. Wahl, U. Schweiger, and D. Oesterhelt, “Specific acceleration of de-and reprotonation steps by azide in mutated bacteriorhodopsins,” Biochimica et Biophysica Acta (BBA)-Bioenergetics, vol. 1187, no. 2, pp. 191–197, 1994.
  • [40] S. P. Balashov, M. Lu, E. S. Imasheva, R. Govindjee, T. G. Ebrey, B. Othersen, Y. Chen, R. K. Crouch, and D. R. Menick, “The proton release group of bacteriorhodopsin controls the rate of the final step of its photocycle at low ph,” Biochemistry, vol. 38, no. 7, pp. 2026–2039, 1999.
  • [41] Q. Li, S. Bressler, D. Ovrutsky, M. Ottolenghi, N. Friedman, and M. Sheves, “On the protein residues that control the yield and kinetics of o630 in the photocycle of bacteriorhodopsin,” Biophysical Journal, vol. 78, no. 1, pp. 354–362, 2000.
  • [42] The dynamical activity refers to the overall rate of back-and-forth jumps across a transitions, Rijbπjb+RjibπibR^{b}_{ij}\pi^{b}_{j}+R^{b}_{ji}\pi^{b}_{i}.
  • [43] A. Kolchinsky, N. Ohga, and S. Ito, “Thermodynamic bound on spectral perturbations, with applications to oscillations and relaxation dynamics,” Physical Review Research, vol. 6, no. 1, p. 013082, 2024.
  • [44] J. A. Fill, “Eigenvalue bounds on convergence to stationarity for nonreversible Markov chains, with an application to the exclusion process,” The Annals of Applied Probability, vol. 1, no. 1, pp. 62–87, 1991.
  • [45] R. M. Gray et al., “Toeplitz and circulant matrices: A review,” Foundations and Trends® in Communications and Information Theory, vol. 2, no. 3, pp. 155–239, 2006.
  • [46] N. Shiraishi, S. Ito, K. Kawaguchi, and T. Sagawa, “Role of measurement-feedback separation in autonomous maxwell’s demons,” New Journal of Physics, vol. 17, no. 4, p. 045012, 2015.
  • [47] N. Shiraishi, T. Matsumoto, and T. Sagawa, “Measurement-feedback formalism meets information reservoirs,” New Journal of Physics, vol. 18, no. 1, p. 013044, 2016.
  • [48] K. Takahashi, K. Fujii, Y. Hino, and H. Hayakawa, “Nonadiabatic control of geometric pumping,” Physical Review Letters, vol. 124, no. 15, p. 150602, 2020.
  • [49] J. M. Horowitz and J. L. England, “Information-theoretic bound on the entropy production to maintain a classical nonequilibrium distribution using ancillary control,” Entropy, vol. 19, no. 7, p. 333, 2017.
  • [50] J. M. Horowitz, K. Zhou, and J. L. England, “Minimum energetic cost to maintain a target nonequilibrium state,” Physical Review E, vol. 95, no. 4, p. 042102, 2017.
  • [51] J. M. Parrondo, J. M. Horowitz, and T. Sagawa, “Thermodynamics of information,” Nature Physics, vol. 11, no. 2, pp. 131–139, 2015.
  • [52] T. Sagawa and M. Ueda, “Minimal energy cost for thermodynamic information processing: measurement and information erasure,” Physical review letters, vol. 102, no. 25, p. 250602, 2009.
  • [53] D. Abreu and U. Seifert, “Thermodynamics of genuine nonequilibrium states under feedback control,” Physical review letters, vol. 108, no. 3, p. 030601, 2012.
  • [54] F. J. Cao and M. Feito, “Thermodynamics of feedback controlled systems,” Physical Review E, vol. 79, no. 4, p. 041118, 2009.
  • [55] S. Ito and T. Sagawa, “Information thermodynamics on causal networks,” Physical review letters, vol. 111, no. 18, p. 180603, 2013.
  • [56] D. Hartich, A. C. Barato, and U. Seifert, “Stochastic thermodynamics of bipartite systems: transfer entropy inequalities and a Maxwell’s demon interpretation,” Journal of Statistical Mechanics: Theory and Experiment, vol. 2014, no. 2, p. P02016, 2014.
  • [57] A. C. Barato, D. Hartich, and U. Seifert, “Efficiency of cellular information processing,” New Journal of Physics, vol. 16, no. 10, p. 103024, 2014.
  • [58] P. Sartori, L. Granger, C. F. Lee, and J. M. Horowitz, “Thermodynamic costs of information processing in sensory adaptation,” PLoS Comput Biol, vol. 10, no. 12, p. e1003974, 2014.
  • [59] J. Barham, “A dynamical model of the meaning of information,” Biosystems, vol. 38, no. 2-3, pp. 235–241, 1996.
  • [60] A. Kolchinsky and D. H. Wolpert, “Semantic information, autonomous agency and non-equilibrium statistical physics,” Interface Focus, vol. 8, p. 20180041, Dec. 2018.
  • [61] P. Pietzonka, A. C. Barato, and U. Seifert, “Universal bound on the efficiency of molecular motors,” Journal of Statistical Mechanics: Theory and Experiment, vol. 2016, p. 124004, Dec. 2016.
  • [62] A. I. Brown and D. A. Sivak, “Allocating dissipation across a molecular machine cycle to maximize flux,” Proceedings of the National Academy of Sciences, vol. 114, no. 42, pp. 11057–11062, 2017.
  • [63] A. I. Brown and D. A. Sivak, “Allocating and splitting free energy to maximize molecular machine flux,” The Journal of Physical Chemistry B, vol. 122, no. 4, pp. 1387–1393, 2018.
  • [64] P. Pietzonka and U. Seifert, “Universal Trade-Off between Power, Efficiency, and Constancy in Steady-State Heat Engines,” Physical Review Letters, vol. 120, p. 190602, May 2018.
  • [65] J. A. Wagoner and K. A. Dill, “Mechanisms for achieving high speed and efficiency in biomolecular machines,” Proceedings of the National Academy of Sciences, vol. 116, no. 13, pp. 5902–5907, 2019.
  • [66] T. Schmiedl and U. Seifert, “Efficiency of molecular motors at maximum power,” EPL (Europhysics Letters), vol. 83, no. 3, p. 30005, 2008.
  • [67] D. A. Beard, S.-d. Liang, and H. Qian, “Energy balance for analysis of complex metabolic networks,” Biophysical journal, vol. 83, no. 1, pp. 79–86, 2002.
  • [68] K. J. Kauffman, P. Prakash, and J. S. Edwards, “Advances in flux balance analysis,” Current opinion in biotechnology, vol. 14, no. 5, pp. 491–496, 2003.
  • [69] M. Kschischo, “A gentle introduction to the thermodynamics of biochemical stoichiometric networks in steady state,” The European Physical Journal Special Topics, vol. 187, no. 1, pp. 255–274, 2010.
  • [70] R. M. Fleming, C. M. Maes, M. A. Saunders, Y. Ye, and B. Ø. Palsson, “A variational principle for computing nonequilibrium fluxes and potentials in genome-scale biochemical networks,” Journal of theoretical biology, vol. 292, pp. 71–77, 2012.
  • [71] T. R. Gingrich, J. M. Horowitz, N. Perunov, and J. L. England, “Dissipation bounds all steady-state current fluctuations,” Physical review letters, vol. 116, no. 12, p. 120601, 2016.
  • [72] J. M. Horowitz and M. Esposito, “Thermodynamics with continuous information flow,” Physical Review X, vol. 4, no. 3, p. 031015, 2014.
  • [73] M. Esposito, “Stochastic thermodynamics under coarse graining,” Physical Review E, vol. 85, no. 4, p. 041125, 2012.
  • [74] T. M. Cover and J. A. Thomas, Elements of information theory. John Wiley & Sons, 2006.
  • [75] E. Ilker, Ö. Güngör, B. Kuznets-Speck, J. Chiel, S. Deffner, and M. Hinczewski, “Shortcuts in stochastic systems and control of biophysical processes,” Physical Review X, vol. 12, no. 2, p. 021048, 2022.
  • [76] A. M. Ferreira and D. Bashford, “Model for proton transport coupled to protein conformational change: application to proton pumping in the bacteriorhodopsin photocycle,” Journal of the American Chemical Society, vol. 128, no. 51, pp. 16778–16790, 2006.
  • [77] J. K. Lanyi, “Halorhodopsin: a light-driven chloride ion pump,” Annual review of biophysics and biophysical chemistry, vol. 15, no. 1, pp. 11–28, 1986.
  • [78] S. Ahmed and I. R. Booth, “The use of valinomycin, nigericin and trichlorocarbanilide in control of the protonmotive force in escherichia coli cells,” Biochemical Journal, vol. 212, no. 1, pp. 105–112, 1983.
  • [79] C. Naslund, “Mathematics stackexchange. question 104967,” 2012. https://math.stackexchange.com/questions/104967/.
  • [80] Svyatoslav and C. Leibovici, “Mathematics stackexchange. question 4567421,” 2022. https://math.stackexchange.com/questions/4567421/.

Supplemental Material:

Optimization of nonequilibrium free energy harvesting illustrated on bacteriorhodopsin

Jordi Piñero, Ricard Solé, and Artemy Kolchinsky

I Derivations of main results

I.1 Derivation of (8) from LDB and steady-state assumption

Here we derive (8) in the main text, which reads as

𝒢˙tot=˙b(𝝅)+𝒢˙b(𝝅)Σ˙(𝑱c)/β.\displaystyle\dot{\mathcal{G}}^{\mathrm{tot}}=\dot{\mathcal{F}}^{b}(\bm{\pi})+\dot{\mathcal{G}}^{b}(\bm{\pi})-\dot{\Sigma}({\boldsymbol{J}}^{c})/\beta. (S1.1)

To derive this result,

˙c(𝒑)+𝒢˙c(𝒑)\displaystyle\dot{\mathcal{F}}^{c}(\bm{p})+\dot{\mathcal{G}}^{c}(\bm{p}) =i,jpiRjic(fj+β1lnpj)+i,jpiRjicgjic\displaystyle=\sum_{i,j}p_{i}R^{c}_{ji}(f_{j}+{\beta}^{-1}\ln p_{j})+\sum_{i,j}p_{i}R^{c}_{ji}{g}^{c}_{ji}
=β1i,jpiRjic[lnpj+β(fj+gjic)].\displaystyle={\beta}^{-1}\sum_{i,j}p_{i}R^{c}_{ji}[\ln p_{j}+\beta(f_{j}+{g}^{c}_{ji})]. (S1.2)

The first line follows from definitions made in the main text. Since RcR^{c} is a rate matrix, i,jpiRjicai=ipiaijRjic=0\sum_{i,j}p_{i}R^{c}_{ji}a_{i}=\sum_{i}p_{i}a_{i}\sum_{j}R^{c}_{ji}=0 for any 𝒂\bm{a}. This allows us to further rewrite (S1.2) as

˙c(𝒑)+𝒢˙c(𝒑)\displaystyle\dot{\mathcal{F}}^{c}(\bm{p})+\dot{\mathcal{G}}^{c}(\bm{p}) =β1i,jpiRjic[lnpjlnpi+β(fi+fj+gjic)]\displaystyle={\beta}^{-1}\sum_{i,j}p_{i}R^{c}_{ji}[\ln p_{j}-\ln p_{i}+\beta(-f_{i}+f_{j}+{g}^{c}_{ji})]
=β1i,jpiRjic[lnpjlnpiln(Rjic/Rijc)]\displaystyle={\beta}^{-1}\sum_{i,j}p_{i}R^{c}_{ji}[\ln p_{j}-\ln p_{i}-\ln(R^{c}_{ji}/R^{c}_{ij})] (S1.3)

In the second line, we plugged in the condition of LDB, as expressed in (6) in the main text:

ln(Rjic/Rijc)=β(fifjgjic)forRjic>0.\ln(R^{c}_{ji}/R^{c}_{ij})=\beta(f_{i}-f_{j}-{g}^{c}_{ji})\qquad\text{for}\quad R^{c}_{ji}>0. (S1.4)

Using the definition of one-way fluxes due to control given a generic distribution 𝒑\bm{p}, Jjic=piRjic{J^{c}_{ji}}=p_{i}R^{c}_{ji}, and the Schnackenberg expression of EPR Σ˙\dot{\Sigma}, we rewrite (S1.3) as

˙c(𝒑)+𝒢˙c(𝒑)=β1i,jpiRjiclnpjRjicpiRijc=β1ijJjiclnJjicJijc=β1Σ˙(𝑱c).\displaystyle\dot{\mathcal{F}}^{c}(\bm{p})+\dot{\mathcal{G}}^{c}(\bm{p})=-{\beta}^{-1}\sum_{i,j}p_{i}R^{c}_{ji}\ln\frac{p_{j}R^{c}_{ji}}{p_{i}R^{c}_{ij}}=-{\beta}^{-1}\sum_{i\neq j}{J^{c}_{ji}}\ln\frac{{J^{c}_{ji}}}{{J^{c}_{ij}}}=-{\beta}^{-1}\dot{\Sigma}({\boldsymbol{J}}^{c})\,. (S1.5)

In the main text, this equality is applied at the steady-state distribution 𝝅\bm{\pi}. However, Eq. (S1.5) is valid for all distributions 𝒑\bm{p} as long as (S1.4) holds.

To complete our derivation of (S1.1), recall that R𝝅=(Rb+Rc)𝝅=0R\bm{\pi}=(R^{b}+R^{c})\bm{\pi}=0 by definition of the steady-state 𝝅\bm{\pi}. At the same time, from the definition of ˙b(𝒑)\dot{\mathcal{F}}^{b}(\bm{p}) in (2), we have

˙b(𝝅)+˙c(𝝅)=j(fj+β1lnπj)i(Rjib+Rjic)πi=0˙b(𝝅)=˙c(𝝅).\displaystyle\dot{\mathcal{F}}^{b}(\bm{\pi})+\dot{\mathcal{F}}^{c}(\bm{\pi})=\sum_{j}(f_{j}+{\beta}^{-1}\ln\pi_{j})\sum_{i}(R^{b}_{ji}+R^{c}_{ji})\pi_{i}=0\qquad\implies\qquad\dot{\mathcal{F}}^{b}(\bm{\pi})=-\dot{\mathcal{F}}^{c}(\bm{\pi}). (S1.6)

Combining the definition of 𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} with (S1.5) and (S1.6) gives Eq. (S1.1),

𝒢˙tot=𝒢˙b(𝝅)+𝒢˙c(𝝅)=𝒢˙b(𝝅)[˙c(𝝅)+β1Σ˙(𝑱c)]=˙b(𝝅)+𝒢˙b(𝝅)β1Σ˙(𝑱c).\displaystyle\dot{\mathcal{G}}^{\mathrm{tot}}=\dot{\mathcal{G}}^{b}(\bm{\pi})+\dot{\mathcal{G}}^{c}(\bm{\pi})=\dot{\mathcal{G}}^{b}(\bm{\pi})-\left[\dot{\mathcal{F}}^{c}(\bm{\pi})+{\beta}^{-1}\dot{\Sigma}({\boldsymbol{J}}^{c})\right]=\dot{\mathcal{F}}^{b}(\bm{\pi})+\dot{\mathcal{G}}^{b}(\bm{\pi})-{\beta}^{-1}\dot{\Sigma}({\boldsymbol{J}}^{c})\,. (S1.7)

Finally, note that the entropy production rate Σ˙\dot{\Sigma} is always nonnegative:

Σ˙(𝑱c)=12ij(pjRijcpiRjic)lnpjRijcpiRjic0,\displaystyle\dot{\Sigma}({\boldsymbol{J}}^{c})=\frac{1}{2}\sum_{i\neq j}(p_{j}R^{c}_{ij}-p_{i}R^{c}_{ji})\ln\frac{p_{j}R^{c}_{ij}}{p_{i}R^{c}_{ji}}\geq 0, (S1.8)

where the last inequality follows because the terms pjRijcpiRjicp_{j}R^{c}_{ij}-p_{i}R^{c}_{ji} and lnpjRijcpiRjic\ln\frac{p_{j}R^{c}_{ij}}{p_{i}R^{c}_{ji}} have the same sign. This can be seen as a statement of the Second Law.

Let us briefly comment on the physical meaning of our derivation. First, we note that LDB holds when the control dynamics exhibit a separation of timescales, with fast equilibration within each mesostate and slower transitions between mesostates [73].

Second, recall our assumption that the control processes do not interact directly with the external source of free energy, but only with the internal reservoir and heat bath. This assumption is formalized in the particular form of the LDB condition (S1.4). It means that for the control processes, the combined “system+internal reservoir” may be treated as a single system coupled only to a heat bath at inverse temperature β\beta. Hence, the rate of entropy production due to control is simply β\beta times the decrease of the combined free energy of the system and internal reservoir, as in (S1.5). Moreover, owing to Second Law, their combined free energy cannot increase under control dynamics [26, 29],

˙c(𝒑)+𝒢˙c(𝒑)0for all𝒑.\displaystyle\dot{\mathcal{F}}^{c}(\bm{p})+\dot{\mathcal{G}}^{c}(\bm{p})\leq 0\qquad\text{for all}\quad\bm{p}. (S1.9)

I.2 Concavity of constrained optimization (10)

Consider the objective ˙b(𝒑)+𝒢˙b(𝒑)Σ˙(𝑱)/β\dot{\mathcal{F}}^{b}(\bm{p})+\dot{\mathcal{G}}^{b}(\bm{p})-\dot{\Sigma}({\boldsymbol{J}})/\beta in (10) in the main text. In Sec. I.3 below, we show that the term ˙b(𝒑)+𝒢˙b(𝒑)\dot{\mathcal{F}}^{b}(\bm{p})+\dot{\mathcal{G}}^{b}(\bm{p}) is concave in 𝒑\bm{p}. Here we show that Σ˙(𝑱)\dot{\Sigma}({\boldsymbol{J}}) is convex in 𝑱{\boldsymbol{J}}. This shows that the overall objective is concave in the pair (𝒑,𝑱)(\bm{p},{\boldsymbol{J}}).

Consider any pair of feasible fluxes 𝑱𝒀{\boldsymbol{J}}\neq\boldsymbol{Y} and λ(0,1)\lambda\in(0,1) and let 𝑱(λ)=λ𝑱+(1λ)𝒀{\boldsymbol{J}}(\lambda)=\lambda{\boldsymbol{J}}+(1-\lambda)\boldsymbol{Y} indicate their convex mixture. To prove convexity, we write

λΣ˙(𝑱)+(1λ)Σ˙(𝒀)\displaystyle\lambda\dot{\Sigma}({\boldsymbol{J}})+(1-\lambda)\dot{\Sigma}(\boldsymbol{Y}) =λijJjilnJjiJij+(1λ)ijYjilnYjiYij\displaystyle=\lambda\sum_{i\neq j}J_{ji}\ln\frac{J_{ji}}{J_{ij}}+(1-\lambda)\sum_{i\neq j}{Y}_{ji}\ln\frac{{Y}_{ji}}{{Y}_{ij}}
ijJji(λ)lnJji(λ)Jij(λ)=Σ˙(𝑱(λ)).\displaystyle\geq\sum_{i\neq j}J_{ji}(\lambda)\ln\frac{J_{ji}(\lambda)}{J_{ij}(\lambda)}=\dot{\Sigma}({\boldsymbol{J}}(\lambda)). (S1.10)

Here we used the log-sum inequality [74, Thm. 2.7.1],

λalnaa+(1λ)blnbb[λa+(1λ)b]lnλa+(1λ)bλa+(1λ)bfor all a,a,b,b0.\displaystyle\lambda a\ln\frac{a}{a^{\prime}}+(1-\lambda)b\ln\frac{b}{b^{\prime}}\geq[\lambda a+(1-\lambda)b]\ln\frac{\lambda a+(1-\lambda)b}{\lambda a^{\prime}+(1-\lambda)b^{\prime}}\qquad\qquad\text{for all }a,a^{\prime},b,b^{\prime}\geq 0\,. (S1.11)

I.3 Properties of the unconstrained optimization (11)

I.3.1 Concavity and existence of maximizer

Consider the objective in (11), as equivalently written in (13):

(𝒑)=˙b(𝒑)+𝒢˙b(𝒑)=S˙b(𝒑)+ipiϕi,\displaystyle\mathscr{L}(\bm{p})=\dot{\mathcal{F}}^{b}(\bm{p})+\dot{\mathcal{G}}^{b}(\bm{p})=-\dot{S}^{b}(\bm{p})+\sum_{i}p_{i}\phi_{i}, (S1.12)

where S˙b(𝒑)=i,jpiRjiblnpj\dot{S}^{b}(\bm{p})=-\sum_{i,j}p_{i}R^{b}_{ji}\ln p_{j} is the rate of increase of the Shannon entropy of distribution 𝒑\bm{p} under the rate matrix RbR^{b}. Here we show that (𝒑)\mathscr{L}(\bm{p}) is concave, thus (11) involves the maximization of a concave function. This is equivalent to the minimization of a convex function, making it convex optimization problem which can be solved using standard numerical techniques. We also show the existence of a maximizer

𝒑argmax𝒑(𝒑).\bm{p}^{*}\in\operatorname*{arg\,max}_{\bm{p}}\mathscr{L}(\bm{p})\,.

Existence of 𝐩\bm{p}^{*}: The feasible set is compact (nn-dimensional probability simplex) and the objective is continuous, so the maximizer exists by the extreme value theorem. Also, the maximum value is finite (see (S3.86)).

Concavity: Consider any pair of distributions 𝒑𝒒\bm{p}\neq\bm{q} and λ(0,1)\lambda\in(0,1), and let 𝒑(λ)=λ𝒑+(1λ)𝒒\bm{p}(\lambda)=\lambda\bm{p}+(1-\lambda)\bm{q} indicate their convex mixture. We will show that the objective is concave:

(𝒑(λ))λ(𝒑)+(1λ)(𝒒).\mathscr{L}(\bm{p}(\lambda))\geq\lambda\mathscr{L}(\bm{p})+(1-\lambda)\mathscr{L}(\bm{q}). (S1.13)

Observe that ipiϕi\sum_{i}p_{i}\phi_{i} is linear in 𝒑\bm{p}, thus we can use (S1.12) to rearrange (S1.13) as

S˙b(𝒑(λ))λS˙b(𝒑)+(1λ)S˙b(𝒒).\dot{S}^{b}(\bm{p}(\lambda))\leq\lambda\dot{S}^{b}(\bm{p})+(1-\lambda)\dot{S}^{b}(\bm{q}). (S1.14)

To prove (S1.14), we first rewrite the rate of decrease of the Shannon entropy as

S˙b(𝒑)=i,jpiRjiblnpj=i,jpiRjiblnpjpi=ijpiRjiblnpjpi=ijpiRjiblnpipj.\dot{S}^{b}(\bm{p})=-\sum_{i,j}p_{i}R^{b}_{ji}\ln p_{j}=-\sum_{i,j}p_{i}R^{b}_{ji}\ln\frac{p_{j}}{p_{i}}=-\sum_{i\neq j}p_{i}R^{b}_{ji}\ln\frac{p_{j}}{p_{i}}=\sum_{i\neq j}p_{i}R^{b}_{ji}\ln\frac{p_{i}}{p_{j}}. (S1.15)

Here we first used that i,jpiRjibai=iaipijRjib=0\sum_{i,j}p_{i}R^{b}_{ji}a_{i}=\sum_{i}a_{i}p_{i}\sum_{j}R^{b}_{ji}=0 for any 𝒂\bm{a}, then used that the diagonal terms of the sum (i=ji=j) vanish. Next, we use (S1.15) to prove (S1.14) as

S˙b(𝒑(λ))=ijpi(λ)Rjiblnpi(λ)pj(λ)\displaystyle\dot{S}^{b}(\bm{p}(\lambda))=\sum_{i\neq j}p_{i}(\lambda)R^{b}_{ji}\ln\frac{p_{i}(\lambda)}{p_{j}(\lambda)} =ijpi(λ)Rjiblnpi(λ)Rjibpj(λ)Rjib\displaystyle=\sum_{i\neq j}p_{i}(\lambda)R^{b}_{ji}\ln\frac{p_{i}(\lambda)R^{b}_{ji}}{p_{j}(\lambda)R^{b}_{ji}}
ij[λpiRjiblnpiRjibpjRjib+(1λ)qiRjiblnqiRjibqjRjib]\displaystyle\leq\sum_{i\neq j}\left[\lambda p_{i}R^{b}_{ji}\ln\frac{p_{i}R^{b}_{ji}}{p_{j}R^{b}_{ji}}+(1-\lambda)q_{i}R^{b}_{ji}\ln\frac{q_{i}R^{b}_{ji}}{q_{j}R^{b}_{ji}}\right] (S1.16)
=λijpiRjiblnpjpi+(1λ)ijqiRjiblnqjqi\displaystyle=\lambda\sum_{i\neq j}p_{i}R^{b}_{ji}\ln\frac{p_{j}}{p_{i}}+(1-\lambda)\sum_{i\neq j}q_{i}R^{b}_{ji}\ln\frac{q_{j}}{q_{i}}
=λS˙b(𝒑)+(1λ)S˙b(𝒒).\displaystyle=\lambda\dot{S}^{b}(\bm{p})+(1-\lambda)\dot{S}^{b}(\bm{q})\,.

The second line follows from the log-sum inequality (S1.11).

I.3.2 Strict concavity and uniqueness of maximizer under irreducibility assumption

We can prove further results under the assumption that the baseline rate matrix RbR^{b} is irreducible (i.e., there is a path of non-zero transitions connecting any two states) and has a steady-state with full support (πi>0\pi_{i}>0 for all ii). In graph theoretic terms, this means that the graph of allowed transitions under RbR^{b} is strongly connected. Specifically, we prove that our objective is strictly concave, and that the optimizer 𝒑\bm{p}^{*} is unique and has full support.

Strict concavity: The log-sum inequality (S1.11) is strict when a/ab/ba/a^{\prime}\neq b/b^{\prime} [74, Thm. 2.7.1]. In our case, (S1.16) is strict for any pair of states jij\neq i that have Rjib>0R^{b}_{ji}>0 and

piRjibpjRjibqiRjibqjRjib.\displaystyle\frac{p_{i}R^{b}_{ji}}{p_{j}R^{b}_{ji}}\neq\frac{q_{i}R^{b}_{ji}}{q_{j}R^{b}_{ji}}. (S1.17)

Such a pair of states must exist, as we now prove by contradiction. Suppose to the contrary that (S1.17) is an equality for all iji\neq j where Rjib>0R^{b}_{ji}>0. Consider a walk on the graph defined by the allowed transitions in RbR^{b} — that is, a sequence of states i0,i1,i2,i_{0},i_{1},i_{2},\dots such that Rik+1ikb>0R^{b}_{i_{k+1}i_{k}}>0. We would then have that pi0/pi1=qi0/qi1,pi1/pi2=qi1/qi2,p_{i_{0}}/p_{i_{1}}=q_{i_{0}}/q_{i_{1}},p_{i_{1}}/p_{i_{2}}=q_{i_{1}}/q_{i_{2}},\dots, hence also (by multiplication) that pi0/pi2=qi0/qi2,p_{i_{0}}/p_{i_{2}}=q_{i_{0}}/q_{i_{2}},\dots. Now, since RbR^{b} is irreducible, any state can be reached from any other, implying that pi/pj=qi/qjp_{i}/p_{j}=q_{i}/q_{j} for all iji\neq j, hence 𝒑𝒒\bm{p}\propto\bm{q}. Since probabilities are nonnegative and normalized to sum to 1, this can only hold when 𝒑=𝒒\bm{p}=\bm{q}, contradicting our assumption that 𝒑𝒒\bm{p}\neq\bm{q} in (S1.16).

To summarize, there must be some iji\neq j such that the corresponding term in (S1.16) is a strict inequality, therefore

(𝒑(λ))>λ(𝒑)+(1λ)(𝒒).\mathscr{L}(\bm{p}(\lambda))>\lambda\mathscr{L}(\bm{p})+(1-\lambda)\mathscr{L}(\bm{q}). (S1.18)

Uniqueness of maximizer: Suppose that there are two different maximizers, (𝒑(1))=(𝒑(2))=𝒢\mathscr{L}(\bm{p}^{*}_{(1)})=\mathscr{L}(\bm{p}^{*}_{(2)})=\mathscr{G}. Then, (S1.18) implies that the convex mixture λ𝒑(1)+(1λ)𝒑(2)\lambda\bm{p}^{*}_{(1)}+(1-\lambda)\bm{p}^{*}_{(2)} would achieve a larger value than 𝒢\mathscr{G}, leading to a contradiction.

Full support of 𝐩\bm{p}^{*}: Suppose that the maximizer 𝒑\bm{p}^{*} did not have full support, so that some states have 0 probability. Then, there must be some pair iji\neq j such that pi>0p_{i}^{*}>0 and pj=0p_{j}^{*}=0 and Rjib>0R^{b}_{ji}>0 (otherwise the graph of allowed transitions would not be strongly connected). But for this pair, piRjibln(pj/pi)=p_{i}^{*}R^{b}_{ji}\ln({p_{j}^{*}}/{p_{i}^{*}})=-\infty. Plugging into (S1.15) implies that (𝒑)=\mathscr{L}(\bm{p}^{*})=-\infty and contradicting the fact that 𝒑\bm{p}^{*} is a maximizer. Hence, 𝒑\bm{p}^{*} must have full support.

I.4 Derivation and achievability of unconstrained optimization (11)

In this section, we derive the unconstrained optimization (11) from the constrained optimization (10).

First, observe that that the maximum in (10) is always less than the maximum in (11):

𝒢:=sup(𝒑,𝑱)Λ:𝔹𝑱=Rb𝒑˙b(𝒑)+𝒢˙b(𝒑)Σ˙(𝑱)/β𝒢:=max𝒑˙b(𝒑)+𝒢˙b(𝒑).\displaystyle\mathscr{G}:=\sup_{(\bm{p},{\boldsymbol{J}})\in\Lambda:\mathbb{B}{\boldsymbol{J}}=-R^{b}\bm{p}}\,\dot{\mathcal{F}}^{b}(\bm{p})+\dot{\mathcal{G}}^{b}(\bm{p})-\dot{\Sigma}({\boldsymbol{J}})/\beta\quad\leq\quad\mathscr{G}^{\prime}:=\max_{\bm{p}}\,\dot{\mathcal{F}}^{b}(\bm{p})+\dot{\mathcal{G}}^{b}(\bm{p})\,.

This follows from nonnegativity of Σ˙(𝑱)\dot{\Sigma}({\boldsymbol{J}}) and the fact that (11) has less constraints than (10).

We will show that 𝒢=𝒢\mathscr{G}=\mathscr{G}^{\prime} as long as the feasible set Λ\Lambda is not too constrained. In addition to the basic normalization and nonnegativity constraints on 𝒑\bm{p} and 𝑱{\boldsymbol{J}}, we allow Λ\Lambda to encode a set of topological constraints like

Jji=0if Gji=0,\displaystyle J_{ji}=0\qquad\text{if }G_{ji}=0\,, (S1.19)

for iji\neq j, where GG is a symmetric matrix that determines the topology of the control dynamics (Gji=Gij=1G_{ji}=G_{ij}=1 when the transition iji\leftrightarrow j is allowed, and otherwise Gji=Gij=0G_{ji}=G_{ij}=0). We assume that GG is connected and contains all nn states. Of course, we may have Gji=1G_{ji}=1 for all iji\neq j, in which case no topological constraints are imposed. We assume that Λ\Lambda includes no other constraints.

Given these assumptions, we construct a sequence of control process (Rc(κ),𝒈c)(R^{c}(\kappa),\bm{g}^{c}) which satisfy LDB such that:

  1. 1.

    The combined steady state 𝝅(κ)\bm{\pi}(\kappa) of Rb+Rc(κ)R^{b}+R^{c}(\kappa) and control fluxes Jijc(κ)=πj(κ)Rijc(κ){J^{c}_{ij}}(\kappa)=\pi_{j}(\kappa)R^{c}_{ij}(\kappa) belong to Λ\Lambda for all κ\kappa.

  2. 2.

    The combined steady state obeys limκ𝝅(κ)=𝒑\lim_{\kappa\to\infty}\bm{\pi}(\kappa)=\bm{p}^{*}, where 𝒑\bm{p}^{*} is the optimizer of (11).

  3. 3.

    The steady-state EPR vanishes limκ0Σ˙(κ)=0\lim_{\kappa\to 0}\dot{\Sigma}(\kappa)=0, therefore the steady-state harvesting rate obeys

    𝒢limκ𝒢˙tot(κ)=𝒢.\mathscr{G}\geq\lim_{\kappa\to\infty}\dot{\mathcal{G}}^{\mathrm{tot}}(\kappa)=\mathscr{G}^{\prime}.

Our proof technique is related to the idea sketched out in Ref. [50], which studied the minimal cost of maintaining a nonequilibrium steady state. It is also related to constructions from the literature on counterdiabatic driving [75].

I.4.1 Construction of optimal control

We define a sequence of control processes parameterized by κ>0\kappa>0. For each control process, the free energy exchanges with the internal reservoir are set to

gjic=fifj+β1(lnpilnpj).\displaystyle{g}^{c}_{ji}=f_{i}-f_{j}+{\beta}^{-1}(\ln p_{i}^{*}-\ln p_{j}^{*}). (S1.20)

The control rate matrix Rc(κ)R^{c}(\kappa) is defined by scaling a given rate matrix BB

Rjic(κ):=κBjiBji:=Gji1+eβ(fifjgjic)=κGji1+pj/pi.\displaystyle R^{c}_{ji}(\kappa):=\kappa B_{ji}\qquad\qquad B_{ji}:=\frac{G_{ji}}{1+e^{\beta(f_{i}-f_{j}-{g}^{c}_{ji})}}=\kappa\frac{G_{ji}}{1+p_{j}^{*}/p_{i}^{*}}\,. (S1.21)

Here κ0\kappa\geq 0 is an overall rate constant and GG is the matrix discussed in (S1.19). The particular choice of the topology encoded in GG is arbitrary, given our assumption that it is connected and contains all nn states which guarantees that RcR^{c} is irreducible. It can be verified that the rate matrix Rc(κ)R^{c}(\kappa) defined in (S1.21) obeys LDB (6). It also satisfies detailed balance (DB) for the distribution 𝒑\bm{p}^{*},

Rjic(κ)pi=Rijc(κ)pj,\displaystyle R^{c}_{ji}(\kappa){p}^{*}_{i}=R^{c}_{ij}(\kappa){p}^{*}_{j}, (S1.22)

This implies that 𝒑\bm{p}^{*} is the unique steady-state distribution, Rc(κ)𝒑=0R^{c}(\kappa)\bm{p}^{*}=0.

We note that there are various other types of rate matrices that can be used in the construction, as long as they satisfy LDB and DB. However, the parameterization used in (S1.21) is common in the literature [76].

I.4.2 Achievability of the steady state 𝒑\bm{p}^{*}

We show that 𝝅(κ)\bm{\pi}(\kappa), the steady-state of the combined rate matrix of R(κ):=Rb+Rc(κ)R({\kappa}):=R^{b}+R^{c}(\kappa), approaches 𝒑\bm{p}^{*} in the limit of fast control (large κ\kappa). Since [Rb+Rc(κ)]𝝅(κ)=0[R^{b}+R^{c}(\kappa)]\bm{\pi}({\kappa})=0 and Rc(κ)𝒑=0R^{c}(\kappa)\bm{p}^{*}=0, we have

1κRb𝝅(κ)=1κRc(κ)𝝅(κ)=1κRc(κ)(𝒑𝝅(κ))=B(𝒑𝝅(κ)).\displaystyle\frac{1}{\kappa}R^{b}\bm{\pi}({\kappa})=-\frac{1}{\kappa}R^{c}(\kappa)\bm{\pi}({\kappa})=\frac{1}{\kappa}R^{c}(\kappa)\left(\bm{p}^{*}-\bm{\pi}({\kappa})\right)=B(\bm{p}^{*}-\bm{\pi}({\kappa})). (S1.23)

Rearranging and applying the Moore-Penrose inverse B+B^{+} to both sides gives

1κB+Rb𝝅(κ)=B+B(𝒑𝝅(κ)).\displaystyle\frac{1}{\kappa}B^{+}R^{b}\bm{\pi}({\kappa})=B^{+}B\left(\bm{p}^{*}-\bm{\pi}({\kappa})\right)\,. (S1.24)

Since BB is irreducible by construction, B+B=I𝒑𝟏TB^{+}B=I-\bm{p}^{*}\boldsymbol{1}^{T}. Plugging into (S1.24), we obtain

1κB+Rb𝝅(κ)=(I𝒑𝟏T)(𝒑𝝅(κ))=𝒑𝝅(κ),\displaystyle\frac{1}{\kappa}B^{+}R^{b}\bm{\pi}({\kappa})=(I-\bm{p}^{*}\boldsymbol{1}^{T})(\bm{p}^{*}-\bm{\pi}({\kappa}))=\bm{p}^{*}-\bm{\pi}({\kappa}), (S1.25)

which follows from 𝟏T𝒑=𝟏T𝝅(κ)=1\boldsymbol{1}^{T}\bm{p}^{*}=\boldsymbol{1}^{T}\bm{\pi}({\kappa})=1. Taking norms gives

𝒑𝝅(κ)=1κB+Rb𝝅(κ)1κB+Rb𝝅(κ)1κB+Rb,\displaystyle\left\|\bm{p}^{*}-\bm{\pi}({\kappa})\right\|=\frac{1}{\kappa}\|B^{+}R^{b}\bm{\pi}({\kappa})\|\leq\frac{1}{\kappa}\|B^{+}R^{b}\|\|\bm{\pi}({\kappa})\|\leq\frac{1}{\kappa}\|B^{+}R^{b}\|, (S1.26)

where in the last step we used that the norm of any probability distribution is bounded by 1. This shows that limκ𝒑𝝅(κ)=0\lim_{\kappa\to\infty}\left\|\bm{p}^{*}-\bm{\pi}({\kappa})\right\|=0, meaning that the combined steady-state converges to 𝒑\bm{p}^{*}.

I.4.3 EPR vanishes

The steady-state EPR incurred by rate matrix Rc(κ)=κBR^{c}(\kappa)=\kappa B is

Σ˙(κ)=κ2ij(πj(κ)Bijπi(κ)Bji)lnπj(κ)Bijπi(κ)Bji.\displaystyle\dot{\Sigma}(\kappa)=\frac{\kappa}{2}\sum_{i\neq j}(\pi_{j}(\kappa)B_{ij}-\pi_{i}(\kappa)B_{ji})\ln\frac{\pi_{j}(\kappa)B_{ij}}{\pi_{i}(\kappa)B_{ji}}\,. (S1.27)

Next, we use (S1.25) to write 𝒑=𝝅(κ)+1κB+Rb𝝅(κ)\bm{p}^{*}=\bm{\pi}({\kappa})+\frac{1}{\kappa}B^{+}R^{b}\bm{\pi}({\kappa}). Using the DB condition for 𝒑\bm{p}^{*} with respect to Rc(κ)=κBR^{c}(\kappa)=\kappa B obtained in (S1.22), we rearrange terms to find:

πi(κ)Bjiπj(κ)Bij=1κ[Bij(B+Rb𝝅(κ))jBji(B+Rb𝝅(κ))i]\displaystyle\pi_{i}(\kappa)B_{ji}-\pi_{j}(\kappa)B_{ij}=\frac{1}{\kappa}\left[B_{ij}\left(B^{+}R^{b}\bm{\pi}({\kappa})\right)_{j}-B_{ji}\left(B^{+}R^{b}\bm{\pi}({\kappa})\right)_{i}\right] (S1.28)

Plugging into (S1.27) gives

Σ˙(κ)=12ij[Bji(B+Rb𝝅(κ))iBij(B+Rb𝝅(κ))j]lnπj(κ)Bijπi(κ)Bji.\displaystyle\dot{\Sigma}(\kappa)=\frac{1}{2}\sum_{i\neq j}\left[B_{ji}\left(B^{+}R^{b}\bm{\pi}({\kappa})\right)_{i}-B_{ij}\left(B^{+}R^{b}\bm{\pi}({\kappa})\right)_{j}\right]\ln\frac{\pi_{j}(\kappa)B_{ij}}{\pi_{i}(\kappa)B_{ji}}. (S1.29)

Taking the limit κ\kappa\to\infty shows that EPR vanishes:

limκΣ˙(κ)=12ij[Bji(B+Rb𝒑)iBij(B+Rb𝒑)j]lnpjRijcpiRjic=0,\displaystyle\lim_{\kappa\to\infty}\dot{\Sigma}(\kappa)=\frac{1}{2}\sum_{i\neq j}\left[B_{ji}\left(B^{+}R^{b}\bm{p}^{*}\right)_{i}-B_{ij}\left(B^{+}R^{b}\bm{p}^{*}\right)_{j}\right]\ln\frac{p^{*}_{j}R^{c}_{ij}}{p^{*}_{i}R^{c}_{ji}}=0,

where we used limκ𝝅(κ)=𝒑\lim_{\kappa\to\infty}\bm{\pi}({\kappa})=\bm{p}^{*} and the DB condition (S1.22) in the logarithmic factor.

II Bacteriorhodopsin model

II.1 Details of model

Here we provide thermodynamic and kinetic details of the bacteriorhodopsin model analyzed in the main text.

The thermodynamic parameters are taken from Ref. [31], which reports in vitro measurements of internal free energies at 293K=20C293^{\circ}\,\text{K}=20^{\circ}\,\text{C}. Based on Fig. 7 in that paper, we use the following internal free energies for the 6 cycle states:

𝒇\displaystyle\bm{f} \displaystyle\equiv (fK,\displaystyle(f_{K}, fL,\displaystyle f_{L}, fM1,\displaystyle f_{M_{1}}, fM2,\displaystyle f_{M_{2}}, fN,\displaystyle f_{N}, fO\displaystyle f_{O} )\displaystyle\!\!\!\!\!\!)
=\displaystyle= (34.41,\displaystyle(34.41, 27.96,\displaystyle 27.96, 29.57,\displaystyle 29.57, 13.98,\displaystyle 13.98, 13.17,\displaystyle 13.17, 14.78\displaystyle 14.78 )kJ mol1\displaystyle\!\!\!\!\!\!)\;\;\;\text{kJ mol}^{-1}
=\displaystyle= (5.71×1020,\displaystyle(5.71\times 10^{-20}, 4.64×1020,\displaystyle\!\!\!4.64\times 10^{-20}, 4.91×1020,\displaystyle\!\!\!4.91\times 10^{-20}, 2.32×1020,\displaystyle\!\!\!2.32\times 10^{-20}, 2.19×1020,\displaystyle\!\!\!2.19\times 10^{-20}, 2.45×1020\displaystyle\!\!\!2.45\times 10^{-20} )joules\displaystyle\!\!\!\!\!\!)\;\;\;\text{joules} (S2.30)

All values are referenced from the ground state, in other words the zero point refers to the internal free energy of the ground state bR\mathrm{bR} [31]. However, since we coarse-grain the transitions ObRO\to\mathrm{bR} and bRK\mathrm{bR}\to K into a single transition OKO\to K, we do not include this ground state in our model. See Fig. S7 for an illustration of the coarse-graining.

Refer to caption
Figure S7: Bacteriorhodopsin cycle. Using the irreversible character and fast speed of the bRK\mathrm{bR}\to K transition, we coarse-grain the cycle from the original seven-state system (left) to a six-state one (right). Circled region in gray shows the states and transition that is coarse-grained into a single state.

We use gjig_{ji} to indicate the free energy transferred to the internal reservoir by the jump from state i{K,L,M1,M2,N,O}i\in\{K,L,M_{1},M_{2},N,O\} to state j{K,L,M1,M2,N,O}j\in\{K,L,M_{1},M_{2},N,O\}. Only transitions between neighboring states in the cycle are permitted: KLK\leftrightarrow L, LM1L\leftrightarrow M_{1}, M1M2M_{1}\leftrightarrow M_{2}, M2NM_{2}\leftrightarrow N, NON\leftrightarrow O, and OKO\leftrightarrow K. The values of gjig_{ji} are zero for all transitions except for the transition M1M2M_{1}\leftrightarrow M_{2}, corresponding to proton transport across the membrane. This value is determined via Eq. (12) as a function of the electrochemical potential (a.k.a. proton motive force), which includes contributions from electrical potential and pH difference across the membrane. To summarize:

gLK=gKL=gM1L=gLM1=gNM2=gM2N=gON=gNO=gKO=gOK=0gM2M1=gM1M2=Δp:=eΔψ(ln10)β1ΔpHjoules\displaystyle\begin{gathered}g_{LK}=g_{KL}=g_{M_{1}L}=g_{LM_{1}}=g_{NM_{2}}=g_{M_{2}N}=g_{ON}=g_{NO}=g_{KO}=g_{OK}=0\\ g_{M_{2}M_{1}}=-g_{M_{1}M_{2}}=\Delta\mathrm{p}:=e\Delta\psi-(\ln 10)\beta^{-1}\Delta\text{pH}\qquad\text{joules}\end{gathered} (S2.33)

We emphasize that Δψ\Delta\psi is the membrane electrical potential in volts, where Δψ0\Delta\psi\geq 0 indicates that the inside is more negative. ΔpH\Delta\text{pH} is the membrane pH difference, where ΔpH0\Delta\text{pH}\leq 0 indicates that the inside is more basic. No free energy is exchanged in the processes internal to each cycle state, so

𝒈˙b=0.\displaystyle\dot{\bm{g}}^{b}=0\,. (S2.34)

The dynamics are represented by the system’s rate matrix. We parameterize the transition rate for the jump from state i{K,L,M1,M2,N,O}i\in\{K,L,M_{1},M_{2},N,O\} to neighboring state j{K,L,M1,M2,N,O}j\in\{K,L,M_{1},M_{2},N,O\} as

Rji=κji1+eΔsjitot,\displaystyle R_{ji}=\frac{\kappa_{ji}}{1+e^{-\Delta s^{\mathrm{tot}}_{ji}}}\,, (S2.35)

where κji=κij\kappa_{ji}=\kappa_{ij} is the relaxation rate for the transition iji\leftrightarrow j, and Δsjitot=Δsijtot\Delta s^{\mathrm{tot}}_{ji}=-\Delta s^{\mathrm{tot}}_{ij} is the entropy produced during the transition iji\to j. This is a common parametrization which guarantees that the rates satisfy local detailed balance (LDB) [76]. The entropy production for each jump iji\to j is

Δsjitot=β(fifjgji+mji).\displaystyle\Delta s^{\mathrm{tot}}_{ji}=\beta(f_{i}-f_{j}-g_{ji}+m_{ji})\,. (S2.36)

where fif_{i} refers to internal free energies in joules (S2.30), β=1/kBT\beta=1/k_{B}T and kB=1.38×1023joules( K)1k_{B}=1.38\times 10^{-23}\ \text{joules}\cdot\left({}^{\circ}\text{ K}\right)^{-1} is Boltzmann’s constant. As above gji=gijg_{ji}=-g_{ij} is the increase of free energy of the internal reservoir, while mji=mijm_{ji}=-m_{ij} is the decrease of free energy in the external source during the transition iji\to j. In fact, mji=0m_{ji}=0 for all i,ji,j except the transition OKO\leftrightarrow K, which corresponds to photon absorption. The energy absorbed from the photon is hc/λhc/\lambda joules, where hh is Planck’s constant, cc is speed of light, and λ\lambda is the photon wavelength. We use a physiologically plausible value of 580 nm [77]. At this wavelength and temperature of 293293^{\circ} K,

mKO=mOK=hc/λjoules84kBT.\displaystyle m_{KO}=-m_{OK}=hc/\lambda\;\;\text{joules}\quad\approx\quad 84\,k_{B}T\,. (S2.37)

Observe that the transition OKO\to K is highly irreversible (rKOrOKr_{KO}\gg r_{OK}). In fact, our results are essentially the same regardless of whether the transition rate for this step is computed using (S2.35) or made absolutely irreversible.

The relaxation kinetics that enter into (S2.35) are taken from Table 1 in Ref. [32]. We use the geometric mean of the upper and lower estimates of krelax1k_{\mathrm{relax}}^{-1} to get the following relaxation rates (in sec-1):

κLK=κKL=2.57×105κNM2=κM2N=7.22×102κM1L=κLM1=2.55×104κON=κNO=5.10×102κM2M1=κM1M2=5.42×103κKO=κOK=1.28×102\displaystyle\begin{aligned} &\kappa_{LK}&&=\kappa_{KL}&=2.57\times 10^{5}&\qquad\qquad\kappa_{NM_{2}}&&=\kappa_{M_{2}N}&=7.22\times 10^{2}\\ &\kappa_{M_{1}L}&&=\kappa_{LM_{1}}&=2.55\times 10^{4}&\qquad\qquad\kappa_{ON}&&=\kappa_{NO}&=5.10\times 10^{2}\\ &\kappa_{M_{2}M_{1}}&&=\kappa_{M_{1}M_{2}}&=5.42\times 10^{3}&\qquad\qquad\kappa_{KO}&&=\kappa_{OK}&=1.28\times 10^{2}\end{aligned} (S2.38)

For concreteness, here we provide the numerical values of entropy production for each jump iji\to j, at in vivo potential Δψ=120\Delta\psi=120 mV:

(ΔsLKtot,\displaystyle(\Delta s^{\mathrm{tot}}_{LK}, ΔsM1Ltot,\displaystyle\Delta s^{\mathrm{tot}}_{M_{1}L}, ΔsM2M1tot,\displaystyle\Delta s^{\mathrm{tot}}_{M_{2}M_{1}}, ΔsNM2tot,\displaystyle\Delta s^{\mathrm{tot}}_{N{M_{2}}}, ΔsONtot,\displaystyle\Delta s^{\mathrm{tot}}_{ON}, ΔsKOtot\displaystyle\Delta s^{\mathrm{tot}}_{KO} )\displaystyle\!\!\!\!\!\!)
=\displaystyle= (2.65,\displaystyle(2.65, 0.66,\displaystyle-0.66, 0.21,\displaystyle 0.21, 0.33,\displaystyle 0.33, 0.66,\displaystyle-0.66, 76.62\displaystyle 76.62 )\displaystyle\!\!\!\!\!\!) (S2.39)

The resulting rate matrix, again at the in vivo potential, is (in sec-1)

R=\displaystyle R=\, [2.40×1051.70×1040001.28×1022.40×1052.57×1041.68×10400008.69×1031.99×1042.35×10300003.07×1032.77×1033.01×10200004.20×1024.75×1023.37×1026.78×10320001.74×1024.65×102].\displaystyle\left[\begin{array}[]{rrrrrr}-2.40\times 10^{5}&1.70\times 10^{4}&0&0&0&1.28\times 10^{2}\\ 2.40\times 10^{5}&-2.57\times 10^{4}&1.68\times 10^{4}&0&0&0\\ 0&8.69\times 10^{3}&-1.99\times 10^{4}&2.35\times 10^{3}&0&0\\ 0&0&3.07\times 10^{3}&-2.77\times 10^{3}&3.01\times 10^{2}&0\\ 0&0&0&4.20\times 10^{2}&-4.75\times 10^{2}&3.37\times 10^{2}\\ 6.78\times 10^{-32}&0&0&0&1.74\times 10^{2}&-4.65\times 10^{2}\end{array}\right]\,. (S2.46)
KLM1M2NO\displaystyle\hskip 10.00002pt\underbrace{\hskip 55.00008pt}_{K}\underbrace{\hskip 55.00008pt}_{L}\underbrace{\hskip 55.00008pt}_{M_{1}}\underbrace{\hskip 55.00008pt}_{M_{2}}\underbrace{\hskip 55.00008pt}_{N}\underbrace{\hskip 55.00008pt}_{O}

II.2 Details of numerical analysis in Fig. 2

To calculate the curves plotted in Fig. 2, we first define the ordered set of transitions

𝒯\displaystyle\mathcal{T} :=(KL,LM1,M1M2,M2N,NO)\displaystyle:=(K\leftrightarrow L,L\leftrightarrow M_{1},M_{1}\leftrightarrow M_{2},M_{2}\leftrightarrow N,N\leftrightarrow O)

Then, for each value of the electrical membrane potential Δψ\Delta\psi we perform the following:

  1. 1.

    Use the transition rates from (S2.35) to define the ‘total’ (baseline-and-control) rate matrix RR and numerically solve for R𝝅=0R\bm{\pi}=0 to obtain 𝝅\bm{\pi}, which is unique since RR is irreducible (all states are connected, see Fig. S7).

  2. 2.

    Use the transition rates from (S2.35) and values of 𝒈\bm{g} from (S2.33) to calculate the total harvesting rate as

    𝒢˙tot=i,jπiRjigji.\displaystyle\dot{\mathcal{G}}^{\mathrm{tot}}=\sum_{i,j}\pi_{i}R_{ji}g_{ji}\,. (S2.47)

    The total harvesting rate is plotted as the thick black line in Fig. 2.

  3. 3.

    For each transition t𝒯t\in\mathcal{T} that acts as control:

    1. (a)

      Define the baseline transition matrix RbR^{b} by removing the chosen transition from RR. As standard, the diagonal entries RiibR^{b}_{ii} are determined by Riib=jRjibR^{b}_{ii}=-\sum_{j}R^{b}_{ji}.

    2. (b)

      Define optimization parameters (𝒑,𝑱)(\bm{p},{\boldsymbol{J}}) with 𝒑n\bm{p}\in\mathbb{R}^{n} and 𝑱{\boldsymbol{J}} a vector in n2\mathbb{R}^{n^{2}} constrained such that ipi=1\sum_{i}p_{i}=1, pi0p_{i}\geq 0 i\forall i, 𝔹𝑱=Rb𝒑\mathbb{B}{\boldsymbol{J}}=-R^{b}\bm{p} where 𝔹n×n2\mathbb{B}\in\mathbb{R}^{n\times n^{2}} is the incidence matrix with entries 𝔹k,ij=δkiδkj\mathbb{B}_{k,ij}=\delta_{ki}-\delta_{kj}. Finally, we make all 𝑱ji=𝑱ij=0{\boldsymbol{J}}_{ji}={\boldsymbol{J}}_{ij}=0 for {i,j}t\{i,j\}\neq t.

    3. (c)

      Use the transition rates and free energy values of 𝒇\bm{f} from (S2.30) to fix the “baseline harvesting rate” and the “increase of system free energy” (Eq. (2)) functions:

      𝒢˙b(𝒑)\displaystyle\dot{\mathcal{G}}^{b}(\bm{p}) =i,jpiRjibgji\displaystyle=\sum_{i,j}p_{i}R^{b}_{ji}g_{ji} (S2.48)
      ˙b(𝒑)\displaystyle\dot{\mathcal{F}}^{b}(\bm{p}) =i,jpiRjib(fj+β1lnpj).\displaystyle=\sum_{i,j}p_{i}R^{b}_{ji}(f_{j}+{\beta}^{-1}\ln p_{j}). (S2.49)
    4. (d)

      Plug ˙b(𝒑)\dot{\mathcal{F}}^{b}(\bm{p}), 𝒢˙b(𝒑)\dot{\mathcal{G}}^{b}(\bm{p}) and Σ˙(𝑱)\dot{\Sigma}({\boldsymbol{J}}) into Eq. (10) to solve for 𝒑\bm{p}^{*}, 𝑱{\boldsymbol{J}}^{*} and 𝒢\mathscr{G} (color-coded line in Fig. 2) using numerical optimization.

We emphasize that in the data shown in Fig. 2 we add no extra constraints. However, in the remaining of this supplementary material, we consider various additional linear constraints pertaining to activity, thermodynamic affinity and kinetic limitations. Such constraints are imposed at Step 3.b and determine the feasibility set Λ\Lambda in Eq. (10).

We make three final observations regarding our bacteriorhodopsin model. First, our analysis assumes that control processes can be added without affecting membrane parameters, such as ΔpH\Delta\mathrm{pH} and Δψ\Delta\psi. In practice, this may be justified by homeostatic mechanisms. For instance, excess proton pumping produced by the introduction of control may be balanced by up-regulation of ATPase, which consumes the protons while making ATP.

Second, in order to consistent with LDB (6), changing the rate of control transition iji\to j, while keeping the internal free energy values fixed, may require changing the free energy used by that control transition gjic{g}^{c}_{ji}. For transitions coupled to the membrane potential (such as M1M2M_{1}\leftrightarrow M_{2}), this can be achieved by manipulating Δψ\Delta\psi or ΔpH\Delta\text{pH}, which here act as external parameters (see (12)). However, for a transition that is not coupled to the membrane potential, manipulating gjic{g}^{c}_{ji} may require, for example, the consumption of a chemical fuel such as ATP. The strength of driving can be modulated by regulating the nonequilibrium concentration of the chemical fuel.

Third, our model reproduces steady-state currents and parameter dependence (such as membrane potential) that agree with reported data, at least to a first approximation. At the same time, it is worth noting that the equilibrium constants reported in our source of kinetic data [32] differ somewhat from the free energy changes reported in our source of thermodynamic data [31]. This may be due to different experimental or estimation methods, or because some reaction steps are not approximated well as elementary transitions.

II.3 Additional analysis: reprotonation step as control

In the analysis in the main text, we observe that near the in vivo membrane potential, the reprotonation reaction (NON\leftrightarrow O) is the least efficient of all five transitions in 𝒯\mathcal{T}. In this section, we study this transition as control with additional constraints.

Our analysis can be motivated in the context of synthetic or natural selection for increased output in bacteriorhodopsin [35, 36, 37, 38]. Suppose that the genotype-phenotype map of bacteriorhodopsin is sufficiently modular such that the thermodynamic and kinetic properties of individual transitions in the cycle can be separately manipulated by mutations. In fact, in case of the reprotonation transition NON\leftrightarrow O, there is evidence that a single amino acid in the bacteriorhodopsin protein specifically and effectively changes the kinetics of that transition [35]. In this setting, we ask to what extend the existing NON\leftrightarrow O transition is optimal. At the same time, we illustrate how underlying thermodynamic and kinetic constraints, e.g., as might arise from underlying diffusion timescales and biochemistry, can be incorporated when quantifying optimality.

In concrete terms, we let RNOb=RONb=0R^{b}_{NO}=R^{b}_{ON}=0 and fix the flux parameters of the optimization problem (10) as Jij=Jji=0J_{ij}=J_{ji}=0 for all {i,j}{N,O}\{i,j\}\neq\{N,O\}. Formally, we consider the following feasibility set in (10):

ΛNO:={(𝒑,𝑱):𝒑0,𝟏T𝒑=1,𝑱0,Jij=0 if (i,j){(N,O),(O,N)}}.\displaystyle\Lambda^{NO}:=\{(\bm{p},{\boldsymbol{J}}):\bm{p}\geq 0,\bm{1}^{T}\bm{p}=1,{\boldsymbol{J}}\geq 0,J_{ij}=0\text{ if }(i,j)\not\in\{{(N,O)},{(O,N)}\}\}\,. (S2.50)

along with other constraints discussed below.

To compare the result of optimization to the actual system, we also consider the actual transition as the control, defined via:

RONc=κON1+eΔsONtotRNOc=κNO1+eΔsONtot\displaystyle R^{c}_{ON}=\frac{\kappa_{ON}}{1+e^{-\Delta s^{\mathrm{tot}}_{ON}}}\qquad\qquad R^{c}_{NO}=\frac{\kappa_{NO}}{1+e^{-\Delta s^{\mathrm{tot}}_{ON}}} (S2.51)

with all other Rijc=0R^{c}_{ij}=0. The baseline-and-control dynamics correspond to the actual bacteriorhodopsin system described in Sec. II.1. See also Fig. S8 (left).

For Sections II.3 and II.4 in this SM, it is useful to define the current (i.e., net flux) as

𝒥=JNOJON\displaystyle\mathcal{J}=J_{NO}^{*}-J_{ON}^{*} (S2.52)

where JNOJ_{NO}^{*} and JONJ_{ON}^{*} are the optimal fluxes found by our optimization problem (10). In the setting of these sections, where the NON\leftrightarrow O control transition is the missing step of the unicyclic bacteriorhodopsin system, 𝒥\mathcal{J} is the cyclic current in the presence of the baseline and the optimal control transition.

Refer to caption
Figure S8: Depiction of choices for baseline and control for Sections II.3 (left) and II.4 (right) of the SM. In each case, the baseline (shaded blue) does not include the transition of reprotonation (NON\leftrightarrow O) and proton-pumping (M1M2M_{1}\leftrightarrow M_{2}), respectively; instead, these are treated as control (shaded yellow). For both scenarios, as in the analysis done in the main text, note that no cyclic current circulates under baseline dynamics but does so under baseline-and-control (i.e. the actual) dynamics.

II.3.1 Constrained activity and affinity

We optimize the maximum harvesting rate with respect to the transition NON\leftrightarrow O, while imposing an additional constraint on the “dynamical activity” of the fluxes NON\to O and ONO\to N. This defines a smaller feasibility set in (10):

ΛaNO:=ΛNO{(𝒑,𝑱):JNO+JONa},\displaystyle\Lambda_{a}^{NO}:=\Lambda^{NO}\cap\{(\bm{p},{\boldsymbol{J}}):J_{NO}+J_{ON}\leq a\}, (S2.53)

for some value of aa in units of sec1\text{sec}^{-1}. This upper-bounds the dynamical activity of our control transition such that fluxes cannot be arbitrarily large.

Refer to caption
Figure S9: Right: actual harvesting rate (in black) and optimized harvesting rate for controlled transition NON\leftrightarrow O at different values of activity constraint aa. Dashed vertical line shows in vivo membrane potential (120mV120\text{mV}). Left panel shows the respective currents (upper plot) and the distribution at in vivo value of Δψ\Delta\psi (lower plot) for the actual system versus the optimized solutions under a=101a=10^{1} and aa\to\infty, in units of sec1\text{sec}^{-1}.

The result of this analysis are shown in Fig. (S9). We begin by noting that, in some instances in the right plot in this figure, the optimized constrained control yields a lower harvesting rate than that of the actual system (black line). For example, if a=101a=10^{1} at around in vivo values, the actual harvesting rate is roughly double the one achieved by the constrained 𝒢a=101\mathscr{G}_{a=10^{1}} (blue line). This is not too surprising since, from our computation of the actual harvesting rate at in vivo values, we read off (in sec1\text{sec}^{-1}):

JONin vivo=πNRON41.6JNOin vivo=πORNO30.1J^{\text{\emph{in\,vivo}}}_{ON}=\pi_{N}R_{ON}\simeq 41.6\qquad J^{\text{\emph{in\,vivo}}}_{NO}=\pi_{O}R_{NO}\simeq 30.1

Thus, constraining the activity as in (S2.53) limits the ability of the optimized system to amplify the current above the actual in vivo current (see top left plot in Fig. S9). In contrast, increasing aa approaches the optimal solution to the one shown for the NON\leftrightarrow O transition in the main text (Fig. 2), here shown in red. Note also that for strongly constrained activity, such as a=101a=10^{1}, the optimal distribution around the in vivo value shifts in the opposite way with respect to the unconstrained case (blue against red bar-plots in the lower left of Fig. S9).

Finally, in the upper left of Fig. S9 we plot the current as a function of Δψ\Delta\psi. We observe that the actual (baseline-and-control) current effectively goes to zero (stalls) at large potentials, since there is not enough free energy in the cycle to push protons across the membrane. At low potentials, the actual current saturates at a larger value while the harvesting rate plummets. This is because at sufficiently low Δψ\Delta\psi, transporting protons from inside to outside the cell actually drains free energy stored in the membrane potential, so the cycle operates as a dud.

Thermodynamic affinity constrained: Next, we add further constraints to the optimization problem by including thermodynamic affinity limitations, defined by the feasibility set:

Λ𝒜,aNO:=ΛaNO{(𝒑,𝑱):JNOe𝒜JONJNOe𝒜}.\displaystyle\Lambda_{\mathcal{A},a}^{NO}:=\Lambda_{a}^{NO}\cap\left\{(\bm{p},{\boldsymbol{J}}):J_{NO}e^{-\mathcal{A}}\leq J_{ON}\leq J_{NO}e^{\mathcal{A}}\right\}. (S2.54)

This linear constraint enforces a bound on the thermodynamic affinity of the NON\leftrightarrow O transition,

|lnJONJNO|𝒜for some𝒜0.\displaystyle\left|\ln\frac{J_{ON}}{J_{NO}}\right|\leq\mathcal{A}\qquad\text{for some}\quad\mathcal{A}\geq 0\,. (S2.55)

Results for this constraint are shown in Fig. S10 with the fixed choice of dynamical activity a=102sec1a=10^{2}\ \text{sec}^{-1}.

Refer to caption
Figure S10: Right: actual harvesting rate (in black) and optimized harvesting rate for controlled transition NON\leftrightarrow O at different values of maximum affinity 𝒜\mathcal{A} and maximum activity a=102sec1a=10^{2}\ \text{sec}^{-1}. Dashed vertical line shows in vivo membrane potential (120mV120\text{mV}). Left panel shows the respective currents (upper plot) and the distribution at in vivo value of Δψ\Delta\psi (lower plot) for the actual system and the optimized solutions under 𝒜=0.1\mathcal{A}=0.1 and 𝒜=0.8\mathcal{A}=0.8.

The combination of the constraint on dynamical activity and thermodynamic affinity is necessary to give physically meaningful results. To see why, imagine that only the thermodynamic affinity was constrained, as in (S2.55). Now consider some pair of a distribution 𝒑\bm{p} and flux vector 𝑱{\boldsymbol{J}} in our optimization problem (10), where 𝑱{\boldsymbol{J}} is restricted to have non-zero transitions only for NON\to O and ONO\to N, and satisfy the steady-state condition 𝔹𝑱=Rb𝒑\mathbb{B}{\boldsymbol{J}}=-R^{b}\bm{p}. These fluxes can be increased as JNOJNO+α,JONJON+αJ_{NO}\mapsto J_{NO}+\alpha,J_{ON}\mapsto J_{ON}+\alpha for α0\alpha\geq 0. This transformation doesn’t change the current across the transition NON\to O (S2.52), so the steady-state constraint is still valid for the same 𝒑\bm{p} (note that 𝔹𝑱\mathbb{B}{\boldsymbol{J}} only depends on the currents, i.e., net fluxes). Finally, by choosing α\alpha large enough, we can make the EPR term in (10) vanish and satisfy the affinity constraint (S2.55), since limα|ln[(JNO+α)/(JON+α)]|=0\lim_{\alpha\to\infty}|\ln[(J_{NO}+\alpha)/(J_{ON}+\alpha)]|=0. This shows that, lacking other constraints, the affinity constraint is vacuous, since it can be always satisfied by making the one-way fluxes sufficiently large. On the other hand, the combination of the activity and affinity constraints sets a bound on the fluxes.

II.3.2 Constrained kinetics

Suppose that we optimize NON\leftrightarrow O as control under a constraint involving the respective transition rates. In particular, assume that the possible rates are bounded by some value κ\kappa such that RONcκ,RNOcκR^{c}_{ON}\leq\kappa,R^{c}_{NO}\leq\kappa (in units of sec1\text{sec}^{-1}). Then, this can be encoded in the following feasibility set:

ΛκNO:=ΛNO{(𝒑,𝑱):JNOκpO,JONκpN}.\displaystyle\Lambda_{\kappa}^{NO}:=\Lambda^{NO}\cap\{(\bm{p},{\boldsymbol{J}}):J_{NO}\leq\kappa p_{O},J_{ON}\leq\kappa p_{N}\}. (S2.56)

Note that Λκ\Lambda_{\kappa} introduces constraints that mix both 𝒑\bm{p} and 𝑱{\boldsymbol{J}}. We also note that constraints on the transition rates are perhaps the most realistic ones when considering the constraints faced by by biomolecular machines [35]. Harvesting rate curves for different κ\kappa are shown in the right plot of Fig. S11.

Refer to caption
Figure S11: Right: actual harvesting rate (in black) and optimized harvesting rates for controlled transition NON\leftrightarrow O at different values of kinetic constraint κ\kappa. Dashed vertical line shows in vivo membrane potential (120mV120\text{mV}). Left panel shows the respective currents (upper plot) and the distribution at in vivo value of Δψ\Delta\psi (lower plot) for the actual system and the optimized solutions under κ=101\kappa=10^{1} and unconstrained κ\kappa.

II.4 Proton-pumping as control

The proton-pumping steps (M1M2M_{1}\leftrightarrow M_{2}) is perhaps one of the most experimentally accessible transitions to study. This is because it is the only transition that depends explicitly on the membrane potential Δψ\Delta\psi, which can be experimentally tweaked and thus used as an external parameter. For example, the electrochemical membrane potential is frequently manipulated by introduction of ionphores, such as valinomycin [78].

In this section, we study this transition as control, possibly under constraints.

In this analysis, the baseline rate matrix RbR^{b} does not include the transition M1M2M_{1}\leftrightarrow M_{2}. In concrete terms, we let RM2M1b=RM1M2b=0R^{b}_{M_{2}M_{1}}=R^{b}_{M_{1}M_{2}}=0 and fix the flux parameters of the optimization problem (10) as Jij=Jji=0J_{ij}=J_{ji}=0 for all {i,j}{M1,M2}\{i,j\}\neq\{M_{1},M_{2}\}. Formally, we consider the following feasibility set in (10):

ΛM1M2:={(𝒑,𝑱):𝒑0,𝟏T𝒑=1,𝑱0,Jij=0 if (i,j){(M1,M2),(M2,M1)}}.\displaystyle\Lambda^{M_{1}M_{2}}:=\{(\bm{p},{\boldsymbol{J}}):\bm{p}\geq 0,\bm{1}^{T}\bm{p}=1,{\boldsymbol{J}}\geq 0,J_{ij}=0\text{ if }(i,j)\not\in\{{(M_{1},M_{2})},{(M_{2},M_{1})}\}\}\,. (S2.57)

along with other constraints discussed below.

To compare the result of optimization to the actual system, we also consider the actual transition as the control, defined via:

RM2M1c=κM2M11+eΔsM2M1totRM1M2c=κM1M21+eΔsM2M1tot\displaystyle R^{c}_{M_{2}M_{1}}=\frac{\kappa_{M_{2}M_{1}}}{1+e^{-\Delta s^{\mathrm{tot}}_{M_{2}M_{1}}}}\qquad\qquad R^{c}_{M_{1}M_{2}}=\frac{\kappa_{M_{1}M_{2}}}{1+e^{-\Delta s^{\mathrm{tot}}_{M_{2}M_{1}}}} (S2.58)

with all other Rijc=0R^{c}_{ij}=0. The baseline-and-control dynamics correspond to the actual bacteriorhodopsin system described in Sec. II.1. See also Fig. S8 (right).

Recall that the proton-pumping transition is the only one that involves the membrane potential Δψ\Delta\psi (see (S2.33)), and that it is no longer part of the baseline. For this reason, the optimization problem (10), which only involves baseline parameters, no longer depends on the choice of Δψ\Delta\psi. We now proceed to study this problem for different choices of additional constraints on (𝒑,𝑱)\left(\bm{p},{\boldsymbol{J}}\right). For this reason, in Fig. 2, the value for 𝒢\mathscr{G} when M1M2M_{1}\leftrightarrow M_{2} acts as control is a constant. In this case, it is interesting to compare the optimum harvesting rates with the values of 𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} at in vivo membrane potential versus maxΔψ𝒢˙tot\max_{\Delta\psi}\dot{\mathcal{G}}^{\mathrm{tot}}, the maximum value attained by 𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} across all membrane potentials in Fig. 2 (right). Results are shown in Fig. S12.

Refer to caption
Refer to caption
Refer to caption
Figure S12: Left: Log-log plot of the optimized harvesting rates as a function of dynamical activity bound aa (ΛaM1M2\Lambda_{a}^{M_{1}M_{2}}). Middle: Log-linear plot of the optimized harvesting rates under thermodynamic affinity and dynamical activity constraints (Λ𝒜,aM1M2\Lambda_{\mathcal{A},a}^{M_{1}M_{2}}) for values of 𝒜\mathcal{A} at fixed a=50sec1a=50\ \text{sec}^{-1} (green line) and a=102sec1a=10^{2}\ \text{sec}^{-1} (red line). Right: Log-log plot of the optimized harvesting rates as a function of the kinetic bound κ\kappa (ΛκM1M2\Lambda_{\kappa}^{M_{1}M_{2}}). All figures also show the value of the actual harvesting rate 𝒢˙tot\dot{\mathcal{G}}^{\mathrm{tot}} at the in vivo membrane potential value (black dashed lines), which gives 70kBT/sec\sim 70k_{B}T/\text{sec}, and the maximum achievable harvesting rate that the system can achieve for any Δψ\Delta\psi (gray line), 83.5kBT/sec\sim 83.5k_{B}T/\text{sec}.

Activity constraint: analogously to Sec. II.3, we now define:

ΛaM1M2:=ΛM1M2{(𝒑:𝑱):JM1M2+JM2M1a,}\displaystyle\Lambda_{a}^{M_{1}M_{2}}:=\Lambda^{M_{1}M_{2}}\cap\{(\bm{p}:{\boldsymbol{J}}):J_{M_{1}M_{2}}+J_{M_{2}M_{1}}\leq a,\} (S2.59)

for values of aa in units of sec1\text{sec}^{-1}. Solving (10) under ΛaM1M2\Lambda_{a}^{M_{1}M_{2}} yields the blue curve in Fig. S12 for 𝒢\mathscr{G} as a function of aa.

Thermodynamic affinity constraint: owing to the same reasoning as in Sec. II.3, this constraint proves physically meaningful if combined with another constraint on the fluxes, such as the activity constraint (S2.59). In this case, we analogously define the feasibility set:

Λ𝒜,aM1M2:=ΛaM1M2{(𝒑,𝑱):JM1M2e𝒜JM2M1JM1M2e𝒜},\displaystyle\Lambda_{\mathcal{A},a}^{M_{1}M_{2}}:=\Lambda_{a}^{M_{1}M_{2}}\cap\left\{(\bm{p},{\boldsymbol{J}}):J_{M_{1}M_{2}}e^{-\mathcal{A}}\leq J_{M_{2}M_{1}}\leq J_{M_{1}M_{2}}e^{\mathcal{A}}\right\}, (S2.60)

In the inset of Fig. S12, we show the solution to (10) under Λ𝒜,aM1M2\Lambda_{\mathcal{A},a}^{M_{1}M_{2}} for values of a range of 𝒜\mathcal{A} with fixed values of a=50sec1a=50\ \text{sec}^{-1} (green line) and a=102sec1a=10^{2}\ \text{sec}^{-1} (red line).

Kinetic constraint: analogously to Sec. II.3, we define:

ΛκM1M2:=ΛM1M2{(𝒑,𝑱):JM1M2κpM2,JM2M1κpM1},\displaystyle\Lambda_{\kappa}^{M_{1}M_{2}}:=\Lambda^{M_{1}M_{2}}\cap\{(\bm{p},{\boldsymbol{J}}):J_{M_{1}M_{2}}\leq\kappa p_{M_{2}},J_{M_{2}M_{1}}\leq\kappa p_{M_{1}}\}, (S2.61)

for values of κ\kappa in units of sec1\text{sec}^{-1}. Solving (10) under ΛκM1M2\Lambda_{\kappa}^{M_{1}M_{2}} yields the orange curve in Fig. S12 for 𝒢\mathscr{G} as a function of κ\kappa.

III Closed-form solutions of Eq. (11) in different regimes

We consider a system of nn states indexed by i{1,,n}i\in\{1,\ldots,n\} that evolves according to a Markovian master equation with rate matrix R=Rb+RcR=R^{b}+R^{c}, where RbR^{b} and RcR^{c} correspond to baseline and control processes respectively. Here we study the optimization (11) in three limiting regimes.

III.1 Linear Response (LR) regime, Eq. (15)

We first consider (11) in the linear response regime, that is under the assumption that the optimal distribution is close to the baseline steady state 𝒑𝝅b\bm{p}^{*}\approx{\bm{\pi}^{b}}, thereby arriving at (15). We assume that RbR^{b} is irreducible and has a unique steady state 𝝅b{\bm{\pi}^{b}} with full support. Note that the assumption that 𝝅b{\bm{\pi}^{b}} has full support is satisfied when RbR^{b} is “weakly reversible”, meaning that Rijb>0R^{b}_{ij}>0 whenever Rjib>0R^{b}_{ji}>0.

First, we rewrite the entropic term in (13) as

i,jRijbpjlnpi\displaystyle\sum_{i,j}R^{b}_{ij}p_{j}\ln p_{i} =i,jRijbpj(lnpilnπib)+i,jRijbpjlnπib\displaystyle=\sum_{i,j}R^{b}_{ij}p_{j}\left(\ln p_{i}-\ln\pi^{b}_{i}\right)+\sum_{i,j}R^{b}_{ij}p_{j}\ln\pi^{b}_{i} (S3.62)
=i,jRijb(pjπjb)(lnpilnπib)+i,jRijbpjlnπib,\displaystyle=\sum_{i,j}R^{b}_{ij}(p_{j}-\pi^{b}_{j})\left(\ln p_{i}-\ln\pi^{b}_{i}\right)+\sum_{i,j}R^{b}_{ij}p_{j}\ln\pi^{b}_{i}, (S3.63)

where in the last step we used that i,jRijbπjblnπib=0-\sum_{i,j}R^{b}_{ij}\pi^{b}_{j}\ln\pi^{b}_{i}=0 when 𝝅b{\bm{\pi}^{b}} is the steady-state distribution of the baseline. We focus on the first term in (S3.63), and use the expansion lnxx1\ln x\simeq x-1 for x1x\approx 1, to derive

i,jRijb(pjπjb)ln(piπib)i,jRijb(pjπjb)piπibπib=i,jπjbπibRijbzjzi,\displaystyle\sum_{i,j}R^{b}_{ij}(p_{j}-\pi^{b}_{j})\ln\left(\frac{p_{i}}{\pi^{b}_{i}}\right)\approx\sum_{i,j}R^{b}_{ij}(p_{j}-\pi^{b}_{j})\frac{p_{i}-\pi^{b}_{i}}{\pi^{b}_{i}}=\sum_{i,j}\sqrt{\frac{\pi^{b}_{j}}{\pi^{b}_{i}}}R^{b}_{ij}z_{j}z_{i}, (S3.64)

where we defined

𝒛:=D1(𝒑𝝅b) , with Dij:=δijπib.\displaystyle\boldsymbol{z}:=D^{-1}(\bm{p}-{\bm{\pi}^{b}})\text{ , with }\ D_{ij}:=\delta_{ij}\sqrt{\pi^{b}_{i}}. (S3.65)

Our variable to optimize is a probability distribution 𝒑\bm{p}, which needs to satisfy two constrains: ipi=1\sum_{i}p_{i}=1 and pi0p_{i}\geq 0. The latter constraint holds automatically, given our hypothesis that 𝒑\bm{p} is very close to 𝝅b{\bm{\pi}^{b}}. The former constraint can be expressed in terms of 𝒛\boldsymbol{z} by setting

𝒛TD1𝝅b=0i(piπib)=0ipi=iπib=1.\displaystyle\boldsymbol{z}^{T}D^{-1}{\bm{\pi}^{b}}=0\rightarrow\sum_{i}\left(p_{i}-\pi^{b}_{i}\right)=0\iff\sum_{i}p_{i}=\sum_{i}\pi^{b}_{i}=1. (S3.66)

We impose this linear constraint on 𝒛\boldsymbol{z} below, when redefining the optimization problem in terms of 𝒛\boldsymbol{z} rather than 𝒑\bm{p}.

Before we finish off with the entropic term, we introduce the following symmetric matrix:

Aij:=12(Rijb+Rjibπibπjb).\displaystyle A_{ij}:=\frac{1}{2}\left(R^{b}_{ij}+R^{b}_{ji}\frac{\pi^{b}_{i}}{\pi^{b}_{j}}\right). (S3.67)

The matrix AA is sometimes dubbed the “additive symmetrization” of RbR^{b}. It will always obey the detailed balance condition (DB) for 𝝅b{\bm{\pi}^{b}}, πjbAij=πibAji\pi^{b}_{j}A_{ij}=\pi^{b}_{i}A_{ji}. Moreover, A=RbA=R^{b} if and only if RbR^{b} obeys DB. We note that only in this latter case would 𝝅b{\bm{\pi}^{b}} be an equilibrium distribution. We also define:

M:=D1ADMij=(D1AD)ij=πjbπibAij.\displaystyle M:=D^{-1}AD\ \rightarrow\ M_{ij}=(D^{-1}AD)_{ij}=\sqrt{\frac{\pi^{b}_{j}}{\pi^{b}_{i}}}A_{ij}. (S3.68)

Observe that M=D1ADM=D^{-1}AD, which implies that MM and AA are related via a similarity transformation so any right eigenpair (λ,𝒖)(\lambda,\boldsymbol{u}) of AA, (λ,D1𝒖)\left(\lambda,D^{-1}\boldsymbol{u}\right) is an eigenpair of the symmetric matrix MM. Also, since DD and AA are both symmetric, so is MM. In particular, since AA is the sum of two irreducible rate matrices, it is then itself an irreducible rate matrix which has a unique right eigenvector. It is easy to show that, in fact, 𝝅b{\bm{\pi}^{b}} is AA’s steady-state distribution with eigenvalue 0. Then, by similarity, MM also has a unique eigenvector D1𝝅bD^{-1}{\bm{\pi}^{b}} with eigenvalue 0. Moreover, since AA is a rate matrix, it is negative semidefinite (λα0\lambda_{\alpha}\leq 0), and so MM is also negative semidefinite. We indicate the right eigenvectors of AA as 𝒖α\boldsymbol{u}^{\alpha} and the eigenvectors of the symmetric matrix MM as 𝒎α=D1𝒖α\boldsymbol{m}^{\alpha}=D^{-1}\boldsymbol{u}^{\alpha}. We assume that 𝒎α\boldsymbol{m}^{\alpha} are normalized as 𝒎α=1\|\boldsymbol{m}^{\alpha}\|=1, therefore D1𝒖α=1\|D^{-1}\boldsymbol{u}^{\alpha}\|=1.

Going back to (S3.64), we note that

ijπjbπibRijbzjzi\displaystyle\sum_{ij}\sqrt{\frac{\pi^{b}_{j}}{\pi^{b}_{i}}}R^{b}_{ij}z_{j}z_{i} =12(ijπjbπibRijbzjzi+i,jπibπjbRjibzizj)\displaystyle=\frac{1}{2}\left(\sum_{ij}\sqrt{\frac{\pi^{b}_{j}}{\pi^{b}_{i}}}R^{b}_{ij}z_{j}z_{i}+\sum_{i,j}\sqrt{\frac{\pi^{b}_{i}}{\pi^{b}_{j}}}R^{b}_{ji}z_{i}z_{j}\right)
=i,jπjbπib(Rijb+πibπjbRjib)zjzi=i,jπjbπibzjzi=i,jMi,jzjzi.\displaystyle=\sum_{i,j}\sqrt{\frac{\pi^{b}_{j}}{\pi^{b}_{i}}}\left(R^{b}_{ij}+\frac{\pi^{b}_{i}}{\pi^{b}_{j}}R^{b}_{ji}\right)z_{j}z_{i}=\sum_{i,j}\sqrt{\frac{\pi^{b}_{j}}{\pi^{b}_{i}}}z_{j}z_{i}=\sum_{i,j}M_{i,j}z_{j}z_{i}. (S3.69)

In order to keep track of the second term in (S3.63), let us first recall the free-energy term in (13) and the pay-off vector ϕ\boldsymbol{\phi} which, in terms of the baseline parameters, reads as

ϕi:=g˙ib+jRjib(fjfi+gjib),\phi_{i}:=\dot{g}^{b}_{i}+\sum_{j}R^{b}_{ji}(f_{j}-f_{i}+{g}^{b}_{ji}),

and conveniently redefine it such that:

ϕLR=ϕ+β1RbTln𝝅b.\displaystyle\boldsymbol{\phi}^{\text{LR}}=\boldsymbol{\phi}+{\beta}^{-1}{R^{b}}^{T}\ln{\bm{\pi}^{b}}. (S3.70)

For reasons that will become obvious below, we re-scale (S3.70) by setting

𝒗:=β2DϕLR.\displaystyle\boldsymbol{v}:=\frac{\beta}{2}D\boldsymbol{\phi}^{\text{LR}}. (S3.71)

We combine the above with the definition (S3.65) and the nonlinear term (S3.69). We then recast the optimization problem in terms of the variable 𝒛\boldsymbol{z}, which is constrained by (S3.66). Finally, under the LR regime, (11) is approximated by

𝒢𝒢LR:=ϕT𝝅b+β1max𝒛n:𝒛TD1𝝅b=0𝒛TM𝒛+2𝒛T𝒗.\displaystyle\mathscr{G}\simeq\mathscr{G}_{\text{LR}}:=\boldsymbol{\phi}^{T}{\bm{\pi}^{b}}+{\beta}^{-1}\max_{\boldsymbol{z}\in\mathbb{R}^{n}:\boldsymbol{z}^{T}D^{-1}{\bm{\pi}^{b}}=0}\boldsymbol{z}^{T}M\boldsymbol{z}+2\boldsymbol{z}^{T}\boldsymbol{v}. (S3.72)

Eq. (S3.72) is a quadratic optimization problem, which can be solved using standard techniques from linear algebra. For convenience we summarize these techniques in Lemma 1 in Sec. V of this SM. That theorem implies that

𝒢LR=ϕT𝝅bβ1𝒗TM+𝒗𝒑LR=𝝅bDM+𝒗\displaystyle\mathscr{G}_{\text{LR}}=\boldsymbol{\phi}^{T}{\bm{\pi}^{b}}-{\beta}^{-1}\boldsymbol{v}^{T}M^{+}\boldsymbol{v}\qquad\qquad\bm{p}^{*}_{\mathrm{LR}}={\bm{\pi}^{b}}-DM^{+}\boldsymbol{v}\, (S3.73)

where M+=α>1λα1𝒎α𝒎αTM^{+}=\sum_{\alpha>1}\lambda_{\alpha}^{-1}\boldsymbol{m}^{\alpha}{\boldsymbol{m}^{\alpha}}^{T} is the Moore-Penrose pseudo-inverse of MM. In applying Theorem 1, we used result (S5.123) and the relation 𝒑=𝝅b+D𝒛\bm{p}^{*}={\bm{\pi}^{b}}+D\boldsymbol{z}^{*}, where 𝒛\boldsymbol{z}^{*} solves (S3.72).

We can rewrite (S3.73) using the eigensystem of MM,

𝒢LR=ϕT𝝅b+β1α>1(𝒗T𝒎α)2λα𝒑LR=𝝅b+α>1𝒗T𝒎αλαD𝒎α\displaystyle\mathscr{G}_{\text{LR}}=\boldsymbol{\phi}^{T}{\bm{\pi}^{b}}+{\beta}^{-1}\sum_{\alpha>1}\frac{(\boldsymbol{v}^{T}\boldsymbol{m}^{\alpha})^{2}}{-\lambda_{\alpha}}\qquad\qquad\bm{p}^{*}_{\mathrm{LR}}={\bm{\pi}^{b}}+\sum_{\alpha>1}\frac{\boldsymbol{v}^{T}\boldsymbol{m}^{\alpha}}{-\lambda_{\alpha}}D\boldsymbol{m}^{\alpha}\, (S3.74)

Finally, as in the main text we define

Ωα=β1𝒗T𝒎α=12(DϕLR)T(D1𝒖α)=12(ϕLR)T𝒖α,\displaystyle\Omega_{\alpha}={\beta}^{-1}\boldsymbol{v}^{T}\boldsymbol{m}^{\alpha}=\frac{1}{2}\left(D\boldsymbol{\phi}^{\text{LR}}\right)^{T}\left(D^{-1}\boldsymbol{u}^{\alpha}\right)=\frac{1}{2}\left(\boldsymbol{\phi}^{\text{LR}}\right)^{T}\boldsymbol{u}^{\alpha}, (S3.75)

which can be interpreted as the harvesting amplitude of the eigenmodes of AA. Plugging into (S3.73) gives Eq. (15) in the main text,

𝒢LR=ϕT𝝅b+βα>1|Ωα|2λαand𝒑LR=𝝅b+α>1βΩαλα𝒖α\displaystyle\mathscr{G}_{\text{LR}}=\boldsymbol{\phi}^{T}{\bm{\pi}^{b}}+{\beta}\sum_{\alpha>1}\frac{|\Omega_{\alpha}|^{2}}{-\lambda_{\alpha}}\qquad\text{and}\qquad\bm{p}^{*}_{\mathrm{LR}}={\bm{\pi}^{b}}+\sum_{\alpha>1}\frac{\beta\Omega_{\alpha}}{-\lambda_{\alpha}}\boldsymbol{u}^{\alpha}\, (S3.76)

Region of validity of the LR regime

Note that the LR regime applies when 𝒑𝝅b1\|\bm{p}^{*}-{\bm{\pi}^{b}}\|\ll 1. We can write

𝒑LR𝝅b=βα>1Ωαλα𝒖αα>1β|Ωα|λα𝒖αα>1β|Ωα|λαD𝒎αβ(n1)maxα>1|Ωα/λα|maxiπib,\displaystyle\begin{aligned} \|\bm{p}^{*}_{\mathrm{LR}}-{\bm{\pi}^{b}}\|&=\beta\Big{\|}\sum_{\alpha>1}\frac{\Omega_{\alpha}}{-\lambda_{\alpha}}\boldsymbol{u}^{\alpha}\Big{\|}\leq\sum_{\alpha>1}\frac{\beta|\Omega_{\alpha}|}{-\lambda_{\alpha}}\|\boldsymbol{u}^{\alpha}\|\\ &\leq\sum_{\alpha>1}\frac{\beta|\Omega_{\alpha}|}{-\lambda_{\alpha}}\|D\|\|\boldsymbol{m}^{\alpha}\|\leq\beta(n-1)\max_{\alpha>1}|\Omega_{\alpha}/\lambda_{\alpha}|\max_{i}\sqrt{\pi^{b}_{i}}\,,\end{aligned} (S3.77)

where we used the triangle inequality and properties of the matrix norm. Therefore, the LR approximation is guaranteed to be valid when

maxα>1|Ωα/λα|1β(n1)maxiπib.\displaystyle\max_{\alpha>1}|\Omega_{\alpha}/\lambda_{\alpha}|\ll\frac{1}{\beta(n-1)\max_{i}\sqrt{\pi^{b}_{i}}}\,. (S3.78)

In other words, the harvesting amplitude on each eigenmode must be slower than relaxation modes, up to a factor that depends only on β\beta, nn, and the steady state 𝝅b{\bm{\pi}^{b}}.

III.2 Deterministic (D) regime, Eq. (16)

In the Deterministic (D) regime, we assume that the nonlinear terms in (13) are small compared to the linear terms. This allows us to approximate the optimal solution of (13) as the linear optimization

𝒢\displaystyle\mathscr{G} =max𝒑β1(Rb𝒑)Tln𝒑+ϕT𝒑\displaystyle=\max_{\bm{p}}\,{\beta}^{-1}\left(R^{b}\bm{p}\right)^{T}\ln\bm{p}+\boldsymbol{\phi}^{T}\bm{p} (S3.79)
max𝒑ϕT𝒑=:𝒢D,\displaystyle\approx\max_{\bm{p}}\boldsymbol{\phi}^{T}\bm{p}=:\mathscr{G}_{\text{D}}, (S3.80)

where (ln𝒑)T:=(lnp1,,lnpn)(\ln\bm{p})^{T}:=(\ln p_{1},\ldots,\ln p_{n}). The maximum in (S3.80) is achieved by (𝒑D)i=δii\left(\bm{p}^{*}_{\mathrm{D}}\right)_{i}=\delta_{ii^{*}} with i=argmaxiϕii^{*}=\arg\max_{i}\phi_{i}, so

𝒢D=ϕi=maxi[g˙ib+jRjib(fjfi+gjib)],\displaystyle\mathscr{G}_{\text{D}}=\phi_{i^{*}}=\max_{i}\big{[}\dot{g}^{b}_{i}+\sum_{j}R^{b}_{ji}(f_{j}-f_{i}+{g}^{b}_{ji})\big{]}\,, (S3.81)

as appears in the main text.

Region of validity of the D regime

We now discuss when the Deterministic regime approximation is valid.

We begin by making the assumption that the baseline harvesting rate is non-negative, 𝒢˙(𝝅b)0\dot{\mathcal{G}}({\bm{\pi}^{b}})\geq 0. In addition, for notational convenience, let K:=maxi(Riib)K:=\max_{i}(-R^{b}_{ii}) indicate the largest escape rate.

We express the regime of validity in terms of the following two parameters:

α\displaystyle\alpha :=Kβ𝒢D=maxi(Riib)βmaxi[g˙ib+jRjib(fjfi+gjib)]0\displaystyle:=\frac{K}{\beta\mathscr{G}_{\text{D}}}=\frac{\max_{i}(-R^{b}_{ii})}{\beta\max_{i}\big{[}\dot{g}^{b}_{i}+\sum_{j}R^{b}_{ji}(f_{j}-f_{i}+{g}^{b}_{ji})\big{]}}\geq 0 (S3.82)
γ\displaystyle\gamma :=ln(miniπib)0.\displaystyle:=-\ln\Big{(}\min_{i}\pi^{b}_{i}\Big{)}\geq 0\,. (S3.83)

The parameter α\alpha reflects the balance between diffusion out of the optimal state versus Deterministic harvesting rate. The parameter γ\gamma is the minimal steady-state probability of any state.

Assuming α<1\alpha<1, we show that

|𝒢𝒢D1|α(lnα+γ+1)\displaystyle\left|\frac{\mathscr{G}}{\mathscr{G}_{\text{D}}}-1\right|\leq\alpha\left(-\ln\alpha+\gamma+1\right)\, (S3.84)

The RHS of (S3.84) vanishes for α0\alpha\to 0, so the LHS tightens as 𝒢/𝒢D1\mathscr{G}/\mathscr{G}_{\text{D}}\to 1. We emphasize that the D approximation only becomes accurate in relative, not additive, terms (i.e., it is not necessarily true that 𝒢𝒢D0\mathscr{G}-\mathscr{G}_{\text{D}}\to 0).

To derive (S3.84), we first consider an upper bound on 𝒢\mathscr{G}. Observe that for any 𝒑\bm{p},

S˙b(𝒑)=i,jRijbpjlnpiiRiibpilnpii|Riib|(pilnpi)KS(𝒑)Klnn\displaystyle-\dot{S}^{b}(\bm{p})=\sum_{i,j}R^{b}_{ij}p_{j}\ln p_{i}\leq\sum_{i}R^{b}_{ii}p_{i}\ln p_{i}\leq\sum_{i}|R^{b}_{ii}|(-p_{i}\ln p_{i})\leq KS(\bm{p})\leq K\ln n (S3.85)

where we used Rijbpjlnpi0R^{b}_{ij}p_{j}\ln p_{i}\leq 0 for iji\neq j and S(𝒑)lnnS(\bm{p})\leq\ln n. Plugging into (S3.79) and using that 𝒢DϕT𝒑\mathscr{G}_{\text{D}}\geq\boldsymbol{\phi}^{T}\bm{p} for all 𝒑\bm{p} and γlnn\gamma\geq\ln n, we then have

𝒢K(lnn)/β+𝒢D=𝒢Dαlnn+𝒢D𝒢Dαγ+𝒢D.\displaystyle\mathscr{G}\leq K(\ln n)/\beta+\mathscr{G}_{\text{D}}=\mathscr{G}_{\text{D}}\alpha\ln n+\mathscr{G}_{\text{D}}\leq\mathscr{G}_{\text{D}}\alpha\gamma+\mathscr{G}_{\text{D}}\,. (S3.86)

To derive a lower bound on 𝒢\mathscr{G}, define the distribution pi:=(1α)δii+απibp_{i}:=\left(1-\alpha\right)\delta_{ii^{*}}+\alpha\pi^{b}_{i}, with α\alpha as in (S3.82). Plugging this distribution into the objective in (S3.79) yields

𝒢\displaystyle\mathscr{G} β1(Rb𝒑)Tln𝒑+ϕT𝒑=β1(1α)i(Riiblnpi)+ϕT𝒑,\displaystyle\geq{\beta}^{-1}\left(R^{b}\bm{p}\right)^{T}\ln\bm{p}+\boldsymbol{\phi}^{T}\bm{p}={\beta}^{-1}(1-\alpha)\sum_{i}(R^{b}_{ii^{*}}\ln p_{i})+\boldsymbol{\phi}^{T}\bm{p}, (S3.87)

where we used that jRijb((1α)δji+απjb)=Riib(1α)\sum_{j}R^{b}_{ij}\left(\left(1-\alpha\right)\delta_{ji^{*}}+\alpha\pi^{b}_{j}\right)=R^{b}_{ii^{*}}(1-\alpha) since Rb𝝅b=0R^{b}{\bm{\pi}^{b}}=0. Using Riiblnpi0R^{b}_{i^{*}i^{*}}\ln p_{i^{*}}\geq 0, we bound the first term on the right side of (S3.87) as

iRiiblnpi\displaystyle\sum_{i}R^{b}_{ii^{*}}\ln p_{i} i:iiRiiblnpi=i:iiRiibln(απib)\displaystyle\geq\sum_{i:i\neq i^{*}}R^{b}_{ii^{*}}\ln p_{i}=\sum_{i:i\neq i^{*}}R^{b}_{ii^{*}}\ln(\alpha\pi^{b}_{i})
i:iiRiib(lnαγ)\displaystyle\geq\sum_{i:i\neq i*}R^{b}_{ii^{*}}(\ln\alpha-\gamma)
=Riib(lnαγ)K(lnαγ),\displaystyle=-R^{b}_{i^{*}i^{*}}(\ln\alpha-\gamma)\geq K\left(\ln\alpha-\gamma\right)\,, (S3.88)

where we used that lnαγ<0\ln\alpha-\gamma<0 in the last inequality. For the second term on the right side of (S3.87),

ϕT𝒑=ipiϕi=(1α)iδiiϕi+αiπibϕi=(1α)𝒢D+α𝒢˙(𝝅b).\displaystyle\boldsymbol{\phi}^{T}\bm{p}=\sum_{i}p_{i}\phi_{i}=(1-\alpha)\sum_{i}\delta_{ii^{*}}\phi_{i}+\alpha\sum_{i}\pi^{b}_{i}\phi_{i}=(1-\alpha)\mathscr{G}_{\text{D}}+\alpha\dot{\mathcal{G}}({\bm{\pi}^{b}}). (S3.89)

We can plug (III.2) and (S3.89) into (S3.87) to give a lower bound on 𝒢\mathscr{G},

𝒢\displaystyle\mathscr{G} β1(1α)K(lnαγ)+(1α)𝒢D+α𝒢˙(𝝅b)\displaystyle\geq{\beta}^{-1}(1-\alpha)K(\ln\alpha-\gamma)+(1-\alpha)\mathscr{G}_{\text{D}}+\alpha\dot{\mathcal{G}}({\bm{\pi}^{b}})
=(1α)𝒢Dα(lnαγ1)+𝒢D+α𝒢˙(𝝅b)\displaystyle=(1-\alpha)\mathscr{G}_{\text{D}}\alpha(\ln\alpha-\gamma-1)+\mathscr{G}_{\text{D}}+\alpha\dot{\mathcal{G}}({\bm{\pi}^{b}})

where we used the definition of α\alpha (S3.82) and rearranged. Since lnαγ<0\ln\alpha-\gamma<0, we can further drop the α2\alpha^{2} term to give

𝒢\displaystyle\mathscr{G} 𝒢Dα(lnαγ1)+𝒢D+α𝒢˙(𝝅b),\displaystyle\geq\mathscr{G}_{\text{D}}\alpha(\ln\alpha-\gamma-1)+\mathscr{G}_{\text{D}}+\alpha\dot{\mathcal{G}}({\bm{\pi}^{b}})\,,
𝒢Dα(lnαγ1)+𝒢D,\displaystyle\geq\mathscr{G}_{\text{D}}\alpha(\ln\alpha-\gamma-1)+\mathscr{G}_{\text{D}}\,, (S3.90)

where in the second line we used the assumption 𝒢˙(𝝅b)0\dot{\mathcal{G}}({\bm{\pi}^{b}})\geq 0. Combining and using that α<1\alpha<1 yields

𝒢Dα(lnαγ1)𝒢𝒢D𝒢Dαγ𝒢Dα(lnα+γ+1),\displaystyle\mathscr{G}_{\text{D}}\alpha\left(\ln\alpha-\gamma-1\right)\leq\mathscr{G}-\mathscr{G}_{\text{D}}\leq\mathscr{G}_{\text{D}}\alpha\gamma\leq\mathscr{G}_{\text{D}}\alpha(-\ln\alpha+\gamma+1)\,, (S3.91)

which can be rearranged to give (S3.84).

Interestingly, we can also derive a relative perturbation bound on the maximum increase of the harvesting rate, above the baseline harvesting rate. Specifically, using a similar derivation as above, we can show that

|𝒢𝒢˙(𝝅b)𝒢D𝒢˙(𝝅b)1|α(lnα+γ+1),\displaystyle\left|\frac{\mathscr{G}-\dot{\mathcal{G}}({\bm{\pi}^{b}})}{\mathscr{G}_{\text{D}}-\dot{\mathcal{G}}({\bm{\pi}^{b}})}-1\right|\leq\alpha\left(-\ln\alpha+\gamma+1\right)\,, (S3.92)

where α=K/β(𝒢D𝒢˙(𝝅b))0\alpha=K/\beta(\mathscr{G}_{\text{D}}-\dot{\mathcal{G}}({\bm{\pi}^{b}}))\geq 0 and γ\gamma is defined as before. In fact, this perturbation bound on the increase holds without any additional assumptions, i.e., regardless of the sign of 𝒢˙(𝝅b)\dot{\mathcal{G}}({\bm{\pi}^{b}}).

III.3 Near-Deterministic (ND) regime, Eq. (17)

In this regime, we consider a perturbation of the Deterministic regime: we assume that the optimal 𝒑\bm{p}^{*} is close to a delta-function distribution centered at i=argmaxiϕii^{*}={\text{argmax}}_{i}\phi_{i}. If piδiip_{i}\approx\delta_{ii^{*}},

p˙i=jRijbpjRiib.\displaystyle\dot{p}_{i}=\sum_{j}R^{b}_{ij}p_{j}\approx R^{b}_{ii^{*}}\,. (S3.93)

This allows us to approximate the entropic term in (S3.79) as

(Rb𝒑)Tln𝒑\displaystyle(R^{b}\bm{p})^{T}\ln\bm{p} =i,jRijbpjlnpiiRiiblnpi\displaystyle=\sum_{i,j}R^{b}_{ij}p_{j}\ln p_{i}\approx\sum_{i}R^{b}_{ii^{*}}\ln p_{i}
=Riibln(1i:iipi)+i:iiRiiblnpi\displaystyle=R^{b}_{i^{*}i^{*}}\ln\Big{(}1-\sum_{i:i\neq i^{*}}p_{i}\Big{)}+\sum_{i:i\neq i^{*}}R^{b}_{ii^{*}}\ln p_{i}
i:ii(Riibpi+Riiblnpi)\displaystyle\approx\sum_{i:i\neq i^{*}}\left(-R^{b}_{i^{*}i^{*}}p_{i}+R^{b}_{ii^{*}}\ln p_{i}\right) (S3.94)

where we used ln(1x)x\ln(1-x)\approx-x when x0x\approx 0. Plugging into (S3.79) gives

𝒢\displaystyle\mathscr{G} max𝒑β1i:ii(Riibpi+Riiblnpi)+(1i:iipi)ϕi+i:iipiϕi\displaystyle\simeq\max_{\bm{p}}{\beta}^{-1}\sum_{i:i\neq i^{*}}\left(-R^{b}_{i^{*}i^{*}}p_{i}+R^{b}_{ii^{*}}\ln p_{i}\right)+\Big{(}1-\sum_{i:i\neq i^{*}}p_{i}\Big{)}\phi_{i^{*}}+\sum_{i:i\neq i^{*}}p_{i}\phi_{i}
=𝒢D+max𝒑i:ii[pi(ϕiϕi+β1Riib)+β1Riiblnpi]=:𝒢ND,\displaystyle=\mathscr{G}_{\text{D}}+\max_{\bm{p}}\sum_{i:i\neq i^{*}}\left[-p_{i}\left(\phi_{i^{*}}-\phi_{i}+\beta^{-1}R^{b}_{i^{*}i^{*}}\right)+\beta^{-1}R^{b}_{ii^{*}}\ln p_{i}\right]=:\mathscr{G}_{\text{ND}}\,, (S3.95)

where we recall that 𝒢D=ϕi\mathscr{G}_{\text{D}}=\phi_{i^{*}}, which does not depend on 𝒑\bm{p}. After maximizing with respect to {pi}i:ii\{p_{i}\}_{i:i\neq i^{*}} by taking derivatives and setting them to zero, one obtains

(𝒑ND)i=Riibβ(ϕiϕi)+Riib(forii)and(𝒑ND)i=1i:ii(𝒑ND)i.\displaystyle(\bm{p}^{*}_{\mathrm{ND}})_{i}=\frac{R^{b}_{ii^{*}}}{\beta(\phi_{i^{*}}-\phi_{i})+R^{b}_{i^{*}i^{*}}}\qquad(\text{for}\,i\neq i^{*})\qquad{\text{and}}\qquad\,(\bm{p}^{*}_{\mathrm{ND}})_{i^{*}}=1-\sum_{i:i\neq i^{*}}(\bm{p}^{*}_{\mathrm{ND}})_{i}\,. (S3.96)

Plugging (S3.96) into (III.3) gives

𝒢ND=𝒢D+β1i:iiRiib[ln(𝒑ND)i1].\displaystyle\mathscr{G}_{\text{ND}}=\mathscr{G}_{\text{D}}+{\beta}^{-1}\sum_{i:i\neq i^{*}}R^{b}_{ii^{*}}\left[\ln(\bm{p}^{*}_{\mathrm{ND}})_{i}-1\right]\,. (S3.97)

Region of validity of the ND regime

The ND approximation applies when (𝒑ND)i1(\bm{p}^{*}_{\mathrm{ND}})_{i^{*}}\approx 1, or equivalently when

1i:ii(𝒑ND)i=i:iiRiibβ(ϕiϕi)+Riib.1\gg\sum_{i:i\neq i^{*}}(\bm{p}^{*}_{\mathrm{ND}})_{i}=\sum_{i:i\neq i^{*}}\frac{R^{b}_{ii^{*}}}{\beta(\phi_{i^{*}}-\phi_{i})+R^{b}_{i^{*}i^{*}}}\,.

For convenience, we define the harvesting gap as the difference in the pay-off between the optimal state ii^{*} and the second optimal state,

Δϕmin:=ϕimaxi:iiϕi.\Delta{\phi_{\min}}:=\phi_{i^{*}}-\max_{i:i\neq i^{*}}\phi_{i}\,.

By combining and rearranging the above, we can simplify the validity condition for the ND approximation as

βΔϕmin2Riib,\displaystyle\beta\Delta{\phi_{\min}}\gg-2R^{b}_{i^{*}i^{*}}, (S3.98)

Note that this condition also guarantees that (𝒑ND)i0(\bm{p}^{*}_{\mathrm{ND}})_{i}\geq 0 for iii\neq i^{*}. Expression (S3.98) implies that ND approximation is valid when the harvesting gap is much larger than the rates of escape from the optimal state ii^{*}.

IV Unicyclic model

Here we analyze the unicyclic model considered in the main text. Before proceeding, we briefly introduce some useful facts about the eigendecomposition of a unicylic rate matrix.

IV.1 An Algebraic Aperitif: eigendecomposition of unicyclic rate matrix

Consider a unicylic rate matrix RbR^{b},

Rb=(2kk0kk2kk00k2k0k002k),\displaystyle R^{b}=\left(\begin{matrix}-2k&k&0&\cdots&k\\ k&-2k&k&\cdots&0\\ 0&k&-2k&\cdots&0\\ \vdots&\vdots&\vdots&\ddots&\vdots\\ k&0&0&\cdots&-2k\end{matrix}\right)\,, (S4.99)

which is a symmetric circulant matrix. Due to symmetry, its steady state is uniform: πib=1/n\pi^{b}_{i}=1/n for all ii. The eigensystem for RR is obtained from the theory of circulant matrices [45], which, for odd nn, yields:

λa\displaystyle\lambda_{a} =2k[1(ωa)]=2k[1cos(2π(a1)n)],\displaystyle=-2k\left[1-\Re\left(\omega_{a}\right)\right]=-2k\left[1-\cos\left(\frac{2\pi(a-1)}{n}\right)\right]\,, (S4.100)

where ωa:=ei2π(a1)/n\omega_{a}:=e^{\mathrm{i}2\pi(a-1)/n}. These eigenvalues are all degenerate twice (except λ(1)=0\lambda_{(1)}=0, whose eigenvector is simply (1,1,,1)(1,1,\ldots,1)). An orthonormal choice of eigenbasis is given by the set

{m(a)}={1n(1,ωa,ωa2,,ωan1)}.\displaystyle\{m_{(a)}\}=\left\{\frac{1}{\sqrt{n}}\left(1,\omega_{a},\omega_{a}^{2},\ldots,\omega_{a}^{n-1}\right)\right\}. (S4.101)

IV.2 Transition harvesting cycle

We now analyze the unicyclic model using techniques developed in Sec. III. We consider a systems with nn state arranged in a ring, where baseline transitions are symmetric with uniform kinetics: i𝑘𝑘i+1modni\overset{k}{\underset{k}{\rightleftharpoons}}i+1\mod n, i=1,,n\forall i=1,\ldots,n (see Fig. S13 here and Fig. 4 (a) in the main text). The baseline dynamics are equivalent to a random walk on a one-dimensional ring, and the baseline rate matrix is given by (S4.99).

We assume that a single transition, taken to be 121\to 2 without loss of generality, harvests g21b=g12b=Θ{g}^{b}_{21}=-{g}^{b}_{12}=\Theta joules of free energy. In addition, note that in the main text we chose to set the units of k=1k=1 without loss of generality. However, we have kept kk explicit in the rest of the SM. For this system, the baseline steady state is uniform, πib=1/n\pi^{b}_{i}=1/n, and the harvesting vector is

ϕLR=ϕ=(kΘ,kΘ,0,,0).\displaystyle\boldsymbol{\phi}^{\text{LR}}=\boldsymbol{\phi}=(k\Theta,-k\Theta,0,\ldots,0)\,. (S4.102)
Refer to caption
Figure S13: A unicyclic system across the nn states with symmetric back-and-forth rates kk, as discussed in the main text. An amount of Θ\Theta joules is harvested as free energy into the internal reservoir for each transition 121\to 2 (and vice versa for 212\to 1).

In the rest of this section, we analyze this model in the LR, D and ND regimes. We will also discuss the validity of these approximation in terms of the model parameters.

IV.2.1 LR regime

We first consider the LR solution (S3.76). Combining (S4.102) with (S3.71), πib=1/n\pi^{b}_{i}=1/n, and D=1nInD=\frac{1}{\sqrt{n}}I_{n} gives 𝒗=kβΘ2n(1,1,0,,0)\boldsymbol{v}=\frac{k\beta\Theta}{2\sqrt{n}}(1,-1,0,\ldots,0). In addition, since in this particular example RR obeys detailed balance, Rb=AR^{b}=A. Finally, it is easy to verify that M=D1AD=A=RbM=D^{-1}AD=A=R^{b}. Thus, MM has the same eigendecomposition as RR. The normalized eigenvectors of MM are given by (S4.101).

We can compute Ωa=β1𝒗T𝒎α\Omega_{a}={\beta}^{-1}\boldsymbol{v}^{T}\boldsymbol{m}^{\alpha} for a=2,,na=2,\ldots,n, using the eigenvector set given in (S4.101),which simply yields

Ωa=kΘ2n(1ωa).\displaystyle\Omega_{a}=\frac{k\Theta}{2n}\left(1-\omega_{a}\right)\,. (S4.103)

Next, we compute the maximum harvesting rate attainable using (S3.76),

𝒢LR=a=2nβ|Ωa|2λa\displaystyle\mathscr{G}_{\text{LR}}=\sum_{a=2}^{n}\frac{\beta\left|\Omega_{a}\right|^{2}}{-\lambda_{a}} =βk(Θ2n)2a=2n|1ωa|22[1cos(2π(a1)n)]\displaystyle=\beta k\left(\frac{\Theta}{2n}\right)^{2}\sum_{a=2}^{n}\frac{\left|1-\omega_{a}\right|^{2}}{2\left[1-\cos\left(\frac{2\pi(a-1)}{n}\right)\right]}
=βk(Θ2n)2a=2n1\displaystyle=\beta k\left(\frac{\Theta}{2n}\right)^{2}\sum_{a=2}^{n}1
=βk(Θ2n)2(n1).\displaystyle=\beta k\left(\frac{\Theta}{2n}\right)^{2}(n-1)\,. (S4.104)

On the other hand, continuing with odd nn, it is also possible to compute the deviation of the optimal distribution from stationary distribution using (S3.76),

Δ𝒑LR:=𝒑LR𝝅b=βa=2nΩaD𝒎αλa,\displaystyle\Delta\bm{p}^{*}_{\mathrm{LR}}:=\bm{p}^{*}_{\mathrm{LR}}-{\bm{\pi}^{b}}=\beta\sum_{a=2}^{n}\frac{\Omega_{a}^{\dagger}D\boldsymbol{m}^{\alpha}}{-\lambda_{a}}\,, (S4.105)

which is depicted in Fig. S14(a). (Note that, due to our choice of a complex-valued eigenbasis, we must be careful in adding the complex conjugate on the Ωa\Omega_{a}, following the ordering of the operator M+M^{+} in (S3.76).) Now, component by component, we can rewrite (S4.105) as

(Δ𝒑LR)i\displaystyle\left(\Delta\bm{p}^{*}_{\mathrm{LR}}\right)_{i} =βΘ4n2a=2n(1ωa)ωai11cos(2π(a1)n)\displaystyle=\frac{\beta\Theta}{4n^{2}}\sum_{a=2}^{n}\frac{(1-\omega_{a})^{\dagger}\omega_{a}^{i-1}}{1-\cos\left(\frac{2\pi(a-1)}{n}\right)}
=βΘ4n2a=2ncos(2π(a1)(i1)n)cos(2π(a1)(i2)n)1cos(2π(a1)n)\displaystyle=\frac{\beta\Theta}{4n^{2}}\sum_{a=2}^{n}\frac{\cos\left(\frac{2\pi(a-1)(i-1)}{n}\right)-\cos\left(\frac{2\pi(a-1)(i-2)}{n}\right)}{1-\cos\left(\frac{2\pi(a-1)}{n}\right)}
=βΘ4n2[a=2ncos(2π(a1)(i2)n)a=2nsin(2π(a1)(i2)n)sin(2π(a1)n)1cos(2π(a1)n)].\displaystyle=\frac{\beta\Theta}{4n^{2}}\left[-\sum_{a=2}^{n}\cos\left(\frac{2\pi(a-1)(i-2)}{n}\right)-\sum_{a=2}^{n}\frac{\sin\left(\frac{2\pi(a-1)(i-2)}{n}\right)\sin\left(\frac{2\pi(a-1)}{n}\right)}{1-\cos\left(\frac{2\pi(a-1)}{n}\right)}\right]. (S4.106)

In the first line, we used the expressions for Ωa\Omega_{a} from (S4.103), λa\lambda_{a} from (S4.100), 𝒎α\boldsymbol{m}^{\alpha} from (S4.101), and D=I/nD=I/\sqrt{n} and then simplified. In the second line, we expanded (1ωa)ωai1=ωai1ωai2(1-\omega_{a})^{\dagger}\omega_{a}^{i-1}=\omega_{a}^{i-1}-\omega_{a}^{i-2} into real and imaginary components and then used that the imaginary components cancel over the sum. In the last line, we used the identity cos(x+y)=cos(x)cos(y)sin(x)sin(y)\cos(x+y)=\cos(x)\cos(y)-\sin(x)\sin(y) for x=2π(a1)(i2)nx=\frac{2\pi(a-1)(i-2)}{n} and y=2π(a1)ny=\frac{2\pi(a-1)}{n} and then simplified. It can be verified that the first series in (S4.106) sums to 1-1. The second series is trickier but can be simplified using trigonometric methods as shown in Ref. [79]. Plugging in that solution and simplifying gives the very simple expression:

(Δ𝒑LR)i=βΘ4n2{2(i1)(n+1) for i=2,,n(n1) for i=1.\displaystyle\left(\Delta\bm{p}^{*}_{\mathrm{LR}}\right)_{i}=\frac{\beta\Theta}{4n^{2}}\begin{cases}2(i-1)-(n+1)&{\text{\qquad for }}i=2,\ldots,n\\ (n-1)&{\text{\qquad for }}i=1\end{cases}\,. (S4.107)

Thus, in the LR regime, the optimal distribution builds up in equal increments starting at i=2i=2 until the optimal state i=1i=1, after which it falls off a cliff. This is shown in Fig. S14.

Refer to caption
Figure S14: The optimal Δ𝒑LR\Delta\bm{p}^{*}_{\mathrm{LR}} (deviation of optimal distribution from the uniform baseline steady state) given (S4.107). The optimal distribution exhibits a clockwise cyclic current given by (S4.108), The magnitude of Δ𝒑LR\Delta\bm{p}^{*}_{\mathrm{LR}} decays as n1\sim n^{-1}, so larger rings will result in optimal distributions that are closer to the baseline steady state.

We can also compute the probability current across the transition 121\to 2, which we leave as an exercise for the reader:

k[(𝒑LR)1(𝒑LR)2]=kβΘ2n(n1n).\displaystyle k{\left[(\bm{p}^{*}_{\mathrm{LR}})_{1}-(\bm{p}^{*}_{\mathrm{LR}})_{2}\right]}=\frac{k\beta\Theta}{2n}\left(\frac{n-1}{n}\right)\,. (S4.108)

The expressions (S3.77) and (S3.78) suggest for which values of Θ\Theta the optimal solution will be in the LR regime. In this case, however, it is possible to work out the norm 𝒑LR𝝅b\|\bm{p}^{*}_{\mathrm{LR}}-{\bm{\pi}^{b}}\| exactly:

Δ𝒑LR=βΘ4n2[x=2n(2xn3)2+(n1)2]1/2=βΘ43n21n3\|\Delta\bm{p}^{*}_{\mathrm{LR}}\|=\frac{\beta\Theta}{4n^{2}}\left[\sum_{x=2}^{n}\left(2x-n-3\right)^{2}+(n-1)^{2}\right]^{1/2}=\frac{\beta\Theta}{4\sqrt{3}}\sqrt{\frac{n^{2}-1}{n^{3}}}

which implies that the LR regime is valid when

𝒑LR𝝅b1Θ43βn3n21\displaystyle\|\bm{p}^{*}_{\mathrm{LR}}-{\bm{\pi}^{b}}\|\ll 1\Leftrightarrow\Theta\ll\frac{4\sqrt{3}}{\beta}\sqrt{\frac{n^{3}}{n^{2}-1}} (S4.109)

In this case, the larger the number of states nn in the ring, the wider the interval of Θ\Theta values that will make the optimal solution fall into the LR regime. For large nn, we obtain

Θ43βn.\Theta\ll\frac{4\sqrt{3}}{\beta}\sqrt{n}.

IV.2.2 D and ND Regimes

Note that the optimal mesostate is i=1i^{*}=1. Hence 𝒢D=ϕ1=kΘ\mathscr{G}_{\text{D}}=\phi_{1}=k\Theta and (𝒑D)i=δi1(\bm{p}^{*}_{\mathrm{D}})_{i}=\delta_{i1}. Next, consider the ND expressions (S3.96) with the payoff vector (S4.102). Denote Δϕ2=ϕiϕ2=2kΘ\Delta\phi_{2}=\phi_{i^{*}}-\phi_{2}=2k\Theta and Δϕn=ϕiϕn=kΘ\Delta\phi_{n}=\phi_{i^{*}}-\phi_{n}=k\Theta; and observe that R21b=Rn1b=kR^{b}_{21}=R^{b}_{n1}=k, R11b=2kR^{b}_{11}=-2k and Rj1b=0R^{b}_{j1}=0 for all j2,nj\neq 2,n. Hence, for iii\neq i^{*}, we obtain

(𝒑ND)i=Ri1bβΔϕi+R11b={12(βΘ1)i=21βΘ2i=n0otherwise(𝒑ND)i=1=1i:ii(𝒑ND)i=13βΘ42(βΘ1)(βΘ2).\displaystyle\begin{aligned} (\bm{p}^{*}_{\mathrm{ND}})_{i}&=\frac{R^{b}_{i1}}{\beta\Delta\phi_{i}+R^{b}_{11}}=\begin{cases}\frac{1}{2(\beta\Theta-1)}&\qquad i=2\\ \frac{1}{\beta\Theta-2}&\qquad i=n\\ 0&\qquad{\text{otherwise}}\end{cases}\\ \left(\bm{p}^{*}_{\mathrm{ND}}\right)_{i^{*}=1}&=1-\sum_{i:i\neq i^{*}}(\bm{p}^{*}_{\mathrm{ND}})_{i}=1-\frac{3\beta\Theta-4}{2(\beta\Theta-1)(\beta\Theta-2)}\,.\end{aligned} (S4.110)

The harvesting rate attainable in this regime can be calculated using Eq. (S3.97),

𝒢ND\displaystyle\mathscr{G}_{\text{ND}} =ϕ1+β1i:iiRi1b[ln(𝒑ND)i1]\displaystyle=\phi_{1}+{\beta^{-1}}\sum_{i:i\neq i^{*}}R^{b}_{i1}\left[\ln\left(\bm{p}^{*}_{\mathrm{ND}}\right)_{i}-1\right] (S4.111)
=k(Θ2/β)kβ1ln[2(βΘ1)(βΘ2)].\displaystyle=k(\Theta-2/\beta)-k\beta^{-1}\ln\left[2(\beta\Theta-1)(\beta\Theta-2)\right]. (S4.112)

IV.3 State harvesting cycle

In this section, we consider the second unicyclic model, see Fig. S15 and Fig. 4 (b) in the main text. In this model, the baseline dynamics correspond once again to the unicycle with symmetric rates, as in Sec. IV.2. However, harvesting is not coupled to transitions, but rather to internal fluxes within a single mesostate. Without loss of generality, we choose this mesostate to be i=1i=1. In other words, we let g˙1b=θ>0\dot{g}^{b}_{1}=\theta>0 and g˙1<inb=0\dot{g}^{b}_{1<i\leq n}=0. There is now no preferential direction for the probability current. The free energy harvested per unit time when the system is in mesostate 11 is given by the parameter θ\theta, which carries the same units as 𝒢\mathscr{G}. Just as before, the baseline steady state is uniform, i.e. πib=1/n\pi^{b}_{i}=1/n. Hence, the harvesting vector is

ϕLR=ϕ=(θ,0,,0).\displaystyle\boldsymbol{\phi}^{\text{LR}}=\boldsymbol{\phi}=(\theta,0,\ldots,0)\,. (S4.113)

Below we consider the LR, D and ND regimes.

Refer to caption
Figure S15: We consider a unicyclic system with transitions across the nn states with symmetric back-and-forth rates kk. A single state (labelled i=1i=1) leads to a flux of free energy at rate θ\theta.

IV.3.1 LR Regime

In analogy with the previous example, the expression for the upper bound in free energy harvesting rate under the LR regime here is obtained as

𝒢LR=βa=2nΩa2λa,\displaystyle\mathscr{G}_{\text{LR}}=\beta\sum_{a=2}^{n}\frac{\Omega_{a}^{2}}{-\lambda_{a}}\,, (S4.114)

with Ωa=β1𝒗T𝒎α\Omega_{a}={\beta}^{-1}\boldsymbol{v}^{T}\boldsymbol{m}^{\alpha}, where the vectors 𝒎α\boldsymbol{m}^{\alpha} correspond to the eigensystem discussed in discussed in Sec. IV.1, i.e. in (S4.101). On the other hand, here 𝒗=βθ2n(1,0,,0)\boldsymbol{v}=\frac{\beta\theta}{2\sqrt{n}}(1,0,\ldots,0). The latter is obtained by construction using (S3.65), (S3.70) and (S3.71). Thus,

Ωa=θ2n.\Omega_{a}=\frac{\theta}{2n}\,.

For large even nn, it is possible to show that [80]

a=2n/211cos(2π(a1)n)n212+𝒪(1),\displaystyle\sum_{a=2}^{n/2}\frac{1}{1-\cos\left(\frac{2\pi(a-1)}{n}\right)}\simeq\frac{n^{2}}{12}+\mathcal{O}(1)\,, (S4.115)

which allows us to write

𝒢LRβ12k(θ2)2.\displaystyle\mathscr{G}_{\text{LR}}\simeq\frac{\beta}{12k}\left(\frac{\theta}{2}\right)^{2}\,. (S4.116)

If we wanted to approximate the result above for a large odd nn, we would need to add a term proportional to 1/4n21/4n^{2}, which is of second order, hence the leading order is still captured by expression (S4.116).

We can also study the optimal distribution using (S3.76),

Δ𝒑LR=𝒑LR𝝅b=βa=2nΩaD𝒎αλa=βθ4kn2a=2n𝒖α1cos(2π(a1)n)\displaystyle\Delta\bm{p}^{*}_{\mathrm{LR}}=\bm{p}^{*}_{\mathrm{LR}}-{\bm{\pi}^{b}}={\beta}\sum_{a=2}^{n}\frac{\Omega_{a}^{\dagger}D\boldsymbol{m}^{\alpha}}{-\lambda_{a}}=\frac{\beta\theta}{4kn^{2}}\sum_{a=2}^{n}\frac{\boldsymbol{u}^{\alpha}}{1-\cos\left(\frac{2\pi(a-1)}{n}\right)} (S4.117)

with u(a):=(1,wa,wa2,,wan1)u_{(a)}:=\left(1,w_{a},w_{a}^{2},\ldots,w_{a}^{n-1}\right). This is computed numerically and its behavior is shown in Fig. S14(b). We leave this computation as an exercise to the reader.

Refer to caption
Figure S16: The optimal Δ𝒑LR\Delta\bm{p}^{*}_{\mathrm{LR}} (the deviation of the optimal distribution from the uniform baseline steady state) from (S4.117). Note that there optimal distribution does not lead to a cyclic current.

Expression (S3.78) suggests sufficiency conditions for when the optimum will be in the LR regime. Note that, in this case, maxa>1Ωa=θ/2n\max_{a>1}\Omega_{a}={\theta}/{2n} while

mina>1λa=4kmina>1sin2(π(a1)n)=4ksin2(πn).\min_{a>1}-\lambda_{a}=4k\min_{a>1}\sin^{2}\left(\frac{\pi(a-1)}{n}\right)=4k\sin^{2}\left(\frac{\pi}{n}\right)\,.

Then, using D=1/n\|D\|=1/\sqrt{n}, condition (S3.78) will read as:

θ2n4knβ(n1)sin2(πn)θ8nkβ(nn1)sin2(πn).\displaystyle\frac{\theta}{2n}\ll\frac{4k\sqrt{n}}{\beta(n-1)}\sin^{2}\left(\frac{\pi}{n}\right)\Leftrightarrow\theta\ll\frac{8\sqrt{n}k}{\beta}\left(\frac{n}{n-1}\right)\sin^{2}\left(\frac{\pi}{n}\right)\,. (S4.118)

For sufficiently large nn, we can approximate sin2(πn)(πn)2\sin^{2}(\frac{\pi}{n})\approx(\frac{\pi}{n})^{2}, so the LR regime is valid when

θ8π2n3/2kβ.\displaystyle\theta\ll\frac{8\pi^{2}}{n^{3/2}}\frac{k}{\beta}\,. (S4.119)

Expression (S3.78), however, is not necessarily a tight bound. That is, the result hereby obtained is a sufficient condition for the LR regime to be valid, but not a necessary one.

IV.3.2 D and ND Regimes

The optimal mesostate is i=1i^{*}=1. Hence

𝒢D=ϕ1=θ(𝒑D)i=δi1.\mathscr{G}_{\text{D}}=\phi_{1}=\theta\qquad\qquad(\bm{p}^{*}_{\mathrm{D}})_{i}=\delta_{i1}\,.

Next, by following a similar procedure as above, we denote Δϕ2=Δϕn=θ\Delta\phi_{2}=\Delta\phi_{n}=\theta, R21b=Rn1b=kR^{b}_{21}=R^{b}_{n1}=k, R11b=2kR^{b}_{11}=-2k and Rj1b=0R^{b}_{j1}=0 for all j2,nj\neq 2,n. Thus, for iii\neq i^{*}, we obtain:

(𝒑ND)i=Ri1bΔϕi+R11b=1βθ/k2(δ2i+δni)(𝒑ND)i=1=1i:ii(𝒑ND)i=βθ/2k2βθ/2k1.\displaystyle\begin{aligned} \left(\bm{p}^{*}_{\mathrm{ND}}\right)_{i}&=\frac{R^{b}_{i1}}{\Delta\phi_{i}+R^{b}_{11}}=\frac{1}{\beta\theta/k-2}\left(\delta_{2i}+\delta_{ni}\right)\\ (\bm{p}^{*}_{\mathrm{ND}})_{i^{*}=1}&=1-\sum_{i:i\neq i^{*}}(\bm{p}^{*}_{\mathrm{ND}})_{i}=\frac{\beta\theta/2k-2}{\beta\theta/2k-1}\,.\end{aligned} (S4.120)

The harvesting rate attainable in this regime can be calculated using Eq. (S3.97),

𝒢ND\displaystyle\mathscr{G}_{\text{ND}} =ϕ1+β1i:iiRi1b[ln(𝒑ND)i1]\displaystyle=\phi_{1}+\beta^{-1}\sum_{i:i\neq i^{*}}R^{b}_{i1}\left[\ln(\bm{p}^{*}_{\mathrm{ND}})_{i}-1\right]
=θ2kβ1[1+ln(βθk12)].\displaystyle=\theta-2k{\beta}^{-1}\left[1+\ln\left(\beta\theta k^{-1}-2\right)\right]\,. (S4.121)

V Quadratic optimization lemma

In this section, we provide a useful theorem for solving the quadratic optimization problems that occurs in our analysis of the linear response (LR) regime. It uses standard techniques from linear algebra.

Lemma 1.

Consider the maximization problem

=max𝒛n:𝒖T𝒛=0𝒛TM𝒛+2𝒛T𝒗,\mathscr{L}^{*}=\max_{\boldsymbol{z}\in\mathbb{R}^{n}:\boldsymbol{u}^{T}\boldsymbol{z}=0}\boldsymbol{z}^{T}M\boldsymbol{z}+2\boldsymbol{z}^{T}\boldsymbol{v}, (S5.122)

where Mn×nM\in\mathbb{R}^{n\times n} is a negative semidefinite symmetric matrix with a single null eigenvector 𝐮n\boldsymbol{u}\in\mathbb{R}^{n} and 𝐯n\boldsymbol{v}\in\mathbb{R}^{n} is any vector. The solution is given by:

=𝒗TM+𝒗and𝒛=M+𝒗,\displaystyle\mathscr{L}^{*}=-\boldsymbol{v}^{T}M^{+}\boldsymbol{v}\ \quad\text{and}\quad\boldsymbol{z}^{*}=-M^{+}\boldsymbol{v}, (S5.123)

where M+M^{+} is the Moore-Penrose pseudo-inverse of MM.

Proof.

Since MM is symmetric, its real valued eigendecomposition can be written as:

M=α=1nλα𝒖α𝒖αTM=\sum_{\alpha=1}^{n}\lambda_{\alpha}\boldsymbol{u}^{\alpha}{\boldsymbol{u}^{\alpha}}^{T}

where we will choose 𝒖=𝒖1\boldsymbol{u}=\boldsymbol{u}^{1}, without loss of generality, i.e. λ1=0\lambda_{1}=0 is the only zero eigenvalue. The Moore-Penrose pseudo-inverse of MM can be written as:

M+=α=2n1λα𝒖α𝒖αT.M^{+}=\sum_{\alpha=2}^{n}\frac{1}{\lambda_{\alpha}}\boldsymbol{u}^{\alpha}{\boldsymbol{u}^{\alpha}}^{T}.

The constraint on the optimization problem reads as 𝒛T𝒖=0\boldsymbol{z}^{T}\boldsymbol{u}=0, which implies that

M+M𝒛=(I𝒖𝒖T)𝒛=𝒛.M^{+}M\boldsymbol{z}=(I-\boldsymbol{u}\boldsymbol{u}^{T})\boldsymbol{z}=\boldsymbol{z}.

Now, consider the objective function,

𝒛TM𝒛+2𝒛T𝒗=𝒛TM𝒛+𝒗TM+M𝒛+𝒛TMM+𝒗=(𝒛+M+𝒗)TM(𝒛+M+𝒗)𝒗TM+𝒗\displaystyle\boldsymbol{z}^{T}M\boldsymbol{z}+2\boldsymbol{z}^{T}\boldsymbol{v}=\boldsymbol{z}^{T}M\boldsymbol{z}+\boldsymbol{v}^{T}M^{+}M\boldsymbol{z}+\boldsymbol{z}^{T}MM^{+}\boldsymbol{v}=\left(\boldsymbol{z}+M^{+}\boldsymbol{v}\right)^{T}M\left(\boldsymbol{z}+M^{+}\boldsymbol{v}\right)-\boldsymbol{v}^{T}M^{+}\boldsymbol{v} (S5.124)

where we used properties of the transverse operation, symmetry of MM and M+M^{+}, that M+M𝒛=𝒛𝒛T=𝒛TMM+M^{+}M\boldsymbol{z}=\boldsymbol{z}\Leftrightarrow\boldsymbol{z}^{T}=\boldsymbol{z}^{T}MM^{+}, 𝒗T𝒛=𝒛T𝒗\boldsymbol{v}^{T}\boldsymbol{z}=\boldsymbol{z}^{T}\boldsymbol{v} and that M+MM+=M+M^{+}MM^{+}=M^{+}. Note that the last term on the right hand side of the previous expression does not depend on 𝒛\boldsymbol{z}, and that the first term is always non-positive because MM is negative semidefinite. Thus, the objective is maximized only by setting the first term to zero, which is achieved by 𝒛=M+𝒗\boldsymbol{z}=-M^{+}\boldsymbol{v}. Upon substitution, this yields =𝒗M+𝒗\mathscr{L}^{*}=-\boldsymbol{v}M^{+}\boldsymbol{v} .