Exponential convergence to equilibrium for a two-speed model with variant drift fields via the resolvent estimate
Abstract
We study a two-speed model with variant drift fields, which generalizes the Goldstein-Taylor model but the two advection fields are not necessarily in proportion. Due to the lack of a local equilibrium structure of the steady state, prevailing hypocoercivity methods could not be directly applied to such a system. To prove the exponential convergence to equilibrium for this model, we use a recent generalization of the Gearhart-Prüss theorem [Sci. China Math. 64 (2021), no. 3, 507–518] to gain semigroup bounds, where the relative entropy estimate plays a central role in gaining desired resolvent estimates via a compactness argument.
Key words: two-speed model, resolvent estimate, exponential convergence, non-separable velocity field
AMS subject classifications : 35B40, 35Q92, 47A10, 82C05.
1 Introduction
We consider the following two-speed model with variant advection fields
(1.1) |
Here is regarded as the population density of particles of type at position and time , in a kinetic description. Each particle is characterized by its spatial coordinate , and its type . And particles of different types are driven by different velocity fields, namely and . The reaction terms on the right hand side indicate that two types of particles can change to each other with a constant transition rate. The boundary condition means that the mass fluxes going out of the domain from one side will go back to the domain from the other side, therefore the total mass
is conserved in time.
We assume that two velocity fields and are both non-degenerate and different from each other in the following sense
Assumption 1.
We assume satisfying
(1.2) |
and there exists some such that
(1.3) |
We aim to characterize the long time behavior of system (1.1), which is the exponential convergence to the unique steady state. The case , widely referred to as the Goldstein-Taylor model, has been studied from various aspects (e.g. [4, 16, 3] and references therein). In this work, however, we are interested in the general scenario including (but not limited to) the cases that is not a constant.
Motivating Examples
Our primary motivation is a Fokker-Planck equation from neuroscience proposed in [12, 11], which describes the collective behavior of a large number of biological neurons. Each neuron within the target system is characterized by two variables: its voltage (membrane potential) and conductance . In the simplest form, the Fokker-Planck equation is linear, and is given by
(1.4) |
where is a constant and the velocity field is given by
(1.5) |
which is a combination of two effects. The first term , with a constant , drives the voltage to its resting potential (which is set to be zero here). Physically it models the leaky effect. The second term , with , stimulates the voltage of a neuron to increase, so that it may reach the firing threshold . If the voltage of a neuron arrives at the firing threshold , its value is reset to immediately, which accounts for the firing-resetting mechanism of neurons.
Notice that, a firing event could happen only when , which means the conductance has to exceed a certain value, i.e., . In this case we assign the following boundary condition
(1.6) |
On the other hand, if , then , no neurons at this conductance can reach the firing threshold , and therefore no neurons would be reset at . In this case the following boundary condition is imposed instead
(1.7) |
Note that in both cases the fluxes at and are equal, i.e.,
Finally, the problem (1.4) is completed by a simple no-flux boundary condition at
System (1.4) is a kinetic description of neuron activities and the diffusion is only in the direction, which is analogous to the velocity variable in typical kinetic equations, and the voltage variable is reminiscent of the position variable.
First rigorous analysis on this kinetic neuron system (1.4) is given in [23], simplified models are subsequently analyzed in [24, 19], and extensive numerical exploration has been carried out in [10]. However, due to the difficulties which we shall discuss later in this introduction, no result on the exponential convergence to the steady state has been obtained yet. Clearly, the two-speed model (1.1) can be viewed as a simplification of the Fokker-Planck equation (1.4) where the diffusion in the conductance variable is replaced by a reversible process between two types particles convecting in different velocity fields.
A second motivation is one preliminary attempt to extend the hypoelliptic advection-diffusion equation in [7] from shear flows to more general flows. We consider the following advection-diffusion equation for with ,
(1.8) |
In the shear flow case , such a system is proposed and analyzed in [7] to study the enhanced dissipation effects. This shear flow case has also been analyzed in [2] recently using Hörmander’s hypoellipicity theory, whose method applies to more general diffusion term like or instead of . For an incompressible flow, such as the shear flow, in recent years much understanding has been obtained on its role in such models. But on compressible flows, the related literature seems to be relatively sparse [14].
Difficulties
The above three systems (1.1),(1.4) and (1.8) can be written into a general form of kinetic equations
(1.9) |
where denotes a transport operator, denotes a collision operator, and certain proper boundary conditions are prescribed.
In typical kinetic equations such as the Boltzmann equation, the kernel of the collision operator is called local equilibriums, and the global equilibrium is in the intersection of the kernels of and . A common difficulty in analyzing the long time behavior of (1.9) is its hypocoercivity structure [27]. Typically the collision operator only gives dissipation in partial directions (like the direction in (1.1), the direction in (1.4) and the direction in (1.8)), while the transport operator itself does not give dissipation. Therefore the operator is not coercive with respect to the common norm. And the exponential convergence to the global equilibrium can (only) be obtained via exploring the interplay between and . Such convergence can be obtained via compactness arguments in a non-constructive way in some cases (e.g. [26, 8]). In the last two decades, various hypocoervity methods have been developed to obtain a constructive convergence rate, therefore giving a quantitative characterization of the long time behavior of kinetic equations [5, 16, 27, 1, 6].
In contrast to other kinetic models, a prominent feature shared by the systems (1.1), (1.4) or (1.8) is that the velocity field is not in a “separable form”. For example, in (1.4) the velocity field can not be written as , while in most familiar kinetic models one simply has , which is naturally in a separable form. For the two-speed model (1.1), the velocity field is in a separable form only when , which is equivalent to is simply a constant, and such examples include the classical Goldstein-Taylor model when . However, in the paper, we aim to study the general two-speed model, where the two velocity fields only need to satisfy the non-degenerate and distinguishable conditions given in Assumption 1.
The non-separable velocity field brings new difficulties for understanding the long time asymptotic behavior of the two-speed model in various and yet related aspects. First and most directly, one can not represent the solution by the method of separation of variables via the Fourier transform in , or more generally, using the eigenfunctions of the operator .
Secondly, due to the non-separable velocity fields, the steady states of the systems (1.1),(1.4) or (1.8) are much more complicated. In particular, the local equilibrium structure does not exist: the global steady state is no longer in the kernel of both the collision operator and the transport operator . In fact, one can check that in the two-speed model (1.1), the intersection of the kernels of and contains only trivial solutions (zero) unless is a constant. Therefore, the global steady state is essentially determined by the integrated combination of the transport operator , the collision operator and (last but not least) the boundary conditions. Thus, due to the lack of a local equilibrium structure and additional challenges that the boundary conditions bring in, it is not clear yet how the prevailing hypocoercivity methods could be applied to this two-speed model (1.1), although it appears in such a simple form.
It is worth emphasizing that the challenges in analyzing the two-speed model (1.1) already capture the essential difficulties of the kinetic model (1.9) with a non-separable velocity field, although more complicated models may face additional difficulties such as the lack of regularity properties of the steady states. However, the studies on other kinetic models, such as (1.4) and (1.8), are beyond the scope of the current paper. See also the discussions in Section 4.
Main results and methods
In this work, we prove the exponential convergence to the unique global steady state for the two-speed model (1.1). The main results are summarized in the following theorem.
Theorem 1 (Main result).
Remark 1.
For heuristic purposes, we layout the proof strategy as follows. First, with characterization of the steady states , we can carry out the standard relative entropy estimate ([20, 22]), given by
(1.11) |
where . Such relative entropy estimate (1.11) gives crucial information on the dynamics of (1.1). In fact, from (1.1) one can derive convergence to the steady state in a weak sense by adapting a classical compactness argument (see [20], and also Chapter 3.6 of [22]). However, an exponential convergence does not follow from (1.11) due to the lack of coercivity in the dissipation term: it only involves information in the direction but the dissipation in the direction is not involved.
Second, to prove the exponential convergence in spite of the degeneracy in the dissipation, we leverage the generalization of the Gearhart-Prüss theorem [18, 17] recently shown by Wei [28],
Theorem 2 (Wei [28]).
Let be an m-dissipative operator on a Hilbert space , then the corresponding semigroup satisfies the following bound,
(1.12) |
where is defined as
(1.13) |
Theorem 2 gives the semigroup bound (1.12) via the resolvent estimate (1.13), and has been used in the analysis of hydrodynamic stability (e.g. [13]). We defer giving the definition of an m-dissipative operator [21] to Section 2. We shall show that, the relative entropy estimate (1.11) naturally implies a semigroup formulation of (1.1), where the generator is indeed m-dissipative. Hence, proving the exponential convergence boils down to showing that the corresponding , which can be achieved by a compactness argument.
To sum up, we prove the exponential convergence of the two-speed model with variant drift fields (1.1), by combining the relative entropy structure with a generalized Gearhart-Prüss theorem, to obtain exponentially contracting property on the corresponding time-evolution semigroup. Such strategy has potential to be used in understanding the long time behavior of other (kinetic) models, in particular those with “non-separable” advection fields, such as (1.4) and (1.8).
Throughout this paper, we take Assumption 1 unless otherwise specified. The rest of this paper is arranged as follows: With preparations in Section 2: steady state, and the semigroup formulation, we prove Theorem 1 in Section 3. Finally a discussion is given in Section 4.
2 Preliminary: steady state and generator of the semigroup
In this section, we aim to construct a proper semigroup formulation for (1.1) such that the time evolution semigroup can be shown to be exponentially contacting. However, to this end, we need to first show some preliminary properties of the steady state.
First, we prove the existence and uniqueness of a positive steady state.
Proposition 1.
There exists a steady state of (1.1), which is unique up to a constant multiplier. Moreover, we can choose a normalization constant such that the steady state is positive and satisfies
(2.1) |
Furthermore, the following estimate holds
(2.2) |
Proof.
The steady state equation is given by
(2.3) |
It is convenient to introduce the fluxes , and thus
We can equivalently rewrite the system (2.3) as
(2.4) | ||||
And the properties of the steady states can be obtained by analyzing the ODE system (2.4).
Let be the fundamental matrix of the linear system, i.e.,
(2.5) |
Then to find a steady state is equivalent to find a right-eigenvector of with eigenvalue . The existence follows by observing that is a left-eigenvector of for all with eigenvalue , which can be checked directly by computing .
For positivity, we note that is a constant for all . Therefore the ODE system (2.4) can be decoupled into
(2.6) |
with a similar equation for . Note that
and does not change sign for , we get that does not change sign from . Similarly does not change sign. Then we deduce that the corresponding does not change sign either. Finally, by integrating the first equation of (2.4) on , we get
and we conclude that are of the same sign for .
Therefore we can get a non-negative steady state , by multiplying with a proper constant. To show are indeed positive, we follow a contradiction argument to show . WLOG suppose for some . If , then is also zero since does not change sign. Hence, by the ODE (2.4) we deduce . Since satisfies the linear ODE (2.4), it follows that , which is a contradiction. If or , thanks to the periodic boundary condition for (2.4) we can derive a contradiction similarly.
Integrating the first equation in (2.3) we get
thanks to the boundary condition. By multiplying a normalization constant we can get a positive steady state satisfying (2.1).
For uniqueness, it is equivalent to show that the kernel of is exactly one-dimensional, where denotes the identity matrix. Actually, otherwise we would have , which means every solution of the ODE in (2.4) will satisfy the condition automatically. By considering initial data , we get from (2.6) which is a contradiction.
∎
With the steady state given in Proposition 1, we define the following complex Hilbert space,
(2.7) |
whose inner product and norm are given by
(2.8) |
Since is continuous and positive by Proposition 1, the norm in is equivalent to the usual norm.
We define the unbounded operator
(2.9) | ||||
where we use the mod-2 convention for the subscripts: . We shall interpret the solution of (1.1) as defined by the semigroup generated by , i.e., mild solution [21].
Next, we translate the relative entropy estimate (1.11) to this semigroup formulation, which shows that is an m-dissipative operator, which implies that generates a semigroup of contractions. We recall an operator on a Hilbert space is called a dissipative operator if for all . An m-dissipative operator is a dissipative operator with the range for all [21].
Proposition 2.
The operator defined in (2.9) is an m-dissipative operator in , with
(2.10) |
We introduce the following notation:
(2.11) |
Then
(2.12) |
And from the boundary condition of and we have
(2.13) |
Proof of Proposition 2.
We first check (2.10) which shows that is dissipative. Substitute the steady state system (2.3) in (2.12), we get
(2.14) |
Therefore
Taking real part of the first term and integrating by parts we get
Therefore we continue the calculation to conclude that is dissipative
To show that is m-dissipative, it suffices to show that its adjoint is also dissipative ([21] Chapter 1.4 Corollary 4.4). With the boundary condition of , it is standard and straightforward to check that , therefore
∎
Note that we can not find an such that , which reflects the lack of coercivity in (1.1).
As a final preparation, we show that there is no eigenvalue other than has a non-negative real part.
Proposition 3.
If is an eigenvalue of the operator with a non-negative real part and is an associated eigenfunction. Then and is multiplied by a constant.
Proof.
By the entropy estimate (2.10) in Proposition 2, we deduce
Therefore , from (2.14) we obtain
Since is an eigenfunction, , we get
(2.15) |
From Assumption 1 at some , therefore if . In this case, by a basic property of linear ODE, we conclude that .
If , from (2.15) we deduce which implies that is a constant, i.e., is multiplied by a constant. ∎
Actually, the last argument in the proof above serves as an alternative proof of the uniqueness of the steady state which is based on the entropy estimate, rather than the ODE argument in Proposition 1.
3 Proof of the main theorem
To obtain the exponentially contracting property of the semigroup , in the view of Theorem 2, we shall work in the orthogonal complement of in , denoted as :
(3.1) |
By definition of the inner product in (2.8), we have , therefore is in if and only if
(3.2) |
Clearly, is a Hilbert space. Note that as an operator on , actually maps to , which allows us to restrict it as an unbounded operator on . The following Lemma translates the result on in the previous section to its restriction on .
Lemma 1.
The restriction of on : , is m-dissipative and has no eigenvalue with a non-negative real part.
Proof.
Since is a subspace of , it is easy to see is still dissipative on . To show is m-dissipative on , it remains to check that for all , it holds that for all in , there exists in such that . Since is m-dissipative in by Proposition 2, there exists some satisfying this. By a straightforward integration of the equation , we get
This shows that is actually in , which finishes the proof. ∎
Now we begin the proof of Theorem 1.
Proof of Theorem 1.
Notice that the first part of the theorem has already been proved in Proposition 1.
For the second part, by Theorem 2, it suffices to show that defined as follows is strictly positive,
(3.3) |
We argue by contradiction. The idea is to use a compactness argument to derive a contradiction with Assumption 1.
Suppose , then there exists a real sequence and a sequence such that , and we have
(3.4) |
Recall in Lemma 1, the operator can be restricted as a m-dissipative operator on . And the norm of , inherited from (2.8), is equivalent to the usual norm.
We divide the discussion into two cases.
Case 1: is bounded. Up to a subsequence, we might as well take convergence to . In this case
(3.5) |
is also bounded in . Apply the compact embedding from to on , we can take converges strongly in up to extraction of subsequences. Therefore by Assumption 1, we get converges to some strongly in . Then using (3.5) again we obtain the strong convergence of in . Now taking the limit in (3.5), we deduce that and
Note that from the strong convergence of . However the above equation shows that is an eigenfunction of with the eigenvalue , which contradicts with Lemma 1.
Case 2: When is not bounded, it suffices to consider the case without loss of generality. By the relative entropy estimate (2.10),
as goes to infinity. Therefore we have
(3.6) |
Then we write out (3.4) in terms of ,
combined which with (3.6) we deduce
which is equivalent to
(3.7) |
Using (3.7) and the compact embedding from to on , we deduce that there exists some constant such that
(3.8) |
Similarly we deduce that for , there exists some constant such that
(3.9) |
By the upper bound on in Proposition 1, we get for some and therefore (otherwise strongly converges to zero in ).
Now we calculate the standard norm
By the calculation above together with the limit (3.10), we deduce that and
(3.11) |
as goes to infinity.
Lemma 2.
For a nonzero, real-valued function and a real sequence , as goes to infinity, we have
(3.12) |
Proof of Lemma 2.
We can find an interval where does not vanish. Since , it suffices to show that
(3.13) |
We assume on without loss of generality. Consider a change of variable , , then (3.13) is equivalent to
where and stands for the inverse change of variable from . Then the result follows from the Riemann-Lebesgue lemma since is a continuous function on . ∎
4 Discussion
In this article, we prove the exponential convergence to equilibrium of (1.1). We use a non-constructive compactness argument, which does not give an explicit decay rate. As explained in the introduction, it seems difficult to adapt constructive hypocoercivity methods in literature to (1.1). We may study more explicit estimates on the decay rate in the future.
The assumption (Assumption 1) on the difference between is fairly weak. In fact, this condition is sharp: if otherwise then would satisfy a pure transport equation
Therefore no convergence to equilibrium can be expected in general. Actually, in this case two kinds of particles are indistinguishable in the sense that they are driven by a same velocity field. In view of the general formulation (1.9), when , the transport operator and the collision operator commute, therefore no interplay can be used to compensate the lack of coercivity.
Extension to “continuous” systems (1.4),(1.8)
Although our approach for the two-speed model (1.1) serves as a preliminary study in treating a non-separable velocity field, extending the result to (1.4) or (1.8) is far from straightforward. Our proof of Theorem 1 benefits from the fact that the collision operator in (1.1) is a bounded operator, which is not the case in (1.4) or (1.8). The essential difficulty may be that we have to deal with a hypoelliptic boundary value problem, either in the steady state equation or in the resolvent estimate . However, for a hypoelliptic boundary value problem, a general well-posedness and regularity theory is lacking [1], in contrast to their elliptic counterparts.
Extension with a non-homogeneous cross-section
Another interesting extension is to consider a non-homogeneous cross-section on the collision term in (1.1)
(4.1) |
For the Goldstein-Taylor model, i.e., , such an extension has been studied in [15, 9, 4, 25]. For non-homogeneous cases, the approach for (1.1) may directly extend to (4.1) provided the following assumption on holds,
Assumption 2.
We assume satisfying
(4.2) |
and there exists such that
(4.3) |
In Assumption 2 we allow degeneracy of and , but they need to be simultaneously non-zero at one point . Such an assumption can not be relaxed to that there exists two (possibly different) points such at at and at . Otherwise, one can construct examples such that the operator has a pure imaginary eigenvalue, following the proof of Proposition 3.
Acknowledgement
Z. Zhou is supported by the National Key R&D Program of China, Project Number 2021YFA1001200, 2020YFA0712000 and NSFC grant Number 12031013, 12171013. X.Dou is partially supported by The Elite Program of Computational and Applied Mathematics for PhD Candidates in Peking University. We also thank José Carrillo, Benoît Perthame, Xiaoqian Xu and Zhifei Zhang for helpful discussions.
References
- [1] D. Albritton, S. Armstrong, J. C. Mourrat, and M. Novack. Variational methods for the kinetic fokker-planck equation. arXiv preprint arXiv:1902.04037, 2021.
- [2] Dallas Albritton, Rajendra Beekie, and Matthew Novack. Enhanced dissipation and hörmander’s hypoellipticity. arXiv preprint arXiv:2105.12308, 2021.
- [3] Anton Arnold, Jean Dolbeault, Christian Schmeiser, and Tobias Wöhrer. Sharpening of decay rates in fourier based hypocoercivity methods. arXiv preprint arXiv:2012.09103, 2020.
- [4] Anton Arnold, Amit Einav, Beatrice Signorello, and Tobias Wöhrer. Large time convergence of the non-homogeneous goldstein-taylor equation. Journal of Statistical Physics, 182(2):1–35, 2021.
- [5] Anton Arnold and Jan Erb. Sharp entropy decay for hypocoercive and non-symmetric fokker-planck equations with linear drift. arXiv: Analysis of PDEs, 2014.
- [6] Fabrice Baudoin. Bakry–émery meet villani. Journal of Functional Analysis, 273(7):2275–2291, 2017.
- [7] Jacob Bedrossian and Michele Coti Zelati. Enhanced dissipation, hypoellipticity, and anomalous small noise inviscid limits in shear flows. Archive for Rational Mechanics and Analysis, 224(3):1161–1204, 2017.
- [8] Etienne Bernard and Francesco Salvarani. On the exponential decay to equilibrium of the degenerate linear boltzmann equation. Journal of Functional Analysis, 265(9):1934–1954, 2013.
- [9] Etienne Bernard and Francesco Salvarani. Optimal estimate of the spectral gap for the degenerate goldstein-taylor model. Journal of Statistical Physics, 153(2):363–375, 2013.
- [10] María J Cáceres, José A Carrillo, and Louis Tao. A numerical solver for a nonlinear fokker–planck equation representation of neuronal network dynamics. Journal of Computational Physics, 230(4):1084–1099, 2011.
- [11] David Cai, Louis Tao, Aaditya V Rangan, and David W McLaughlin. Kinetic theory for neuronal network dynamics. Communications in Mathematical Sciences, 4(1):97–127, 2006.
- [12] David Cai, Louis Tao, Michael Shelley, and David W McLaughlin. An effective kinetic representation of fluctuation-driven neuronal networks with application to simple and complex cells in visual cortex. Proceedings of the National Academy of Sciences, 101(20):7757–7762, 2004.
- [13] Qi Chen, Te Li, Dongyi Wei, and Zhifei Zhang. Transition threshold for the 2-d couette flow in a finite channel. Archive for rational mechanics and analysis, 238(1):125–183, 2020.
- [14] Michele Coti Zelati and Grigorios A Pavliotis. Homogenization and hypocoercivity for fokker–planck equations driven by weakly compressible shear flows. IMA Journal of Applied Mathematics, 85(6):951–979, 2020.
- [15] Helge Dietert and Josephine Evans. Finding the jump rate for fastest decay in the goldstein-taylor model. arXiv preprint arXiv:2103.10064, 2021.
- [16] Jean Dolbeault, Clement Mouhot, and Christian Schmeiser. Hypocoercivity for linear kinetic equations conserving mass. Transactions of the American Mathematical Society, 367(6):3807–3828, 2015.
- [17] Bernard Helffer. Spectral theory and its applications. Number 139. Cambridge University Press, 2013.
- [18] Bernard Helffer and Johannes Sjöstrand. From resolvent bounds to semigroup bounds. arXiv preprint arXiv:1001.4171, 2010.
- [19] Jeongho Kim, Benoît Perthame, and Delphine Salort. Fast voltage dynamics of voltage–conductance models for neural networks. Bulletin of the Brazilian Mathematical Society, New Series, 52(1):101–134, 2021.
- [20] Philippe Michel, Stéphane Mischler, and Benoît Perthame. General relative entropy inequality: an illustration on growth models. Journal de Mathématiques Pures et Appliquées, 84(9):1235–1260, 2005.
- [21] Amnon Pazy. Semigroups of linear operators and applications to partial differential equations, volume 44. Springer Science & Business Media, 2012.
- [22] Benoît Perthame. Transport equations in biology. Springer Science & Business Media, 2006.
- [23] Benoît Perthame and Delphine Salort. On a voltage-conductance kinetic system for integrate and fire neural networks. Kinetic and Related Models, 6(4):841–864, 2013.
- [24] Benoît Perthame and Delphine Salort. Derivation of an integrate&fire equation for neural networks from a voltage-conductance kinetic model. Communications in Mathematical Sciences, 17(5):1193–1211, 2019.
- [25] Minh-Binh Tran. Convergence to equilibrium of some kinetic models. Journal of Differential Equations, 255(3):405–440, 2013.
- [26] Seiji Ukai. On the existence of global solutions of mixed problem for non-linear boltzmann equation. Proceedings of the Japan Academy, 50(3):179–184, 1974.
- [27] C Villani. Hypocoercivity. 949-951. American Mathematical Soc, 2009.
- [28] Dongyi Wei. Diffusion and mixing in fluid flow via the resolvent estimate. Science China Mathematics, 64(3):507–518, 2021.