largesymbols”03 largesymbols”02

Under-Approximate Reachability Analysis for a Class of Linear Systems with Inputs

Mohamed Serry and Jun Liu Mohamed Serry is with the Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario, Canada (email: mserry@uwaterloo.ca).Jun Liu is with the Department of Applied Mathematics, University of Waterloo, Waterloo, Ontario, Canada (e-mail: j.liu@uwaterloo.ca).This work was funded by NSERC DG, CRC, and ERA programs.

Abstract

Under-approximations of reachable sets and tubes have been receiving growing research attention due to their important roles in control synthesis and verification. Available under-approximation methods applicable to continuous-time linear systems typically assume the ability to compute transition matrices and their integrals exactly, which is not feasible in general, and/or suffer from high computational costs. In this note, we attempt to overcome these drawbacks for a class of linear time-invariant (LTI) systems, where we propose a novel method to under-approximate finite-time forward reachable sets and tubes, utilizing approximations of the matrix exponential and its integral. In particular, we consider the class of continuous-time LTI systems with an identity input matrix and initial and input values belonging to full dimensional sets that are affine transformations of closed unit balls. The proposed method yields computationally efficient under-approximations of reachable sets and tubes, when implemented using zonotopes, with first-order convergence guarantees in the sense of the Hausdorff distance. To illustrate its performance, we implement our approach in three numerical examples, where linear systems of dimensions ranging between 2 and 200 are considered.

Index Terms:

Under-approximations, linear uncertain systems, matrix lower bounds, Hausdorff distance.

I Introduction

Reachable sets and tubes of dynamical systems are central in control synthesis and verification applications, especially in the presence of uncertainties and constraints [1, 2]. Mere approximations of reachable sets and tubes are not sufficient in such frameworks. Instead, conservative estimations, i.e., over (outer)-approximations, are typically utilized to ensure all possible behaviors of a given control system are accounted for, which explains the sheer number of over-approximation methods in the literature [3, 4].

In the last few years, there has been a growing interest in additionally under-approximating reachable sets and tubes for synthesis and verification (see, e.g., [5, 6, 7, 8]), because under-approximations can be used in estimating subsets of the states that are attainable under given control constraints [9], obtaining subsets of the initial states from which all trajectories fulfill safety and reachability specifications [10], solving falsification problems: verifying if reachable sets/tubes intersect with unsafe sets [11], and measuring the accuracy of computed over-approximations. Motivated by the aforementioned applications, we aim in this paper at investigating under-approximations of forward reachable sets and tubes for continuous-time LTI systems with uncertainties or constraints on the initial and input values, where we attempt to overcome some of the limitations associated with available methods in the literature.

Optimal control-based polytopic approaches [12, 13] were proposed for linear systems with uncertain inputs and initial conditions. These methods rely on obtaining boundary points of reachable sets, associated with specified direction vectors, and then computing the convex hull of the obtained boundary points. Given a reachable set, the convergence of the polytopic approaches requires computing an increasing number of boundary points until the whole reachable set boundary is obtained, which is computationally expensive, especially if the dimension of the reachable set is high. An ellipsoidal method was proposed in [14] for controllable linear systems with ellipsoidal initial and input sets. The method relies on solving initial value problems, derived from maximal principles similar to those presented in [13], to obtain ellipsoidal subsets that touch a given reachable set at some boundary points, depending on specified direction vectors. The accuracy of the ellipsoidal method of [14] in under-approximating a given reachable set is increased by evaluating an increasing number of ellipsoids, which necessitates considering an increasing number of direction vectors, and then taking their union. The aforementioned optimal control-based approaches [12, 13, 14] assume the ability to compute transition matrices and their integrals exactly, and that is not feasible in general. In addition, when under-approximating a reachable tube, the mentioned approaches use non-convex representations for the under-approximations, which are challenging to analyze in the contexts of verification and synthesis (see the introduction in [15]).

In [16], a set-propagation technique was proposed, yielding convergent under-approximations of forward reachable sets and tubes for a general class of linear systems with uncertainties or constraints on the initial and input values, where the under-approximations of reachable sets and tubes are given as convex sets and finite unions of convex sets, respectively, which can be analysed with relative ease. However, the method in [16], like the approaches in [17, 12, 13, 14], assumes the ability to compute transition matrices and their integrals exactly. A similar set-propagation approach was proposed in [18] for LTI systems, which relies on computing Minkowski differences to under-approximate reachable tubes. The method in [18] also suffers from the issue of the method in [16], in addition to the computational hurdle of evaluating Minkowski differences (see, e.g., [19]).

The issue of evaluating transition matrices and their integrals exactly can be solved by adopting formally correct under-approximation methods that are designed for general nonlinear systems. For example, a formally correct interval arithmetic under-approximation approach was proposed in [8]; however, such a method lacks convergence guarantees, and may produce empty under-approximations. In [20], a novel method was proposed for nonlinear systems with uncertain initial conditions that depends on computing over-approximations and then scaling them down to obtain under-approximations. A drawback of the approach in [20] is that the scaling necessitates solving (sub-optimal) optimization problems that involve enclosures of boundaries of reachable sets, which can be computationally expensive (see the comparison in Section VII-C1). Finally, a recent approach has been proposed in [21] to under-approximate reachable sets when the system parameters (e.g. system matrices) are not known exactly , where collected trajectories (i.e., data) are utilized to estimate the system dynamics. Such an approach is highly valuable in applications when system identification cannot be attained, due, e.g., to failure or damage mid-operation. However, this approach is conservative (i.e., convergence cannot be attained in general) as it considers the set of all systems that can generate the collected trajectories, while fulfilling some specified assumptions.

In this work, we present a novel efficient approach that results in under-approximations of forward finite-time reachable sets and tubes for a class of LTI systems with inputs, where approximations of the matrix exponential and its integral are used, first-order convergence guarantees are provided, and approximations of reachable sets and tubes are given as convex sets and finite unions of convex sets, respectively. Our approach is fundamentally based on set-based recursive relations (see, e.g., [22, 16]), where truncation errors are accounted for in an under-approximating manner, utilizing matrix lower bounds [23].

II Preliminaries

Let $\mathbb{R}$ , $\mathbb{R}_{+}$ , $\mathbb{Z}$ , $\mathbb{Z}_{+}$ , and $\mathbb{C}$ denote the sets of real numbers, non-negative real numbers, integers, non-negative integers, and complex numbers, respectively, and $\mathbb{N}\mathrel{:=}\mathbb{Z}_{+}\setminus\{0\}$ . Let $[a,b]$ , $\orbrack a,b\clbrack$ , $[a,b\clbrack$ , and $\orbrack a,b]$ denote closed, open, and half-open intervals, respectively, with end points $a$ and $b$ , and $[a;b]$ , $\orbrack a;b\clbrack$ , $[a;b\clbrack$ , and $\orbrack a;b]$ stand for their discrete counterparts, e.g., $[a;b]=[a,b]\cap\mathbb{Z}$ , and $[1;4\clbrack=\{1,2,3\}$ . Given any map $f\colon A\to B$ , the image of a subset $C\subseteq A$ under $f$ is given by $f(C)=\left\{f(c)\,\middle|\,c\in C\right\}$ . The identity map $X\to X\colon x\mapsto x$ is denoted by $\operatorname{id}$ , where the domain of definition $X$ will always be clear form the context. The Minkowski sum of $M,N\subseteq\mathbb{R}^{n}$ is defined as $M+N\mathrel{:=}\left\{y+z\,\middle|\,y\in M,z\in N\right\}$ . By $\|\cdot\|$ we denote any norm on $\mathbb{R}^{n}$ , the norm of a non-empty subset $M\subseteq\mathbb{R}^{n}$ is defined by $\|M\|\mathrel{:=}\sup_{x\in M}\|x\|$ , $\mathbf{B}_{n}\subseteq\mathbb{R}^{n}$ is the closed unit ball w.r.t. $\|\cdot\|$ , and the maximum norm on $\mathbb{R}^{n}$ is denoted by $\|\cdot\|_{\infty}$ ( $\|x\|_{\infty}=\max\left\{|x_{i}|\,\middle|\,i\in[1;n]\right\},~{}x\in\mathbb{R}^{n}$ ). Given norms on $\mathbb{R}^{n}$ and $\mathbb{R}^{m}$ , $\mathbb{R}^{n\times m}$ is endowed with the usual matrix norm, $\|A\|=\sup_{\|x\|\leq 1}\|Ax\|$ for $A\in\mathbb{R}^{n\times m}$ , e.g., the matrix norm of $A$ induced by the maximum norm is $\|A\|_{\infty}=\max_{i\in[1;n]}\sum_{j=1}^{m}\left|A_{i,j}\right|$ . Given a square matrix $A\in\mathbb{R}^{n\times n}$ , $\rho(A)$ denotes the spectral radius of $A$ , i.e., $\rho(A)\mathrel{:=}\max\{\left|\lambda\right|,~{}\lambda\textrm{ is an eigenvalue of }A\}$ . The spectral radius satisfies the property below, which follows from [24, proof of Lemma 5.6.10, p. 348]).

Lemma 1.

Let $A\in\mathbb{R}^{n\times n}$ . For each $\epsilon>0$ , there exists an induced matrix norm $\|\cdot\|_{\epsilon}$ such that $\|A\|_{\epsilon}\leq\rho(A)+\epsilon$ .

Given $A\in\mathbb{R}^{n\times m}$ , $\mathrm{rank}(A)$ , $\mathrm{col}(A)$ , and $A^{\dagger}\in\mathbb{R}^{m\times n}$ denote the rank, the column space, and the Moore–Penrose inverse of $A$ , respectively ( $A^{\dagger}=A^{-1}$ if $A$ is invertible). The following lemma provides a sufficient condition to check invertibility.

Lemma 2.

Let $P,\tilde{P}\in\mathbb{R}^{n\times n}$ , and $P$ be invertible. If $\|P-\tilde{P}\|\|{P}^{-1}\|<1$ , then $\tilde{P}$ is invertible.

Proof.

See, e.g., [25, proof of Theorem 5.7, p. 111]. ∎

Given $A\in\mathbb{R}^{n\times m}\setminus\{0\}$ , $\|A\|_{l}$ denotes the matrix lower bound of $A$ w.r.t. $\|\cdot\|$ , which is defined as $\|A\|_{l}\mathrel{:=}\max\{m\in\mathbb{R}~{}|~{}\forall y\in\mathrm{col}(A),\exists x\in\mathbb{R}^{m}~{}\mathrm{s.t.}~{}Ax=y~{}\mathrm{and}~{}m\|x\|\leq\|y\|\},$ see [23]. Matrix lower bounds satisfy the following properties, which are essential in our derivation of the proposed method.

Lemma 3.

Let $A\in\mathbb{R}^{n\times m}$ and $B\in\mathbb{R}^{m\times p}$ , where $\mathrm{rank}(A)=n$ and $\mathrm{rank}(B)=m$ (full row rank). Then:

(a)

$\|A\|_{l}\mathbf{B}_{n}\subseteq A\mathbf{B}_{m}$ (follows from [23, Lemma 2.3]).
(b)

${1}/{\|A^{\dagger}\|}\leq\|A\|_{l}$ (follows from [23, Lemma 2.2]).
(c)

$\|A\|_{l}\|B\|_{l}\leq\|AB\|_{l}$ (follows from [23, Lemma 4.4]).

The collection of full-dimensional subsets of $\mathbb{R}^{n}$ that are affine transformations of closed unit balls is denoted by $\mathbb{A}_{n}$ , where, in this work, saying $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{n}$ implies that $c\in\mathbb{R}^{n}$ , $G\in\mathbb{R}^{n\times p}$ , and $\mathrm{rank}(G)=n$ . Integration of single-valued functions presented herein is always understood in the sense of Bochner. Given a non-empty subset $X\subseteq\mathbb{R}^{n}$ and a measurable subset $S\subseteq\mathbb{R}$ , $X^{S}$ denotes the set of Lebesgue measurable maps with domain $S$ and values in $X$ . Given a non-empty compact subset $W\subseteq\mathbb{R}^{m}$ , a compact interval $[a,b]\subset\mathbb{R}$ , and an integrable matrix-valued function $F:[a,b]\to\mathbb{R}^{n\times m}$ , we define the set-valued integral $\int_{a}^{b}F(t)W\mathrm{d}t\mathrel{:=}\bigcup_{w\in W^{[a,b]}}\int_{a}^{b}F(t)w(t)\mathrm{d}t.$ The Hausdorff distance $\mathfrak{d}(\Omega,\Gamma)$ of two non-empty bounded subsets $\Omega,\Gamma\subseteq\mathbb{R}^{n}$ w.r.t. $\|\cdot\|$ is defined as $\mathfrak{d}(\Omega,\Gamma)\mathrel{:=}\inf\left\{\varepsilon>0\,\middle|\,\Omega\subseteq\Gamma+\varepsilon\mathbf{B}_{n},\Gamma\subseteq\Omega+\varepsilon\mathbf{B}_{n}\right\}.$ The Hausdorff distance satisfies the triangle inequality, in addition to the following set of properties.

Lemma 4 (Hausdorff distance).

Let $\Omega,\Omega^{\prime},\Gamma,\Gamma^{\prime}\subseteq\mathbb{R}^{n}$ be non-empty and bounded, and let $A,B\in\mathbb{R}^{m\times n}$ . Then, the following hold (see [15, Lemma A.2]):

(a)

$\mathfrak{d}(\Omega+\Gamma,\Omega^{\prime}+\Gamma^{\prime})\leq\mathfrak{d}(\Omega,\Omega^{\prime})+\mathfrak{d}(\Gamma,\Gamma^{\prime})$ .
(b)

$\mathfrak{d}(A\Omega,A\Gamma)\leq\|A\|\mathfrak{d}(\Omega,\Gamma)$ .
(c)

$\mathfrak{d}(A\Omega,B\Omega)\leq\|A-B\|\|\Omega\|$ (implying $\mathfrak{d}(\Omega,0)\leq\|\Omega\|$ ).
(d)

Let $(\Omega_{i})_{i\in I}$ and $(\Gamma_{i})_{i\in I}$ be families of non-empty subsets of $\mathbb{R}^{n}$ . Then, $\mathfrak{d}\left(\cup_{i\in I}\Omega_{i},\cup_{i\in I}\Gamma_{i}\right)\leq\sup_{i\in I}\mathfrak{d}(\Omega_{i},\Gamma_{i})$ .

III System Description and Problem Formulation

In this paper, we consider the LTI system

\dot{x}(t)=Ax(t)+u(t)

(1)

over the time interval $[0,T]$ , where $x(t)\in\mathbb{R}^{n}$ is the system state, $u(t)\in\mathbb{R}^{n}$ is the input, and $A\in\mathbb{R}^{n\times n}$ is the system matrix. The initial value $x(0)$ and the input $u(t),~{}t\in[0,T]$ belong to known sets $X_{0}$ and $U$ , respectively. Let $T$ , $A$ , $X_{0}$ , and $U$ be fixed and assume that:

1.

The time interval $[0,T]$ is compact and $T>0$ .
2.

$X_{0}=c_{x}+G_{x}\mathbf{B}_{p_{x}}\in\mathbb{A}_{n}$ and $U=c_{u}+G_{u}\mathbf{B}_{p_{u}}\in\mathbb{A}_{n}$ , where $c_{x},c_{u},G_{x}$ , and $G_{u}$ are known.

Given an initial value $x(0)=x_{0}$ and an integrable input signal $v:[0,T]\rightarrow\mathbb{R}^{n}$ , the unique solution, $\varphi(\cdot,x_{0},v)$ , to system (1), generated by $x_{0}$ and $v(\cdot)$ , on $[0,T]$ is given by [26, Theorem 6.5.1, p. 114]

\varphi(t,x_{0},v)=\mathrm{e}^{tA}x_{0}+\int_{0}^{t}\mathrm{e}^{(t-s)A}v(s)\mathrm{d}s,~{}t\in[0,T].

Herein, $\mathrm{e}^{(\cdot)A}$ (or $\exp((\cdot)A)$ ) is the matrix exponential function, which has the Taylor series expansion $\exp(tA)=\sum_{j=0}^{\infty}{(tA)^{j}}/{j!}.$ Define

\mathcal{L}(t,k)\mathrel{:=}\sum_{j=0}^{k-1}\frac{(tA)^{j}}{j!},\quad\mathcal{T}(t,k)\mathrel{:=}\int_{0}^{t}\mathcal{L}(s,k)\mathrm{d}s=\sum_{j=0}^{k-1}\frac{t^{j+1}A^{j}}{(j+1)!},

where $\mathcal{L}(t,k)$ is the truncated $(k-1)$ th-order Taylor expansion of $\exp(tA)$ and $\mathcal{T}(t,k)$ is its definite integral. It is easy to verify that, for all $t\in\mathbb{R}_{+}$ and $k\in\mathbb{N}$ ,

	$\displaystyle\\|\mathrm{e}^{tA}\\|$	$\displaystyle\leq\mathrm{e}^{t\\|A\\|},~{}\\|\mathcal{L}(t,k)\\|\leq\mathrm{e}^{t\\|A\\|},$		(2)
	$\displaystyle\\|\mathrm{e}^{tA}-\mathcal{L}(t,k)\\|$	$\displaystyle\leq\theta(t\\|A\\|,k)\leq\frac{(t\\|A\\|)^{k}}{k!}\mathrm{e}^{t\\|A\\|},$		(3)

where $\theta\colon\mathbb{R}_{+}\times\mathbb{N}\rightarrow\mathbb{R}_{+}$ is defined as

\theta(r,p)\mathrel{:=}\mathrm{e}^{r}-\sum_{j=0}^{p-1}\frac{r^{j}}{j!}=\sum_{j=p}^{\infty}\frac{r^{j}}{j!},~{}r\in\mathbb{R}_{+},~{}p\in\mathbb{N}.

(4)

The function $\theta$ is infinitely differentiable and monotonically increasing in its first argument, and monotonically decreasing in its second argument, with a greatest lower bound of zero.

Let $\mathcal{R}(t)$ denote the forward reachable set of system $\eqref{eq:LinearSystem}$ at time $t\in[0,T]$ , with initial values in $X_{0}$ , and input signals with values in $U$ . In other words,

\mathcal{R}(t)\mathrel{:=}\mathrm{e}^{tA}X_{0}+\int_{0}^{t}\mathrm{e}^{sA}U\mathrm{d}s,~{}t\in[0,T].

(5)

The set $\exp(tA)X_{0}$ is referred to as the homogeneous reachable set at time $t$ and is denoted by $\mathcal{R}_{h}(t)$ , and the set $\int_{0}^{t}\exp(sA)U\mathrm{d}s$ is referred to as the input reachable set at time $t$ and is denoted by $\mathcal{R}_{u}(t)$ . Furthermore, let $[a,b]\subseteq[0,T]$ . Then, $\mathcal{R}([a,b])=\bigcup_{t\in[a,b]}\mathcal{R}(t)$ is the reachable tube over the time interval $[a,b]$ . In this paper, we aim to compute arbitrarily precise under-approximations of $\mathcal{R}_{h}(T)$ , $\mathcal{R}_{u}(T)$ , $\mathcal{R}(T)$ , and $\mathcal{R}([0,T])$ , utilizing the approximations $\mathcal{L}$ and $\mathcal{T}$ .

Remark 1 (Applications of under-approximations).

In this work, we focus on under-approximating forward finite reachable sets for linear systems with uncertainties or constraints on the initial values and inputs. Under-approximations can be beneficial in control synthesis and verification applications. For example, let us consider the case when the input set $U$ corresponds to a disturbance set, and $X_{\mathrm{US}}\subset\mathbb{R}^{n}$ be an unsafe set. If $\mathcal{R}([0,T])$ , or an under-approximation of it, intersects with $X_{\mathrm{US}}$ , then this indicates that the initial set $X_{0}$ , for sure, does not satisfy safety specifications (see the framework of falsification, e.g., in [11]).

Moreover, let $X_{\mathrm{target}}\subseteq\mathbb{R}^{n}$ be a target set, and let the set $U$ correspond to a control input set. Define the backward reachable set (see, e.g., [27])

\mathcal{R}_{\mathrm{bw}}(X_{\mathrm{target}},T)\mathrel{:=}\mathrm{e}^{T(-A)}X_{\mathrm{target}}+\mathrm{e}^{T(-A)}(-\mathcal{R}_{u}(T)).

If an initial value of interest $x_{0}\in\mathbb{R}^{n}$ belongs to $\mathcal{R}_{\mathrm{bw}}(X_{\mathrm{target}},T)$ , or an under-approximation of it, then the existence of a control signal with values in $U$ , driving $x_{0}$ to $X_{\mathrm{target}}$ in time $T$ , is guaranteed, and such a control signal can be obtained by, e.g., solving an associated constrained optimal control problem. Note that under-approximating $\mathcal{R}_{\mathrm{bw}}(X_{\mathrm{target}},T)$ requires under-approximating $\mathcal{R}_{u}(T)$ , $\exp(T(-A))(-\mathcal{R}_{u}(T))$ , and $\exp(T(-A))X_{\mathrm{target}}$ . In this work, we address directly how to under-approximate $\mathcal{R}_{u}(T)$ . Moreover, the tools in this work, and in particular Lemma 6, can be easily applied to under-approximate $\exp(T(-A))(-\mathcal{R}_{u}(T))$ , and $\exp(T(-A))X_{\mathrm{target}}$ .

IV Proposed method

In this section, we thoroughly derive the proposed method. The convergence guarantees of the method are discussed in Section VI.

We start with the following theoretical recursive relation, which is algorithmically similar to efficient over-approximation methods in the literature [9], and is the basis of the proposed method of this work.

Lemma 5.

Given $N\in\mathbb{N}$ , define $\tau=T/N$ ,


$\displaystyle S_{0}^{N}$	$\displaystyle=X_{0},$	$\displaystyle S_{i}^{N}$	$\displaystyle=\mathrm{e}^{\tau A}S_{i-1}^{N},~{}i\in[1;N],$	(6a)
$\displaystyle V_{0}^{N}$	$\displaystyle=\mathcal{R}_{u}(\tau),$	$\displaystyle V_{i}^{N}$	$\displaystyle=\mathrm{e}^{\tau A}V_{i-1}^{N},~{}i\in[1;N],$	(6b)
$\displaystyle W_{0}^{N}$	$\displaystyle=\{0\},$	$\displaystyle W_{i}^{N}$	$\displaystyle=W_{i-1}^{N}+V_{i-1}^{N},~{}i\in[1;N],$	(6c)
$\displaystyle\Gamma_{i}^{N}$	$\displaystyle=S_{i}^{N}+W_{i}^{N},~{}i\in[0;N].$			(6d)

Then, for all $i\in[0;N],$ $S_{i}^{N}=\mathcal{R}_{h}(i\tau),$ $W_{i}^{N}=\mathcal{R}_{u}(i\tau),$ and $\Gamma_{i}^{N}=\mathcal{R}(i\tau)$ . Moreover, $\bigcup_{i=0}^{N}\Gamma_{i}^{N}\subseteq\mathcal{R}([0,T])$ .

Proof.

By induction, $S_{i}^{N}=\exp(i\tau A)X_{0}=\mathcal{R}_{h}(i\tau),~{}i\in[0;N]$ . According to [28, Corollary 3.6.2, p. 118], ${W}_{i}^{N}=\mathcal{R}_{u}(i\tau),~{}i\in[0;N]$ . Therefore, $\Gamma_{i}^{N}=S_{i}^{N}+W_{i}^{N}=\mathcal{R}_{h}(i\tau)+\mathcal{R}_{u}(i\tau)=\mathcal{R}(i\tau),~{}i\in[0;N]$ . The last claim follows from $\bigcup_{i\in[0;N]}\Gamma_{i}^{N}=\bigcup_{i\in[0;N]}\mathcal{R}(i\tau)\subseteq\bigcup_{t\in[0,T]}\mathcal{R}(t)=\mathcal{R}([0,T])$ . ∎

The algorithm in Lemma 5, in theory, addresses the under-approximation problem of this work; however, this algorithm cannot be implemented exactly in general. In the next sections, we address the challenges in implementing this theoretical algorithm and propose a practically implementable method, which is the main contribution of this work.

IV-A Under-approximating the image of the matrix exponential

The first obstacle in implementing the algorithm in Lemma 5 is that recursive computations of the images of the matrix exponential are required (see the definitions of $S_{i}^{N}$ and $V_{i}^{N}$ in Equations (6a) and (6b), respectively), and exact computations of such images are not feasible in general as the exact value of the matrix exponential is generally not known. The following technical lemma provides an insight into how to replace the aforementioned images with under-approximations, where approximations of the matrix exponential can be utilized.

Lemma 6.

Let $P,\tilde{P}\in\mathbb{R}^{n\times m}$ and $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{m}$ . Assume that $P$ is of full row rank and that $\|(\tilde{P}-P)c\|\leq\|PG\|_{l}$ . Then, $\tilde{P}(c+\alpha G\mathbf{B}_{p})\subseteq P\Omega$ for any $\alpha\in[0,\alpha_{m}(\Omega,P,\tilde{P})]$ , where

\alpha_{m}(\Omega,P,\tilde{P})\mathrel{:=}\frac{\|PG\|_{l}-\|(\tilde{P}-P)c\|}{\|PG\|_{l}+\|(\tilde{P}-P)G\|}.

(7)

Proof.

Fix $\alpha\in[0,\alpha_{m}(\Omega,P,\tilde{P})]$ . Note that $\alpha$ satisfies $\|(\tilde{P}-P)c\|+\alpha\|(\tilde{P}-P)G\|\leq(1-\alpha)\|PG\|_{l}$ . Define $F\mathrel{:=}\tilde{P}(c+\alpha G\mathbf{B}_{p})$ . Then, using Lemma 3(a),

	$\displaystyle F\subseteq$	$\displaystyle Pc+(\tilde{P}-P)c+\alpha(\tilde{P}-P)G\mathbf{B}_{p}+\alpha PG\mathbf{B}_{p}$
	$\displaystyle\subseteq$	$\displaystyle Pc+\\|(\tilde{P}-P)c\\|\mathbf{B}_{n}+\alpha\\|(\tilde{P}-P)G\\|\mathbf{B}_{n}+\alpha PG\mathbf{B}_{p}$
	$\displaystyle=$	$\displaystyle Pc+\left(\\|(\tilde{P}-P)c\\|+\alpha\\|(\tilde{P}-P)G\\|\right)\mathbf{B}_{n}+\alpha PG\mathbf{B}_{p}$
	$\displaystyle\subseteq$	$\displaystyle Pc+(1-\alpha)\\|PG\\|_{l}\mathbf{B}_{n}+\alpha PG\mathbf{B}_{p}$
	$\displaystyle\subseteq$	$\displaystyle Pc+(1-\alpha)PG\mathbf{B}_{p}+\alpha PG\mathbf{B}_{p}=P(c+G\mathbf{B}_{p})=P\Omega.$

∎

Lemma 6 can be explained intuitively as follows. The set $P(c+G\mathbf{B}_{p})$ needs to be under-approximated utilizing an approximation of $P$ (our approximation is $\tilde{P}$ in this case). The set $\tilde{P}(c+G\mathbf{B}_{p})$ resembles an approximation of $P(c+G\mathbf{B}_{p})$ ; however, it is not an under-approximation. By utilizing estimates of the approximation errors, the set $P(c+G\mathbf{B}_{p})$ is deflated, by introducing the parameter $\alpha\in[0,\alpha_{m}(c+G\mathbf{B}_{p},P,\tilde{P})]$ , where $\alpha_{m}$ is defined by (7), making the set $P(c+\alpha G\mathbf{B}_{p})$ the desired under-approximation (see Figure 1).

Refer to caption — Figure 1: Schematic representation of the result of Lemma 6: the set $P(c+G\mathbf{B}_{p})$ (black), its approximation $\tilde{P}(c+G\mathbf{B}_{p})$ (red), and the under-approximation $\tilde{P}(c+\alpha G\mathbf{B}_{p})$ (blue) (unit balls herein are w.r.t. the maximum norm), where $\alpha\in[0,\alpha_{m}(c+G\mathbf{B}_{p},P,\tilde{P})]$ and $\alpha_{m}$ is defined by (7) .

Utilizing Lemma 6, we can obtain under-approximations of the images of the matrix exponential using truncated Taylor expansions as shown in Corollary 1 below. Before doing so, we first introduce the deflation function $\lambda$ , which plays a major role in determining the extent of which the considered sets in our analysis need to be “shrunk” in order to obtain under-approximations.

Definition 1 (Deflation coefficient).

Given $t\in\mathbb{R}_{+}$ , $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{n}$ , and $k\in\mathbb{N}$ , define the deflation parameter

\lambda(t,\Omega,k)\mathrel{:=}\frac{1-\mathrm{e}^{t\|A\|}\theta(t\|A\|,k)\|G^{\dagger}\|\|c\|}{1+\mathrm{e}^{t\|A\|}\theta(t\|A\|,k)\|G^{\dagger}\|\|G\|}.

(8)

Corollary 1.

Given $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{n}$ , and $t\in\mathbb{R}_{+}$ , then for all $k\in\mathbb{N}$ such that $\lambda(t,\Omega,k)\geq 0$ , we have $\mathcal{L}(t,k)[c+\lambda(t,\Omega,k)G]\subseteq\exp(tA)\Omega.$

Proof.

The corollary follows from Lemma 6 (with $\tilde{P}=\mathcal{L}(t,k)$ and $P=\exp(tA)$ ) by verifying that

	$\displaystyle\alpha_{m}(\Omega,\mathrm{e}^{tA},\mathcal{L}(t,k))$	$\displaystyle=\frac{\\|\mathrm{e}^{tA}G\\|_{l}-\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})c\\|}{\\|\mathrm{e}^{tA}G\\|_{l}+\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})G\\|}.$
		$\displaystyle\geq\frac{(\\|G^{\dagger}\\|\\|\mathrm{e}^{-tA}\\|)^{-1}-\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})c\\|}{(\\|G^{\dagger}\\|\\|\mathrm{e}^{-tA}\\|)^{-1}+\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})G\\|}$
		$\displaystyle\geq\frac{1-\\|G^{\dagger}\\|\mathrm{e}^{t\\|A\\|}\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})c\\|}{1+\\|G^{\dagger}\\|\mathrm{e}^{t\\|A\\|}\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})G\\|}$
		$\displaystyle\geq\frac{1-\mathrm{e}^{t\\|A\\|}\\|G^{\dagger}\\|\theta(t\\|A\\|,k)\\|c\\|}{1+\mathrm{e}^{t\\|A\\|}\\|G^{\dagger}\\|\theta(t\\|A\\|,k)\\|G\\|}$
		$\displaystyle=\lambda(t,\Omega,k)\geq 0,$

where we have used Lemma 3(b),(c). ∎

IV-B Invertible truncated Taylor series of the matrix exponential

In the algorithm presented in Lemma 5, images of the matrix exponential (the sets $S_{i}^{N}$ and $V_{i}^{N}$ ) are computed recursively. As we aim to adopt Corollary 1 to replace these exact images with under-approximations, it is important that each under-approximation is in the class $\mathbb{A}_{n}$ (cf. $\Omega$ in Corollary 1). This necessitates that:

1.

the set $X_{0}$ is in $\mathbb{A}_{n}$ , which holds by assumption;
2.

the under-approximation of $\mathcal{R}_{u}(\tau)$ , which we derive and discuss in Sections IV-C and IV-D, is in $\mathbb{A}_{n}$ ; and
3.

in each iteration of the method, when under-approximating $S_{i}^{N}$ and $V_{i}^{N}$ using Corollary 1, the values of the function $\lambda$ are positive, and the values of $\mathcal{L}$ are invertible.

The positivity of $\lambda$ can be imposed by setting the number of Taylor terms in $\mathcal{L}$ to be sufficiently large (see Section IV-E for details).

Unfortunately, the invertibility of truncated Taylor expansions of the matrix exponential is not guaranteed in general. For example, if $A=-\operatorname{id}$ , then $\mathcal{L}(1,2)=0$ (not invertible). The following lemma provides an efficient way to check the invertibility of $\mathcal{L}(t,k)$ .

Lemma 7.

Let $t\in\mathbb{R}_{+}$ and define

k_{\min}(t)\mathrel{:=}\min\left\{k\in\mathbb{N}\,\middle|\,\theta(t\rho(A),k)\mathrm{e}^{t\rho(A)}<1\right\}.

(9)

Then, $\mathcal{L}(t,k)$ is invertible for all $k\in[k_{\min}(t);\infty\clbrack$ .

Proof.

First, we note that $k_{\min}(t)$ is well-defined (finite) as $\lim_{k\rightarrow\infty}\theta(r,k)=0$ for any $r\in\mathbb{R}_{+}$ and that the inequality $\theta(t\rho(A),k)\exp(t\rho(A))<1$ holds for any $k\in[k_{\min}(t);\infty\clbrack$ as $\theta$ is monotonically decreasing with respect to its second argument. Fix $k\in[k_{\min}(t);\infty\clbrack$ . The continuity of $\theta(t(\cdot),k)\exp(t(\cdot))$ implies the existence of some $\varepsilon>0$ such that $\theta(t(\rho(A)+\varepsilon),k)\exp(t(\rho(A)+\varepsilon))<1.$ Using Lemma 1, there exists an induced matrix norm $\|\cdot\|_{\varepsilon}$ such that $\|A\|_{\varepsilon}\leq\rho(A)+\varepsilon$ . Hence, using estimates (2) and (3) and the increasing monotonicity of $\theta(t(\cdot),k)\exp(t(\cdot))$ ,

	$\displaystyle\\|\mathrm{e}^{tA}-\mathcal{L}(t,k)\\|_{\varepsilon}\\|(\mathrm{e}^{tA})^{-1}\\|_{\varepsilon}$	$\displaystyle=\\|\mathrm{e}^{tA}-\mathcal{L}(t,k)\\|_{\varepsilon}\\|\mathrm{e}^{-tA}\\|_{\varepsilon}$
		$\displaystyle\leq\theta(t\\|A\\|_{\varepsilon},k)\mathrm{e}^{t\\|A\\|_{\varepsilon}}$
		$\displaystyle\leq\theta(t(\rho(A)+\varepsilon),k)\mathrm{e}^{t(\rho(A)+\varepsilon)}<1.$

Finally, using Lemma 2, $\mathcal{L}(t,k)$ is invertible. ∎

IV-C Under-approximating input reachable sets

The second main issue with implementing the Algorithm in Lemma 5 is the requirement to evaluate the input reachable set $\mathcal{R}_{u}(\tau)$ exactly, which is generally not possible. The following lemma is the starting point to address this issue.

Lemma 8.

Let $I=[a,b]$ be a compact interval, $P,\tilde{P}:I\rightarrow\mathbb{R}^{n\times m}$ be continuous matrix-valued functions, and $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{m}$ . Assume that $\mathrm{rank}(P(s))=n$ for all $s\in I$ . Moreover, assume $\sup_{s\in I}\|(\tilde{P}(s)-P(s))c\|\leq\inf_{s\in I}\|P(s)G\|_{l}$ . Then, $\int_{I}\tilde{P}(s)\mathrm{d}s(c+\alpha G\mathbf{B}_{p})\subseteq\int_{I}P(s)\Omega\mathrm{d}s$ for any $\alpha\in[0,\gamma_{m}(\Omega,t_{1},t_{2},P,\tilde{P})]$ , where

\gamma_{m}(\Omega,I,P,\tilde{P})\mathrel{:=}\frac{\inf_{s\in I}\|P(s)G\|_{l}-\sup_{s\in I}\|P_{d}(s)c\|}{\inf_{s\in I}\|P(s)G\|_{l}+\sup_{s\in I}\|P_{d}(s)G\|},

(10)

and $P_{d}(s)\mathrel{:=}\tilde{P}(s)-P(s),~{}s\in I$ .

Proof.

Fix $\alpha\in[0,\gamma_{m}(\Omega,I,P,\tilde{P})]$ and define

\mu=\sup_{s\in I}\|(\tilde{P}(s)-P(s))c\|+\alpha\sup_{s\in I}\|(\tilde{P}(s)-P(s))G\|.

Then, $\alpha$ satisfies $\mu\leq(1-\alpha)\inf_{s\in I}\|P(s)G\|_{l}.$ Let $y\in\int_{I}\tilde{P}(s)\mathrm{d}s(c+\alpha G\mathbf{B}_{p})$ . Then, there exists $x\in\mathbf{B}_{p}$ such that $y=\int_{I}\tilde{P}(s)\mathrm{d}s(c+\alpha Gx)=\int_{I}f(s)\mathrm{d}s$ , where $f(s)=\tilde{P}(s)(c+\alpha Gx)$ . Therefore, using Lemma 3(a), we have, for all $s\in I$ ,

	$\displaystyle f(s)=$	$\displaystyle P(s)c+(\tilde{P}(s)-P(s))c$
		$\displaystyle+\alpha(\tilde{P}(s)-P(s))Gx+\alpha P(s)Gx$
	$\displaystyle\subseteq$	$\displaystyle P(s)c+\mu\mathbf{B}_{n}+\alpha P(s)G\mathbf{B}_{p}$
	$\displaystyle\subseteq$	$\displaystyle P(s)c+(1-\alpha)\inf_{z\in I}\\|P(z)G\\|_{l}\mathbf{B}_{n}+\alpha P(s)G\mathbf{B}_{p}$
	$\displaystyle\subseteq$	$\displaystyle P(s)c+(1-\alpha)\\|P(s)G\\|_{l}\mathbf{B}_{n}+\alpha P(s)G\mathbf{B}_{p}$
	$\displaystyle\subseteq$	$\displaystyle P(s)c+(1-\alpha)P(s)G\mathbf{B}_{p}+\alpha P(s)G\mathbf{B}_{p}=P(s)\Omega.$

Using [29, Theorem 8.2.10, p. 316], there exists $g\in\Omega^{I}$ such that $f(s)=P(s)g(s)$ for almost all $s\in I.$ Hence, $y=\int_{I}P(s)g(s)\mathrm{d}s\in\int_{I}P(s)\Omega\mathrm{d}s$ , which completes the proof. ∎

The logic of Lemma 8 is similar to that of Lemma 6, where the set-valued integral $\int_{I}P(s)(c+G\mathbf{B}_{p})\mathrm{d}s$ is under-approximated by a deflated version of the set $\int_{I}\tilde{P}(s)\mathrm{d}s(c+G\mathbf{B}_{p})$ , utilizing the approximating matrix function $\tilde{P}$ . Note that the set representation of $\int_{I}\tilde{P}(s)\mathrm{d}s(c+G\mathbf{B}_{p})$ (linear transformation of $(c+G\mathbf{B}_{p})$ under $\int_{I}\tilde{P}(s)\mathrm{d}s$ ) is simpler than that of the original set-valued integral, making it more appealing in set-valued computations. The deflation herein is obtained based on estimates of approximation errors as seen in the definition of $\gamma_{m}$ given by Equation (10).

Lemma 8 provides a sufficient tool to under-approximate the input reachable set as seen in the following corollary.

Corollary 2.

Given $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{n}$ , and $t\in\mathbb{R}_{+}$ , then for all $k\in\mathbb{N}$ such that $\lambda(t,\Omega,k)\geq 0$ , we have $\mathcal{T}(t,k)(c+\lambda(t,\Omega,k)G\mathbf{B}_{p})\subseteq\int_{0}^{t}\exp(sA)\Omega\mathrm{d}s.$

Proof.

This follows by verifying that $\gamma_{m}(\Omega,I,\mathrm{e}^{(\cdot)A},\mathcal{L}(\cdot,k))\geq\lambda(t,\Omega,k)\geq 0$ , where the detailed estimates, which are similar to those presented in the proof of Corollary 1, are omitted for brevity. ∎

IV-D Invertibility of the integral of the matrix exponential

Corollary 2 establishes how the input reachable set $\mathcal{R}_{u}(\tau)$ required in the algorithm in Lemma 5 can be replaced by an under-approximation that is an affine transformation of a unit ball (not necessarily full-dimensional) as $U\in\mathbb{A}_{n}$ by assumption. A blind implementation of the corollary may, however, lead to another issue if the under-approximation of $\mathcal{R}_{u}(\tau)$ is not a full dimensional set (i.e., not in $\mathbb{A}_{n}$ ). As we mentioned in Section IV-E, we need the under-approximation of $\mathcal{R}_{u}(\tau)$ to be in $\mathbb{A}_{n}$ . This, as can be seen from Corollary 2, is attained if the utilized value of $\lambda$ is positive and the value of the approximating function $\mathcal{T}$ is invertible. The first requirement can be fulfilled by setting the number of Taylor series terms of the approximation $\mathcal{T}$ to be large and we elaborate regarding that in Section IV-E. What is left is to ensure the invertibility of the value of $\mathcal{T}$ , which we can ensure if the integral of the matrix exponential itself is invertible.

To further investigate the invertibility requirement, let us first introduce the set $\mathbb{I}$ of positive $t$ values such that $\int_{0}^{t}\exp(sA)\mathrm{d}s$ is invertible, i.e.,

\mathbb{I}\mathrel{:=}\left\{t\in\mathbb{R}_{+}\,\middle|\,\int_{0}^{t}\exp(sA)\mathrm{d}s~{}\textrm{is invertible}\right\}.

This set can be deduced exactly from the eigenvalues of $A$ as shown in the lemma below (see, e.g., [28, Lemma 3.4.1, p. 100]).

Lemma 9.

Let $t\in\mathbb{R}_{+}\setminus\{0\}$ , then $\int_{0}^{t}\exp(sA)\mathrm{d}s$ is invertible iff $2\pi z\mathbf{i}/t$ is not an eigenvalue of $A$ for any $z\in\mathbb{Z}\setminus\{0\}$ , where $\mathbf{i}=\sqrt{-1}$ .

We can see from Lemma 9 that the invertibility of $\int_{0}^{t}\exp(sA)\mathrm{d}s$ fails at only a countable number of values of $t$ . Hence, in practice, the invertibility is likely always fulfilled. Furthermore, we can use Lemma 9 to show that $\int_{0}^{t}\exp(sA)\mathrm{d}s$ is always invertible for $t$ sufficiently small, but nonzero (which is the case as we use it with $t=\tau$ ). For a given system matrix $A$ , the following lemma derives a fixed open interval on which the matrix exponential is guaranteed to be invertible.

Lemma 10.

Let $E=\{m\in\mathbb{R}_{+}~{}|~{}m=\left|\lambda\right|,~{}Av=\lambda v,~{}\lambda\in\mathbb{C},~{}v\in\mathbb{C}^{n},~{}\Re(\lambda)=0\}\setminus\{0\}$ be the set of absolute values of the purely imaginary (nonzero) eigenvalues of $A$ . If $E$ is non-empty, set $t_{\max}=2\pi/\max{E}$ , otherwise $t_{\max}=\infty$ . Then, $\int_{0}^{t}\exp(sA)\mathrm{d}s$ is invertible for all $t\in\orbrack 0,t_{\max}\clbrack$ .

As can be seen from Lemma 10, we can always find, based on the eigenvalues of the system matrix $A$ , which is fixed and known for a given system, a non-empty open interval with zero as a left limit point on which the integral of the matrix exponential is guaranteed to be invertible. In our proposed method, we are interested in an under-approximation of $\mathcal{R}_{u}(\tau)$ , where $\tau$ is typically small as it corresponds to the time step size of the method ( $\tau=T/N$ ). Therefore, the invertibility of $\int_{0}^{\tau}\exp(sA)\mathrm{d}s$ can always be fulfilled if we set the time discretization parameter $N$ to be sufficiently large (but still finite), depending on the eigenvalues of $A$ .

If the invertibility of the integral of the matrix exponential is fulfilled ( $t\in\mathbb{I}$ ), we can obtain a finite $k\in\mathbb{N}$ such that the truncated Taylor series of $\int_{0}^{t}\exp(sA)\mathrm{d}s$ , $\mathcal{T}(t,k)$ , is invertible. The existence of a finite $k$ , such that $\mathcal{T}(t,k)$ is invertible, follows from the invertibility of $\int_{0}^{t}\exp(sA)\mathrm{d}s$ ( $t\in\mathbb{I}$ ), the fact that $\lim_{k\rightarrow\infty}\mathcal{T}(t,k)=\int_{0}^{t}\exp(sA)\mathrm{d}s$ , and the continuity of the matrix inverse (Lemma 2).

IV-E Determining the number of Taylor series terms

Next, we aim to determine the number of Taylor series terms used in the approximations $\mathcal{L}$ and $\mathcal{T}$ . First, note that the under-approximations introduced in Corollaries 1 and 2 depend on the deflation function $\lambda$ . In this work, we propose simple criteria, that determine the number of Taylor series terms, which aim to maximize the values of $\lambda$ , while incorporating the full-dimensionality considerations discussed in Sections IV-E and IV-D. Notice that for any given $t\in\mathbb{R}_{+}$ and $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{n}$ , the function $\lambda(t,\Omega,\cdot)$ is bounded above with the least upper bound of $1$ . Our goal is to choose the minimum value of $k$ such that the deflation coefficient is larger than some design parameter $\epsilon\in[0,1\clbrack$ , while ensuring the invertibility of the approximations of the matrix exponential and its integral. Therefore, we introduce the following parameters associated with the number of Taylor series terms used in under-approximating homogeneous and input reachable sets.

Definition 2.

Given $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{n}$ , $t\in\mathbb{R}_{+}$ , $\bar{t}\in\mathbb{I}$ , and $\epsilon\in[0,1\clbrack$ . $\kappa(t,\Omega,\epsilon)$ is defined as

\kappa(t,\Omega,\epsilon)\mathrel{:=}\min\left(\left\{k\in[k_{\min}(t);\infty\clbrack\,\middle|\,\lambda(t,\Omega,k)>\epsilon\right\}\cap[2;\infty\clbrack\right).

(11)

Moreover, $\eta(\bar{t},\Omega,{\epsilon})$ is defined as

\eta(\bar{t},\Omega,{\epsilon})\mathrel{:=}\min\{k\in\mathbb{N}~{}|~{}\lambda(\bar{t},\Omega,k)>{\epsilon},\mathcal{T}(\bar{t},k)~{}\textrm{is invertible}\}.

(12)

As shown in the proof of Lemma 7, $k_{\min}(t)$ is well-defined. Since $\lambda(t,\Omega,k)\rightarrow 1$ , as $k\rightarrow\infty$ , $\kappa(t,\Omega,\epsilon)$ is also well-defined for any $\epsilon\in[0,1\clbrack$ . Note that, the lower bound of 2 used in the definition of $\kappa(t,\Omega,{\epsilon})$ is of importance only when deducing the convergence guarantees in Section VI. The well-definiteness of $\eta(\bar{t},\Omega,\epsilon)$ follows from the fact that $\lim_{k\rightarrow\infty}\lambda(\bar{t},\Omega,k)=1$ and the invertibility argument at the end of Section IV-D.

IV-F Under-approximations of reachable sets and tubes

The previous sections have established the tools necessary for the proposed method. Next, we introduce the operators, $\mathcal{H}$ and $\mathcal{I}$ , which are designed based on Corollaries 1 and 2, in addition to the criteria introduced in (11) and (12), to obtain full-dimensional under-approximations of homogeneous and input reachable sets, using approximations of the matrix exponential and its integral.

Definition 3.

We define the homogeneous and input under-approximation operators $\mathcal{H}\colon\mathbb{R}_{+}\times\mathbb{A}_{n}\times\orbrack 0,1]\rightarrow\mathbb{A}_{n}$ and $\mathcal{I}\colon\mathbb{I}\times\mathbb{A}_{n}\times\orbrack 0,1]\rightarrow\mathbb{A}_{n}$ as follows:

\mathcal{H}(t,\Omega,\epsilon)\mathrel{:=}\mathcal{L}(t,\kappa(t,\Omega,\epsilon))[c+\lambda(t,\Omega,\kappa(t,\Omega,\epsilon))G\mathbf{B}_{p}],

(13)

\mathcal{I}(\bar{t},\Omega,\epsilon)\mathrel{:=}\mathcal{T}(\bar{t},\eta(\bar{t},\Omega,\epsilon))[c+\lambda(\bar{t},\Omega,\eta(\bar{t},\Omega,{\epsilon}))G\mathbf{B}_{p}],

(14)

where $t\in\mathbb{R}_{+}$ , $\bar{t}\in\mathbb{I}$ , $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{n}$ , and $\epsilon\in[0,1\clbrack$ .

Now, we are ready to introduce the proposed method.

Theorem 11.

Given $N\in\mathbb{N}$ and $\epsilon_{h},{\epsilon}_{u}\in[0,1\clbrack$ , define $\tau=T/N$ , and assume $\tau\in\mathbb{I}$ . Moreover, define


$\displaystyle\mathcal{S}_{0}^{N}$	$\displaystyle=X_{0},$	$\displaystyle\mathcal{S}_{i}^{N}$	$\displaystyle=\mathcal{H}(\tau,S_{i-1}^{N},\epsilon_{h}),~{}i\in[1;N],$	(15a)
$\displaystyle\mathcal{V}_{0}^{N}$	$\displaystyle=\mathcal{I}(\tau,U,{\epsilon}_{u}),$	$\displaystyle\mathcal{V}_{i}^{N}$	$\displaystyle=\mathcal{H}(\tau,\mathcal{V}_{i-1}^{N},\epsilon_{h}),~{}i\in[1;N],$	(15b)
$\displaystyle\mathcal{W}_{0}^{N}$	$\displaystyle=\{0\},$	$\displaystyle\mathcal{W}_{i}^{N}$	$\displaystyle=\mathcal{W}_{i-1}^{N}+\mathcal{V}_{i-1}^{N},~{}i\in[1;N],$	(15c)
$\displaystyle\Lambda_{i}^{N}$	$\displaystyle=\mathcal{S}_{i}^{N}+\mathcal{W}_{i}^{N},~{}i\in[0;N].$			(15d)

Recall the definitions of $\{S_{i}^{N}\}_{i=0}^{N}$ , $\{V_{i}^{N}\}_{i=0}^{N}$ , $\{W_{i}^{N}\}_{i=0}^{N}$ and $\{\Gamma_{i}^{N}\}_{i=0}^{N}$ given in Lemma 5. Then, $\mathcal{S}_{i}^{N}\subseteq S_{i}^{N}=\mathcal{R}_{h}(i\tau)$ , $\mathcal{V}_{i}^{N}\subseteq V_{i}^{N}$ , $\mathcal{W}_{i}^{N}\subseteq W_{i}^{N}=\mathcal{R}_{u}(i\tau)$ , and $\Lambda_{i}^{N}\subseteq\Gamma_{i}^{N}=\mathcal{R}(i\tau)$ for all $i\in[0;N]$ . Furthermore, $\bigcup_{i=0}^{N}\Lambda_{i}^{N}\subseteq\mathcal{R}([0,T])$ .

Proof.

We have $S_{0}^{N}=\mathcal{S}_{0}^{N}=X_{0}$ . Assume that $\mathcal{S}_{i}^{N}\subseteq S_{i}^{n}$ for some $i\in[0;N-1]$ , then, using Corollary 1, $\mathcal{S}_{i+1}^{N}=\mathcal{H}(\tau,\mathcal{S}_{i}^{N},\epsilon_{h})\subseteq\exp(\tau A)\mathcal{S}_{i}^{N}\subseteq\exp(\tau A)S_{i}^{N}=S_{i+1}^{N}.$ Therefore, using induction, $\mathcal{S}_{i}^{N}\subseteq S_{i}^{N}$ for all $i\in[0;N]$ . Similarly, and using Corollary 2, we have $\mathcal{V}_{i}^{N}\subseteq V_{i}^{N}$ for all $i\in[0;N]$ . Hence, for all $i\in[0;N]$ , $\mathcal{W}_{i}^{N}\subseteq W_{i}^{N}$ . Moreover, for all $i\in[0;N]$ , $\Lambda_{i}^{N}=\mathcal{S}_{i}^{N}+\mathcal{W}_{i}^{N}\subseteq{S}_{i}^{N}+{W}_{i}^{N}=\Gamma_{i}^{N}=\mathcal{R}(i\tau)$ . Finally, $\bigcup_{i=0}^{N}\Lambda_{i}^{N}\subseteq\bigcup_{i\in[0;N]}\mathcal{R}(i\tau)\subseteq\bigcup_{t\in[0,T]}\mathcal{R}(t)=\mathcal{R}([0,T])$ . ∎

Remark 2 (Assumptions on initial and input sets).

In Section III, it was assumed that both the initial and input sets are in $\mathbb{A}_{n}$ . This assumption can be slightly relaxed if one of these sets is strictly equal to $\{0\}$ . For example, if $X_{0}=\{0\}$ , then $\mathcal{R}(t)=\mathcal{R}_{u}(t),~{}t\in[0,T]$ . The proposed method can be implemented in this case by considering computing the sets $\mathcal{W}_{i}^{N},~{}i\in[0;N]$ , which are independent of $X_{0}$ , and omitting the computations of $\mathcal{S}_{i}^{N},~{}i\in[0;N]$ . Similarly, if $U=\{0\}$ , we have $\mathcal{R}(t)=\mathcal{R}_{h}(t),~{}t\in[0,T]$ , and the proposed method can be implemented by considering the computations of $\mathcal{S}_{i}^{N},~{}i\in[0;N]$ , only, which are independent of $U$ .

V Implementation using zonotopes and memory complexity

Fix $N\in\mathbb{N}$ and recall the definitions of $\{\mathcal{S}_{i}^{N}\}_{i=0}^{N}$ , $\{\mathcal{V}_{i}^{N}\}_{i=0}^{N}$ , $\{\mathcal{W}_{i}^{N}\}_{i=0}^{N}$ and $\{\Lambda_{i}^{N}\}_{i=0}^{N}$ in Theorem 11. The computations of $\mathcal{S}_{i}^{N},~{}\mathcal{V}_{i}^{N},~{}i\in[0;N]$ are straightforward for any arbitrary norm on $\mathbb{R}^{n}$ since these sets are simply full-dimensional affine transformations of unit balls. However, the computations of $\{\mathcal{W}_{i}^{N}\}_{i=0}^{N}$ and $\{\Lambda_{i}^{N}\}_{i=0}^{N}$ involve Minkowski sums, whose explicit expressions are generally unknown. If the embedded norm is the maximum norm, then Minkowski sums can be computed explicitly. Hence, we implement the proposed method using zonotopes, i.e., affine transformations of closed unit balls w.r.t. the maximum norm.

Given $c\in\mathbb{R}^{n}$ , $G\in\mathbb{R}^{n\times p}$ , a zonotope $\mathcal{Z}(c,G)\subseteq\mathbb{R}^{n}$ is defined by $c+G\mathbf{B}_{p}^{\infty}$ , where $\mathbf{B}_{p}^{\infty}$ denotes the $p$ -dimensional closed unit ball w.r.t. the maximum norm. The columns of $G$ are referred to as the generators of $\mathcal{Z}(c,G)$ and the ratio $p/n$ is referred to as the order of $\mathcal{Z}(c,G)$ and is denoted by $o(\mathcal{Z}(c,G))$ (e.g., the order of $\mathbf{B}_{n}^{\infty}$ is one). Herein, the number of generators of $\mathcal{Z}(c,G)$ is denoted by $\mathrm{Gen}(\mathcal{Z}(c,G))$ . For any two zonotopes $\mathcal{Z}(c,G),\mathcal{Z}(\tilde{c},\tilde{G})\subseteq\mathbb{R}^{n}$ and any linear transformation $L\in\mathbb{R}^{m\times n}$ , $\mathcal{Z}(c,G)+\mathcal{Z}(\tilde{c},\tilde{G})=\mathcal{Z}(c+\tilde{c},[G,\tilde{G}])$ and $L\mathcal{Z}(c,G)=\mathcal{Z}(Lc,LG).$

Let us analyze the memory complexity of the proposed method implemented with zonotopes. We have $\mathrm{Gen}(X_{0})=o(X_{0})n$ and $\mathrm{Gen}(U)=o(U)n$ . As affinely transforming zonotopes preserves their orders, we have $\mathrm{Gen}(\mathcal{S}_{i}^{N})=o(X_{0})n,~{}i\in[0;N]$ , and $\mathrm{Gen}(\mathcal{V}_{i}^{N})=o(U)n,~{}i\in[0;N]$ . The sequence $\{\mathcal{W}_{i}^{N}\}_{i=0}^{N}$ is computed as $\mathcal{W}_{i}^{N}=\sum_{j=0}^{i-1}\mathcal{V}_{j}^{N},~{}i\in[0;N].$ Hence, $\mathrm{Gen}(\mathcal{W}_{i}^{N})=io(U)n,~{}i\in[0;N].$ Consequently, as $\Lambda_{i}^{N}=\mathcal{S}_{i}^{N}+\mathcal{W}_{i}^{N},~{}i\in[0;N]$ , $\mathrm{Gen}(\Lambda_{i}^{N})=o(X_{0})n+io(U)n,~{}i\in[0;N].$ Finally, the total number of generators from the sequence $\{\Lambda_{i}^{N}\}_{i=0}^{N}$ is $\sum_{i=0}^{N}\mathrm{Gen}(\Lambda_{i}^{N})=(N+1)o(X_{0})n+N(N+1)o(U)n/2$ . This shows that the total number of generators stored is of order $N^{2}n$ and that gives a space complexity of order $N^{2}n^{2}$ (second order in the argument $N$ ), which is identical, e.g., to the space complexity of the over-approximation method in [15]. However, if we store only the sets $\mathcal{S}_{i}^{N},~{}\mathcal{V}_{i}^{N}$ , $i\in[0;N]$ (the sets prior to Minkowski sum computations), the space complexity is reduced to be of order $Nn^{2}$ (first order in the argument $N$ ), where the sets $\mathcal{W}_{i}^{N},~{}\Lambda_{i}^{N},~{}i\in[0;N]$ , can be computed afterwards when needed by means of Minkowski sums, which are computationally inexpensive [30]. This approach was proposed in [9] in order to lower memory complexity, also resulting in a first order memory complexity with respect to the argument $N$ . In Section VII-C, we explore empirically the time efficiency of the proposed method via means of numerical simulations.

As shown above, Minkowski sums of zonotopes increase the number of generators to be stored. This may limit the applicability of our zonotopic implementation to cases when the discretization parameter $N$ is not significantly large as the proposed method incorporates iterative Minkowski sums. This issue can be overcome by implementing zonotopic order reductions that replace a given zonotope with a zonotopic under-approximation with less generators, which lessen the memory requirement, with the price of reduced accuracy. For example, a given zonotope can be first inscribed by another zonotope, with a less number of generators, using one of the standard over-approximating order reduction techniques (see, e.g., [31]), and then the over-approximating zonotope can be scaled down to be contained in the original zonotope by utilizing linear programming (see [32]). Furthermore, a zonotope can be under-approximated by choosing a subset of its generators, based on some specified criteria, and replacing the chosen generators by their sum (or difference) [7, 33]. In fact, these two approaches have been implemented in the 2021 version of the reachability software CORA [34]. In Section VII, we will explore the influence of order reduction techniques on the under-approximations obtained using our proposed method.

VI Convergence

In Section IV, we have introduced the proposed method and proved its under-approximate capability. The proposed method requires to assign values to the parameters $\epsilon_{h}$ and ${\epsilon}_{u}$ in the interval $[0,1\clbrack$ . The closer the values of $\epsilon_{h}$ and ${\epsilon}_{u}$ to one, the higher the approximation accuracy. Herein, we show that if we choose $\epsilon_{h}=1-1/N^{2}$ and $\epsilon_{u}=1-1/N$ , then the proposed method generates convergent under-approximations, as $N$ approaches $\infty$ , with first-order convergence guarantees, in the sense of the Hausdorff distance, and that is the second main result of this work.

Theorem 12.

Let $N\in\mathbb{N}$ , $\tau=T/N$ , $\epsilon_{h}=1-1/N^{2}$ , and $\epsilon_{u}=1-1/N$ . Assume $\tau\in\mathbb{I}$ and that $\tau\|A\|\leq 1$ . Define $\{\mathcal{S}_{i}^{N}\}_{i=0}^{N}$ , $\{\mathcal{V}_{i}^{N}\}_{i=0}^{N}$ , $\{\mathcal{W}_{i}^{N}\}_{i=0}^{N}$ , and $\{\Lambda_{i}^{N}\}_{i=0}^{N}$ as in Theorem 11. Then, there exist constants $D_{1},D_{2},D_{3},D_{4}\in\mathbb{R}_{+}$ that are independent of $N$ , such that, for all $i\in[0;N]$ , $\mathfrak{d}(\mathcal{S}_{i}^{N},\mathcal{R}_{h}(i\tau))\leq D_{1}\tau$ , $\mathfrak{d}(\mathcal{W}_{i}^{N},\mathcal{R}_{u}(i\tau))\leq D_{2}\tau$ , $\mathfrak{d}(\Lambda_{i}^{N},\mathcal{R}(i\tau))\leq D_{3}\tau,$ and $\mathfrak{d}(\bigcup_{i=0}^{N}\Lambda_{i}^{N},\mathcal{R}([0,T]))\leq D_{4}\tau.$

Next, we state some technical results that are necessary in the proof of Theorem 12. The proofs of Lemmas 14, 15, and 16 are given in the Appendix.

Lemma 13 (Semigroup property of reachable sets [35]).

Given $0\leq a\leq b\leq T$ , $\mathcal{R}(b)=\exp((b-a)A)\mathcal{R}(a)+\int_{0}^{(b-a)}\exp(sA)U\mathrm{d}s.$

Lemma 14.

Let $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{n}$ , $\epsilon\in[0,1\clbrack$ , and $t\in[0,T]$ . Assume $t\|A\|\leq 1$ . Then,

\mathfrak{d}(\mathcal{H}(t,\Omega,\epsilon),\mathrm{e}^{tA}\Omega)\leq(2(1-\epsilon)+(t\|A\|)^{2})\mathrm{e}^{T\|A\|}\|\Omega\|.

Lemma 15.

Let $\Omega=c+G\mathbf{B}_{p}\in\mathbb{A}_{n}$ , $\epsilon\in[0,1\clbrack$ , and $t\in\mathbb{I}\cap[0,T]$ . Assume $t\|A\|\leq 1$ . Then,

\mathfrak{d}(\mathcal{I}(t,\Omega,{\epsilon}),\int_{0}^{t}\mathrm{e}^{sA}\Omega\mathrm{d}s)\leq 2((1-\epsilon)t+t^{2}\|A\|)\mathrm{e}^{T\|A\|}\|\Omega\|.

Lemma 16.

Let $N\in\mathbb{N}$ , $\tau=T/N$ , and $\{S_{i}^{N}\}_{i=0}^{N}$ and $\{V_{i}^{N}\}_{i=0}^{N}$ be defined as in Equations (6a) and (6b), respectively. Then, $\|S_{i}^{N}\|\leq\exp(T\|A\|)\|X_{0}\|$ and $\|V_{i}^{N}\|\leq\tau\exp(2T\|A\|)\|U\|$ for all $i\in[0;N]$ .

Now, we are ready to prove Theorem 12.

Proof of Theorem 12.

Recall the definitions of $\{S_{i}^{N}\}_{i=0}^{N}$ , $\{V_{i}^{N}\}_{i=0}^{N}$ , $\{W_{i}^{N}\}_{i=0}^{N}$ and $\{\Gamma_{i}^{N}\}_{i=0}^{N}$ in Lemma 5. Note that according to Lemma 5, $S_{i}^{N}=\mathcal{R}_{h}(i\tau),~{}W_{i}^{N}=\mathcal{R}_{u}(i\tau)$ and $\Gamma_{i}^{N}=\mathcal{R}(i\tau)$ for all $i\in[0;N]$ . Assume without loss of generality that $A\neq 0$ (the case when $A=0$ is trivial). Let $p_{i}=\mathfrak{d}(S_{i}^{N},\mathcal{S}_{i}^{N}),~{}i\in[0;N]$ . We have $p_{0}=0$ as $S_{0}^{N}=\mathcal{S}_{0}^{N}=X_{0}$ . For $i\in[1;N]$ , we have, using the definitions of $S_{i}^{N}$ and $\mathcal{S}_{i}^{N}$ in Equations (6a) and (15a), respectively, the triangle inequality, and Lemma 4(b),

	$\displaystyle p_{i}$	$\displaystyle\leq\mathfrak{d}(\mathrm{e}^{\tau A}S_{i-1}^{N},\mathrm{e}^{\tau A}\mathcal{S}_{i-1}^{N})+\mathfrak{d}(\mathrm{e}^{\tau A}\mathcal{S}_{i-1}^{N},\mathcal{H}(\tau,\mathcal{S}_{i-1}^{N},\epsilon_{h}))$
		$\displaystyle\leq\mathrm{e}^{\tau\\|A\\|}p_{i-1}+\mathfrak{d}(\mathrm{e}^{\tau A}\mathcal{S}_{i-1}^{N},\mathcal{H}(\tau,\mathcal{S}_{i-1}^{N},\epsilon_{h})).$

Using Lemma 14 and the fact that $\epsilon_{h}=1-(\tau/T)^{2}$ , the term $\mathfrak{d}(\exp(\tau A)\mathcal{S}_{i-1}^{N},\mathcal{H}(\tau,\mathcal{S}_{i-1}^{N},\epsilon_{h}))$ is bounded above by $C_{1}\|\mathcal{S}_{i-1}^{N}\|\tau^{2},$ where $C_{1}=({2}/{T^{2}}+\|A\|^{2})\exp(T\|A\|).$ Moreover, using the fact that $\mathcal{S}_{i-1}^{N}\subseteq S_{i-1}^{N},~{}i\in[1;N]$ , as shown in Theorem 11, and Lemma 16, we have $\|\mathcal{S}_{i-1}^{N}\|\leq\|{S}_{i-1}^{N}\|\leq M=\exp(T\|A\|)\|X_{0}\|,~{}i\in[1;N+1].$ Therefore, $p_{i}\leq\exp(\tau\|A\|)p_{i-1}+C_{1}M\tau^{2},~{}i\in[1;N].$ Using induction, we have, for all $i\in[0;N]$ ,

p_{i}\leq\frac{\mathrm{e}^{i\tau\|A\|}-1}{\mathrm{e}^{\tau\|A\|}-1}C_{1}M\tau^{2}\leq\frac{\mathrm{e}^{T\|A\|}-1}{\tau\|A\|}C_{1}M\tau^{2}=D_{1}\tau,

(16)

where $D_{1}=(\exp(T\|A\|)-1)C_{1}M/\|A\|.$ Similarly, let $q_{i}=\mathfrak{d}(V_{i}^{N},\mathcal{V}_{i}^{N}),~{}i\in[0;N]$ . We have, using Lemma 15 and the fact that ${\epsilon}_{u}=1-\tau/T$ , $q_{0}=\mathfrak{d}(\int_{0}^{\tau}\exp(sA)U\mathrm{d}s,\mathcal{I}(\tau,U,{\epsilon}_{u}))\leq C_{2}\tau^{2},$ where $C_{2}=2(1/T+\|A\|)\exp(T\|A\|)\|U\|.$ The remaining terms, $q_{i},~{}i\in[1;N]$ , can be bounded using the triangular inequality and Lemma 4(b), where we deduce the recursive inequality $q_{i}\leq\mathrm{e}^{\tau\|A\|}q_{i-1}+\mathfrak{d}(\exp(\tau A)\mathcal{V}_{i-1}^{N},\mathcal{H}(\tau,\mathcal{V}_{i-1}^{N},\epsilon_{h})),~{}i\in[1;N].$ Now, for the term $\mathfrak{d}(\exp(\tau A)\mathcal{V}_{i-1}^{N},\mathcal{H}(\tau,\mathcal{V}_{i-1}^{N},\epsilon_{h}))$ , we use Lemma 14, which results in the inequality $\mathfrak{d}(\exp(\tau A)\mathcal{V}_{i-1}^{N},\mathcal{H}(\tau,\mathcal{V}_{i-1}^{N},\epsilon_{h}))\leq C_{1}\|\mathcal{V}_{i-1}^{N}\|\tau^{2}.$ Using $\mathcal{V}_{i-1}^{N}\subseteq V_{i-1}^{N},~{}i\in[1;N]$ from Theorem 11, and Lemma 16, the sequence $\{\|\mathcal{V}_{i-1}^{N}\|\}_{i=1}^{N+1}$ is bounded above by $\tau\tilde{M}$ , where $\tilde{M}=\exp(2T\|A\|)\|U\|.$ Hence, $q_{i}\leq\exp(\tau\|A\|)q_{i-1}+C_{1}\tilde{M}\tau^{3},~{}i\in[1;N],$ and by induction, we obtain, for all $i\in[0;N]$ ,

\begin{split}q_{i}&\leq\mathrm{e}^{i\tau\|A\|}C_{2}\tau^{2}+\frac{\mathrm{e}^{i\tau\|A\|}-1}{\mathrm{e}^{\tau\|A\|}-1}C_{1}\tilde{M}\tau^{3}\\ &\leq\mathrm{e}^{i\tau\|A\|}C_{2}\tau^{2}+\frac{\mathrm{e}^{i\tau\|A\|}}{\tau\|A\|}C_{1}\tilde{M}\tau^{3}=\mathrm{e}^{i\tau\|A\|}C_{3}\tau^{2},\end{split}

(17)

where $C_{3}=C_{2}+{C_{1}\tilde{M}}/{\|A\|}.$ Next, define $r_{i}=\mathfrak{d}(W_{i}^{N},\mathcal{W}_{i}^{N}),~{}i\in[0;N]$ . Then, $r_{0}=0$ as $W_{0}^{N}=\mathcal{W}_{0}^{N}=\{0\}$ and, for $i\in[1;N]$ , $r_{i}\leq r_{i-1}+q_{i-1},$ where we have utilized Lemma 4(a). Using induction, the sequence $\{r_{i}\}_{i=0}^{N}$ is bounded above as follows: $r_{i}\leq\sum_{j=0}^{i-1}q_{j},~{}i\in[0;N],$ where $\sum_{j=0}^{-1}(\cdot)=0$ . Hence, using estimate (17),

\begin{split}r_{i}&\leq\sum_{j=0}^{i-1}\mathrm{e}^{j\tau\|A\|}C_{3}\tau^{2}\leq\frac{\mathrm{e}^{i\tau\|A\|}-1}{\mathrm{e}^{\tau\|A\|}-1}C_{3}\tau^{2}\\ &\leq\frac{\mathrm{e}^{T\|A\|}-1}{\tau\|A\|}C_{3}\tau^{2}=D_{2}\tau,\end{split}

(18)

where $D_{2}=(\exp(T\|A\|)-1)C_{3}/\|A\|.$ Let $s_{i}=\mathfrak{d}(\Gamma_{i}^{N},\Lambda_{i}^{N}),~{}i\in[0;N]$ . Using Lemma 4(a), we have, for all $i\in[0;N]$ , $s_{i}\leq p_{i}+r_{i}.$ By incorporating the bounds (16) and (18), we get, for $i\in[0;N]$ ,

s_{i}\leq D_{1}\tau+D_{2}\tau=D_{3}\tau,

(19)

where $D_{3}={D_{1}}+D_{2}$ .

Now, we prove the last claim of the theorem. Note that, using the definition of reachable sets given in Equation (5),

\|\mathcal{R}(t)\|\leq K\mathrel{:=}\mathrm{e}^{T\|A\|}\left(\|X_{0}\|+T\|U\|\right),~{}t\in[0,T].

(20)

Define $\tilde{\mathcal{R}}_{N}(t)=\Lambda_{\tilde{i}(t)}^{N}$ , where $\tilde{i}(t)=\lfloor t/\tau\rfloor$ (floor of $t/\tau$ ). Note that $\bigcup_{i=0}^{N}\Lambda_{i}^{N}=\bigcup_{t\in[0,T]}\tilde{\mathcal{R}}_{N}(t)$ and that, using Theorem 11, $\tilde{\mathcal{R}}_{N}(t)\subseteq\mathcal{R}(t),~{}t\in[0,T].$ Using Lemma 4(d), the Hausdorff distance between $\bigcup_{i=0}^{N}\Lambda_{i}^{N}$ and $\mathcal{R}([0,T])$ satisfies the inequality

\mathfrak{d}(\bigcup_{i=0}^{N}\Lambda_{i}^{N},\mathcal{R}([0,T]))\leq\sup_{t\in[0,T]}\mathfrak{d}(\tilde{\mathcal{R}}_{N}(t),\mathcal{R}(t)).

(21)

Let $t\in[0,T]$ and set $i=\lfloor t/\tau\rfloor$ . Then, $\tilde{\mathcal{R}}_{N}(t)=\Lambda_{i}^{N}$ . Using the triangular inequality,

\mathfrak{d}(\mathcal{R}(t),\tilde{\mathcal{R}}_{N}(t))\leq\mathfrak{d}(\mathcal{R}(t),\mathcal{R}(i\tau))+\mathfrak{d}(\mathcal{R}(i\tau),\Lambda_{i}^{N}).

Let us estimate $\mathfrak{d}(\mathcal{R}(t),\mathcal{R}(i\tau))$ . Note that $0\leq t-i\tau\leq\tau\leq T$ . Using Lemma 13, $\mathcal{R}(t)$ can be written as $\mathcal{R}(t)=\exp((t-i\tau)A)\mathcal{R}(i\tau)+\int_{0}^{(t-i\tau)}\exp(sA)U\mathrm{d}s.$ Hence, using Lemma 4(a),(c) and estimates (2) and (20) (below, $\Delta$ denotes $t-i\tau$ ),

	$\displaystyle\mathfrak{d}(\mathcal{R}(t),\mathcal{R}(i\tau))$	$\displaystyle\leq\mathfrak{d}(\mathrm{e}^{\Delta A}\mathcal{R}(i\tau),\mathcal{R}(i\tau))+\mathfrak{d}(\int_{0}^{\Delta}\mathrm{e}^{sA}U\mathrm{d}s,0)$
		$\displaystyle\leq\\|\mathrm{e}^{\Delta A}-\operatorname{id}\\|\\|\mathcal{R}(i\tau)\\|+\\|U\\|\int_{0}^{\Delta}\mathrm{e}^{s\\|A\\|}\mathrm{d}s$
		$\displaystyle\leq K\Delta\\|A\\|\mathrm{e}^{\Delta\\|A\\|}+\\|U\\|\Delta\mathrm{e}^{T\\|A\\|}\leq C_{4}\tau,$

where $C_{4}=(K\|A\|+\|U\|)\exp(T\|A\|)$ . Moreover, using (16), we have, $\mathfrak{d}(\mathcal{R}(i\tau),\Lambda_{i}^{N})\leq C_{4}\tau$ . Therefore, $\mathfrak{d}(\mathcal{R}(t),\tilde{\mathcal{R}}_{N}(t))\leq D_{4}\tau,$ where $D_{4}=C_{4}+D_{3}$ . As the choice of $t\in[0,T]$ is arbitrary and in view of (21), the proof is complete. ∎

VII Numerical Examples

In this section, we illustrate the proposed method through three numerical examples. The proposed method is implemented using zonotopes in MATLAB (2019a) and run on an AMD Ryzen 5 2500U/2GHz processor. Plots of zonotopes, scaling-based under-approximations [20], and reduced-order zonotopic under-approximations are produced using the software CORA (2021 version) [34]. For all the considered linear systems and values of the time discretization parameter $N$ , $\int_{0}^{T/N}\exp(sA)\mathrm{d}s$ is invertible. The optimization problems associated with evaluating the parameters $k_{\min}$ , $\kappa$ , and $\eta$ , given in Equations (9), (11), and (12), respectively, are solved via brute force. The invertibility condition used in the definition of $\eta$ in (12) is checked using the $\mathtt{rank}$ function in MATLAB.

VII-A 2-D system with a closed-form reachable set

Consider an instance of system (1) (perturbed double integrator system), with $A=\big{(}\begin{smallmatrix}0&0\\ 1&0\end{smallmatrix}\big{)},~{}U=[0,1]\times[0,1],~{}X_{0}=\{(0,0)^{\intercal}\},$ and $T=1$ . Note that $\mathcal{R}(T)=\mathcal{R}_{u}(T)=\int_{0}^{T}\exp(sA)U_{1}\mathrm{d}s+\int_{0}^{T}U_{2}\mathrm{d}s,$ where $U_{1}=[0,1]\times\{0\}$ , and $U_{2}=\{0\}\times[0,1]$ . The set $\int_{0}^{T}\exp(sA)U_{1}\mathrm{d}s$ is given explicitly as $\int_{0}^{T}\exp(sA)U_{1}\mathrm{d}s=\{(x,y)^{\intercal}\in\mathbb{R}^{2},~{}x^{2}/2\leq y\leq x-x^{2}/2,~{}x\in[0,1]\}$ (see [36, the formula of $M_{2}$ , p. 363]), whereas, using [37, Theorem 3, p. 21], $\int_{0}^{T}U_{2}\mathrm{d}s=\{0\}\times[0,1]$ . Hence, $\mathcal{R}(T)=\left\{(x,y)^{\intercal}\in\mathbb{R}^{2}\,\middle|\,x^{2}/2\leq y\leq x-x^{2}/2+1,~{}x\in[0,1]\right\}$ . In view of Remark 2, we aim to compute under-approximations of $\mathcal{R}(T)$ using the proposed method. Herein, we consider different values of the discretization parameters $N$ and set $\epsilon_{h}=1-1/N^{2}$ and $\epsilon_{u}=1-1/N$ .

Figure 2 displays several under-approximations of $\mathcal{R}(T)$ with $N\in\{1,3,5,20\}$ . The mentioned figure shows how the obtained approximations are indeed enclosed by the exact reachable set, in agreement with Theorem 11. Moreover, the mentioned Figure exhibits how the approximation accuracy of the proposed method increases as $N$ increases, which further supports the convergence result in Theorem 12. The computational time associated with evaluating the under-approximation with $N=20$ is less than 0.003 seconds.

VII-B 5-D system

Herein, we adopt a five dimensional instance of system (1) from the literature, where matrix $A$ and the sets $X_{0}$ and $U$ are given in [2, Equation 3.11, p. 39], and set $T=1$ . We aim to under-approximate the reachable tube $\mathcal{R}([0,T])$ using the proposed method with $N\in\{10,100\}$ , where we set $\epsilon_{h}=1-1/N^{2}$ and $\epsilon_{u}=1-1/N$ , and compute the sets $\Lambda_{i}^{N},~{}i\in[1;N]$ ( $\cup_{i=0}^{N}\Lambda_{i}^{N}$ is the desired under-approximation herein). As the exact reachable tube is not known, we use the convergent over-approximation method of Serry and Reissig [15], with a refined time discretization (200 steps) and an accurate approximation of the matrix exponential, $\mathcal{L}(\cdot,10)$ , to produce an accurate representation of the reachable tube and use it as the basis of comparison. Furthermore, we additionally compute, for the case $N=10$ , a modified version of the under-approximation from the proposed method, where each set $\Lambda_{i}^{N}$ is under-approximated using order reduction based on summing generators (method sum in CORA). The zonotopes resulting from order reduction are at most of order 2.

Figure 3 displays two projections of the over-approximation and the under-approximations. As seen in the mentioned figure, the over-approximation (grey area) encloses the under-approximations (black areas) from the proposed method. Moreover, the reduced-order zonotopes (yellow areas) are enclosed by the sets $\Lambda_{i}^{N},~{}i\in[0;N]$ from the proposed method (without the order reduction). We observe that for the case $N=10$ , the sets $\Lambda_{i}^{N},~{}i\in[0;N]$ and their corresponding reduced-order under-approximations are almost over-lapping for the first few iterations ( $i$ is small); however, the accuracy of the reduced-order under-approximations decays with each iteration due to their constrained order (2 in this case). This highlights the trade-off between accuracy and memory reduction when it comes to incorporating order reduction techniques in computing under-approximations. We easily observe that the under-approximation when $N=100$ resembles the over-approximation more accurately, relative to the case when $N=10$ . This implies that the under-approximation with $N=100$ also resembles the actual reachable tube more accurately, and this is a consequence of the convergence guarantees of Theorem 12. The computational times associated with evaluating the under-approximations from the proposed method (without order reduction) with $N=10$ and $N=100$ are 0.0199 and 0.0279 seconds, respectively.

VII-C Randomly generated systems

In this section, we study the performance of the proposed method on randomly generated linear systems, where the matrix exponentials associated with the generated systems are not known exactly.

VII-C1 Homogeneous linear systems

To compare the performance of the proposed method with the scaling method [20], we consider homogeneous linear systems ( $\dot{x}=Ax$ ), with randomly generated instances of matrix $A$ , using the MATLAB command rand, where $n\in\{2,4,6,8,10\}$ , $X_{0}=\mathbf{B}_{n}^{\infty}$ , and $T=1$ . For each instance of $A$ , we compute under-approximations of $\mathcal{R}(T)=\mathcal{R}_{h}(T)$ using both the proposed method (see Remark 2) and the scaling method. The scaling method is implemented in the 2021 version of CORA, with settings tuned and approved by the first author of [20]. For our proposed method, we use $N=100$ , and set $\epsilon_{h}=1-1/N^{2}$ . The computational times and the volumes of the under-approximations from the two methods are listed in Table I.

TABLE I: Computational times and volumes of under-approximations from the proposed (with subscript

{p}

) and scaling [20] (with subscript

{s}

) methods for randomly generated homogeneous linear systems.

$n$	2	4	6	8	10
$t_{\mathrm{c},p}$ [s]	0.0150	0.0156	0.0138	0.0120	0.0179
$t_{\mathrm{c},s}$ [s]	1.5294	5.3289	16.7495	43.5934	96.3664
$\mathrm{vol}_{p}$	13.7889	94.3212	748.5102	5.5201e+03	1.0776e+05
$\mathrm{vol}_{s}$	13.5197	91.1578	715.6002	5.1814e+03	9.9622e+04

The table exhibits that the proposed method performs marginally better than the scaling method in terms of accuracy, with slightly larger volumes for the computed under-approximations. Most importantly, Table I displays how the proposed method outperforms the scaling method in terms of computational time while having better/comparable accuracy. This may be due to the fact that the proposed method obtains under-approximations by scaling intermediate sets, using simple optimization problems (Equations (11) and (12)), without the need to solve optimization problems that utilize enclosures of boundaries of reachable sets as in the scaling method.

VII-C2 Linear systems with input

Herein, we study empirically the time efficiency associated with obtaining the under-approximations $\Lambda_{i}^{N},~{}i\in[1;N],$ from the proposed method. We consider instances of system (1), with $n$ ranging between 10 and 200, where $X_{0}=U=\mathbf{B}_{n}^{\infty}$ , and $T=1$ . For each instance of $n$ , we randomly generate a corresponding matrix $A$ using the MATLAB command rand. The eigenvalues of each generated $A$ are checked not to be purely imaginary to ensure the invertibility of $\int_{0}^{T/N}\exp(sA)\mathrm{d}s$ (see Lemma 9). For every $n$ , we estimate the computational time associated with implementing the proposed method, where we set $N=100$ and $\epsilon_{h}=\epsilon_{u}=0.8$ .

Through our empirical exploration of the method performance, we noticed that for the randomly generated systems and set time interval, the reachable sets can be “narrow” especially when the dimension is high. Subsequently, the under-approximations obtained from the proposed method are nearly degenerate (ill-conditioned). This negatively influences the computations of the deflation parameter as the term $\|G\|\|G^{\dagger}\|$ (condition number) in Equation (8) becomes significantly large. Consequently, the order of the approximation $\mathcal{L}$ , which is determined using the definition of $\kappa$ in Equation (11), may have to be substantially large in order for the deflation parameter values to be larger than the specified design parameter $\varepsilon_{h}$ . To avoid these degenerate cases, we restrict our investigation herein to normalized system matrices, where each generated matrix $A$ is divided by its maximum norm. We note that normalized system matrices were considered in previous studies that investigated the performance of over-approximation methods (see, e.g., [38]).

Figure 4 plots the recorded computational time, associated with the computations from the proposed method, as a function of the dimension $n$ . The mentioned figure exhibits the efficiency of the computations, for the considered randomly generated systems, as the computational time is less than 0.25 seconds for $n\leq 50$ , less than 1.25 seconds for $50\leq n\leq 100$ , and less than 7 seconds when $n=200$ . This numerical example highlights the potential role of the proposed method in real-time computations in different applications, such as falsification and control synthesis as highlighted in Remark 1, due to the relatively fast computations especially for moderate dimensions ( $n\leq 50$ ).

VIII Conclusion

In this paper, we proposed a novel convergent method to under-approximate finite-time forward reachable sets and tubes of a class of continuous-time linear uncertain systems, where approximations of the matrix exponential and its integral are utilized. In future work, we aim to explore extensions and modifications of the proposed method to cover wider classes of systems, reduce computational cost, and increase accuracy. Furthermore, we seek to address how to obtain under-approximations in cases when the reachable sets are “almost” degenerate.

Aknowledgement

The authors thank Niklas Kochdumper (Stony Brook University, USA) for his useful guidance and for tuning the settings used for the scaling method implementation in CORA.

We will need the following lemma in the proofs of Lemmas 14 and 15.

Lemma 17.

Given $\Omega=c+G\mathbf{B}_{p}\subseteq\mathbb{R}^{n}$ , where $c\in\mathbb{R}^{n}$ and $G\in\mathbb{R}^{n\times p}$ , we have $\|c\|\leq\|\Omega\|$ and $\|G\|=\|G\mathbf{B}_{p}\|\leq 2\|\Omega\|$ .

Proof.

The first inequality follows from the fact that $c\in\Omega$ . The second inequality is deduced as follows: $\|G\mathbf{B}_{p}\|=\sup_{b\in\mathbf{B}_{p}}\|Gb\|\leq\sup_{b\in\mathbf{B}_{p}}\|Gb+c\|+\|c\|=\|\Omega\|+\|c\|\leq 2\|\Omega\|.$ ∎

Proof of Lemma 14.

For convenience, define $H_{1}=\mathcal{H}(t,\Omega,\epsilon),H_{2}=\mathcal{L}(t,\kappa(t,\Omega,\epsilon))\Omega$ , and $H_{3}=\exp(tA)\Omega$ . Using the definition of $\mathcal{H}$ in Equation (13), the triangle inequality,

\mathfrak{d}(H_{1},H_{3})\leq\mathfrak{d}(H_{1},H_{2})+\mathfrak{d}(H_{2},H_{3}).

(22)

Using Lemma 4(a),(c), we have

\displaystyle\mathfrak{d}(H_{1},H_{2})\leq\left|1-\lambda(t,\Omega,\kappa(t,\Omega,\epsilon_{h}))\right|\|\mathcal{L}(t,\kappa(t,\Omega,\epsilon))\|\|G\mathbf{B}_{p}\|.

Note that, by the definition of $\kappa$ given in Equation (11), $\epsilon<\lambda(t,\Omega,\kappa(t,\Omega,\epsilon))<1$ , which indicates that $\left|1-\lambda(t,\Omega,\kappa(t,\Omega,\epsilon))\right|\leq 1-\epsilon.$ Furthermore, using (2), $\|\mathcal{L}(t,\kappa(t,\Omega,\epsilon_{h}))\|\leq\exp(t\|A\|)$ . Moreover, we have, using Lemma 17, $\|G\mathbf{B}_{p}\|\leq 2\|\Omega\|$ . Hence,

\mathfrak{d}(H_{1},H_{2})\leq 2(1-\epsilon)\mathrm{e}^{t\|A\|}\|\Omega\|.

(23)

Next, we estimate $\mathfrak{d}(H_{2},H_{3})$ , which, using Lemma 4(c), satisfies the inequality

\mathfrak{d}(H_{2},H_{3})\leq\|\mathrm{e}^{tA}-\mathcal{L}(t,\kappa(t,\Omega,\epsilon))\|\|\Omega\|.

As $\mathcal{L}(t,\kappa(t,\Omega,\epsilon))$ is a Taylor approximation of $\exp(tA)$ of, at least, first order ( $\kappa(t,\Omega,\epsilon)\geq 2$ ), and that $t\|A\|\leq 1$ , we have, using the bound (3), where the arguments of $\kappa$ are dropped for convenience,

	$\displaystyle\\|\mathrm{e}^{tA}-\mathcal{L}(t,\kappa)\\|$	$\displaystyle\leq{(t\\|A\\|)^{\kappa}}\frac{\mathrm{e}^{t\\|A\\|}}{\kappa!}$
		$\displaystyle\leq{(t\\|A\\|)^{2}}\frac{\mathrm{e}^{t\\|A\\|}}{2!}$
		$\displaystyle\leq(t\\|A\\|)^{2}\mathrm{e}^{t\\|A\\|}.$

Consequently,

\mathfrak{d}(H_{2},H_{3})\leq(t\|A\|)^{2}\mathrm{e}^{t\|A\|}\|\Omega\|.

(24)

Combining estimates (22), (23), and (24) yields

	$\displaystyle\mathfrak{d}(\mathcal{H}(t,\Omega,\epsilon),\mathrm{e}^{tA}\Omega)\leq$	$\displaystyle(2(1-\epsilon)+(t\\|A\\|)^{2})\mathrm{e}^{t\\|A\\|}\\|\Omega\\|$
	$\displaystyle\leq$	$\displaystyle(2(1-\epsilon)+(t\\|A\\|)^{2})\mathrm{e}^{T\\|A\\|}\\|\Omega\\|.$

∎

Proof of Lemma 15.

For convenience, define $I_{1}=\mathcal{I}(t,\Omega,{\epsilon}),~{}I_{2}=\int_{0}^{t}\exp(sA)\mathrm{d}s\Omega$ , and $I_{3}=\int_{0}^{t}\exp(sA)\Omega\mathrm{d}s$ . Using the triangular inequality,

\mathfrak{d}(I_{1},I_{3})\leq\mathfrak{d}\left(I_{1},I_{2}\right)+\mathfrak{d}\left(I_{2},I_{3}\right).

(25)

The term $\mathfrak{d}\left(I_{1},I_{2}\right)$ can be bounded above using the triangular inequality, the definition of $\mathcal{I}$ given in Equation (14), and Lemma 4(a),(c), as follows:

	$\displaystyle\mathfrak{d}(I_{1},I_{2})\leq$	$\displaystyle\mathfrak{d}(I_{1},\mathcal{T}(t,\eta(t,\Omega,{\epsilon}))\Omega)+\mathfrak{d}(\mathcal{T}(t,\eta(t,\Omega,{\epsilon}))\Omega,I_{2})$
	$\displaystyle\leq$	$\displaystyle\left\|1-\lambda(t,\Omega,\eta(t,\Omega,{\epsilon}))\right\|\\|\mathcal{T}(t,\eta(t,\Omega,{\epsilon}))\\|\\|G\mathbf{B}_{p}\\|$
		$\displaystyle+\\|\int_{0}^{t}\mathrm{e}^{sA}\mathrm{d}s-\mathcal{T}(t,\eta(t,\Omega,{\epsilon}))\\|\\|\Omega\\|.$

Using estimate (2), we obtain the bound

	$\displaystyle\\|\mathcal{T}(t,\eta(t,\Omega,{\epsilon}_{u}))\\|$	$\displaystyle\leq\int_{0}^{t}\\|\mathcal{L}(s,\eta(t,\Omega,{\epsilon}_{u}))\\|\mathrm{d}s$
		$\displaystyle\leq\int_{0}^{t}\mathrm{e}^{s\\|A\\|}\mathrm{d}s\leq t\mathrm{e}^{t\\|A\\|}.$

Moreover, using the definition of $\eta$ in Equation (12), we have $\left|1-\lambda(t,\Omega,\eta(t,\Omega,{\epsilon}))\right|\leq 1-\epsilon.$ Also, using Lemma 17, we have $\|G\mathbf{B}_{p}\|\leq 2\|\Omega\|$ . Besides that, using estimate (3) and the definition of $\eta$ , we have (the arguments of $\eta$ are dropped for convenience)

	$\displaystyle\\|\int_{0}^{t}\mathrm{e}^{sA}\mathrm{d}s-\mathcal{T}(t,\eta)\\|$	$\displaystyle\leq\int_{0}^{t}\\|\mathrm{e}^{sA}-\mathcal{L}(s,\eta)\\|\mathrm{d}s$
		$\displaystyle\leq\int_{0}^{t}\frac{(s\\|A\\|)^{\eta}}{\eta!}\mathrm{e}^{s\\|A\\|}\mathrm{d}s$
		$\displaystyle\leq\frac{t\\|A\\|}{\eta!}\mathrm{e}^{t\\|A\\|}\int_{0}^{t}\mathrm{d}s$
		$\displaystyle\leq t^{2}\\|A\\|\mathrm{e}^{t\\|A\\|},$

where we have used the facts that $\eta\geq 1$ and $s\|A\|\leq t\|A\|\leq 1,~{}s\in[0,t]$ . Therefore,

\begin{split}\mathfrak{d}(I_{1},I_{2})&\leq 2(1-\epsilon)t{\mathrm{e}^{t\|A\|}}\|\Omega\|+t^{2}\|A\|\mathrm{e}^{t\|A\|}\|\Omega\|\\ &=(2(1-\epsilon)t+t^{2}\|A\|)\mathrm{e}^{t\|A\|}\|\Omega\|.\end{split}

(26)

Now, we estimate $\mathfrak{d}\left(I_{2},I_{3}\right)$ . Using [37, Theorem 3, p. 21], $I_{2}$ can be rewritten as

	$\displaystyle I_{2}$	$\displaystyle=(\frac{1}{t}\int_{0}^{t}\mathrm{e}^{sA}\mathrm{d}s)(t\Omega)=(\frac{1}{t}\int_{0}^{t}\mathrm{e}^{sA}\mathrm{d}s)\int_{0}^{t}\Omega\mathrm{d}s$
		$\displaystyle=\int_{0}^{t}B\Omega\mathrm{d}s,$

where $B=({1}/{t})\int_{0}^{t}\exp(sA)\mathrm{d}s.$ Then, the Hausdorff distance between $I_{2}$ and $I_{3}$ can be estimated, using Lemma 4(c), as

\mathfrak{d}(I_{2},I_{3})\leq\|\Omega\|\int_{0}^{t}\|B-\mathrm{e}^{sA}\|\mathrm{d}s.

Moreover, using the continuous differentiability of $\exp((\cdot)A)$ , the Ostrowski inequality in [39, Theorem 1], and estimate (2), we have

\|B-\mathrm{e}^{sA}\|\leq{t\|A\|\mathrm{e}^{t\|A\|}},~{}s\in[0,t].

Hence,

\mathfrak{d}(I_{2},I_{3})\leq\|\Omega\|\int_{0}^{t}{t\|A\|\mathrm{e}^{t\|A\|}}\mathrm{d}s\leq t^{2}\|\Omega\|\|A\|\mathrm{e}^{t\|A\|}.

(27)

By combining the bounds (25), (26), and (27), we get

	$\displaystyle\mathfrak{d}(I_{1},I_{3})$	$\displaystyle\leq 2((1-\epsilon)t+t^{2}\\|A\\|)\mathrm{e}^{t\\|A\\|}\\|\Omega\\|$
		$\displaystyle\leq 2((1-\epsilon)t+t^{2}\\|A\\|)\mathrm{e}^{T\\|A\\|}\\|\Omega\\|.$

∎

Proof of Lemma 16.

It can be shown using induction that $S_{i}^{N}=\exp(i\tau A)X_{0}$ and $V_{i}^{N}=\exp(i\tau A)\int_{0}^{\tau}\exp(sA)U\mathrm{d}s$ for all $i\in[0;N]$ . Hence,

\|S_{i}^{N}\|\leq\mathrm{e}^{i\tau\|A\|}\|X_{0}\|\leq\mathrm{e}^{T\|A\|}\|X_{0}\|

and

\|V_{i}^{N}\|\leq\mathrm{e}^{i\tau\|A\|}\int_{0}^{\tau}\mathrm{e}^{s\|A\|}\mathrm{d}s\|U\|\leq\tau\mathrm{e}^{2T\|A\|}\|U\|

for all $i\in[0;N]$ . ∎

References

[1] G. Reissig, A. Weber, and M. Rungger, “Feedback refinement relations for the synthesis of symbolic controllers,” IEEE Trans. Automat. Control, vol. 62, no. 4, pp. 1781–1796, 2016.
[2] M. Althoff, “Reachability analysis and its application to the safety assessment of autonomous cars,” Ph.D. dissertation, Technische Universität München, 7 Jul. 2010.
[3] E. Asarin, T. Dang, G. Frehse, A. Girard, C. Le Guernic, and O. Maler, “Recent progress in continuous and hybrid reachability analysis,” in Proc. of CACSD, CCA, and ISIC, 2006, pp. 1582–1587.
[4] M. Althoff, G. Frehse, and A. Girard, “Set propagation techniques for reachability analysis,” Annual Review of Control, Robotics, and Autonomous Syst., vol. 4, pp. 369–395, 2021.
[5] Z. She and M. Li, “Over-and under-approximations of reachable sets with series representations of evolution functions,” IEEE Trans. Automat. Control, vol. 66, no. 3, pp. 1414–1421, 2020.
[6] H. Yin, M. Arcak, A. K. Packard, and P. Seiler, “Backward reachability for polynomial systems on a finite horizon,” IEEE Trans. Automat. Control, 2021.
[7] L. Yang and N. Ozay, “Scalable zonotopic under-approximation of backward reachable sets for uncertain linear systems,” arXiv preprint arXiv:2107.01724, 2021.
[8] E. Goubault and S. Putot, “Robust under-approximations and application to reachability of non-linear control systems with disturbances,” IEEE Control Syst. Lett., 2020.
[9] A. Girard, C. L. Guernic, and O. Maler, “Efficient computation of reachable sets of linear time-invariant systems with inputs,” in Proc. of HSCC. Springer, 2006, pp. 257–271.
[10] B. Xue, M. Fränzle, and N. Zhan, “Inner-approximating reachable sets for polynomial systems with time-varying uncertainties,” IEEE Trans. Automat. Control, 2019.
[11] A. Bhatia and E. Frazzoli, “Incremental search methods for reachability analysis of continuous and hybrid systems,” in Proc. of HSCC. Springer, 2004, pp. 142–156.
[12] T. Pecsvaradi and K. S. Narendra, “Reachable sets for linear dynamical systems,” Information and Control, vol. 19, pp. 319–344, 1971.
[13] P. Varaiya, “Reach set computation using optimal control,” in Verification of Digital and Hybrid Syst. Springer, 2000, pp. 323–331.
[14] A. B. Kurzhanski and P. Varaiya, “Ellipsoidal techniques for reachability analysis: internal approximation,” Syst. Control Lett., vol. 41, no. 3, pp. 201–211, 2000.
[15] M. Serry and G. Reissig, “Over-approximating reachable tubes of linear time-varying systems,” IEEE Trans. Automat. Control, 2021.
[16] M. Serry, “Convergent under-approximations of reachable sets and tubes: A piecewise constant approach,” J. Franklin Institute, vol. 358, no. 6, pp. 3215–3231, 2021.
[17] A. Hamadeh and J. Goncalves, “Reachability analysis of continuous-time piecewise affine systems,” Automatica, vol. 44, no. 12, p. 3189–3194, 2008.
[18] M. Fauré, J. Cieslak, D. Henry, A. Verhaegen, and F. Ankersen, “Reachable tube computation of uncertain LTI systems using support functions,” in Proc. of ECC. IEEE, 2021, pp. 2670–2675.
[19] L. Yang, H. Zhang, J.-B. Jeannin, and N. Ozay, “Efficient backward reachability using the minkowski difference of constrained zonotopes,” IEEE Trans. Computer-Aided Design of Integrated Circ. and Syst., 2022.
[20] N. Kochdumper and M. Althoff, “Computing non-convex inner-approximations of reachable sets for nonlinear continuous systems,” in Proc. of CDC. IEEE, 2020, pp. 2130–2137.
[21] T. Shafa and M. Ornik, “Reachability of nonlinear systems with unknown dynamics,” IEEE Trans. Automat. Control, 2022.
[22] V. Veliov, “Second-order discrete approximation to linear differential inclusions,” SIAM J. Numerical Anal., vol. 29, no. 2, pp. 439–451, 1992.
[23] J. F. Grcar, “A matrix lower bound,” Linear Algebra and its Applications, vol. 433, no. 1, pp. 203–220, 2010.
[24] R. A. Horn and C. R. Johnson, Matrix analysis, 2nd ed. Cambridge university press, 2013.
[25] B. D. MacCluer, Elementary Functional Analysis. Springer, 2009.
[26] D. L. Lukes, Differential Equations, ser. Mathematics in Science and Engineering. London: Academic Press Inc., 1982, vol. 162.
[27] I. M. Mitchell, “Comparing forward and backward reachability as tools for safety analysis,” in International Workshop on Hybrid Syst.: Computation and Control. Springer, 2007, pp. 428–443.
[28] E. D. Sontag, Mathematical Control Theory, 2nd ed., ser. Texts in Applied Mathematics. New York: Springer-Verlag, 1998, vol. 6.
[29] J.-P. Aubin and H. Frankowska, Set-valued Anal. in Control Theory. Springer, 2000.
[30] M. Althoff and G. Frehse, “Combining zonotopes and support functions for efficient reachability analysis of linear systems,” in Proc. of CDC. IEEE, 2016, pp. 7439–7446.
[31] A.-K. Kopetzki, B. Schürmann, and M. Althoff, “Methods for order reduction of zonotopes,” in Proc. of CDC, 2017, pp. 5626–5633.
[32] S. Sadraddini and R. Tedrake, “Linear encodings for polytope containment problems,” in Proc. of CDC. IEEE, 2019, pp. 4367–4372.
[33] V. Raghuraman and J. P. Koeln, “Set operations and order reductions for constrained zonotopes,” Automatica, vol. 139, p. 110204, 2022.
[34] M. Althoff, “An introduction to CORA 2015,” in Proc. of the ARCH Workshop, vol. 34, 2015, pp. 120–151.
[35] F. L. Chernousko, State Estimation of Dynamic Systems. CRC Press, 1994.
[36] R. Ferretti, “High-order approximations of linear control systems via Runge-Kutta schemes,” Computing, vol. 58, no. 4, pp. 351–364, 1997.
[37] J.-P. Aubin and A. Cellina, Differential Inclusions. Springer, 1984.
[38] A. Girard, “Reachability of uncertain linear systems using zonotopes,” in Proc. of HSCC, vol. 3414. Springer, 2005, pp. 291–305.
[39] N. S. Barnett, C. Buşe, P. Cerone, and S. S. Dragomir, “Ostrowski’s inequality for vector-valued functions and applications,” Computers & Mathematics with Applications, vol. 44, no. 5-6, pp. 559–572, 2002.

	$\displaystyle\\|\mathrm{e}^{tA}\\|$	$\displaystyle\leq\mathrm{e}^{t\\|A\\|},~{}\\|\mathcal{L}(t,k)\\|\leq\mathrm{e}^{t\\|A\\|},$		(2)
	$\displaystyle\\|\mathrm{e}^{tA}-\mathcal{L}(t,k)\\|$	$\displaystyle\leq\theta(t\\|A\\|,k)\leq\frac{(t\\|A\\|)^{k}}{k!}\mathrm{e}^{t\\|A\\|},$		(3)

	$\displaystyle\alpha_{m}(\Omega,\mathrm{e}^{tA},\mathcal{L}(t,k))$	$\displaystyle=\frac{\\|\mathrm{e}^{tA}G\\|_{l}-\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})c\\|}{\\|\mathrm{e}^{tA}G\\|_{l}+\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})G\\|}.$
		$\displaystyle\geq\frac{(\\|G^{\dagger}\\|\\|\mathrm{e}^{-tA}\\|)^{-1}-\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})c\\|}{(\\|G^{\dagger}\\|\\|\mathrm{e}^{-tA}\\|)^{-1}+\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})G\\|}$
		$\displaystyle\geq\frac{1-\\|G^{\dagger}\\|\mathrm{e}^{t\\|A\\|}\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})c\\|}{1+\\|G^{\dagger}\\|\mathrm{e}^{t\\|A\\|}\\|(\mathcal{L}(t,k)-\mathrm{e}^{tA})G\\|}$
		$\displaystyle\geq\frac{1-\mathrm{e}^{t\\|A\\|}\\|G^{\dagger}\\|\theta(t\\|A\\|,k)\\|c\\|}{1+\mathrm{e}^{t\\|A\\|}\\|G^{\dagger}\\|\theta(t\\|A\\|,k)\\|G\\|}$
		$\displaystyle=\lambda(t,\Omega,k)\geq 0,$

	$\displaystyle\\|\mathrm{e}^{tA}-\mathcal{L}(t,k)\\|_{\varepsilon}\\|(\mathrm{e}^{tA})^{-1}\\|_{\varepsilon}$	$\displaystyle=\\|\mathrm{e}^{tA}-\mathcal{L}(t,k)\\|_{\varepsilon}\\|\mathrm{e}^{-tA}\\|_{\varepsilon}$
		$\displaystyle\leq\theta(t\\|A\\|_{\varepsilon},k)\mathrm{e}^{t\\|A\\|_{\varepsilon}}$
		$\displaystyle\leq\theta(t(\rho(A)+\varepsilon),k)\mathrm{e}^{t(\rho(A)+\varepsilon)}<1.$

	$\displaystyle\\|\mathrm{e}^{tA}-\mathcal{L}(t,\kappa)\\|$	$\displaystyle\leq{(t\\|A\\|)^{\kappa}}\frac{\mathrm{e}^{t\\|A\\|}}{\kappa!}$
		$\displaystyle\leq{(t\\|A\\|)^{2}}\frac{\mathrm{e}^{t\\|A\\|}}{2!}$
		$\displaystyle\leq(t\\|A\\|)^{2}\mathrm{e}^{t\\|A\\|}.$

	$\displaystyle\\|\int_{0}^{t}\mathrm{e}^{sA}\mathrm{d}s-\mathcal{T}(t,\eta)\\|$	$\displaystyle\leq\int_{0}^{t}\\|\mathrm{e}^{sA}-\mathcal{L}(s,\eta)\\|\mathrm{d}s$
		$\displaystyle\leq\int_{0}^{t}\frac{(s\\|A\\|)^{\eta}}{\eta!}\mathrm{e}^{s\\|A\\|}\mathrm{d}s$
		$\displaystyle\leq\frac{t\\|A\\|}{\eta!}\mathrm{e}^{t\\|A\\|}\int_{0}^{t}\mathrm{d}s$
		$\displaystyle\leq t^{2}\\|A\\|\mathrm{e}^{t\\|A\\|},$