\newtheoremrep

mytheoremTheorem[section] \newtheoremrepmyproposition[mytheorem]Proposition \newtheoremrepmylemma[mytheorem]Lemma \newtheoremrepmycorollary[mytheorem]Corollary \newtheoremrepmyexample[mytheorem]Example \newtheoremrepmyremark[mytheorem]Remark \newtheoremrepmydefinition[mytheorem]Definition

Lexicographic Ranking Supermartingales with
Lazy Lower Bounds

Toru Takisaka¹ Libo Zhang² Changjiang Wang¹ Jiamou Liu² ¹ University of Electronic Science and Technology of China
takisaka@uestc.edu.cn; 202222080938@std.uestc.edu.cn² The University of Auckland
lzha797@aucklanduni.ac.nz; jiamou.liu@auckland.ac.nz¹ University of Electronic Science and Technology of China
takisaka@uestc.edu.cn; 202222080938@std.uestc.edu.cn² The University of Auckland
lzha797@aucklanduni.ac.nz; jiamou.liu@auckland.ac.nz

Abstract

Lexicographic Ranking SuperMartingale (LexRSM) is a probabilistic extension of Lexicographic Ranking Function (LexRF), which is a widely accepted technique for verifying program termination. In this paper, we are the first to propose sound probabilistic extensions of LexRF with a weaker non-negativity condition, called single-component (SC) non-negativity. It is known that such an extension, if it exists, will be nontrivial due to the intricacies of the probabilistic circumstances.

Toward the goal, we first devise the notion of fixability, which offers a systematic approach for analyzing the soundness of possibly negative LexRSM. This notion yields a desired extension of LexRF that is sound for general stochastic processes. We next propose another extension, called Lazy LexRSM, toward the application to automated verification; it is sound over probabilistic programs with linear arithmetics, while its subclass is amenable to automated synthesis via linear programming. We finally propose a LexRSM synthesis algorithm for this subclass, and perform experiments.

1 Introduction

Background 1: Lexicographic RFs with different non-negativity conditions. Ranking function (RF) is one of the most well-studied tools for verifying program termination. An RF is typically a real-valued function over program states that satisfies: (a) the ranking condition, which requires an RF to decrease its value by a constant through each transition; and (b) the non-negativity condition, which imposes a lower bound on the value of the RF so that its infinite descent through transitions is prohibited. The existence of such a function implies termination of the underlying program, and therefore, one can automate verification of program termination by RF synthesis algorithms.

Improving the applicability of RF synthesis algorithms, i.e., making them able to prove termination of a wider variety of programs, is one of the core interests in the study of RF. A lexicographic extension of RF (LexRF) [BradleyMS05, Ben-AmramG15] is known as a simple but effective approach to the problem. Here, a LexRF is a function to real-valued vectors instead of the reals, and its ranking condition is imposed with respect to the lexicographic order. For example, the value of a LexRF may change from $(1,1,1)$ to $(1,0,2)$ through a state transition; here, the value “lexicographically decreases by 1” through the transition, that is, it decreases by 1 in some dimension while it is non-increasing on the left to that dimension. LexRF is particularly good at handling nested structures of programs, as vectors can measure the progress of different “phases” of programs separately. LexRF is also used in top-performing termination provers (e.g., [ultimateAutomizer]).

There are several known ways to impose non-negativity on LexRFs (see also Fig. 1): (a) Strong non-negativity, which requires non-negativity in every dimension of the LexRF; (b) leftward non-negativity, which requires non-negativity on the left of the ranking dimension of each transition, i.e., the dimension where the value of the LexRF should strictly decrease through the transition; and (c) single-component non-negativity, which requires non-negativity only in the ranking dimensions. It is known that any of these non-negativity conditions makes the resulting LexRF sound [BradleyMS05, Ben-AmramG15], i.e., a program indeed terminates whenever it admits a LexRF with either of these non-negativity conditions. For better applicability, single-component non-negativity is the most preferred, as it is the weakest constraint among the three.

⬇ $\ell_{1}:$ $\ell_{2}:$ ⬇ skip; $x:=1$ ; … ⬇ // $\boldsymbol{\eta}=(a_{1},\underline{b_{1}},c_{1})$ // $\boldsymbol{\eta}=(\underline{a_{2}},b_{2},c_{2})$

Non-negativity condition	$\boldsymbol{\eta}$ should be non-neg. at
Strong (ST) non-neg.	$a_{1},b_{1},c_{1},a_{2},b_{2},c_{2}$
Leftward (LW) non-neg.	$a_{1},b_{1},a_{2}$
Single-component (SC) non-neg.	$b_{1},a_{2}$

Figure 1: A demo of different non-negativity conditions for LexRFs. There, the ranking dimensions of the LexRF

\boldsymbol{\eta}

are indicated by underlines, and the last column of the table shows where each condition requires

\boldsymbol{\eta}

to be non-negative.

Background 2: Probabilistic programs and lexicographic RSMs. One can naturally think of a probabilistic counterpart of the above argument. One can consider probabilistic programs that admit randomization in conditional branching and variable updates. The notion of RF is then generalized to Ranking SuperMartingale (RSM), a function similar to RFs except that the ranking condition requires an RSM to decrease its value in expectation. The existence of an RSM typically implies almost-sure termination of the underlying program, i.e., termination of the program with probability 1.

Such a probabilistic extension has been actively studied, in fact: probabilistic programs are used in e.g., stochastic network protocols [parker2013verification], randomized algorithms [dubhashi2009concentration, karp1991introduction], security [barthe2016proving, lobo2021programming, barthe2016programming], and planning [canal2019probabilistic]; and there is a rich body of studies in RSM as a tool for automated verification of probabilistic programs (see §8). Similar to the RF case, a lexicographic extension of RSM (LexRSM, [AgrawalCP18, ChatterjeeGNZZ21]) is an effective approach to improve its applicability. In addition to its advantages over nested structures, LexRSM can also witness almost-sure termination of certain probabilistic programs with infinite expected runtime [AgrawalCP18, Fig. 2]; certifying such programs is known as a major challenge for RSMs.

Problem: Sound probabilistic extension of LexRF with weaker non-negativity. strongly non-negative LexRF soundly extends to LexRSM in a canonical way [AgrawalCP18], i.e., basically by changing the clause “decrease by a constant” in the ranking condition of LexRF to “decrease by a constant in expectation”. In contrast, the similar extension of leftward or single-component non-negative LexRF yields an unsound LexRSM notion [ferrer2015probabilistic, ChatterjeeGNZZ21]. To date, a sound LexRSM with the weakest non-negativity in the literature is Generalized LexRSM (GLexRSM) [ChatterjeeGNZZ21], which demands leftward non-negativity and an additional one, so-called expected leftward non-negativity. Roughly speaking, the latter requires LexRSMs to be non-negative in each dimension (in expectation) upon “exiting” the left of the ranking dimension. For example, in Fig. 1, it requires $b_{2}$ to be non-negative, as the second dimension of $\boldsymbol{\eta}$ “exits” the left of the ranking dimension upon the transition $\ell_{1}\rightarrow\ell_{2}$ . GLexRSM does not generalize either leftward or single-component non-negative LexRF, in the sense that the former is strictly more restrictive than the latter two when it is considered over non-probabilistic programs.

These results do not mean that leftward or single-component non-negative LexRF can never be extended to LexRSM, however. More concretely, the following problem is valid (see the last paragraph of $\S$ 3 for a formal argument):

KEY PROBLEM: Find a sound LexRSM notion that instantiates¹¹1 We use the term “instantiate” to emphasize that we compare LexRSM and LexRF. single-component non-negative LexRF, i.e., a LexRSM notion whose condition is no stronger than that of single-component non-negative LexRF in non-probabilistic settings.

We are motivated to study this problem for a couple of reasons. First, it is a paraphrase of the following fundamental question: when do negative values of (Lex)RSM cause trouble, say, to its soundness? This question is a typical example in the study of RSM where the question becomes challenging due to its probabilistic nature. The question also appears in other topics in RSM; for example, it is known that the classical variant rule of Floyd-Hoare logic does not extend to almost-sure termination of probabilistic programs in a canonical way [Huang0CG19], due to the complicated treatment of negativity in RSMs. To our knowledge, this question has only been considered in an ad-hoc manner through counterexamples (e.g., [ferrer2015probabilistic, Huang0CG19, ChatterjeeGNZZ21]), and we do not yet have a systematic approach to answering it.

⬇ $\ell_{1}:$ $\ell_{2}:$ $\ell_{3}:$ $\ell_{4}:$ $\ell_{5}:$ ⬇ $x:=0$ ; while $x<5$ do if $y<10$ then $y:=y+\mathit{Unif}[1,2]$ else $x:=x+\mathit{Unif}[1,2]$ fi od ⬇ $\boldsymbol{\eta}=(15-2x,12-y,1)$ $\boldsymbol{\eta}=(15-2x,12-y,0)$ $\boldsymbol{\eta}=(15-2x,11-y,2)$ $\boldsymbol{\eta}=(14-2x,\;\;\;\,\;\;\,\;\;0,1)$ $\boldsymbol{\eta}=(\;\;\;\;\;\;\;\;\;\;0,\;\;\,\;\;\,\;\;\;0,0)$ ⬇ $[x<7]$ $[x<5]$ $[y<10,x<5]$ $[y\geq 10,x<5]$ $[x\geq 5]$

Figure 2: A probabilistic modification of speedDis1 [alias2010multi], where

\mathit{Unif}[a,b]

is a uniform sampling from the (continuous) interval

[a,b]

. Inequalities on the right represent invariants. While

\boldsymbol{\eta}

is not a GLexRSM, it is an LLexRSM we propose; thus it witnesses almost-sure termination of the program.

Second, relaxing the non-negativity condition of LexRSM is highly desirable if we wish to fully unlock the benefit of the lexicographic extension in automated verification. A motivating example is given in Fig. 2. The probabilistic program in Fig. 2 terminates almost-surely, but it does not admit any linear GLexRSM (and hence, the GLexRSM synthesis algorithms in [ChatterjeeGNZZ21] cannot witness its almost-sure termination); for example, the function $\boldsymbol{\eta}$ ranks every transition of the program, but violates both leftward and expected leftward non-negativity at the transition $\ell_{1}\rightarrow\ell_{2}$ (note $\boldsymbol{\eta}$ ranks this transition in the third dimension; to check the violation of expected leftward non-negativity, also note $\boldsymbol{\eta}$ ranks $\ell_{2}\rightarrow\ell_{4}$ in the first dimension). Here, the source of the problem is that the program has two variables whose progress must be measured (i.e., increment $y$ to 10 in $\ell_{3}$ ; and increment $x$ to 5 in $\ell_{4}$ ), but one of their progress measures can be arbitrarily small during the program execution ( $y$ can be initialized with any value). Not only that this structure is rather fundamental, it is also expected that our desired LexRSM could handle it, if it exists. Indeed, modify the probabilistic program in Fig. 2 into a non-probabilistic one by changing “ $\mathit{Unif}[1,2]$ ” to “ $1$ ”; then the program admits $\boldsymbol{\eta}$ as a single-component non-negative LexRF.

Contributions. In this paper, we are the first to introduce sound LexRSM notions that instantiate single-component non-negative LexRF. Our contributions are threefold, as we state below.

•

First, in response to the first motivation we stated above, we devise a novel notion of fixability as a theoretical tool to analyze if negative values of a LexRSM “cause trouble”. Roughly speaking, we identify the source of the trouble as “ill” exploitation of unbounded negativity of LexRSM; our $\varepsilon$ -fixing operation prohibits such exploitation by basically setting all the negative values of a LexRSM into the same negative value $-\varepsilon$ , and we say a LexRSM is $\varepsilon$ -fixable if it retains the ranking condition through such a transformation. We give more details about its concept and key ideas in §2.

The soundness of $\varepsilon$ -fixable LexRSM immediately follows from that of strongly non-negative one [AgrawalCP18] because any LexRSM becomes strongly non-negative through the $\varepsilon$ -fixing operation (after globally adding $\varepsilon$ ). Fixable LexRSM instantiates single-component non-negative LexRF for general stochastic processes (Thm. 4), while also serving as a technical basis for proving the soundness of other LexRSMs. Meanwhile, fixable LexRSM cannot be directly applied to automated verification algorithms due to the inherent non-linearity of $\varepsilon$ -fixing; this observation leads us to our second contribution.
•

Second, in response to the second motivation we stated above, we introduce Lazy LexRSM (LLexRSM) as another LexRSM notion that instantiates single-component non-negative LexRF. LLexRSM does not involve the $\varepsilon$ -fixing operation in its definition; thanks to this property, we have a subclass of LLexRSM that is amenable to automated synthesis via linear programming (see §6). The LLexRSM condition consists of the single-component non-negative LexRSM condition and stability at negativity we propose (Def. 5), which roughly requires the following: Once the value of a LexRSM gets negative in some dimension, it must stay negative until that dimension exits the left of the ranking one. For example, $\boldsymbol{\eta}$ in Fig. 2 is an LLexRSM; indeed, $\ell_{2}\rightarrow\ell_{4}$ and $\ell_{1}\rightarrow\ell_{5}$ are the only transitions where $\boldsymbol{\eta}$ possibly changes its value from negative to non-negative in some dimension (namely, the second one), which is although the right to the ranking dimension (the first one).

We prove linear LLexRSM is sound for probabilistic programs over linear arithmetics (see Thm. 5 for the exact assumption). The proof is highly nontrivial, which is realized by subtle use of a refined variant of fixability; we explain its core idea in §2. Furthermore, Thm. 5 shows that expected leftward non-negativity in GLexRSM [ChatterjeeGNZZ21] is actually redundant under the assumption in Thm. 5. This is surprising, as expected leftward non-negativity has been invented to restore the soundness of leftward non-negative LexRSM, which is generally unsound.
•

Third, we present a synthesis algorithm for the subclass of LLexRSM we mentioned above, and do experiments; there, our algorithms verified almost-sure termination of various programs that could not be handled by (a better proxy of) the GLexRSM-based one. The details can be found in §7.

2 Key Observations with Examples

Here we demonstrate by examples how intricate the treatment of negative values of LexRSM is, and how we handle it by our proposed notion of fixability.

Blocking “ill” exploitation of unbounded negativity. Fig. 3 is a counterexample that shows leftward non-negative LexRSM is generally unsound (conceptually the same as [ChatterjeeGNZZ21, Ex. 1]). The probabilistic program in Fig. 3 does not terminate almost-surely because the chance of entering $\ell_{4}$ from $\ell_{3}$ quickly decreases as $t$ increases. Meanwhile, $\boldsymbol{\eta}=(\eta_{1},\eta_{2},\eta_{3})$ in Fig. 3 is a leftward non-negative LexRSM over a global invariant $[0\leq x\leq 1]$ ; in particular, observe $\eta_{2}$ decreases by $1$ in expectation from $\ell_{3}$ , whose successor location is either $\ell_{4}$ or $\ell_{1}$ .

⬇ $\ell_{1}:$ $\ell_{2}:$ $\ell_{3}:$ $\ell_{4}:$ $\ell_{5}:$ ⬇ $x:=0$ ; $t:=1$ ; while $x=0$ do $t:=t+1$ ; if $\mathbf{prob}(2^{-t})$ then $x:=1$ fi od ⬇ $\boldsymbol{\eta}=(2-x,\;\;\;\;0,2)$ $\boldsymbol{\eta}=(\;\;\;\;\;\;\,2,\;\;\;\;0,1)$ $\boldsymbol{\eta}=(\;\;\;\;\;\;\,2,\;\;\;\;0,0)$ $\boldsymbol{\eta}=(\;\;\;\;\;\;\,2,-2^{t},0)$ $\boldsymbol{\eta}=(\;\;\;\;\;\;\,0,\;\;\;\;0,0)$

Figure 3: An example of “ill” exploitation.

This example reveals an inconsistency between the ways how the single-component non-negativity and ranking condition evaluate the value of a LexRSM, say $\boldsymbol{\eta}=(\eta_{1},\ldots,\eta_{n})$ . The single-component non-negativity claims $\boldsymbol{\eta}$ cannot rank a transition in a given dimension $k$ whenever $\eta_{k}$ is negative; intuitively, this means that any negative value in the ranking domain $\mathbb{R}$ should be understood as the same state, namely the “bottom” of the domain. Meanwhile, the ranking condition evaluates different negative values differently; a smaller negative value of $\eta_{k}$ can contribute more to satisfy the ranking condition, as one can see from the behavior of $\eta_{2}$ in Fig. 3 at $\ell_{3}$ . The function $\boldsymbol{\eta}$ in Fig. 3 satisfies the ranking condition over a possibly non-terminating program through “ill” exploitation of this inconsistency; as $t$ becomes larger, the value of $\eta_{2}$ potentially drops more significantly through the transition from $\ell_{3}$ , but with a smaller probability.

The first variant of our fixability notion, called $\varepsilon$ -fixability, enables us to ensure that such exploitation is not happening. We simply set every negative value in a LexRSM $\boldsymbol{\eta}$ to a negative constant $-\varepsilon$ , and say $\boldsymbol{\eta}$ is $\varepsilon$ -fixable if it retains the ranking condition through the modification²²2 To give the key ideas in a simpler way, the description here slightly differs from the actual definition in §4; referred results in §2 are derived from the latter. See Rem. 4. . For example, the $\varepsilon$ -fixing operation changes the value of $\eta_{2}$ in Fig. 3 at $\ell_{4}$ from $-2^{t}$ to $-\varepsilon$ , and $\boldsymbol{\eta}$ does not satisfy the ranking condition after that. Therefore, $\boldsymbol{\eta}$ in Fig. 3 is not $\varepsilon$ -fixable for any $\varepsilon>0$ (i.e., we successfully reject this $\boldsymbol{\eta}$ through the fixability check). Meanwhile, an $\varepsilon$ -fixable LexRSM witnesses almost-sure termination of the underlying program; indeed, the fixed LexRSM is a strongly non-negative LexRSM (by globally adding $\varepsilon$ to the fixed $\boldsymbol{\eta}$ ), which is known to be sound [AgrawalCP18].

The notion of $\varepsilon$ -fixability is operationally so simple that one might even feel it is a boring idea; nevertheless, its contribution to revealing the nature of possibly negative LexRSM is already significant in our paper. Indeed, (a) $\varepsilon$ -fixable LexRSM instantiates single-component non-negative LexRF with an appropriate $\varepsilon$ (Thm. 4); (b) $\varepsilon$ -fixable LexRSM generalizes GLexRSM [ChatterjeeGNZZ21], and the proof offers an alternative proof of soundness of GLexRSM that is significantly simpler than the original one (Thm. 4); and (c) its refined variant takes the crucial role in proving soundness of our second LexRSM variant, lazy LexRSM.

Allowing “harmless” unbounded negativity. While $\varepsilon$ -fixable LexRSM already instantiates single-component non-negative LexRF, we go one step further to obtain a LexRSM notion that is amenable to automated synthesis, in particular via Linear Programming (LP). The major obstacle to this end is the case distinction introduced by $\varepsilon$ -fixability, which makes the fixed LexRSM nonlinear. Lazy LexRSM (LLexRSM), our second proposed LexRSM, resolves this problem while it also instantiates single-component non-negative LexRF.

Linear LLexRSM is sound over probabilistic programs with linear arithmetics (Thm. 5). The key to the proof is, informally, the following observation: Restrict our attention to probabilistic programs and functions $\boldsymbol{\eta}$ that are allowed in the LP-based synthesis. Then “ill” exploitation in Fig. 3 never occurs, and therefore, a weaker condition than $\varepsilon$ -fixability (namely, the LLexRSM one) suffices for witnessing program termination. In fact, Fig. 3 involves (a) non-linear arithmetics in the program, (b) parametrized if-branch in the program (i.e., the grammar “ if prob $(p)$ then $P$ else $Q$ fi ” with $p$ being a variable), and (c) non-linearity of $\boldsymbol{\eta}$ . None of them are allowed in the LP-based synthesis (at least, in the standard LP-based synthesis via Farkas’ Lemma [chakarov2013probabilistic, AgrawalCP18, ChatterjeeGNZZ21]). Our informal statement above is formalized as Thm. 5, which roughly says: Under such a restriction to probabilistic programs and $\boldsymbol{\eta}$ , any LLexRSM is $(\varepsilon,\gamma)$ -fixable. Here, $(\varepsilon,\gamma)$ -fixability is a refined version of $\varepsilon$ -fixability; while it also ensures that “ill” exploitation is not happening in $\boldsymbol{\eta}$ , it is less restrictive than $\varepsilon$ -fixability by allowing “harmless” unbounded negative values of $\boldsymbol{\eta}$ .

⬇ $\ell_{1}:$ $\ell_{2}:$ $\ell_{3}:$ $\ell_{4}:$ $\ell_{5}:$ ⬇ $x:=0$ ; $t:=1$ ; while $x=0$ do if $\mathbf{prob}(0.5)$ then $t:=4t$ else $x:=1$ fi od ⬇ $\boldsymbol{\eta}=(2-x,\;\;\;\;\;t+1)$ $\boldsymbol{\eta}=(\;\;\;\;\;\;\,2,\;\;\;\;\;\;\;\;\;\;\;t)$ $\boldsymbol{\eta}=(\;\;\;\;\;\;\,2,\;\;\;4t+2)$ $\boldsymbol{\eta}=(\;\;\;\;\;\;\,2,-2t-4)$ $\boldsymbol{\eta}=(\;\;\;\;\;\;\,0,\;\;\;\;\;\;\;\;\;\;\;0)$

Figure 4: An example of “harmless” unbounded negativity.

Fig. 4 gives an example of such a harmless behavior of $\boldsymbol{\eta}$ rejected by $\varepsilon$ -fixability. It also shows why we cannot simply use $\varepsilon$ -fixability to check an LLexRSM does not do “ill” exploitation. The function $\boldsymbol{\eta}=(\eta_{1},\eta_{2})$ in Fig. 4 is leftward non-negative over the global invariant $[0\leq x\leq 1\land t\geq 1]$ , so it is an LLexRSM for the probabilistic program there; the program and $\boldsymbol{\eta}$ are also in the scope of LP-based synthesis; but $\boldsymbol{\eta}$ is not $\varepsilon$ -fixable for any $\varepsilon>0$ . Indeed, the $\varepsilon$ -fixing operation changes the value of $\eta_{2}$ at $\ell_{4}$ from $-2t-4$ to $-\varepsilon$ , and $\boldsymbol{\eta}$ does not satisfy the ranking condition at $\ell_{2}$ after the change. Here we notice that, however, the unbounded negative values of $\eta_{2}$ are “harmless”; that is, the “ill-gotten gains” by the unbounded negative values of $\eta_{2}$ at $\ell_{4}$ are only “wasted” to unnecessarily increase $\eta_{2}$ at $\ell_{3}$ . In fact, $\boldsymbol{\eta}$ still satisfies the ranking condition if we change the value of $\eta_{2}$ at $\ell_{1},\ell_{2},\ell_{3}$ to $2,1$ , and $0$ , respectively.

We resolve this issue by partially waiving the ranking condition of $\boldsymbol{\eta}$ after the $\varepsilon$ -fixing operation. It is intuitively clear that the program in Fig. 4 almost-surely terminates, and the intuition here is that the program essentially repeats an unbiased coin tossing until the tail is observed (here, “observe the tail” corresponds to “observe $\mbox{\bf prob}(0.5)=\mbox{\bf true}$ at $\ell_{2}$ ”). This example tells us that, to witness the almost-sure termination of this program, we only need to guarantee the program (almost-surely) visits either the terminal location $\ell_{5}$ or the “coin-tossing location” $\ell_{2}$ from anywhere else. The $\varepsilon$ -fixed $\boldsymbol{\eta}$ in Fig. 4 does witness such a property of the program, as it ranks every transition except those that are from a coin-tossing location, namely $\ell_{2}$ .

We generalize this idea as follows: Fix $\gamma\in(0,1)$ , and say a program state is a “coin-tossing state” for $\boldsymbol{\eta}=(\eta_{1},\ldots,\eta_{n})$ in the $k$ -th dimension if $\eta_{k}$ drops from non-negative to negative (i.e., the ranking is “done” in the $k$ -th dimension) with the probability $\gamma$ or higher. Then we say $\boldsymbol{\eta}$ is $(\varepsilon,\gamma)$ -fixable (Def. 4) if the $\varepsilon$ -fixed $\boldsymbol{\eta}$ is a strongly non-negative LexRSM (after adding $\varepsilon$ ) except that, at each coin-tossing state, we waive the ranking condition of $\boldsymbol{\eta}$ in the corresponding dimension. For example, $\boldsymbol{\eta}$ in Fig. 4 is $(\varepsilon,\gamma)$ -fixable for any $\gamma\in(0,0.5]$ . As expected, $(\varepsilon,\gamma)$ -fixable LexRSM is sound for any $\varepsilon>0$ and $\gamma\in(0,1)$ (Cor. 4).

3 Preliminaries

We recall the technical preliminaries. Omitted details are in Appendix A.

Notations. We assume the readers are familiar with the basic notions of measure theory, see e.g. [Ash:book, BertsekasS07]. The sets of non-negative integers and reals are denoted by $\mathbb{N}$ and $\mathbb{R}$ , respectively. The collection of all Borel sets of a topological space $\mathcal{X}$ is denoted by $\mathcal{B}(\mathcal{X})$ . The set of all probability distributions over the measurable space $(\Omega,\mathcal{B}(\Omega))$ is denoted by $\mathcal{D}(\Omega)$ . The value of a vector $\boldsymbol{x}$ at the $i$ -th index is denoted by $\boldsymbol{x}[i]$ or $x_{i}$ . A subset $D\subseteq\mathbb{R}$ of the reals is bounded if $D\subseteq[-x,x]$ for some $x>0$ .

For a finite variable set $V$ and the set $val^{V}$ of its valuations, we form predicates as first-order formulas with atomic predicates of the form $f\leq g$ , where $f,g\colon val^{V}\to R$ and $R$ is linearly ordered. Often, we are only interested in the value of a predicate $\varphi$ over a certain subset $\mathcal{X}\subseteq val^{V}$ , in which case, we call $\varphi$ a predicate over $\mathcal{X}$ . We identify a predicate $\varphi$ over $\mathcal{X}$ with a function $\tilde{\varphi}\colon\mathcal{X}\to\{0,1\}$ such that $\tilde{\varphi}(x)=1$ if and only if $\varphi(x)$ is true. The semantics of $\varphi$ , i.e., the set $\{x\in\mathcal{X}\mid\varphi(x)\mbox{ is true}\}$ , is denoted by $\llbracket\varphi\rrbracket$ . The characteristic function ${\bf 1}_{A}:\mathcal{X}\to\{0,1\}$ of a subset $A$ of $\mathcal{X}$ is a function such that $\llbracket{\bf 1}_{A}=1\rrbracket=A$ . For a probability space $(\Omega,\mathcal{F},\mathbb{P})$ , we say $\varphi$ over $\Omega$ is ( $\mathcal{F}$ -)measurable when $\llbracket\varphi\rrbracket\in\mathcal{F}$ . For such a $\varphi$ , the satisfaction probability of $\varphi$ w.r.t. $\mathbb{P}$ , i.e., the value $\mathbb{P}(\llbracket\varphi\rrbracket)$ , is also denoted by $\mathbb{P}(\varphi)$ ; we say $\varphi$ holds $\mathbb{P}$ -almost surely ( $\mathbb{P}$ -a.s.) if $\mathbb{P}(\varphi)=1$ .

3.1 Syntax and Semantics of Probabilistic Programs

Syntax. We define the syntax of Probabilistic Programs (PPs) similarly to e.g., [AgrawalCP18, TakisakaOUH21]. More concretely, PPs have the standard control structure in imperative languages such as if-branches and while-loops, while the if-branching and variable assignments can also be done in either nondeterministic or probabilistic ways. Namely, ‘if $\star$ ’ describes a nondeterministic branching; ‘ndet $(D)$ ’ describes a nondeterministic assignment chosen from a bounded³³3 This is also assumed in [ChatterjeeGNZZ21] to avoid a complication in possibly negative LexRSMs. domain $D\subseteq\mathcal{B}(\mathbb{R})$ ; ‘if prob $(p)$ ’ with a constant $p\in[0,1]$ describes a probabilistic branching that executes the ‘then’ branch with probability $p$ , or the ‘else’ branch with probability $1-p$ ; and ‘sample $(d)$ ’ describes a probabilistic assignment sampled from a distribution $d\in\mathcal{D}(\mathbb{R})$ . We consider PPs without conditioning, which are also called randomized programs [TakisakaOUH21]; PPs with conditioning are considered in e.g. [OlmedoGJKKM18]. The exact grammar is given in Appendix A.

In this paper, we focus our attention on PPs with linear arithmetics; we say a PP is linear if each arithmetic expression in it is linear, i.e., of the form $b+\sum_{i=1}^{n}a_{i}\cdot v_{i}$ for constants $a_{1},\ldots,a_{n},b$ and program variables $v_{1},\ldots,v_{n}$ .

Semantics. We adopt probabilistic control flow graph (pCFG) as the semantics of PPs, which is standard in existing RSM works (e.g., [chakarov2013probabilistic, TakisakaOUH21, ChatterjeeGNZZ21]). Informally, it is a labeled directed graph whose vertices are program locations, and whose edges represent possible one-step executions in the program. Edges are labeled with the necessary information so that one can reconstruct the PP represented by the pCFG; for example, an edge $e$ can be labeled with the assignment commands executed through $e$ (e.g., ‘ $x:=x+1$ ’), the probability $p\in[0,1]$ that $e$ will be chosen (through ‘if prob $(p)$ ’), the guard condition, and so on. Below we give its formal definition for completeness; see Appendix A for how to translate PPs into pCFGs.

{mydefinition}

[pCFG] A pCFG is a tuple $(L,V,\Delta,\mathit{Up},G)$ , where

1.

$L$ is a finite set of locations.
2.

$V=\{x_{1},\ldots,x_{|V|}\}$ is a finite set of program variables.
3.

$\Delta$ is a finite set of (generalized) transitions⁴⁴4Defining these as edges might be more typical, as in our informal explanation. We adopt the style of [AgrawalCP18, ChatterjeeGNZZ21] for convenience; it can handle ‘if prob $(p)$ ’ by a single $\tau$ . , i.e., tuples $\tau=(\ell,\delta)$ of a location $\ell\in L$ and a distribution $\delta\in\mathcal{D}(L)$ over successor locations.
4.

$\mathit{Up}$ is a function that receives a transition $\tau\in\Delta$ and returns a tuple $(i,u)$ of a target variable index $i\in\{1,\ldots,|V|\}$ and an update element $u$ . Here, $u$ is either (a) a Borel measurable function $u:\mathbb{R}^{|V|}\to\mathbb{R}$ , (b) a distribution $d\in\mathcal{D}(\mathbb{R})$ , or (c) a bounded measurable set $R\in\mathcal{B}(\mathbb{R})$ . In each case, we say $\tau$ is deterministic, probabilistic, and non-deterministic, respectively; the collections of these transitions are denoted by $\Delta_{d}$ , $\Delta_{p}$ , and $\Delta_{n}$ , respectively.
5.

$G$ is a guard function that assigns a $G(\tau):\mathbb{R}^{|V|}\to\{0,1\}$ to each $\tau\in\Delta$ .

Below we fix a pCFG $\mathcal{C}=(L,V,\Delta,\mathit{Up},G)$ . A state of $\mathcal{C}$ is a tuple $s=(\ell,\boldsymbol{x})$ of location $\ell\in L$ and variable assignment vector $\boldsymbol{x}\in\mathbb{R}^{|V|}$ . We write $\mathcal{S}$ to denote the state set $L\times\mathbb{R}^{|V|}$ . Slightly abusing the notation, for $\tau=(\ell,\delta)$ , we identify the set $\llbracket G(\tau)\rrbracket\subseteq\mathbb{R}^{|V|}$ and the set $\{\ell\}\times\llbracket G(\tau)\rrbracket\subseteq\mathcal{S}$ ; in particular, we write $s\in\llbracket G(\tau)\rrbracket$ when $\tau$ is enabled at $s$ , i.e., $s=(\ell,\boldsymbol{x})$ , $\tau=(\ell,\delta)$ and $\boldsymbol{x}\in\llbracket G(\tau)\rrbracket$ .

A pCFG $\mathcal{C}$ with its state set $\mathcal{S}$ can be understood as a transition system over $\mathcal{S}$ with probabilistic transitions and nondeterminism (or, more specifically, a Markov decision process with its states $\mathcal{S}$ ). Standard notions such as successors of a state $s\in\mathcal{S}$ , finite paths, and (infinite) runs of $\mathcal{C}$ are defined as the ones over such a transition system. The set of all successors of $s\in\llbracket G(\tau)\rrbracket$ via $\tau$ is denoted by $\mathrm{succ}_{\tau}(s)$ . The set of runs of $\mathcal{C}$ is denoted by $\Pi_{\mathcal{C}}$ .

Schedulers resolve nondeterminism in pCFGs. Observe there are two types of nondeterminism: (a) nondeterministic choice of $\tau\in\Delta$ at a given state (corresponds to ‘if $\star$ ’), and (b) nondeterministic variable update in a nondeterministic transition $\tau\in\Delta_{n}$ (corresponds to ‘ $x_{i}:=$ ndet $(D)$ ’). We say a scheduler is $\Delta$ -deterministic if its choice is non-probabilistic in Case (a).

We assume pCFGs are deadlock-free; we also assume that there are designated locations $\ell_{\mathrm{in}}$ and $\ell_{\mathrm{out}}$ that represent program initiation and termination, respectively. An initial state is a state of the form $(\ell_{\mathrm{in}},\boldsymbol{x})$ . We assume a transition from $\ell_{\mathrm{out}}$ is unique, denoted by $\tau_{\mathrm{out}}$ ; this transition does not update anything.

By fixing a scheduler $\sigma$ and an initial state $s_{I}$ , the infinite-horizon behavior of $\mathcal{C}$ is determined as a distribution $\mathbb{P}_{s_{I}}^{\sigma}$ over $\Pi_{\mathcal{C}}$ ; that is, for a measurable $A\subseteq\Pi_{\mathcal{C}}$ , the value $\mathbb{P}_{s_{I}}^{\sigma}(A)$ is the probability that a run of $\mathcal{C}$ from $s_{I}$ is in $A$ under $\sigma$ . We call the probability space $(\Pi_{\mathcal{C}},\mathcal{B}(\Pi_{\mathcal{C}}),\mathbb{P}_{s_{I}}^{\sigma})$ the dynamics of $\mathcal{C}$ under $\sigma$ and $s_{I}$ . See [BertsekasS07] for the formal construction; a brief explanation is in Appendix A.

We define the termination time of a pCFG $\mathcal{C}$ as the function $T_{\mathrm{term}}^{\mathcal{C}}:\Pi_{\mathcal{C}}\to\mathbb{N}\cup\{+\infty\}$ such that $T_{\mathrm{term}}^{\mathcal{C}}(s_{0}s_{1}\ldots)=\inf\{t\in\mathbb{N}\mid\exists\boldsymbol{x}.s_{t}=(\ell_{\mathrm{out}},\boldsymbol{x})\}$ . Now we formalize our objective, i.e., almost-sure termination of pCFG, as follows.

{mydefinition}

[AST of pCFG] A run $\omega\in\Pi_{\mathcal{C}}$ terminates if $T_{\mathrm{term}}^{\mathcal{C}}(\omega)<\infty$ . A pCFG $\mathcal{C}$ is a.s. terminating (AST) under a scheduler $\sigma$ and an initial state $s_{I}$ if a run of $\mathcal{C}$ terminates $\mathbb{P}_{s_{I}}^{\sigma}$ -a.s. We say $\mathcal{C}$ is AST if it is AST for any $\sigma$ and $s_{I}$ .

3.2 Lexicographic Ranking Supermartingales

Here we recall mathematical preliminaries of the LexRSM theory. A (Lex)RSM typically comes in two different forms: one is a vector-valued function $\boldsymbol{\eta}:\mathcal{S}\to\mathbb{R}^{n}$ over states $\mathcal{S}$ of a pCFG $\mathcal{C}$ , and another is a stochastic process over the runs $\Pi_{\mathcal{C}}$ of $\mathcal{C}$ . We recall relevant notions in these formulations, which are frequently used in existing RSM works [chakarov2013probabilistic, ChatterjeeGNZZ21]. We also recall the formal definition of LexRSMs with three different non-negativity conditions in Fig. 1.

LexRSM as a quantitative predicate. Fix a pCFG $\mathcal{C}$ . An ( $n$ -dimensional) measurable map (MM) is a Borel measurable function $\boldsymbol{\eta}:\mathcal{S}\to\mathbb{R}^{n}$ . For a given 1-dimensional MM $\eta$ and a transition $\tau$ , The (maximal) pre-expectation of $\eta$ under $\tau$ is a function that formalizes “the value of $\eta$ after the transition $\tau$ ”. More concretely, it is a function $\overline{\mathbb{X}}_{\tau}\eta:\llbracket G(\tau)\rrbracket\to\mathbb{R}$ that returns, for a given state $s$ , the maximal expected value of $\eta$ at the successor state of $s$ via $\tau$ . Here, the maximality refers to the set of all possible nondeterministic choices at $s$ .

A level map $\mathsf{Lv}:\Delta\to\{0,\ldots,n\}$ designates the ranking dimension of the associated LexRSM $\boldsymbol{\eta}:\mathcal{S}\to\mathbb{R}^{n}$ . We require $\mathsf{Lv}(\tau)=0$ if and only if $\tau=\tau_{\mathrm{out}}$ . We say an MM $\boldsymbol{\eta}$ ranks a transition $\tau$ in the dimension $k$ (under $\mathsf{Lv}$ ) when $k=\mathsf{Lv}(\tau)$ . An invariant is a measurable predicate $I:\mathcal{S}\to\{0,1\}$ such that $\llbracket I\rrbracket$ is closed under transitions and $\ell_{\mathrm{in}}\times\mathbb{R}^{|V|}\subseteq\llbracket I\rrbracket$ . The set $\llbracket I\rrbracket$ over-approximates the reachable states in $\mathcal{C}$ .

Suppose an $n$ -dimensional MM $\boldsymbol{\eta}$ and an associated level map $\mathsf{Lv}$ are given. We say $\boldsymbol{\eta}$ satisfies the ranking condition (under $\mathsf{Lv}$ and $I$ ) if the following holds for each $\tau\neq\tau_{\mathrm{out}}$ , $s\in\llbracket I\land G(\tau)\rrbracket$ , and $k\in\{1,\ldots,\mathsf{Lv}(\tau)\}$ :

\displaystyle\overline{\mathbb{X}}_{\tau}\boldsymbol{\eta}[k](s)\leq\begin{cases}\boldsymbol{\eta}[k](s)&\text{if }k<\mathsf{Lv}(\tau),\\ \boldsymbol{\eta}[k](s)-1&\text{if }k=\mathsf{Lv}(\tau).\end{cases}

We also define the three different non-negativity conditions in Fig. 1, i.e., STrong (ST), LeftWard (LW), and Single-Component (SC) non-negativity, as follows:

(ST non-neg.)	$\displaystyle\forall s\in\llbracket I\rrbracket.\forall k\in\{1,\ldots,n\}.$	$\displaystyle\boldsymbol{\eta}[k](s)\geq 0,$
(LW non-neg.)	$\displaystyle\forall\tau\neq\tau_{\mathrm{out}}.\forall s\in\llbracket I\land G(\tau)\rrbracket.\forall k\in\{1,\ldots,\mathsf{Lv}(\tau)\}.$	$\displaystyle\boldsymbol{\eta}[k](s)\geq 0,$
(SC non-neg.)	$\displaystyle\forall\tau\neq\tau_{\mathrm{out}}.\forall s\in\llbracket I\land G(\tau)\rrbracket.$	$\displaystyle\boldsymbol{\eta}[\mathsf{Lv}(\tau)](s)\geq 0.$

All the materials above are wrapped up in the following definition.

{mydefinition}

[(ST/LW/SC)-LexRSM map] Fix a pCFG $\mathcal{C}$ with an invariant $I$ . Let $\boldsymbol{\eta}$ be an MM associated with a level map $\mathsf{Lv}$ . The MM $\boldsymbol{\eta}$ is called a STrongly non-negative LexRSM map (ST-LexRSM map) over $\mathcal{C}$ supported by $I$ if it satisfies the ranking condition and the strong non-negativity under $\mathsf{Lv}$ and $I$ . If it satisfies the leftward or single-component non-negativity instead of the strong one, then we call it LW-LexRSM map or SC-LexRSM map, respectively. LexRSM as a stochastic process. When it comes to automated synthesis, a (Lex)RSM is usually a function $\boldsymbol{\eta}$ over program states, as defined in Def. 3.2. Meanwhile, when we prove the properties of (Lex)RSMs themselves (e.g., soundness), it is often necessary to inspect the behavior of $\boldsymbol{\eta}$ upon the program execution under given scheduler $\sigma$ and initial state $s_{I}$ . Such a behavior of $\boldsymbol{\eta}$ is formalized as a sequence $(\mathbf{X}_{t})_{t=0}^{\infty}$ of random variables over the dynamics of the underlying pCFG, which forms a stochastic process.

A (discrete-time) stochastic process in a probability space $(\Omega,\mathcal{F},\mathbb{P})$ is a sequence $(\mathbf{X}_{t})_{t=0}^{\infty}$ of $\mathcal{F}$ -measurable random variables $\mathbf{X}_{t}:\Omega\to\mathbb{R}^{n}$ for $t\in\mathbb{N}$ . In our context, it is typically associated with another random variable $T:\Omega\to\mathbb{N}\cup\{+\infty\}$ that describes the termination time of $\omega\in\Omega$ . We say $T$ is AST (w.r.t. $\mathbb{P}$ ) if $\mathbb{P}(T<\infty)=1$ ; observe that, if $(\Omega,\mathcal{F},\mathbb{P})$ is the dynamics of a pCFG $\mathcal{C}$ under $\sigma$ and $s_{I}$ , then $\mathcal{C}$ is AST under $\sigma$ and $s_{I}$ if and only if $T_{\mathrm{term}}^{\mathcal{C}}$ is AST w.r.t. $\mathbb{P}$ . As standard technical requirements, we assume there is a filtration $(\mathcal{F}_{t})_{t=0}^{\infty}$ in $(\Omega,\mathcal{F},\mathbb{P})$ such that $(\mathbf{X}_{t})_{t=0}^{\infty}$ is adapted to $(\mathcal{F}_{t})_{t=0}^{\infty}$ , $T$ is a stopping time w.r.t. $(\mathcal{F}_{t})_{t=0}^{\infty}$ , and $(\mathbf{X}_{t})_{t=0}^{\infty}$ is stopped at $T$ ; see Appendix A for their definitions.

For a stopping time $T$ w.r.t. $(\mathcal{F}_{t})_{t=0}^{\infty}$ , we define a level map $(\mathsf{Lv}_{t})_{t=0}^{\infty}$ as a sequence of $\mathcal{F}_{t}$ -measurable functions $\mathsf{Lv}_{t}:\Omega\to\{0,\ldots n\}$ such that $\llbracket\mathsf{Lv}_{t}=0\rrbracket=\llbracket T\leq t\rrbracket$ for each $t$ . We call a pair of a stochastic process and a level map an instance for $T$ ; just like we construct an MM $\boldsymbol{\eta}$ and a level map $\mathsf{Lv}$ as an AST certificate of a pCFG $\mathcal{C}$ , we construct an instance for a stopping time $T$ as its AST certificate. We say an instance $((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ for $T$ ranks $\omega\in\Omega$ in the dimension $k$ at time $t$ when $T(\omega)>t$ and $k=\mathsf{Lv}_{t}(\omega)$ .

For $c>0$ , we say an instance $((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ satisfies the $c$ -ranking condition if, for each $t\in\mathbb{N}$ , $\omega\in\llbracket\mathsf{Lv}_{t}\neq 0\rrbracket$ , and $k\in\{1,\ldots,\mathsf{Lv}_{t}(\omega)\}$ , we have:

\displaystyle\mathbb{E}[\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)\leq\mathbf{X}_{t}[k](\omega)-c\cdot{\bf 1}_{\llbracket k=\mathsf{Lv}_{t}\rrbracket}(\omega)\quad(\mathbb{P}\mbox{-a.s.})

(1)

Here, the function $\mathbb{E}[\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}]$ denotes the conditional expectation of $\mathbf{X}_{t+1}[k]$ given $\mathcal{F}_{t}$ , which takes the role of pre-expectation. We mostly let $c=1$ and simply call it the ranking condition; the only result sensitive to $c$ is Thm. 4.

We also define the three different non-negativity conditions for an instance as follows. Here we adopt a slightly general (but essentially the same) variant of strong non-negativity instead, calling it uniform well-foundedness; we simply allow the uniform lower bound to be any constant $\bot\in\mathbb{R}$ instead of fixing it to be zero. This makes the later argument simpler.

(UN well-fnd.)	$\displaystyle\exists\bot\in\mathbb{R}.\forall t\in\mathbb{N}.\forall\omega\in\Omega.\forall k\in\{1,\ldots,n\}.$	$\displaystyle\mathbf{X}_{t}[k](\omega)\geq\bot,$
(LW non-neg.)	$\displaystyle\forall t\in\mathbb{N}.\forall\omega\in\llbracket\mathsf{Lv}_{t}\neq 0\rrbracket.\forall k\in\{1,\ldots,\mathsf{Lv}_{t}(\omega)\}.$	$\displaystyle\mathbf{X}_{t}[k](\omega)\geq 0,$
(SC non-neg.)	$\displaystyle\forall t\in\mathbb{N}.\forall\omega\in\llbracket\mathsf{Lv}_{t}\neq 0\rrbracket.$	$\displaystyle\mathbf{X}_{t}[\mathsf{Lv}_{t}(\omega)](\omega)\geq 0.$

{mydefinition}

[(UN/LW/SC)-LexRSM] Suppose the following are given: a probability space $(\Omega,\mathcal{F},\mathbb{P})$ ; a filtration $(\mathcal{F}_{t})_{t=0}^{\infty}$ on $\mathcal{F}$ ; and a stopping time $T$ w.r.t. $(\mathcal{F}_{t})_{t=0}^{\infty}$ . An instance $\mathcal{I}=((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ is called a UNiformly well-founded LexRSM (UN-LexRSM) for $T$ with the bottom $\bot\in\mathbb{R}$ and a constant $c\in\mathbb{R}$ if (a) $(\mathbf{X}_{t})_{t=0}^{\infty}$ is adapted to $(\mathcal{F}_{t})_{t=0}^{\infty}$ ; (b) for each $t\in\mathbb{N}$ and $1\leq k\leq n$ , the expectation of $\mathbf{X}_{t}[k]$ exists; (c) $\mathcal{I}$ satisfies the $c$ -ranking condition; and (d) $\mathcal{I}$ is uniformly well-founded with the bottom $\bot$ . We define LW-LexRSM and SC-LexRSM by changing (d) with LW and SC non-negativity, respectively.

We mostly assume $c=1$ and omit to mention the constant. UN-LexRSM is known to be sound [AgrawalCP18]; meanwhile, LW and SC-LexRSM are generally unsound [ChatterjeeGNZZ21, ferrer2015probabilistic]. We still mention the latter two as parts of sound LexRSMs.

From RSM maps to RSMs. Let $\boldsymbol{\eta}$ be an MM over a pCFG $\mathcal{C}$ with a level map $\mathsf{Lv}$ . Together with a $\Delta$ -deterministic scheduler $\sigma$ and initial state $s_{I}$ , it induces an instance $((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ over the dynamics of $\mathcal{C}$ , by letting $\mathbf{X}_{t}(s_{0}s_{1}\ldots)=\boldsymbol{\eta}(s_{t})$ ; it describes the behavior of $\boldsymbol{\eta}$ and $\mathsf{Lv}$ through executing $\mathcal{C}$ from $s_{I}$ under $\sigma$ . Properties of $\boldsymbol{\eta}$ such as ranking condition or non-negativity are inherited to the induced instance (if the expectation of $\mathbf{X}_{t}[k]$ exists for each $t$ , $k$ ). For example, an instance induced by an ST-LexRSM map is an UN-LexRSM with $\bot=0$ .

Non-probabilistic settings, and instantiation of SC-LexRF. The key question in this paper is to find a LexRSM notion that instantiates SC non-negative LexRF (or SC-LexRF for short); that is, we would like to find a LexRSM notion whose conditions are satisfied by SC-LexRSM⁵⁵5 One would perhaps expect to see “SC-LexRF” here; such a change does not make a difference under a canonical definition of SC-LexRF, so we define the notion of instantiation in this way to save space. See also Appendix A. in the non-probabilistic setting, which we formalize as follows. We say a pCFG is a (non-probabilistic) CFG if (a) $\delta$ is Dirac for each $(\ell,\delta)\in\Delta$ , and (b) $\Delta_{p}=\emptyset$ ; this roughly means that a CFG is a model of a PP without ‘if prob $(p)$ ’ and ‘sample $(d)$ ’. We say a probability space $(\Omega,\mathcal{F},\mathbb{P})$ is trivial if $\Omega$ is a singleton, say $\{\omega\}$ .

4 Fixable LexRSMs

In §4-6 we give our novel technical notions and results. In this section, we will introduce the notion of fixability and related results. Here we focus on technical rigorousness and conciseness, see §2 for the underlying intuition. Proofs are given in appendices. We begin with the formal definition of $\varepsilon$ -fixability.

{myremark}

As in Footnote 2, our formal definitions of fixability in this section slightly differ from an informal explanation in §2. One difference is that the $\varepsilon$ -fixing in Def. 4 changes the value of a LexRSM at dimension $k$ whenever it is negative or $k$ is strictly on the right to the ranking dimension. This modification is necessary to prove Thm. 4. Another is that we define fixability as the notion for an instance $\mathcal{I}$ , rather than for an MM $\boldsymbol{\eta}$ . While the latter can be also done in an obvious way (as informally done in §2), we do not formally do that because it is not necessary for our technical development. One can “fix” the argument in §2 into the one over instances by translating “fixability of $\boldsymbol{\eta}$ ” to “fixability of an instance induced by $\boldsymbol{\eta}$ ”.

{mydefinition}

[ $\varepsilon$ -fixing of an instance] Let $\mathcal{I}=((\mathbf{X}_{t})_{t=0}^{\infty}),(\mathsf{Lv}_{t})_{t=0}^{\infty})$ be an instance for a stopping time $T$ , and let $\varepsilon>0$ . The $\varepsilon$ -fixing of $\mathcal{I}$ is another instance $\tilde{\mathcal{I}}=((\tilde{\mathbf{X}}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ for $T$ , where

\displaystyle\tilde{\mathbf{X}}_{t}[k](\omega)=\begin{cases}-\varepsilon&\text{if }\mathbf{X}_{t}[k](\omega)<0\text{ or }k>\mathsf{Lv}_{t}(\omega),\\ \mathbf{X}_{t}[k](\omega)&\text{otherwise}.\end{cases}

We say an SC-LexRSM $\mathcal{I}$ is $\varepsilon$ -fixable, or call it an $\varepsilon$ -fixable LexRSM, if its $\varepsilon$ -fixing $\tilde{\mathcal{I}}$ is an UN-LexRSM with the bottom $\bot=-\varepsilon$ .

Observe that the $\varepsilon$ -fixing of any instance is uniformly well-founded with the bottom $\bot=-\varepsilon$ , so the $\varepsilon$ -fixability only asks if the ranking condition is preserved through $\varepsilon$ -fixing. Also, observe that the soundness of $\varepsilon$ -fixable LexRSM immediately follows from that of UN-LexRSM [AgrawalCP18].

While we do not directly use $\varepsilon$ -fixability as a technical tool, the two theorems below show its conceptual value. The first one answers our key problem: $\varepsilon$ -fixable LexRSM instantiates SC-LexRF with sufficiently large $\varepsilon$ .

{mytheorem}

[fixable LexRSM instantiates SC-LexRF] Suppose $\mathcal{I}=((\boldsymbol{x}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ is an SC-LexRSM for a stopping time $T$ over the trivial probability space with a constant $c$ , and let $\varepsilon\geq c$ . Then $\mathcal{I}$ is $\varepsilon$ -fixable. ∎

The second theorem offers a formal comparison between $\varepsilon$ -fixable LexRSM and the state-of-the-art LexRSM variant in the literature, namely GLexRSM [ChatterjeeGNZZ21]. We show the former subsumes the latter. In our terminology, GLexRSM is LW-LexRSM that also satisfies the following expected leftward non-negativity:

\displaystyle\forall t\in\mathbb{N}.\forall\omega\in\llbracket\mathsf{Lv}_{t}\neq 0\rrbracket.\forall k\in\{1,\ldots,\mathsf{Lv}_{t}(\omega)\}.\ \mathbb{E}[{\bf 1}_{\llbracket k>\mathsf{Lv}_{t+1}\rrbracket}\cdot\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)\geq 0.

We note that our result can be also seen as an alternative proof of the soundness of GLexRSM [ChatterjeeGNZZ21, Thm. 1]. Our proof is also significantly simpler than the original one, as the former utilizes the soundness of UN-LexRSM as a lemma, while the latter does the proof “from scratch”.

{mytheorem}

[fixable LexRSM generalizes GLexRSM] Suppose $\mathcal{I}$ is a GLexRSM for a stopping time $T$ . Then $\mathcal{I}$ is $\varepsilon$ -fixable for any $\varepsilon>0$ . ∎

Now we move on to a refined variant, $(\varepsilon,\gamma)$ -fixability. Before its formal definition, we give a theorem that justifies the partial waiving of the ranking condition described in §2. Below, $\overset{\infty}{\exists}t.\varphi_{t}$ stands for $\forall k\in\mathbb{N}.\exists t\in\mathbb{N}.[t>k\land\varphi_{t}]$ .

{mytheorem}

[relaxation of the UN-LexRSM condition] Suppose the following are given: a probability space $(\Omega,\mathcal{F},\mathbb{P})$ ; a filtration $(\mathcal{F}_{t})_{t=0}^{\infty}$ on $\mathcal{F}$ ; and a stopping time $T$ w.r.t. $(\mathcal{F}_{t})_{t=0}^{\infty}$ . Let $\mathcal{I}=((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ be an instance for $T$ , and let $\bot\in\mathbb{R}$ . For each $k\in\{1,\ldots,n\}$ , let $(\varphi_{t,k})_{t=0}^{\infty}$ be a sequence of predicates over $\Omega$ such that

\displaystyle\overset{\infty}{\exists}t.\varphi_{t,k}(\omega)\Rightarrow\overset{\infty}{\exists}t.[\mathbf{X}_{t}[k](\omega)=\bot\lor k>\mathsf{Lv}_{t}(\omega)]\quad\mbox{($\mathbb{P}$-a.s.)}

(2)

Suppose $\mathcal{I}$ is an UN-LexRSM with the bottom $\bot$ except that, instead of the ranking condition, $\mathcal{I}$ satisfies the inequality (1) only for $t\in\mathbb{N}$ , $k\in\{1,\ldots,n\}$ , and $\omega\in\llbracket k\leq\mathsf{Lv}_{t}\land\lnot(\mathbf{X}_{t}[k]>\bot\land\varphi_{t,k})\rrbracket$ (with $c=1$ ). Then $T$ is AST w.r.t. $\mathbb{P}$ . ∎ The correspondence between the argument in §2 and Thm. 4 is as follows. The predicate $\varphi_{t,k}$ is an abstraction of the situation “we are at a coin-tossing state at time $t$ in the $k$ -th dimension”; and the condition (2) corresponds to the infinite coin-tossing argument (for a given $k$ , if $\varphi_{t,k}$ is satisfied at infinitely many $t$ , then the ranking in the $k$ -th dimension is “done” infinitely often, with probability 1). Given these, Thm. 4 says that the ranking condition of UN-LexRSM can be waived over $\llbracket\mathbf{X}_{t}[k]>\bot\land\varphi_{t,k}\rrbracket$ . In particular, the theorem amounts to the soundness of UN-LexRSM when $\varphi_{t,k}\equiv\mathit{false}$ for each $t$ and $k$ .

Based on Theorem 4, we introduce $(\varepsilon,\gamma)$ -fixability as follows. There, $\mathbb{P}[\varphi\mid\mathcal{F}^{\prime}]:=\mathbb{E}[{\bf 1}_{\llbracket\varphi\rrbracket}\mid\mathcal{F}^{\prime}]$ is the conditional probability of satisfying $\varphi$ given $\mathcal{F}^{\prime}$ .

{mydefinition}

[ $(\varepsilon,\gamma)$ -fixability] Let $\mathcal{I}=((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ be an instance for $T$ , and let $\gamma\in(0,1)$ . We call $\mathcal{I}$ a $\gamma$ -relaxed UN-LexRSM for $T$ if $\mathcal{I}$ satisfies the properties in Thm. 4, where $\varphi_{t,k}$ is as follows:

\displaystyle\varphi_{t,k}(\omega)\equiv\mathbb{P}[\mathbf{X}_{t+1}[k]=\bot\mid\mathcal{F}_{t}](\omega)\geq\gamma.

(3)

We say $\mathcal{I}$ is $(\varepsilon,\gamma)$ -fixable if its $\varepsilon$ -fixing $\tilde{\mathcal{I}}$ is a $\gamma$ -relaxed UN-LexRSM. The predicate $\varphi_{t,k}(\omega)$ in (3) is roughly read “the ranking by $(\mathbf{X}_{t})_{t=0}^{\infty}$ is done at time $t+1$ in dimension $k$ with probability $\gamma$ or higher, given the information about $\omega$ at $t$ ”. This predicate satisfies Condition (2); hence we have the following corollary, which is the key to the soundness of lazy LexRSM in §5. {mycorollary}[soundness of $(\varepsilon,\gamma)$ -fixable instances] Suppose there exists an instance $\mathcal{I}$ over $(\Omega,\mathcal{F},\mathbb{P})$ for a stopping time $T$ that is $(\varepsilon,\gamma)$ -fixable for any $\varepsilon>0$ and $\gamma\in(0,1)$ . Then $T$ is AST w.r.t. $\mathbb{P}$ . ∎

5 Lazy LexRSM and Its Soundness

Here we introduce another LexRSM variant, Lazy LexRSM (LLexRSM). We need this variant for our LexRSM synthesis algorithm; while $\varepsilon$ -fixable LexRSM theoretically answers our key question, it is not amenable to LP-based synthesis algorithms because its case distinction makes the resulting constraint nonlinear.

We define LLexRSM map as follows; see Contributions in §1 for its intuitive meaning with an example. The definition for an instance is in Appendix C.

{mydefinition}

[LLexRSM map] Fix a pCFG $\mathcal{C}$ with an invariant $I$ . Let $\boldsymbol{\eta}$ be an MM associated with a level map $\mathsf{Lv}$ . The MM $\boldsymbol{\eta}$ is called a Lazy LexRSM map (LLexRSM map) over $\mathcal{C}$ supported by $I$ if it is an SC-LexRSM map over $\mathcal{C}$ supported by $I$ , and satisfies stability at negativity defined as follows:

	$\displaystyle\forall\tau\neq\tau_{\mathrm{out}}.\forall s\in\llbracket I\land G(\tau)\rrbracket.\forall k\in\{1,\ldots,\mathsf{Lv}(\tau)-1\}.$
	$\displaystyle\quad\quad\quad\quad\boldsymbol{\eta}[k](s)<0\Rightarrow\forall s^{\prime}\in\mathrm{succ}_{\tau}(s).\biggl{[}\boldsymbol{\eta}[k](s^{\prime})<0\lor k>\max_{\tau^{\prime}:s^{\prime}\in\llbracket G(\tau^{\prime})\rrbracket}\mathsf{Lv}(\tau^{\prime})\biggr{]}.$

We first observe LLexRSM also answers our key question.

{mytheorem}

[LLexRSM instantiates SC-LexRF] Suppose $\boldsymbol{\eta}$ is an SC-LexRSM over a non-probabilistic CFG $\mathcal{C}$ supported by an invariant $I$ , with a level map $\mathsf{Lv}$ . Then $\boldsymbol{\eta}$ is stable at negativity under $I$ and $\mathsf{Lv}$ , and hence, $\boldsymbol{\eta}$ is an LLexRSM map over $\mathcal{C}$ supported by $I$ , with $\mathsf{Lv}$ . ∎

Below we give the soundness result of LLexRSM map. We first give the necessary assumptions on pCFGs and MMs, namely linearity and well-behavedness. we say a pCFG is linear if the update element of each $\tau\in\Delta_{d}$ is a linear function (this corresponds to the restriction on PPs to the linear ones); and an MM $\boldsymbol{\eta}$ is linear if $\lambda\boldsymbol{x}.\boldsymbol{\eta}(\ell,\boldsymbol{x})$ is linear for each $\ell\in L$ . We say a pCFG is well-behaved if its variable samplings are done via well-behaved distributions, which roughly means that their tail probabilities vanish to zero toward infinity quickly enough. We give its formal definition in the appendix (Def. C.1.2), which is somewhat complex; an important fact from the application perspective is that the class of such distributions covers all distributions with bounded supports and some distributions with unbounded supports such as the normal distributions (Prop. C.1.2). Possibly negative (Lex)RSM typically requires some restriction on variable samplings of pCFG (e.g., the integrability in [ChatterjeeGNZZ21]) so that the pre-expectation is well-defined.

The crucial part of the soundness proof is the following theorem, where $(\varepsilon,\gamma)$ -fixability takes the key role. Its full proof is given in Appendix C.

{mytheorem}

Let $\boldsymbol{\eta}:\mathcal{S}\to\mathbb{R}^{n}$ be a linear LLexRSM map for a linear, well-behaved pCFG $\mathcal{C}$ . Then for any $\Delta$ -deterministic scheduler $\sigma$ and initial state $s_{I}$ of $\mathcal{C}$ , the induced instance is $(\varepsilon,\gamma)$ -fixable for some $\varepsilon>0$ and $\gamma\in(0,1)$ .

Proof (sketch). We can show that the $\varepsilon$ -fixing $\tilde{\mathcal{I}}=((\tilde{\mathbf{X}}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ of an induced instance ${\mathcal{I}}=(({\mathbf{X}}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ almost-surely satisfies the inequality (1) of the ranking condition for each $t$ , $\omega$ , and $k$ such that $\tilde{\mathbf{X}}_{t}[k](\omega)=-\varepsilon$ and $1\leq k\leq\mathsf{Lv}_{t}(\omega)$ [TakisakaZWL24arXiv, Prop. C.2]. Thus it suffices to show, for each $\omega$ , $k$ , and $t$ such that $\tilde{\mathbf{X}}_{t}[k](\omega)\geq 0$ and $1\leq k\leq\mathsf{Lv}_{t}(\omega)$ , either $\tilde{\mathcal{I}}$ satisfies the inequality (1) or (1) as a requirement on $\tilde{\mathcal{I}}$ is waived due to the $\gamma$ -relaxation.

Now take any such $t,\omega$ , and $k$ , and suppose the run $\omega$ reads the program line prog at time $t$ . Then we can show the desired property by a case distinction over prog as follows. Here, recall $\omega$ is a sequence $s_{0}s_{1}\ldots s_{t}s_{t+1}\ldots$ of program states; we defined ${\mathbf{X}}_{t}$ by ${\mathbf{X}}_{t}[k](\omega)=\boldsymbol{\eta}[k](s_{t})$ ; and $\mathbb{E}[{\mathbf{X}}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)$ is the expectation of $\boldsymbol{\eta}[k](s^{\prime})$ , where $s^{\prime}$ is the successor state of $s_{0}\ldots s_{t}$ under $\sigma$ (which is not necessarily $s_{t+1}$ ). Also observe the requirement (1) on $\tilde{\mathcal{I}}$ is waived for given $t,\omega$ , and $k$ when the value of $\boldsymbol{\eta}[k](s^{\prime})$ is negative with the probability $\gamma$ or higher.

1.

Suppose prog is a non-probabilistic program line, e.g., ‘ $x_{i}:=f(\boldsymbol{x})$ ’ or ‘while $\varphi$ do’. Then the successor state $s^{\prime}$ of $s_{t}$ is unique. If $\boldsymbol{\eta}[k](s^{\prime})$ is non-negative, then we have $\mathbb{E}[\tilde{\mathbf{X}}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)=\mathbb{E}[{\mathbf{X}}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)$ , so the inequality (1) is inherited from $\mathcal{I}$ to $\tilde{\mathcal{I}}$ ; if negative, then the requirement (1) on $\tilde{\mathcal{I}}$ is waived. The same argument applies to ‘if $\star$ then’ (recall ${\mathcal{I}}$ is induced from a $\Delta$ -deterministic scheduler).
2.

Suppose $\mbox{\it prog}\equiv\mbox{`{if} {prob$(p)$} {then}'}$ . By letting $\gamma$ strictly smaller than $p$ , we see either $\boldsymbol{\eta}[k](s^{\prime})$ is never negative, or it is negative with a probability more than $\gamma$ . Thus we have the desired property for a similar reason to Case 1 (we note this argument requires $p$ to be a constant).
3.

Suppose $\mbox{\it prog}\equiv`x_{i}:=\mbox{\bf sample}(d)$ ’. We can show the desired property by taking a sufficiently small $\gamma$ ; roughly speaking, the requirement (1) on $\tilde{\mathcal{I}}$ is waived unless the chance of $\boldsymbol{\eta}[k](s^{\prime})$ being negative is very small, in which case the room for “ill” exploitation is so small that the inequality (1) is inherited from $\mathcal{I}$ to $\tilde{\mathcal{I}}$ . Almost the same argument applies to ‘ $x_{i}:=\mbox{\bf ndet}(D)$ ’.

We note, by the finiteness of program locations $L$ and transitions $\Delta$ , we can take $\gamma\in(0,1)$ that satisfies all requirements above simultaneously. ∎

Now we have soundness of LLexRSM as the following theorem, which is almost an immediate consequence of Thm. 5 and Cor. 4.

{mytheorem}

[soundness of linear LLexRSM map over linear, well-behaved pCFG] Let $\mathcal{C}$ be a linear, well-behaved pCFG, and suppose there is a linear LLexRSM map over $\mathcal{C}$ (supported by any invariant). Then $\mathcal{C}$ is AST. ∎

6 Automated Synthesis Algorithm of LexRSM

In this section, we introduce a synthesis algorithm of LLexRSM for automated AST verification of linear PPs. It synthesizes a linear MM in a certain subclass of LLexRSMs. We first define the subclass, and then introduce our algorithm.

Our algorithm is a variant of linear template-based synthesis. There, we fix a linear MM $\boldsymbol{\eta}$ with unknown coefficients (i.e., the linear template), and consider an assertion “ $\boldsymbol{\eta}$ is a certificate of AST”; for example, in the standard 1-dimensional RSM synthesis, the assertion is “ $\eta$ is an RSM map”. We then reduce this assertion into a set of linear constraints via Farkas’ Lemma [schrijver1998theory]. These constraints constitute an LP problem with an appropriate objective function. A certificate is synthesized, if feasible, by solving this LP problem. The reduction is standard, so we omit the details; see e.g. [TakisakaOUH21].

Subclass of LLexRSM for automated synthesis. While LLexRSM resolves the major issue that fixable LexRSM confronts toward its automated synthesis, we still need to tweak the notion a bit more, as the stability at negativity condition involves the value of an MM $\boldsymbol{\eta}$ in its antecedent part (i.e., it says “whenever $\boldsymbol{\eta}[k]$ is negative for some $k$ …”); this makes the reduced constraints via Farkas’ Lemma nonlinear. Therefore, we augment the condition as follows.

{mydefinition}

[MCLC] Let $\boldsymbol{\eta}:\mathcal{S}\to\mathbb{R}^{n}$ be an MM supported by an invariant $I$ , with a level map $\mathsf{Lv}$ . We say $\boldsymbol{\eta}$ satisfies the multiple-choice leftward condition (MCLC) if, for each $k\in\{1,\ldots,n\}$ , it satisfies either (4) or (5) below:

	$\displaystyle\forall\tau\in\llbracket k<\mathsf{Lv}\rrbracket.\forall s\in\llbracket I\land G(\tau)\rrbracket.$	$\displaystyle\quad\boldsymbol{\eta}[k](s)\geq 0,$		(4)
	$\displaystyle\forall\tau\in\llbracket k<\mathsf{Lv}\rrbracket.\forall s\in\llbracket I\land G(\tau)\rrbracket.\forall s^{\prime}\in\mathrm{succ}_{\tau}(s).$	$\displaystyle\quad\boldsymbol{\eta}[k](s^{\prime})\leq\boldsymbol{\eta}[k](s).$		(5)

Condition (4) is nothing but the non-negativity condition in dimension $k$ . Condition (5) augments the ranking condition in the strict leftward of the ranking dimension (a.k.a. the unaffecting condition) so that the value of $\boldsymbol{\eta}[k]$ is non-increasing in the worst-case. MCLC implies stability at negativity; hence, by Thm. 5, linear SC-LexRSM maps with MCLC certify AST of linear, well-behaved pCFGs. They also instantiate SC-LexRFs as follows. {mytheorem}[SC-LexRSM maps with MCLC instantiate SC-LexRFs] Suppose $\boldsymbol{\eta}$ is an SC-LexRSM map over a non-probabilistic CFG $\mathcal{C}$ supported by $I$ , with $\mathsf{Lv}$ . Then $\boldsymbol{\eta}$ satisfies MCLC under $I$ and $\mathsf{Lv}$ . ∎

The algorithm. Our LexRSM synthesis algorithm mostly resembles the existing ones [ChatterjeeGNZZ21, AgrawalCP18], so we are brief here; a line-to-line explanation with a pseudocode is in Appendix D. The algorithm receives a pCFG $\mathcal{C}$ and an invariant $I$ , and attempts to construct a SC-LexRSM with MCLC over $\mathcal{C}$ supported by $I$ . The construction is iterative; at the $k$ -th iteration, the algorithm attempts to construct a one-dimensional MM $\eta_{k}$ that ranks transitions of $\mathcal{C}$ that are not ranked by the current construction $\boldsymbol{\eta}=(\eta_{1},\ldots,\eta_{k-1})$ , while respecting MCLC. If the algorithm finds $\eta_{k}$ that ranks at least one new transition, then it appends $\eta_{k}$ to $\boldsymbol{\eta}$ and goes to the next iteration; otherwise, it reports a failure. Once $\boldsymbol{\eta}$ ranks all transitions, the algorithm reports a success, returning $\boldsymbol{\eta}$ as an AST certificate of $\mathcal{C}$ .

Our algorithm attempts to construct $\eta_{k}$ in two ways, by adopting either (4) or (5) as the leftward condition at the dimension $k$ . The attempt with the condition (4) is done in the same manner as existing algorithms [ChatterjeeGNZZ21, AgrawalCP18]; we require $\eta_{k}$ to rank the unranked transitions as many as possible. The attempt with the condition (5) is slightly nontrivial; the algorithm demands a user-defined parameter $\mbox{Class}(U)\subseteq 2^{U}$ for each $U\subseteq\Delta\setminus\{\tau_{\mathrm{out}}\}$ . The parameter $\mbox{Class}(U)$ specifies which set of transitions the algorithm should try to rank, given the set of current unranked transitions $U$ ; that is, for each $\mathcal{T}\in\mbox{Class}(U)$ , the algorithm attempts to find $\eta_{k}$ that exactly ranks transitions in $\mathcal{T}$ .

There are two canonical choices of $\mbox{Class}(U)$ . One is $2^{U}\setminus\{\emptyset\}$ , the brute-force trial; the resulting algorithm does not terminate in polynomial time, but ranks the maximal number of transitions (by trying each $\mathcal{T}$ in the descending order w.r.t. $|\mathcal{T}|$ ). This property makes the algorithm complete. Another choice is the singletons of $U$ , i.e., $\{\{\tau\}\mid\tau\in U\}$ ; while the resulting algorithm terminates in polynomial time, it lacks the maximality property. It is our future work to verify if there is a polynomial complete instance of our proposed algorithm. Still, any instance of it is complete over yet another class of LLexRSMs, namely linear LW-LexRSMs. The formal statement (Thm. D) with proof is in Appendix D.

7 Experiments

We performed experiments to evaluate the performance of our proposed algorithm. The implementation is publicly available⁶⁶6https://doi.org/10.5281/zenodo.10937558.

Our evaluation criteria are twofold: one is how the relaxed non-negativity condition of our LexRSM—SC non-negativity and MCLC—improves the applicability of the algorithm, compared to other existing non-negativity conditions. To this end, we consider two baseline algorithms.

(a)

The algorithm STR: This is the one proposed in [AgrawalCP18], which synthesizes an ST-LexRSM. We use the implementation provided by the authors [artifactgit].
(b)

The algorithm LWN: This synthesizes an LW-LexRSM. LWN is realized as an instance of our algorithm with $\mbox{\rm Class}(U)=\emptyset$ . We use LWN as a proxy of the synthesis algorithm of GLexRSM [ChatterjeeGNZZ21arxiv, Alg. 2], whose implementation does not seem to exist. We note [ChatterjeeGNZZ21arxiv, Alg. 2] synthesizes an LW-LexRSM with some additional conditions; therefore, it is no less restrictive than LWN.

Another criterion is how the choice of $\mbox{\rm Class}(U)$ affects the performance of our algorithm. To this end, we consider two instances of it: (a) Singleton Multiple Choice(SMC), given by $\mbox{\rm Class}(U)=\{\{\tau\}\mid\tau\in U\}$ ; and (b) Exhaustive Multiple Choice(EMC), given by $\mbox{\rm Class}(U)=2^{U}\setminus\emptyset$ . SMC runs in PTIME, but we do not know if it is complete; EMC does not run in PTIME, but is complete.

We use benchmarks from [AgrawalCP18], which consist of non-probabilistic programs collected in [alias2010multi] and their probabilistic modifications. The modification is done in two different ways: (a) while loops “ $\textbf{while }\varphi\textbf{ do }P\textbf{ od}$ ” are replaced with probabilistic ones “while $\varphi$ do (if prob $(0.5)$ then $P$ else skip fi) od”; (b) in addition to (a), variable assignments “ $x:=f(\boldsymbol{x})+a$ ” are replaced with “ $x:=f(\boldsymbol{x})+\mathit{Unif}[a-1,a+1]$ ”. We include non-probabilistic programs in our benchmark set because the “problematic program structure” that hinders automated LexRSM synthesis already exists in non-probabilistic programs (cf. our explanation to Fig. 2). We also tried two PPs from [ChatterjeeGNZZ21, Fig. 1], which we call counterexStr1 and counterexStr2.

We implemented our algorithm upon [AgrawalCP18], which is available at [artifactgit]. Similar to [AgrawalCP18], our implementation works as follows: (1) it receives a linear PP as an input, and translates it into a pCFG $\mathcal{C}$ ; (2) it generates an invariant for $\mathcal{C}$ ; (3) via our algorithm, it synthesizes an SC-LexRSM map with MCLC. Invariants are generated by ASPIC [feautrier2010accelerated], and all LP problems are solved by CPLEX [CPLEX].

Benchmark spec.			Synthesis result				Benchmark spec.			Synthesis result
Benchmark spec.			Baselines		Our algs.		Benchmark spec.			Baselines		Our algs.
Model	p.l.	p.a.	STR	LWN	SMC	EMC	Model	p.l.	p.a.	STR	LWN	SMC	EMC
complex	-	-	$\times$	$\times$	7	5	serpent	-	-	$\times$	$\times$	3	3
complex	$\surd$	-	$\times$	$\times$	7	5	speedDis1	-	-	$\times$	$\times$	4	4
complex	$\surd$	$\surd$	$\times$	$\times$	3	3	speedDis2	-	-	$\times$	$\times$	4	4
cousot9	-	-	$\times$	3	3	3	spdSimMul	-	-	$\times$	$\times$	4	4
cousot9	$\surd$	-	$\times$	$\times$	4	4	spdSimMulDep	-	-	$\times$	$\times$	4	4
loops	-	-	$\times$	$\times$	4	3	spdSglSgl2	$\surd$	$\surd$	$\times$	$\times$	5	5
nestedLoop	$\surd$	$\surd$	$\times$	$\times$	4	3	speedpldi3	-	-	$\times$	3	3	3
realheapsort	-	-	$\times$	3	3	3	speedpldi3	$\surd$	-	$\times$	$\times$	4	4
RHS_step1	-	-	$\times$	3	3	3	counterexStr1	-	$\surd$	N/A	3	3	3
RHS_step1	$\surd$	$\surd$	$\times$	3	3	3	counterexStr2	-	$\surd$	$\times$	$\times$	4	4
realshellsort	$\surd$	$\surd$	$\times$	2	2	2

Table 1: The list of benchmarks in which a feasibility difference is observed between baselines and proposed algorithms. Ticks in “p.l.” and “p.a.” indicate the benchmark has a probabilistic loop and assignment, respectively. Numbers in the result indicate that the algorithm found a LexRSM with that dimension; the crosses indicate failures; “N/A” means we did not run the experiment.

Results. In 135 benchmarks from 55 models, STR succeeds in 98 cases, LWN succeeds in 105 cases while SMC and EMC succeed in 119 cases (we did not run STR for counterexStr1 because it involves a sampling from an unbounded support distribution, which is not supported by STR). Table 1 summarizes the cases where we observe differences in the feasibility of algorithms. As theoretically anticipated, LWN always succeeds in finding a LexRSM whenever STR does; the same relation is observed between SMC vs. LWN and EMC vs. SMC. In most cases, STR, LWN, and SMC return an output within a second⁷⁷7 There was a single example for which more time was spent, due to a larger size. , while EMC suffers from an exponential blowup when it attempts to rank transitions with Condition (5) in Def. 6. The full results are in Appendix E.

On the first evaluation criterion, the advantage of the relaxed non-negativity is evident: SMC/EMC have unique successes vs. STR on 21 programs (21/135 = 15.6% higher success rate) from 16 different models; SMC/EMC also have unique successes vs. LWN in 14 programs (14/135 = 10.4% higher success rate) from 12 models. This result shows that the program structure we observed in Fig. 2 appears in various programs in the real world.

On the second criterion, EMC does not have any unique success compared to SMC. This result suggests that SMC can be the first choice as a concrete instance of our proposed algorithm. Indeed, we suspect that SMC is actually complete—verifying its (in)completeness is a future work. For some programs, EMC found a LexRSM with a smaller dimension than SMC.

Interestingly, LWN fails to find a LexRSM for counterexStr2, despite it being given in [ChatterjeeGNZZ21] as a PP for which a GLexRSM (and hence, an LW non-negative LexRSM) exists. This happens because the implementation in [artifactgit] translates the PP into a pCFG with a different shape than the one in [ChatterjeeGNZZ21] (for the latter, a GLexRSM indeed exists); the former possesses a similar structure as in Fig. 2 because different locations are assigned for the while loop and if branch. This demonstrates the advantage of our algorithm from another point of view, i.e., robustness against different translations of PPs.

8 Related Work

There is a rich body of studies in 1-dimensional RSM [chakarov2013probabilistic, chatterjee2016termination, chatterjee2016algorithmic, chatterjee2017stochastic, ferrer2015probabilistic, mciver2016new, mciver2017new, huang2018new, fu2019termination, moosbrugger2021automated, giesl2019computing], while lexicographic RSM is relatively new [AgrawalCP18, ChatterjeeGNZZ21]. Our paper generalizes the latest work [ChatterjeeGNZZ21] on LexRSM as follows: (a) Soundness of LexRSM as a stochastic process:soundness of $\varepsilon$ -fixable LexRSMs (Def. 4) generalizes [ChatterjeeGNZZ21, Thm. 1] in the sense that every GLexRSM is $\varepsilon$ -fixable for any $\varepsilon>0$ (Thm. 4); (b) Soundness of LexRSM as a function on program states:our result (Thm. 5) generalizes [ChatterjeeGNZZ21, Thm. 2] under the linearity and well-behavedness assumptions; (c) Soundness and completeness of LexRSM synthesis algorithms:our result generalizes the results for one of two algorithms in [ChatterjeeGNZZ21] that assumes boundedness assumption on assignment distribution [ChatterjeeGNZZ21, Thm. 3].

The work [Huang0CG19] also considers a relaxed non-negativity of RSMs. Their descent supermartingale, which acts on while loops, requires well-foundedness only at every entry into the loop body. A major difference from our LexRSM is that they only consider 1-dimensional RSMs; therefore, the problem of relaxing the LW non-negativity does not appear in their setting. Compared with their RSM, our LexRSM has an advantage in verifying PPs with a structure shown in Fig. 2, where the value of our LexRSM can be arbitrarily small upon the loop entrance (at some dimension; see $\eta_{2}$ at $\ell_{1}$ in Fig. 2).

The work [mciver2017new] extends the applicability of standard RSM on a different aspect from LexRSM. The main feature of their RSM is that it can verify AST of the symmetric random walk. While our LexRSM cannot verify AST of this process, the RSM by [mciver2017new] is a 1-dimensional one, which typically struggles on PPs with nested structures. Such a difference can be observed from the experiment result in [MoosbruggerBKK21] (compare [MoosbruggerBKK21, Table 2] and nested_loops, sequential_loops in [MoosbruggerBKK21, Table 1]).

9 Conclusion

We proposed the first variants of LexRSM that instantiate SC-LexRF. An algorithm was proposed to synthesize such a LexRSM, and experiments have shown that the relaxation of non-negativity contributes applicability of the resulting LexRSM. We have two open problems: one is if the class of well-behaved distributions matches with the one of integrable ones; and another is if the SMC variant of our algorithm (see §7) is complete.

Acknowledgment

We thank anonymous reviewers for their constructive comments on the previous versions of the paper. The term “ill exploitation” is taken from one of the reviews that we found very helpful. We also thank Shin-ya Katsumata, Takeshi Tsukada, and Hiroshi Unno for their comments on the paper.

This work is partially supported by National Natural Science Foundation of China No. 62172077 and 62350710215.

References

[1] Ultimate automizer, https://www.ultimate-pa.org/?ui=tool&tool=automizer
[2] Abramowitz, M., Stegun., I.A.: Handbook of Mathematical Functions: with Formulas, Graphs, and Mathematical Tables. Dover Publications (2012)
[3] Agrawal, S., Chatterjee, K., Novotný, P.: Lexicographic ranking supermartingales: an efficient approach to termination of probabilistic programs. Proc. ACM Program. Lang. 2(POPL), 34:1–34:32 (2018), https://doi.org/10.1145/3158122
[4] Agrawal, S., Chatterjee, K., Novotný, P.: Lexicographic ranking supermartingales: an efficient approach to termination of probabilistic programs: Implementation (2018), https://github.com/Sheshansh/prob_termination
[5] Alias, C., Darte, A., Feautrier, P., Gonnord, L.: Multi-dimensional rankings, program termination, and complexity bounds of flowchart programs. In: Static Analysis: 17th International Symposium, SAS 2010, Perpignan, France, September 14-16, 2010. Proceedings 17. pp. 117–133. Springer (2010)
[6] Ash, R., Doléans-Dade, C.: Probability and Measure Theory. Harcourt/Academic Press (2000)
[7] Barthe, G., Gaboardi, M., Grégoire, B., Hsu, J., Strub, P.Y.: Proving differential privacy via probabilistic couplings. In: Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science. pp. 749–758 (2016)
[8] Barthe, G., Gaboardi, M., Hsu, J., Pierce, B.: Programming language techniques for differential privacy. ACM SIGLOG News 3(1), 34–53 (2016)
[9] Ben-Amram, A.M., Genaim, S.: Complexity of bradley-manna-sipma lexicographic ranking functions. In: Kroening, D., Pasareanu, C.S. (eds.) Computer Aided Verification - 27th International Conference, CAV 2015, San Francisco, CA, USA, July 18-24, 2015, Proceedings, Part II. Lecture Notes in Computer Science, vol. 9207, pp. 304–321. Springer (2015), https://doi.org/10.1007/978-3-319-21668-3_18
[10] Bertsekas, D.P., Shreve, S.E.: Stochastic Optimal Control: The Discrete-Time Case. Athena Scientific (2007)
[11] Bradley, A.R., Manna, Z., Sipma, H.B.: Linear ranking with reachability. In: Etessami, K., Rajamani, S.K. (eds.) Computer Aided Verification, 17th International Conference, CAV 2005, Edinburgh, Scotland, UK, July 6-10, 2005, Proceedings. Lecture Notes in Computer Science, vol. 3576, pp. 491–504. Springer (2005), https://doi.org/10.1007/11513988_48
[12] Canal, G., Cashmore, M., Krivić, S., Alenyà, G., Magazzeni, D., Torras, C.: Probabilistic planning for robotics with rosplan. In: Towards Autonomous Robotic Systems: 20th Annual Conference, TAROS 2019, London, UK, July 3–5, 2019, Proceedings, Part I 20. pp. 236–250. Springer (2019)
[13] Chakarov, A., Sankaranarayanan, S.: Probabilistic program analysis with martingales. In: Computer Aided Verification: 25th International Conference, CAV 2013, Saint Petersburg, Russia, July 13-19, 2013. Proceedings 25. pp. 511–526. Springer (2013)
[14] Chatterjee, K., Fu, H., Goharshady, A.K.: Termination analysis of probabilistic programs through positivstellensatz’s. In: Computer Aided Verification: 28th International Conference, CAV 2016, Toronto, ON, Canada, July 17-23, 2016, Proceedings, Part I 28. pp. 3–22. Springer (2016)
[15] Chatterjee, K., Fu, H., Novotnỳ, P., Hasheminezhad, R.: Algorithmic analysis of qualitative and quantitative termination problems for affine probabilistic programs. In: Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages. pp. 327–342 (2016)
[16] Chatterjee, K., Goharshady, E.K., Novotný, P., Zárevúcky, J., Zikelic, D.: On lexicographic proof rules for probabilistic termination. In: Huisman, M., Pasareanu, C.S., Zhan, N. (eds.) Formal Methods - 24th International Symposium, FM 2021, Virtual Event, November 20-26, 2021, Proceedings. Lecture Notes in Computer Science, vol. 13047, pp. 619–639. Springer (2021), https://doi.org/10.1007/978-3-030-90870-6_33
[17] Chatterjee, K., Goharshady, E.K., Novotný, P., Zárevúcky, J., Zikelic, D.: On lexicographic proof rules for probabilistic termination. CoRR abs/2108.02188 (2021), https://arxiv.org/abs/2108.02188
[18] Chatterjee, K., Novotnỳ, P., Zikelic, D.: Stochastic invariants for probabilistic termination. In: Proceedings of the 44th ACM SIGPLAN Symposium on Principles of Programming Languages. pp. 145–160 (2017)
[19] Dubhashi, D.P., Panconesi, A.: Concentration of measure for the analysis of randomized algorithms. Cambridge University Press (2009)
[20] Feautrier, P., Gonnord, L.: Accelerated invariant generation for c programs with aspic and c2fsm. Electronic Notes in Theoretical Computer Science 267(2), 3–13 (2010)
[21] Ferrer Fioriti, L.M., Hermanns, H.: Probabilistic termination: Soundness, completeness, and compositionality. In: Proceedings of the 42nd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages. pp. 489–501 (2015)
[22] Fu, H., Chatterjee, K.: Termination of nondeterministic probabilistic programs. In: Verification, Model Checking, and Abstract Interpretation: 20th International Conference, VMCAI 2019, Cascais, Portugal, January 13–15, 2019, Proceedings 20. pp. 468–490. Springer (2019)
[23] Giesl, J., Giesl, P., Hark, M.: Computing expected runtimes for constant probability programs. In: Automated Deduction–CADE 27: 27th International Conference on Automated Deduction, Natal, Brazil, August 27–30, 2019, Proceedings 27. pp. 269–286. Springer (2019)
[24] Huang, M., Fu, H., Chatterjee, K.: New approaches for almost-sure termination of probabilistic programs. In: Programming Languages and Systems: 16th Asian Symposium, APLAS 2018, Wellington, New Zealand, December 2–6, 2018, Proceedings 16. pp. 181–201. Springer (2018)
[25] Huang, M., Fu, H., Chatterjee, K., Goharshady, A.K.: Modular verification for almost-sure termination of probabilistic programs. Proc. ACM Program. Lang. 3(OOPSLA), 129:1–129:29 (2019), https://doi.org/10.1145/3360555
[26] IBM: Ibm ilog cplex 12.7 user’s manual (ibm ilog cplex division, incline village, nv) (2017)
[27] Karp, R.M.: An introduction to randomized algorithms. Discrete Applied Mathematics 34(1-3), 165–201 (1991)
[28] Lobo-Vesga, E., Russo, A., Gaboardi, M.: A programming language for data privacy with accuracy estimations. ACM Transactions on Programming Languages and Systems (TOPLAS) 43(2), 1–42 (2021)
[29] McIver, A., Morgan, C.: A new rule for almost-certain termination of probabilistic-and demonic programs. arXiv preprint arXiv:1612.01091 (2016)
[30] McIver, A., Morgan, C., Kaminski, B.L., Katoen, J.P.: A new proof rule for almost-sure termination. Proceedings of the ACM on Programming Languages 2(POPL), 1–28 (2017)
[31] Moosbrugger, M., Bartocci, E., Katoen, J.P., Kovács, L.: Automated termination analysis of polynomial probabilistic programs. In: Programming Languages and Systems: 30th European Symposium on Programming, ESOP 2021, Held as Part of the European Joint Conferences on Theory and Practice of Software, ETAPS 2021, Luxembourg City, Luxembourg, March 27–April 1, 2021, Proceedings 30. pp. 491–518. Springer International Publishing (2021)
[32] Moosbrugger, M., Bartocci, E., Katoen, J., Kovács, L.: The probabilistic termination tool amber. In: Huisman, M., Pasareanu, C.S., Zhan, N. (eds.) Formal Methods - 24th International Symposium, FM 2021, Virtual Event, November 20-26, 2021, Proceedings. Lecture Notes in Computer Science, vol. 13047, pp. 667–675. Springer (2021), https://doi.org/10.1007/978-3-030-90870-6_36
[33] Olmedo, F., Gretz, F., Jansen, N., Kaminski, B.L., Katoen, J., McIver, A.: Conditioning in probabilistic programming. ACM Trans. Program. Lang. Syst. 40(1), 4:1–4:50 (2018), https://doi.org/10.1145/3156018
[34] Parker, D.: Verification of probabilistic real-time systems. Proc. 2013 Real-time Systems Summer School (ETR’13) (2013)
[35] Schrijver, A.: Theory of linear and integer programming. John Wiley & Sons (1998)
[36] Takisaka, T., Oyabu, Y., Urabe, N., Hasuo, I.: Ranking and repulsing supermartingales for reachability in randomized programs. ACM Trans. Program. Lang. Syst. 43(2), 5:1–5:46 (2021), https://doi.org/10.1145/3450967
[37] Takisaka, T., Zhang, L., Wang, C., Liu, J.: Lexicographic ranking supermartingales with lazy lower bounds. CoRR abs/2304.11363 (2024), https://doi.org/10.48550/arXiv.2304.11363

Appendix

\begin{array}[]{rrl}\langle\mathit{prog}\rangle&::=&\mbox{`{skip}'}\mid\langle\mathit{pvar}\rangle\,\mbox{`$:=$'}\,\langle\mathit{assgn}\rangle\mid\langle\mathit{prog}\rangle\,\text{`;'}\langle\mathit{prog}\rangle\\ &&\mid\mbox{`{if}'}\,\langle\mathit{bexpr}\rangle\,\mbox{`{then}'}\,\langle\mathit{prog}\rangle\,\mbox{`{else}'}\,\langle\mathit{prog}\rangle\,\mbox{`{fi}'}\\ &&\mid\mbox{`{if}'}\,\mbox{`$\star$'}\,\mbox{`{then}'}\,\langle\mathit{prog}\rangle\,\mbox{`{else}'}\,\langle\mathit{prog}\rangle\,\mbox{`{fi}'}\\ &&\mid\mbox{`{if}'}\,\mbox{`{prob$(p)$}'}\,\mbox{`{then}'}\,\langle\mathit{prog}\rangle\,\mbox{`{else}'}\,\langle\mathit{prog}\rangle\,\mbox{`{fi}'}\\ &&\mid\mbox{`{while}'}\,\langle\mathit{bexpr}\rangle\,\text{`{do}'}\,\langle\mathit{prog}\rangle\,\text{`{od}'}\\ \vspace{\baselineskip}\hfil\\ \langle\mathit{assgn}\rangle&::=&\langle\mathit{expr}\rangle\mid\mbox{`{sample$(d)$}'}\mid\mbox{`{ndet$(D)$}'}\\ \langle\mathit{literal}\rangle&::=&\langle\mathit{expr}\rangle\,\mbox{`$\leq$'}\,\langle\mathit{expr}\rangle\mid\langle\mathit{expr}\rangle\,\mbox{`$\geq$'}\,\langle\mathit{expr}\rangle\\ \langle\mathit{bexpr}\rangle&::=&\langle\mathit{literal}\rangle\mid\neg\langle\mathit{bexpr}\rangle\mid\langle\mathit{bexpr}\rangle\,\mbox{`{or}'}\,\langle\mathit{bexpr}\rangle\\ &&\mid\langle\mathit{bexpr}\rangle\,\mbox{`{and}'}\,\langle\mathit{bexpr}\rangle\end{array}

Figure 5: The syntax of probabilistic programs

Appendix A Omitted Details of Section 3

Notations. The set of finite, nonempty finite, and infinite sequences of elements in a set $\mathcal{X}$ are denoted by $\mathcal{X}^{*}$ , $\mathcal{X}^{+}$ , and $\mathcal{X}^{\mathbb{N}}$ , respectively. For a measurable space $(\Omega,\mathcal{F})$ , the Dirac measure $\delta_{\omega}$ at $\omega\in\Omega$ is the distribution over $(\Omega,\mathcal{F})$ such that $\delta_{\omega}(A)=1$ if $\omega\in A$ , and $\delta_{\omega}(A)=0$ otherwise, for each $A\in\mathcal{F}$ . The support $\mathrm{supp}(\mu)$ of $\mu\in\mathcal{D}(\Omega)$ is the set of samples $\omega\in\Omega$ such that $\mu(A)>0$ for each open set $A$ that contains $\omega$ . For a random variable $X$ over a probability space $(\Omega,\mathcal{F},\mathbb{P})$ , the expectation $\mathbb{E}_{\mathbb{P}}[X]$ of $X$ is the value $\int_{\Omega}Xd\mathbb{P}$ , provided it exists (which is possibly infinite). We write $(x_{i},\boldsymbol{x}_{-i})$ to denote $\boldsymbol{x}=(x_{1},\ldots,x_{n})$ with an emphasis on a particular variable $x_{i}$ .

For a set $\mathcal{X}$ , a function $p\colon\mathcal{X}\to\mathcal{D}(\Omega)$ is called a stochastic kernel on $\Omega$ given $\mathcal{X}$ ; it is used to describe the local random behavior of transition systems (e.g. $p(x)$ may represent the successor distribution given the transition history $x\in\mathcal{X}$ ). We write $p(A\mid x)$ to denote the value $p(x)(A)$ . We say $p$ is deterministic if $p(x)$ is Dirac for $x\in\mathcal{X}$ . In this case, we canonically identify the range of $p$ with $\Omega$ .

For given probability space $(\Omega,\mathcal{F},\mathbb{P})$ , random variable $X:\Omega\to\mathbb{R}$ and sub- $\sigma$ -algebra $\mathcal{F}^{\prime}\subseteq\mathcal{F}$ , the conditional expectation of $X$ given $\mathcal{F}^{\prime}$ is an $\mathcal{F}^{\prime}$ -measurable function $\mathbb{E}[X\mid\mathcal{F}^{\prime}]:\Omega\to\mathbb{R}\cup\{+\infty,-\infty\}$ such that $\int_{A}\mathbb{E}[X\mid\mathcal{F}^{\prime}]d\mathbb{P}=\int_{A}Xd\mathbb{P}$ for each $A\in\mathcal{F}^{\prime}$ . The conditional probability $\mathbb{P}[\varphi\mid\mathcal{F}^{\prime}]$ of a predicate $\varphi$ is defined as $\mathbb{P}[\varphi\mid\mathcal{F}^{\prime}]=\mathbb{E}[{\bf 1}_{\llbracket\varphi\rrbracket}\mid\mathcal{F}^{\prime}]$ . It is known that $\mathbb{E}[X\mid\mathcal{F}^{\prime}]$ exists whenever $\mathbb{E}_{\mathbb{P}}[X]$ does, and is $\mathbb{P}$ -a.s. unique [Ash:book]⁸⁸8 In the book, $X$ is either non-negative or integrable; the fact extends to the general case by evaluating the positive and negative parts of $X$ . Also, if $\mathbb{E}[X\mid\mathcal{F}^{\prime}]$ exists, then $\mathbb{E}[{\bf 1}_{A}\cdot X\mid\mathcal{F}^{\prime}]$ also exists for any $A\in\mathcal{F}$ . .

The formula $\overset{\infty}{\forall}t.\varphi_{t}$ stands for $\exists k.\forall t.t\geq k\Rightarrow\varphi_{t}$ ; the one $\overset{\infty}{\exists}t.\varphi_{t}$ is defined as the dual.

The syntax of probabilistic programs. See Fig. 5⁹⁹9 In our syntax, the command “ $x:=x+\mathit{Unif}[1,2]$ ” in Fig. 2 is understood as a syntax sugar for “ $z:=\textbf{sample$(\mathit{Unif}[1,2])$};x:=x+z$ ”, where $z$ is an auxiliary variable that stores the sampling result. . There, $\langle\mathit{pvar}\rangle$ ranges over program variables $\mathcal{V}$ , a fixed countable set; and $\langle\mathit{expr}\rangle$ ranges over arithmetic expressions over $\mathcal{V}$ , constructed from program variables, real-valued constants and arithmetic operations such as addition and multiplication.

A translation from PPs to pCFGs. See Fig. 6.

(a) skip

(b)

x_{i}:=\langle\mathit{assgn}\rangle

(d) if

\varphi

then A else B

(e) if

prob(p)

then A else B

(f) if * then A else B

(g) while

\varphi

do A od

Figure 6: A translation of PPs into pCFGs. Circles

\ell_{i}

and

\ell_{o}

represent the initial and final locations of the translated pCFG, respectively. The description “

\tau:\varphi,p,(i,u)

” on an arrow from

\ell

\ell^{\prime}

shows that the pCFG has a transition

\tau=(\ell,\delta)

such that

G(\tau)\equiv\varphi

\delta(\ell^{\prime})=p

and

\mathit{Up}(\tau)=(i,u)

, where

u

corresponds to the content of

\langle\mathit{assgn}\rangle

in the case (b); and

\mathit{id}\equiv(1,\lambda\boldsymbol{x}.x_{1})

represents “no update”. Circles

\ell_{i}^{A}

and

\ell_{o}^{A}

represent the initial and final locations of the pCFG for program fragment

A

, whose transitions are abstractly shown by a dotted arrow. the pCFG for

B

is described similarly.

Deadlock-freeness. We assume pCFGs are deadlock-free, i.e., for any state $s\in\mathcal{S}$ there exists $\tau\in\Delta$ enabled at $s$ .

Successors. A state $(\ell^{\prime},\boldsymbol{y})$ is a successor of $(\ell,\boldsymbol{x})$ via $\tau\in\Delta$ if and only if $\tau=(\ell,\delta)$ satisfies $\boldsymbol{x}\in\llbracket G(\tau)\rrbracket$ , $\delta(\ell^{\prime})>0$ , $\mathit{Up}(\tau)=(i,u)$ , and $\boldsymbol{y}[i]$ is: (a) equal to $u(\boldsymbol{x})$ if $\tau\in\Delta_{d}$ ; (b) in $\mathrm{supp}(u)$ if $\tau\in\Delta_{p}$ ; and (c) in $u$ if $\tau\in\Delta_{n}$ .

Finite paths and runs. A finite path of $\mathcal{C}$ is a nonempty finite sequence $s_{0}s_{1}\ldots s_{t}\in\mathcal{S}^{+}$ of states such that $s_{0}=(\ell_{\mathrm{in}},\boldsymbol{x})$ for some $\boldsymbol{x}\in\mathbb{R}^{|V|}$ and $s_{t^{\prime}+1}\in\mathrm{succ}_{\tau}(s_{t^{\prime}})$ with $\tau$ enabled at $s_{t^{\prime}}$ , for each $t^{\prime}\in\{0,\ldots,t-1\}$ . A run of $\mathcal{C}$ is an infinite sequence $\omega\in\mathcal{S}^{\mathbb{N}}$ of states any of whose prefix is a finite path. The set of all finite paths, finite paths with length $t$ , and runs of $\mathcal{C}$ are denoted by $\Pi_{\mathcal{C}}^{f}$ , $\Pi_{\mathcal{C}}^{t}$ , and $\Pi_{\mathcal{C}}$ , respectively.

Schedulers. Schedulers resolve nondeterminism. Recall there are two types of nondeterminism in pCFGs: (a) nondeterministic choice of $\tau\in\Delta$ at a given state (corresponds to ‘if $\star$ ’), and (b) nondeterministic variable update in a nondeterministic transition $\tau\in\Delta_{n}$ (corresponds to ‘ $x_{i}:=$ ndet $(D)$ ’). Therefore, we define a scheduler as a pair of functions $\sigma=(\sigma_{\Delta},\sigma_{V})$ , where $\sigma_{\Delta}$ and $\sigma_{V}$ handle the cases (a) and (b), respectively. Schedulers can make a choice in a probabilistic way, and can be history dependent; that is, $\sigma_{\Delta}$ and $\sigma_{V}$ are (partial) stochastic kernels of the form $\sigma_{\Delta}:\Pi_{\mathcal{C}}^{f}\to\mathcal{D}(\Delta)$ and $\sigma_{V}:\Pi_{\mathcal{C}}^{f}\times\Delta_{n}\times L\to\mathcal{D}(\mathbb{R})$ , respectively.

Formally, a scheduler is a pair of functions $\sigma=(\sigma_{\Delta},\sigma_{V})$ , where, for each $s\in\mathcal{S}$ and $w\in\mathcal{S}^{*}$ such that $ws\in\Pi_{\mathcal{C}}^{f}$ ,

•

$\sigma_{\Delta}:\Pi_{\mathcal{C}}^{f}\to\mathcal{D}(\Delta)$ satisfies $\sigma_{\Delta}(\tau\mid ws)>0$ only if $\tau$ is enabled at $s$ ; and
•

$\sigma_{V}:\Pi_{\mathcal{C}}^{f}\times\Delta_{n}\times L\to\mathcal{D}(\mathbb{R})$ is a partial function defined on $\{(ws,\tau,\ell^{\prime})\mid s\in\llbracket G(\tau)\rrbracket,\tau=(\ell,\delta),\delta(\ell^{\prime})>0\}$ such that $\mathrm{supp}(\sigma_{V}(ws,\tau,\ell^{\prime}))\subseteq u$ , where $\mathit{Up}(\tau)=(i,u)$ .

We say $\sigma=(\sigma_{\Delta},\sigma_{V})$ is $\Delta$ -deterministic if $\sigma_{\Delta}$ is deterministic, i.e., $\sigma_{\Delta}(w)$ is Dirac for any $w\in\Pi_{\mathcal{C}}^{f}$ .

The dynamics of $\mathcal{C}$ . By fixing a scheduler $\sigma$ , the local behavior of a pCFG $\mathcal{C}$ (“what will be the next state given the current transition history?”) is determined as a function $p^{\sigma}:\Pi_{\mathcal{C}}^{f}\to\mathcal{D}(\mathcal{S})$ , where $p^{\sigma}(s_{0}\ldots s_{t})\in\mathcal{D}(\mathcal{S})$ represents the distribution of the successor of $s_{t}$ under $\sigma$ given the transition history $s_{0}\ldots s_{t}\in\Pi_{\mathcal{C}}^{f}$ . By additionally fixing an initial state $s_{I}$ , the infinite-horizon behavior of $\mathcal{C}$ is also determined as a distribution $\mathbb{P}_{s_{I}}^{\sigma}\in\mathcal{D}(\Pi_{\mathcal{C}})$ , where $\mathbb{P}_{s_{I}}^{\sigma}$ represents the distribution of runs generated by $\mathcal{C}$ under $\sigma$ , starting from $s_{I}$ . We call the probability space $(\Pi_{\mathcal{C}},\mathcal{B}(\Pi_{\mathcal{C}}),\mathbb{P}_{s_{I}}^{\sigma})$ the dynamics of $\mathcal{C}$ under $\sigma$ and $s_{I}$ . The distribution $\mathbb{P}_{s_{I}}^{\sigma}$ is realized as the “limit” of the distribution $\mathbb{P}_{t}$ of the $t$ -length finite path under $\sigma$ and $s_{I}$ , which is inductively constructed by $p^{\sigma}$ (see e.g. [BertsekasS07, Prop. 7.28] for the precise argument):

\displaystyle\mathbb{P}_{1}=\delta_{s_{I}}\quad,\quad\mathbb{P}_{t+1}(A\times B)=\int_{w\in A}p^{\sigma}(B\mid w)d\mathbb{P}_{t}\quad(A\in\Pi_{\mathcal{C}}^{t},B\in\mathcal{B}(\mathcal{S})).

Cylinder sets. A cylinder set of $\Omega=\Pi_{\mathcal{C}}$ is a subset of $\Omega$ of the following form for some $t\in\mathbb{N}$ :

[A_{0}A_{1}\ldots A_{t}]=\{s_{0}s_{1}\ldots\in\Omega\mid\forall t^{\prime}\in\{0,\ldots,t\}.s_{t^{\prime}}\in A_{t^{\prime}}\},

where $A_{t^{\prime}}\subseteq\mathcal{S}$ is Borel measurable.

Pre-expectation. The pre-expectation of a (1-dimensional) MM $\eta$ formalizes the notion of “the value of $\eta$ after a transition”, which comes in two different forms. One is the pre-expectation under a scheduler $\sigma$ , which is defined as the expected value of $\eta$ at the successor state under $\sigma$ , given a transition history. This is formalized as a function $\mathbb{X}_{\sigma}\eta:\Pi_{\mathcal{C}}^{f}\to\mathbb{R}$ such that $\mathbb{X}_{\sigma}\eta(w)=\mathbb{E}_{p^{\sigma}(w)}[\eta]$ ; recall $p^{\sigma}(w)\in\mathcal{D}(\mathcal{S})$ is the successor distribution under $\sigma$ given the transition history $w$ . Another variant is the maximal pre-expectation under a transition $\tau$ , which is defined as the maximal expected value of $\eta$ at the successor state via $\tau$ from a given state (not a history, as such a maximal value is history independent). This is formalized as a function $\overline{\mathbb{X}}_{\tau}\eta:\llbracket G(\tau)\rrbracket\to\mathbb{R}$ such that $\overline{\mathbb{X}}_{\tau}\eta(s)=\sup\{\mathbb{X}_{\sigma}\eta(s)\mid\sigma_{\Delta}(s)=\tau\}$ . Equivalently, we can define the value of $\overline{\mathbb{X}}_{\tau}\eta(s)$ as follows, by explicitly writing $\mathbb{X}_{\sigma}\eta$ down; for $\tau=(\ell,\delta)$ with $\mathit{Up}(\tau)=(i,u)$ and $s\in\llbracket G(\tau)\rrbracket$ , we let $\overline{\mathbb{X}}_{\tau}\eta(s)=\sum_{\ell^{\prime}\in L}\delta(\ell^{\prime})\cdot\overline{\mathbb{X}}_{\tau,\ell^{\prime}}\eta(s)$ , where

\displaystyle\overline{\mathbb{X}}_{\tau,\ell^{\prime}}\eta(\ell,\boldsymbol{x})=\begin{cases}\eta(\ell^{\prime},u(\boldsymbol{x}),\boldsymbol{x}_{-i})&\text{if }\tau\in\Delta_{d},\\ \mathbb{E}_{u}[\lambda x_{i}^{\prime}.\eta(\ell^{\prime},x_{i}^{\prime},\boldsymbol{x}_{-i})]&\text{if }\tau\in\Delta_{p},\\ \sup_{x_{i}^{\prime}\in u}\eta(\ell^{\prime},x_{i}^{\prime},\boldsymbol{x}_{-i})&\text{if }\tau\in\Delta_{n}.\end{cases}

(6)

Basic notions about stochastic process. A (discrete-time) stochastic process in a probability space $(\Omega,\mathcal{F},\mathbb{P})$ is a sequence $(\mathbf{X}_{t})_{t=0}^{\infty}$ of $n$ -dimensional, $\mathcal{F}$ -measurable random variables $\mathbf{X}_{t}:\Omega\to\mathbb{R}^{n}$ for $t\in\mathbb{N}$ . In our context, one can suppose the probability space is the dynamics $(\Pi_{\mathcal{C}},\mathcal{B}(\Pi_{\mathcal{C}}),\mathbb{P}_{s_{I}}^{\sigma})$ of a pCFG $\mathcal{C}$ under some $\sigma$ and $s_{I}$ ; there, a typical example of a stochastic process is the value of an MM $\boldsymbol{\eta}$ over $\mathcal{C}$ at time $t$ :

\displaystyle\mathbf{X}_{t}(s_{0}s_{1}\ldots)=\boldsymbol{\eta}(s_{t}).

(7)

In this case, the process $(\mathbf{X}_{t})_{t=0}^{\infty}$ represents the behavior of $\boldsymbol{\eta}$ along the run of $\mathcal{C}$ under $\sigma$ and $s_{I}$ .

At time $t\in\mathbb{N}$ , we usually only have a limited knowledge about a sample $\omega\in\Omega$ of the probability space; for example, when we observe the behavior of a pCFG $\mathcal{C}$ , we only know the finite history $s_{0}\ldots s_{t}$ of a run $s_{0}s_{1}\ldots\in\Pi_{\mathcal{C}}$ at a given time $t$ . Such a limitation of knowledge is formalized as a filtration in $(\Omega,\mathcal{F},\mathbb{P})$ , which is a sequence $(\mathcal{F}_{t})_{t=0}^{\infty}$ of sub- $\sigma$ -algebras of $\mathcal{F}$ such that $\mathcal{F}_{t}\subseteq\mathcal{F}_{t+1}\subseteq\mathcal{F}$ for each $t\in\mathbb{N}$ . Intuitively, two elements $\omega,\omega^{\prime}\in\Omega$ can be distinguished according to $\mathcal{F}_{t}$ if and only if there exists $A\in\mathcal{F}_{t}$ such that $\omega\in A$ and $\omega^{\prime}\not\in A$ . In our context, $\mathcal{F}_{t}$ is typically the $\sigma$ -algebra generated by cylinder sets of length $t+1$ , which can distinguish two runs if and only if they branch off by time $t$ ; we call such a filtration the canonical filtration in the dynamics of pCFG.

Given a stochastic process $(\mathbf{X}_{t})_{t=0}^{\infty}$ and a filtration $(\mathcal{F}_{t})_{t=0}^{\infty}$ in $(\Omega,\mathcal{F},\mathbb{P})$ , it is natural to require that the value of $\mathbf{X}_{t}$ should be determined by the knowledge $\mathcal{F}_{t}$ , i.e., $\mathbf{X}_{t}$ is $\mathcal{F}_{t}$ -measurable. In such a case, $(\mathbf{X}_{t})_{t=0}^{\infty}$ is said to be adapted to $(\mathcal{F}_{t})_{t=0}^{\infty}$ . The stochastic process $(\mathbf{X}_{t})_{t=0}^{\infty}$ defined by Eq. (7) is adapted to the canonical filtration; indeed, the value of $\mathbf{X}_{t}$ only depends on the $t$ -th state $s_{t}$ of the run, which we know according to the $t$ -th element of the canonical filtration.

The termination of the process is formalized by a stopping time with respect to a filtration $(\mathcal{F}_{t})_{t=0}^{\infty}$ , which is a random variable $T:\Omega\to\mathbb{N}\cup\{+\infty\}$ such that the set $\llbracket T\leq t\rrbracket\subseteq\Omega$ is $\mathcal{F}_{t}$ -measurable for each $t\in\mathbb{N}$ . For such a $T$ , we naturally expect the value of a stochastic process $(\mathbf{X}_{t})_{t=0}^{\infty}$ does not change after $T$ , i.e., $\mathbf{X}_{t}(\omega)=\mathbf{X}_{t+1}(\omega)$ holds for each $\omega\in\Omega$ and $t\in\mathbb{N}$ such that $t\geq T(\omega)$ . In such a case, $(\mathbf{X}_{t})_{t=0}^{\infty}$ is said to be stopped at $T$ . In our context, a typical instance of stopping time is the termination time $T_{\mathrm{term}}^{\mathcal{C}}$ of $\mathcal{C}$ ; by the absorbing assumption of $\mathcal{C}$ at $(\ell_{\mathrm{out}},\boldsymbol{x})$ , the stochastic process (7) is stopped at the termination time.

Let $\boldsymbol{\eta}:S\to\mathbb{R}^{n}$ be an MM with $\mathsf{Lv}:\Delta\to\{0,\ldots,n\}$ . For $\Delta$ -deterministic¹⁰¹⁰10 This is relevant for defining $\mathsf{Lv}_{t}$ . Similar RSM construction for general $\sigma$ is also possible by defining runs of pCFGs as alternating sequences of states and transitions. scheduler $\sigma$ and initial state $s_{I}$ , one can construct $(\mathbf{X}_{t})_{t=0}^{\infty}$ , $(\mathcal{F}_{t})_{t=0}^{\infty}$ and $T$ as above, and also the level map $(\mathsf{Lv}_{t})_{t=0}^{\infty}$ by $\mathsf{Lv}_{t}(c_{0}c_{1}\ldots)=\mathsf{Lv}(\sigma_{\Delta}(c_{0}\ldots c_{t}))$ . We say the resulting instance $((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ is induced by $\boldsymbol{\eta}$ and $\mathsf{Lv}$ under $\sigma$ and $s_{I}$ .

Conditional expectation takes the role of pre-expectation in stochastic processes. Informally speaking, the random variable $\mathbb{E}[\mathbf{X}_{t+1}\mid\mathcal{F}_{t}]$ represents the “pre-expectation of $\mathbf{X}_{t}$ ”, i.e., $\mathbb{E}[\mathbf{X}_{t+1}\mid\mathcal{F}_{t}](\omega)$ is the expected value of $\mathbf{X}_{t+1}$ given the knowledge about $\omega$ available from $\mathcal{F}_{t}$ ; for a stochastic process $(\mathbf{X}_{t})_{t=0}^{\infty}$ induced by an MM as Eq. (7), this informal explanation is justified by the following proposition.

{myproposition}

Fix a pCFG $\mathcal{C}$ , its strategy $\sigma$ and initial state $s_{I}$ ; let $(\Pi_{\mathcal{C}},\mathcal{B}(\Pi_{\mathcal{C}}),\mathbb{P}_{s_{I}}^{\sigma})$ be the dynamics of $\mathcal{C}$ under $\sigma$ and $s_{I}$ ; and let an MM $\boldsymbol{\eta}$ over $\mathcal{C}$ be given. Define a stochastic process $(\mathbf{X}_{t})_{t=0}^{\infty}$ in $(\Pi_{\mathcal{C}},\mathcal{B}(\Pi_{\mathcal{C}}),\mathbb{P}_{s_{I}}^{\sigma})$ by Eq. (7), and let $(\mathcal{F}_{t})_{t=0}^{\infty}$ be the canonical filtration in $(\Pi_{\mathcal{C}},\mathcal{B}(\Pi_{\mathcal{C}}),\mathbb{P}_{s_{I}}^{\sigma})$ . For each $t\in\mathbb{N}$ and $k\in\{1,\ldots,n\}$ for which $\mathbb{E}_{\mathbb{P}}[\mathbf{X}_{t}[k]]$ exists, the conditional expectation $\mathbb{E}[\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}]$ is realized by a function $f:\Pi_{\mathcal{C}}\to\mathbb{R}$ such that $f(s_{0}s_{1}\ldots)=\mathbb{X}_{\sigma}\boldsymbol{\eta}[k](s_{0}\ldots s_{t})$ . ∎

Proof.

For simplicity, WLOG assume $\boldsymbol{\eta}=\eta$ is 1-dimensional. For each $t\in\mathbb{N}$ , let $\mathbb{P}_{t}$ be the marginal of $\mathbb{P}$ over $\Pi_{\mathcal{C}}^{t+1}$ . and let $g:\mathcal{S}\to\mathbb{R}$ be such that $\int_{ws\in\Pi_{\mathcal{C}}^{t+2}}g(s)d\mathbb{P}_{t+1}$ exists. Then the following equation is known to hold [BertsekasS07, Proposition 7.28]: for each $A\in\mathcal{B}(\Pi_{\mathcal{C}}^{t+1})$ ,

\displaystyle\int_{w\in A}\biggl{(}\int_{s\in\mathcal{S}}g(s)dp^{\sigma}(w)\biggr{)}d\mathbb{P}_{t}=\int_{ws\in A\times\mathcal{S}}g(s)d\mathbb{P}_{t+1}.

(8)

Recall that $\mathbb{P}_{t}$ is the marginal of $\mathbb{P}$ over $\Pi_{\mathcal{C}}^{t+1}$ ; therefore $\mathbb{E}_{\mathbb{P}}[X]=\int X_{t+1}d\mathbb{P}=\int_{ws\in\Pi_{\mathcal{C}}^{t+2}}\eta(s)d\mathbb{P}_{t+1}$ exists, and hence $g$ in (8) can be instantiated by $\eta$ . Recall $\mathbb{X}_{\sigma}\eta$ is defined by $\mathbb{X}_{\sigma}\eta(w)=\int\eta(s)dp^{\sigma}(w)$ ; we also have $[A\times\mathcal{S}]=[A]$ ; and recall again the relationship between $\mathbb{P}$ and $\mathbb{P}_{t}$ . With all of these in mind, (8) implies

\int_{[A]}fd\mathbb{P}=\int_{[A]}Xd\mathbb{P},

which proves the claim—recall $\mathcal{F}^{\prime}$ was given as $\mathcal{F}^{\prime}=\{[A]\mid A\in\mathcal{B}(\Pi_{\mathcal{C}}^{t+1})\}$ . ∎

{myremark}

A similar fact as Prop. A is claimed in [ChatterjeeGNZZ21arxiv], as a part of soundness proof of their proposed LexRSM [ChatterjeeGNZZ21arxiv, Thm. 2]. However, there is a certain ambiguity in their proof, and therefore, we prove Prop. A independently.

More concretely, in their definition of generalized LexRSM (GLexRSM) [ChatterjeeGNZZ21arxiv, Thm. 2], they require a GLexRSM $(\mathbf{X}_{t})_{t=0}^{\infty}$ that the conditional expectation $\mathbb{E}[\mathbf{X}_{t+1}\cdot{\bf 1}_{A}\mid\mathcal{F}_{t}]$ exists for each $A\in\mathcal{F}_{t}$ . Then in the proof of [ChatterjeeGNZZ21arxiv, Thm. 2], they claim that this conditional expectation exists whenever $(\mathbf{X}_{t})_{t=0}^{\infty}$ is induced from an MM $\boldsymbol{\eta}$ under a scheduler $\sigma$ and an initial state $s_{I}$ . They claim it by saying that a similar function as $f$ in Prop. A satisfies the axiom of conditional expectation [ChatterjeeGNZZ21arxiv, p. 26]; however, our claim in Prop. A is that this holds only when the expectation of $\mathbf{X}_{t}[k]$ exists for each $k$ , and this existence is not explicitly discussed in their proof. In this paper, we show the existence of $\mathbf{X}_{t}[k]$ under the linearity assumption on MMs and pCFGs (Prop. C.1.3).

On Footnote 5. In this paper, we did not formally define the notion of Lexicographic Ranking Function (LexRF) for non-probabilistic program. If we define it as an MM over non-probabilistic CFG (cf. Non-probabilistic settings, and instantiation of SC-LexRF in §3), then what we need to change in the definition of LexRSM map is the meaning of the next-time operator $\overline{\mathbb{X}}$ . More concretely, we would be defining the ranking condition by the following non-probabilistic maximal pre-expectation $\overline{\mathbb{Y}}_{\tau}\eta$ that would be defined as

\displaystyle\overline{\mathbb{Y}}_{\tau}\eta(\ell,\boldsymbol{x})=\begin{cases}\eta(\ell^{\prime},u(\boldsymbol{x}),\boldsymbol{x}_{-i})&\text{if }\tau\in\Delta_{d},\\ \sup_{x_{i}^{\prime}\in u}\eta(\ell^{\prime},x_{i}^{\prime},\boldsymbol{x}_{-i})&\text{if }\tau\in\Delta_{n},\end{cases}

where $\ell^{\prime}$ is the successor location of $\ell$ via $\tau$ . But this is exactly the same as $\overline{\mathbb{X}}_{\tau}\eta$ when considered over non-probabilistic CFG; therefore, LexRFs can be naturally understood as LexRSMs over non-probabilistic CFG.

Similarly, if we define a LexRF as a stochastic process over the trivial probability space, then we would be changing the inequality (1) in the ranking condition to the following:

\displaystyle\mathbf{X}_{t+1}[k](\omega)\leq\mathbf{X}_{t}[k](\omega)-c\cdot{\bf 1}_{\llbracket k=\mathsf{Lv}_{t}\rrbracket}(\omega).

But this is exactly what (1) means over the trivial probability space, whose $\Omega$ is a singleton. Therefore, when we say a LexRF in this paper, we understand it as a LexRSM (map) over a CFG or the trivial probability space.

Appendix B Omitted details of Section 4

Proof of Thm. 4. Any instance over the trivial probability space satisfies the conditions (a) and (b) of SC-LexRSM (Def. 3.2), hence so does the $\varepsilon$ -fixing $((\tilde{\boldsymbol{x}}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ . On the ranking condition, take any $t\in\mathbb{N}$ and $k\in\{1,\ldots,\mathsf{Lv}_{t}\}$ . Notice that, if $k=\mathsf{Lv}_{t}$ , then $\tilde{\boldsymbol{x}}_{t}[k]=\boldsymbol{x}_{t}[k]\geq 0$ holds by the non-negativity condition of SC-LexRSM; with this in mind, the inequality $\tilde{\boldsymbol{x}}_{t+1}[k]\leq\tilde{\boldsymbol{x}}_{t}[k]-c\cdot{\bf 1}_{\llbracket\mathsf{Lv}_{t}=k\rrbracket}$ is easily derived from the ranking condition of SC-LexRSM (with some case distinctions). ∎

Proof of Thm. 4. We write $\mathsf{DONE}_{k}^{t}\equiv\mathbf{X}_{t}[k]<0\lor k>\mathsf{Lv}_{t}$ to denote the fixing condition (recall Def. 4). Our goal is to show the following, where $\tilde{\mathcal{I}}=((\tilde{\mathbf{X}}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ is the $\varepsilon$ -fixing of $\mathcal{I}$ :

\forall t\in\mathbb{N}.\forall\omega\in\Omega.\forall k\in\{1,\ldots,\mathsf{Lv}_{t}(\omega)\}.\mathbb{E}[\tilde{\mathbf{X}}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)\leq\tilde{\mathbf{X}}_{t}[k](\omega)-{\bf 1}_{\llbracket k=\mathsf{Lv}_{t}\rrbracket}(\omega)\quad(\mathbb{P}\mbox{-a.s.}).

To this end, we first observe the LHS of the inequality is transformed as follows:

\mathbb{E}[\tilde{\mathbf{X}}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)=\mathbb{E}[{\bf 1}_{\llbracket\lnot\mathsf{DONE}_{k}^{t+1}\rrbracket}\cdot\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)-\varepsilon\cdot\mathbb{P}(\mathsf{DONE}_{k}^{t+1}).

By LW and SC non-negaitvity, we have $\mathsf{DONE}_{k}^{t}\Leftrightarrow k>\mathsf{Lv}_{t}$ for each $t$ and $k$ ; hence we have, for $t\in\mathbb{N},\omega\in\llbracket\mathsf{Lv}_{t}\neq 0\rrbracket$ , and $k\in\{1,\ldots,\mathsf{Lv}_{t}(\omega)\}$ ,

	$\displaystyle\mathbb{E}[{\bf 1}_{\llbracket{\lnot\mathsf{DONE}_{k}^{t+1}}\rrbracket}\cdot\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)$	$\displaystyle=\mathbb{E}[\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)-\mathbb{E}[{\bf 1}_{\llbracket\mathsf{DONE}_{k}^{t+1}\rrbracket}\cdot\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)$
		$\displaystyle=\mathbb{E}[\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)-\mathbb{E}[{\bf 1}_{\llbracket k>\mathsf{Lv}_{t+1}\rrbracket}\cdot\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)$
		$\displaystyle\leq\mathbb{E}[\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega),$

where the last inequality is due to the expected leftward non-negativity. Hence we have the following $\mathbb{P}$ -a.s., for $t\in\mathbb{N},\omega\in\llbracket\mathsf{Lv}_{t}\neq 0\rrbracket$ , and $k\in\{1,\ldots,\mathsf{Lv}_{t}(\omega)\}$ :

$\displaystyle\mathbb{E}[\tilde{\mathbf{X}}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)$	$\displaystyle\leq\mathbb{E}[\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)$
	$\displaystyle\leq\mathbf{X}_{t}[k](\omega)-{\bf 1}_{\llbracket k=\mathsf{Lv}_{t}\rrbracket}(\omega)$	(ranking condition)
	$\displaystyle=\tilde{\mathbf{X}}_{t}[k](\omega)-{\bf 1}_{\llbracket k=\mathsf{Lv}_{t}\rrbracket}(\omega).$	( $\omega\in\llbracket k\leq\mathsf{Lv}_{t}\rrbracket=\llbracket\lnot\mathsf{DONE}_{k}^{t}\rrbracket$ )	∎

Proof of Thm. 4. We use the proof structure in [ChatterjeeGNZZ21] that utilizes Borel-Cantelli Lemma. The main nontrivial part in our setting is how to properly design the non measure-zero set to derive a contradiction (i.e. eq. (9)), which involves an additional complication due to $\varphi_{t,k}$ .

{mylemma}

[Borel-Cantelli lemma] Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, and let $(\varphi_{t})_{t=0}^{\infty}$ be a sequence of $\mathcal{F}$ -measurable predicates such that $\sum_{t}\mathbb{P}(\varphi_{t})<\infty$ . Then $\mathbb{P}(\overset{\infty}{\exists}t.\varphi_{t})=0$ . ∎

Proof.

Assume the contrary, i.e., $\mathbb{P}(T=\infty)>0$ . Under the assumption, it can be shown that there exist $t_{0}\in\mathbb{N}$ , $k_{0}\in\{1,\ldots,n\}$ and $M\in\mathbb{R}$ such that

\displaystyle\mathbb{P}(\forall t\geq t_{0}.[\mathbf{X}_{t}[k_{0}]>\bot\land k_{0}\leq\mathsf{Lv}_{t}\land\lnot\varphi_{t,k}]\land\overset{\infty}{\exists}t.k_{0}=\mathsf{Lv}_{t}\land\mathbf{X}_{t_{0}}[k_{0}]\leq M)>0.

(9)

Here, predicate in the LHS of (9) is the conjunction of three predicates. Roughly speaking, the first says “after the time $t_{0}$ , the ranking condition is always imposed in the dimension $k_{0}$ ”; the second says “the value of $\mathbf{X}_{t}[k_{0}]$ decreases by 1 in expectation infinitely often”; the third is a technical one that takes care of the case where $\mathbb{E}[\mathbf{X}_{0}[k_{0}]]=+\infty$ .

As a technical setup, we observe the “weak ranking condition” of $\mathcal{I}$ implies the following (cf. the proof of Prop. C.1.1):

	$\displaystyle\forall t\in\mathbb{N}.\forall\omega\in\llbracket\mathsf{Lv}_{t}\neq 0\rrbracket.\mathbf{X}_{t}[\mathsf{Lv}_{t}(\omega)](\omega)>\bot\quad(\mathbb{P}\mbox{-a.s.}),$		(10)
	$\displaystyle\forall t\in\mathbb{N}.\forall\omega\in\llbracket\mathsf{Lv}_{t}\neq 0\rrbracket.\forall k\in\{1,\ldots,\mathsf{Lv}_{t}(\omega)-1\}.\mathbf{X}_{t}[k](\omega)=\bot\Rightarrow\mathbf{X}_{t+1}[k](\omega)=\bot\quad(\mathbb{P}\mbox{-a.s.}).$		(11)

The proof of the statement is as follows; below, we omit the valuation element $\omega\in\Omega$ of predicates for brevity, i.e., we write e.g., $\varphi$ instead of $\varphi(\omega)$ . First, let $\psi_{k}\equiv\overset{\infty}{\forall}t.k\leq\mathsf{Lv}_{t}\land\overset{\infty}{\exists}t.k=\mathsf{Lv}_{t}$ ; then by a standard measure-theoretic argument, there exists $k_{0}\in\{1,\ldots,n\}$ such that $\mathbb{P}(\psi_{k_{0}})>0$ holds. Meanwhile, for each $\leq k\in\{1,\ldots,n\}$ , we have the following $\mathbb{P}$ -a.s.:

$\displaystyle\psi_{k}$	$\displaystyle\implies\overset{\infty}{\forall}t.\bigl{[}[\mathbf{X}_{t}[k]=\bot\Rightarrow\mathbf{X}_{t+1}[k]=\bot]\land k\leq\mathsf{Lv}_{t}\bigr{]}\land\overset{\infty}{\exists}t.k=\mathsf{Lv}_{t}$	(condition (10) and (11))
	$\displaystyle\implies\overset{\infty}{\forall}t.[\mathbf{X}_{t}[k]>\bot\land k\leq\mathsf{Lv}_{t}]\land\overset{\infty}{\exists}t.\mathsf{Lv}_{t}=k$	(see below)
	$\displaystyle\implies\overset{\infty}{\forall}t.[\mathbf{X}_{t}[k]>\bot\land k\leq\mathsf{Lv}_{t}\land\lnot\varphi_{t,k}]\land\overset{\infty}{\exists}t.\mathsf{Lv}_{t}=k.$	(contraposition of (2))

Here, the second implication holds because $\bigl{[}\overset{\infty}{\forall}t.[\mathbf{X}_{t}[k]=\bot\Rightarrow\mathbf{X}_{t+1}[k]=\bot]\land\overset{\infty}{\exists}t.k=\mathsf{Lv}_{t}\bigr{]}\Rightarrow\overset{\infty}{\forall}t.\mathbf{X}_{t}[k]>\bot$ is always true. Indeed, the LHS implies $\overset{\infty}{\forall}t.[\mathbf{X}_{t}[k]=\bot\Rightarrow\mathbf{X}_{t+1}[k]=\bot]\land\overset{\infty}{\exists}t.\mathbf{X}_{t}[k]>\bot$ by condition (10), from which the RHS easily follows. Hence $\mathbb{P}(\psi_{k_{0}})>0$ implies

\alpha=\mathbb{P}(\overset{\infty}{\forall}t.[\mathbf{X}_{t}[k_{0}]>\bot\land k_{0}\leq\mathsf{Lv}_{t}\land\lnot\varphi_{t,k_{0}}]\land\overset{\infty}{\exists}t.k_{0}=\mathsf{Lv}_{t})>0.

Let $\alpha_{t^{\prime}}=\mathbb{P}(\forall t\geq t^{\prime}.[\mathbf{X}_{t}[k_{0}]>\bot\land k_{0}\leq\mathsf{Lv}_{t}\land\lnot\varphi_{t,k_{0}}]\land\overset{\infty}{\exists}t.k_{0}=\mathsf{Lv}_{t})$ ; by Monotone Convergence Theorem we have $\alpha=\lim_{t^{\prime}\to\infty}\alpha_{t^{\prime}}$ , and hence there exists $t_{0}$ such that $\alpha_{t_{0}}>0$ . Via a similar argument, we can show the existence of $M\in\mathbb{R}$ for which the inequality (9) holds.

Now let $\Phi_{t}\equiv\forall t^{\prime}\in\{0,\ldots,t\}.\bigl{[}\mathbf{X}_{t^{\prime}+t_{0}}[k_{0}]>\bot\land k_{0}\leq\mathsf{Lv}_{t^{\prime}+t_{0}}\land\lnot\varphi_{t^{\prime}+t_{0},k_{0}}\bigr{]}.$ Also let $T^{\prime}(\omega)=\min\{t\mid\lnot\Phi_{t}(\omega)\}\cdot{\bf 1}_{\llbracket\mathbf{X}_{t_{0}}[k_{0}]\leq M\rrbracket}(\omega)$ . Then the inequality (9) is rewritten as $\mathbb{P}(T^{\prime}=\infty\land\overset{\infty}{\exists}t.k_{0}=\mathsf{Lv}_{t})>0.$ Define a stochastic process $(Y_{t})_{t=0}^{\infty}$ stopped at $T^{\prime}$ by $Y_{0}(\omega)=\bot$ for $\omega\in\llbracket T^{\prime}=0\rrbracket$ ; and $Y_{t}(\omega)=\mathbf{X}_{t+t_{0}}[k_{0}](\omega)$ for each $t\in\mathbb{N}$ and $\omega\in\llbracket T^{\prime}\geq t\land T^{\prime}\neq 0\rrbracket$ ; and $Y_{t}(\omega)=Y_{T^{\prime}(\omega)}(\omega)$ otherwise. Then we have $\mathbb{E}[Y_{t}]\geq\bot$ for each $t$ by construction. Also notice that $\mathbb{E}[Y_{0}]\leq M$ holds. Now, for each $t\in\mathbb{N}$ , observe the following holds:

	$\displaystyle T^{\prime}>t$	$\displaystyle\implies\mathbf{X}_{t+t_{0}}[k_{0}]>\bot\land k_{0}\leq\mathsf{Lv}_{t+t_{0}}\land\lnot\varphi_{t+t_{0},k_{0}}$
		$\displaystyle\implies k_{0}\leq\mathsf{Lv}_{t+t_{0}}\land\lnot(\mathbf{X}_{t+t_{0}}[k_{0}]>\bot\land\varphi_{t+t_{0},k_{0}}).$

Therefore, by the “weak ranking condition” of $\mathcal{I}$ , we have

\displaystyle\int_{\llbracket T^{\prime}>t\rrbracket}\mathbb{E}[\mathbf{X}_{t+t_{0}+1}[k_{0}]\mid\mathcal{F}_{t+t_{0}}]d\mathbb{P}\leq\int_{\llbracket T^{\prime}>t\rrbracket}\mathbf{X}_{t+t_{0}}[k_{0}]-{\bf 1}_{\llbracket k_{0}=\mathsf{Lv}_{t+t_{0}}\rrbracket}d\mathbb{P}.

Also observe $Y_{t^{\prime}}(\omega)=\mathbf{X}_{t^{\prime}+t_{0}}[k_{0}](\omega)$ holds for each $t^{\prime}\in\{0,\ldots,t+1\}$ for $\omega\in\llbracket T^{\prime}>t\rrbracket$ . Hence we have $\int_{\llbracket T^{\prime}>t\rrbracket}Y_{t+1}d\mathbb{P}=\int_{\llbracket T^{\prime}>t\rrbracket}\mathbb{E}[Y_{t+1}\mid\mathcal{F}_{t+t_{0}}]d\mathbb{P}\leq\int_{\llbracket T^{\prime}>t\rrbracket}Y_{t}-{\bf 1}_{\llbracket k_{0}=\mathsf{Lv}_{t+t_{0}}\rrbracket}d\mathbb{P}$ for each $t\in\mathbb{N}$ , where the first equality is by definition of the conditional expectation. As we have $Y_{t+1}(\omega)=Y_{t}(\omega)$ for $\omega\in\llbracket T^{\prime}\leq t\rrbracket$ (recall $(Y_{t})_{t=0}^{\infty}$ is stopped at $T^{\prime}$ ), we have

\bot\leq\mathbb{E}[Y_{t+1}]\leq\mathbb{E}[Y_{t}]-\mathbb{P}(T^{\prime}>t\land k_{0}=\mathsf{Lv}_{t+t_{0}}).

Hence we have $\sum_{t\in\mathbb{N}}\mathbb{P}(T^{\prime}>t\land k_{0}=\mathsf{Lv}_{t+t_{0}})\leq\mathbb{E}[Y_{0}]<\infty.$ By Borel-Cantelli lemma (Lem. B), we have $\mathbb{P}(\overset{\infty}{\exists}t.(T^{\prime}>t\land k_{0}=\mathsf{Lv}_{t+t_{0}}))=\mathbb{P}(T^{\prime}=\infty\land\overset{\infty}{\exists}t.k_{0}=\mathsf{Lv}_{t})=0$ , which is a contradiction. ∎

Appendix C Omitted details of Section 5

Proof of Thm. 5. Over a non-probabilistic CFG, the ranking condition of $\boldsymbol{\eta}$ implies

\forall\tau\neq\tau_{\mathrm{out}}.\forall s\in\llbracket I\land G(\tau)\rrbracket.\forall k\in\{1,\ldots,\mathsf{Lv}(\tau)\}.\forall s^{\prime}\in\mathrm{succ}_{\tau}(s).\bigl{[}\boldsymbol{\eta}[k](s^{\prime})\leq\boldsymbol{\eta}[k](s)-\mathbf{1}_{k=\mathsf{Lv}(\tau)}\bigr{]}.

From this condition, stability at negativity of $\boldsymbol{\eta}$ clearly follows. ∎

C.1 Proof of Thm. 5

The proof of Thm. 5 is quite involved, so we make a devoted subsection for it. We first define the notion of LLexRSM as an instance; we do this only in appendices as it is basically an intermediate notion to connect the LLexRSM condition of an MM $\boldsymbol{\eta}$ and $(\varepsilon,\gamma)$ -fixability of its induced instance.

{mydefinition}

[LLexRSM] Suppose the following are given: a probability space $(\Omega,\mathcal{F},\mathbb{P})$ ; a filtration $(\mathcal{F}_{t})_{t=0}^{\infty}$ on $\mathcal{F}$ ; and a stopping time $T$ w.r.t. $(\mathcal{F}_{t})_{t=0}^{\infty}$ . An instance $((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ is called a Lazy Lexicographic Ranking SuperMartingale (LLexRSM) for $T$ if it is an SC-LexRSM for $T$ , and additionally satisfies the following:

	(stability at negativity)	$\displaystyle\forall t\in\mathbb{N}.\forall\omega\in\Omega.\forall k\in\{1,\ldots,\mathsf{Lv}_{t}(\omega)-1\}.$
		$\displaystyle\qquad\qquad\qquad\qquad\quad\mathbf{X}_{t}[k](\omega)<0\Rightarrow\mathbf{X}_{t+1}[k](\omega)<0\lor k>\mathsf{Lv}_{t+1}(\omega).$

C.1.1 Preparation 1: carving out the LLexRSM conditions from fixability

Our plan of soundness proof is the following: if an instance $\mathcal{I}=((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ is induced by an LLexRSM map under linearity and well-behavedness assumptions (in which case $\mathcal{I}$ is an LLexRSM, as we show in Prop. C.1.3), then $\mathcal{I}$ is $(\varepsilon,\gamma)$ -fixable, and hence the underlying stopping time is AST. It turns out that each of the LLexRSM conditions of $\mathcal{I}$ —ranking condition, SC non-negativity and stability at negativity—contributes to its fixability in a rather independent way. More concretely, for given $t\in\mathbb{N}$ and $k\in\{1,\ldots,n\}$ , split the set $\llbracket k\leq\mathsf{Lv}_{t}\rrbracket\subseteq\Omega$ (i.e., the set of samples $\omega\in\Omega$ on which the ranking condition is imposed in dimension $k$ at time $t$ ) into the following three:

\displaystyle\tilde{\Omega}_{1}^{t,k}=\llbracket\tilde{\mathbf{X}}_{t}[k]=-\varepsilon\land k=\mathsf{Lv}_{t}\rrbracket,\quad\tilde{\Omega}_{2}^{t,k}=\llbracket\tilde{\mathbf{X}}_{t}[k]=-\varepsilon\land k<\mathsf{Lv}_{t}\rrbracket,\quad\tilde{\Omega}_{3}^{t,k}=\llbracket\tilde{\mathbf{X}}_{t}[k]\geq 0\land k\leq\mathsf{Lv}_{t}\rrbracket.

Here, $\tilde{\mathcal{I}}=((\tilde{\mathbf{X}}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ is the $\varepsilon$ -fixing of $\mathcal{I}$ for a fixed $\varepsilon>0$ . Then the $(\varepsilon,\gamma)$ -fixability of $\mathcal{I}$ over $\tilde{\Omega}_{1}^{t,k}$ , $\tilde{\Omega}_{2}^{t,k}$ and $\tilde{\Omega}_{3}^{t,k}$ are derived from its SC non-negativity, stability at negativity and ranking condition, respectively. In this section, we show that the conditions are $\mathbb{P}$ -a.s. equivalent in the first two cases (Prop. C.1.1); We handle the third case later, which is much more nontrivial.

Let $\Omega^{t,k}\subseteq\Omega$ be given for each $t\in\mathbb{N}$ and $k\in\{1,\ldots n\}$ . We say an instance $((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ satisfies the ranking condition over $\Omega^{t,k}$ when it satisfies the following:

\forall t\in\mathbb{N}.\forall k\in\{1,\ldots,n\}.\forall\omega\in\llbracket T>t\rrbracket\cap\Omega^{t,k}.\mathbb{E}[\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)\leq\mathbf{X}_{t}[k](\omega)-{\bf 1}_{\llbracket\mathsf{Lv}_{t}=k\rrbracket}(\omega)\ (\mathbb{P}\mbox{-a.s.})

Observe it is the usual ranking condition (Def. 3.2) when $\Omega^{t,k}=\llbracket k\leq\mathsf{Lv}_{t}\rrbracket$ . Now the following holds.

{myproposition}

[LLexRSM conditions as partial fixability] Let an instance $\mathcal{I}=((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ in a probability space $(\Omega,\mathcal{F},\mathbb{P})$ be given, and let $\tilde{\mathcal{I}}=((\tilde{\mathbf{X}}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ be its $\varepsilon$ -fixing for some $\varepsilon>0$ . Then the following hold.

(a)

$\mathcal{I}$ is SC non-negative $\mathbb{P}$ -a.s. iff $\tilde{\mathcal{I}}$ satisfies the ranking condition over $\llbracket\tilde{\mathbf{X}}_{t}[k]=-\varepsilon\land k=\mathsf{Lv}_{t}\rrbracket$ .
(b)

$\mathcal{I}$ is stable at negativity $\mathbb{P}$ -a.s. iff $\tilde{\mathcal{I}}$ satisfies the ranking condition over $\llbracket\tilde{\mathbf{X}}_{t}[k]=-\varepsilon\land k<\mathsf{Lv}_{t}\rrbracket$ . ∎

Proof.

Let $\mathcal{J}=((\mathbf{Y}_{t})_{t=0}^{\infty},(\mathsf{Lv}^{\prime}_{t})_{t=0}^{\infty})$ be a uniformly well-founded instance with a bottom $\bot$ . For given $t\in\mathbb{N}$ and $k\in\{1,\ldots,n\}$ , split the set $\llbracket k\leq\mathsf{Lv}^{\prime}_{t}\rrbracket$ into the following:

\displaystyle\Omega_{1}^{t,k}=\llbracket\mathbf{Y}_{t}[k]=\bot\land k=\mathsf{Lv}^{\prime}_{t}\rrbracket,\Omega_{2}^{t,k}=\llbracket\mathbf{Y}_{t}[k]=\bot\land k<\mathsf{Lv}^{\prime}_{t}\rrbracket,\Omega_{3}^{t,k}=\llbracket\mathbf{Y}_{t}[k]>\bot\land k\leq\mathsf{Lv}^{\prime}_{t}\rrbracket.

(12)

Then it can be shown that the ranking condition for $\mathcal{J}$ over $\Omega_{1}^{t,k}$ and $\Omega_{2}^{t,k}$ are equivalent to (13) and (14) below, respectively:

	$\displaystyle\forall t\in\mathbb{N}.\forall\omega\in\llbracket\mathsf{Lv}_{t}\neq 0\rrbracket.\mathbf{Y}_{t}[\mathsf{Lv}^{\prime}_{t}(\omega)](\omega)>\bot\quad(\mathbb{P}\mbox{-a.s.}),$		(13)
	$\displaystyle\forall t\in\mathbb{N}.\forall k\in\{1,\ldots,n\}.\forall\omega\in\llbracket\mathbf{Y}_{t}[k]=\bot\land k<\mathsf{Lv}^{\prime}_{t}\rrbracket.\mathbf{Y}_{t+1}[k](\omega)=\bot\quad(\mathbb{P}\mbox{-a.s.}).$		(14)

Indeed, we note that the inequality of the ranking condition is never satisfied over $\Omega_{1}^{t,k}$ ; therefore, $\mathcal{J}$ satisfies the ranking condition over $\Omega_{1}^{t,k}$ iff $\mathbb{P}(\Omega_{1}^{t,k})=0$ for each $t$ and $k$ , which is equivalent to (13). The equivalence with (14) is shown from the fact that the following holds $\mathbb{P}$ -a.s. over $\Omega_{2}^{t,k}$ :

\mathbb{E}[\mathbf{Y}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)\leq\mathbf{Y}_{t}[k](\omega)-{\bf 1}_{\llbracket\mathsf{Lv}^{\prime}_{t}=k\rrbracket}(\omega)\Leftrightarrow\mathbb{E}[\mathbf{Y}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)=\bot\Leftrightarrow\mathbf{Y}_{t+1}[k](\omega)=\bot.

Now, if $\mathcal{J}$ is the $\varepsilon$ -fixing $\tilde{\mathcal{I}}$ of some $\mathcal{I}$ (whose bottom is $-\varepsilon$ ), then it is easy to check $\tilde{\mathcal{I}}$ satisfies (13) iff $\mathcal{I}$ is SC non-negative $\mathbb{P}$ -a.s.; similarly, $\tilde{\mathcal{I}}$ satisfies (14) iff $\mathcal{I}$ is stable at negativity $\mathbb{P}$ -a.s. ∎

{myremark}

We note Prop. C.1.1 implies that LLexRSM generalizes $\varepsilon$ -fixable LexRSM modulo $\mathbb{P}$ -a.s., i.e. any $\varepsilon$ -fixable LexRSM with $\varepsilon>0$ satisfies the ranking conition, SC non-negativity and stability at negativity $\mathbb{P}$ -a.s. This generalization is also strict, in the sense that there is an LLexRSM that is not $\varepsilon$ -fixable for any $\varepsilon>0$ (Fig. 3 constitutes such a one).

C.1.2 Preparation 2: Well-behaved distributions and its properties

Below we give the formal definition of well-behaved distributions. {mydefinition}[well-behaved distributions] We say a distribution $p\in\mathcal{D}(\mathbb{R})$ is well-behaved if the following holds: for any $a\in\mathbb{R}\setminus\{0\}$ , there exist constants $C_{1}\in(0,1)$ and $C_{2}>0$ such that

\displaystyle\forall b\in\mathbb{R}.\biggl{[}p(ax+b<0)\leq C_{1}\Rightarrow\int ax+bdp\geq\int\max\{0,ax+b\}dp-C_{2}\cdot p(ax+b<0)\biggr{]}.

(15)

Condition (15) is a canonical realization of what we need for the soundness proof. Recall our goal is to show that, if a linear MM $\boldsymbol{\eta}$ satisfies the ranking condition, then any of its induced instance $\mathcal{I}$ is $(\varepsilon,\gamma)$ -fixable over $\tilde{\Omega}_{3}^{t,k}=\llbracket\tilde{\mathbf{X}}_{t}[k]\geq 0\land k\leq\mathsf{Lv}_{t}\rrbracket$ . Condition (15) is designed so that this property holds whenever the underlying pCFG updates variables according to well-behaved distributions: roughly speaking, the antecedent part of (15) reads “ $\varphi_{t,k}(\omega)$ in (3) is false”; and the consequent part of (15) reads “at $\omega$ , the ranking condition for $\mathcal{I}$ in the dimension $k$ at time $t$ implies the ranking condition for its $\varepsilon$ -fixing $\tilde{\mathcal{I}}$ for the same $k$ and $t$ ”.

We formally define our well-behavedness condition of pCFGs below, together with linearity. {mydefinition} For $\tau\in\Delta$ , let $u_{\tau}$ be the second component of $\mathit{Up}(\tau)$ . We say a pCFG $\mathcal{C}$ is linear if, for each $\tau\in\Delta_{d}$ , the function $u_{\tau}:\mathbb{R}^{|V|}\to\mathbb{R}$ is linear; we say $\mathcal{C}$ is well-behaved if, for each $\tau\in\Delta_{p}$ , the distribution $u_{\tau}$ is well-behaved. An MM $\boldsymbol{\eta}$ is linear if $\lambda\boldsymbol{x}.\boldsymbol{\eta}(\ell,\boldsymbol{x})$ is linear for each $\ell\in L$ .

Below we prove two important classes of distributions are well-behaved. We do not use it in the proof of Thm. 5; we prove it to demonstrate the applicability of LLexRSM.

{myproposition}

The following hold.

(a)

Any $p\in\mathcal{D}(\mathbb{R})$ with a bounded support is well-behaved. Moreover, for $c>0$ , let $P=\{p\in\mathcal{D}(\mathbb{R})\mid\mathrm{supp}(p)\subseteq[-c,c]\}$ ; then for $a\in\mathbb{R}\setminus\{0\}$ , there exist $C_{1}$ and $C_{2}$ such that (15) holds for any $p\in P$ .
(b)

For any $\mu\in\mathbb{R}$ and $\sigma^{2}>0$ , the normal distribution $\mbox{Norm}(\mu,\sigma^{2})$ with the mean $\mu$ and the standard deviation $\sigma^{2}$ is well-behaved¹¹¹¹11 We use $\mu$ and $\sigma^{2}$ to represent the mean and standard deviation, following the standard of statistics; there is no relevance to measures and schedulers. . ∎

Proof of Prop. C.1.2. We prove for the case $a<0$ only; the proof is similar for the case $a>0$ . A proof of (a) is as follows. Fix $y_{0}\in\mathbb{R}$ and $y_{1}>0$ , and take any $p\in\mathcal{D}(\mathbb{R})$ such that $\mathrm{supp}(p)\subseteq[y_{0},y_{0}+y_{1}]$ . Then for a given $b\in\mathbb{R}$ , $p(ax+b<0)<1$ implies $ay_{0}+b\geq 0$ , and hence $\inf\{ax+b\mid x\in\mathrm{supp}(p)\}\geq a(y_{0}+y_{1})+b\geq ay_{1}$ . Then we have $\int_{\mathbb{R}}\min\{0,ax+b\}dp=\int_{\llbracket ax+b<0\rrbracket}ax+bdp\geq ay_{1}\cdot p(ax+b<0).$ Therefore, we let $C_{1}$ be any value in $(0,1)$ and $C_{2}=-ay_{1}$ and the proof is done.

A proof of (11) is as follows. We only prove the case where $\mu=0$ and $\sigma^{2}=1$ ; the proof is similar for the general case. Let $p=\mbox{Norm}(0,1)$ . First observe, for $y\in\mathbb{R}$ ,

\sqrt{2\pi}\int_{y}^{\infty}xdp=\int_{y}^{\infty}xe^{-\frac{x^{2}}{2}}dx=\biggl{[}-e^{-\frac{x^{2}}{2}}\biggr{]}_{y}^{\infty}=e^{-\frac{y^{2}}{2}}.

Also there is a known bound $\int_{y}^{\infty}e^{-\frac{x^{2}}{2}}dx>\frac{y}{y^{2}+1}e^{-\frac{y^{2}}{2}}$ , see [AbramowitzS:book]. By these, for $b<0$ we have

\frac{\int_{\frac{b}{a}}^{\infty}ax-bdp}{\int_{\frac{b}{a}}^{\infty}dp}=-b+a\cdot\frac{\int_{\frac{b}{a}}^{\infty}xe^{-\frac{x^{2}}{2}}dx}{\int_{\frac{b}{a}}^{\infty}e^{-\frac{x^{2}}{2}}dx}>-b+a\cdot\frac{(b/a)^{2}+1}{(b/a)}=\frac{a^{2}}{b}.

Thus in particular, for $b\leq-1$ , we have $\mbox{(LHS)}>-a^{2}$ . Now, for a given $a\in\mathbb{R}\setminus\{0\}$ , let $C_{1}=p(ax+1<0)$ . Then we have $p(ax+b<0)\leq C_{1}\Rightarrow\int_{\llbracket ax+b<0\rrbracket}ax+bdp\geq-a^{2}\cdot p(ax+b<0)$ , and hence, we let $C_{2}=a^{2}$ and we are done. ∎

C.1.3 Preparation 3: LLexRSM map induces LLexRSM

As written in From RSM maps to RSMs of §3.2, properties of $\boldsymbol{\eta}$ such as ranking condition and non-negativity are inherited to its induced instance if the expectation of $\mathbf{X}_{t}[k]$ exists for each $t$ , $k$ . This existence is actually not so trivial when $\boldsymbol{\eta}$ is unbounded from below. Here we ensure LLexRSM map induces LLexRSM in our setting (Prop. C.1.3).

We say a distribution $d\in\mathcal{D}(\mathbb{R})$ is integrable (for linear functions) if it satisfies $\mathbb{E}_{d}[\lambda x.|x|]<\infty$ . We note this property of $d$ implies $\mathbb{E}_{d}[|\eta|]<\infty$ for any linear function $\eta:\mathbb{R}\to\mathbb{R}$ ; if $d$ is not integrable, then $\mathbb{E}_{d}[|\eta|]=\infty$ holds for any $\eta$ that is a non-constant linear function. In [ChatterjeeGNZZ21arxiv], each variable sampling distribution of a pCFG is assumed to be integrable; below we prove that stochastic processes induced by an MM $\boldsymbol{\eta}$ has a finite expectation at each $t$ under this assumption, plus the linearity assumption on MMs and pCFGs.

{myproposition}

Let $\mathcal{C}$ be a linear pCFG with its state set $\mathcal{S}$ , and let $\boldsymbol{\eta}:\mathcal{S}\to\mathbb{R}^{n}$ be a linear MM. Let $(\mathbf{X}_{t})_{t=0}^{\infty}$ be induced by $\boldsymbol{\eta}$ under a $\Delta$ -deterministic scheduler $\sigma$ and an initial state $s_{0}$ , and let $(\Pi_{\mathcal{C}},\mathcal{B}(\Pi_{\mathcal{C}}),\mathbb{P})$ be the dynamics of $\mathcal{C}$ under $\sigma$ and $s_{0}$ . Suppose, for each transition $\tau\in\Delta_{p}$ such that $\mathit{Up}(\tau)=(i,u)$ , the distribution $u$ is integrable. Then we have $\mathbb{E}_{\mathbb{P}}[|\mathbf{X}_{t}[k]|]<\infty$ for each $k\in\{1,\ldots,n\}$ and $t\in\mathbb{N}$ .

Proof.

Without loss of generality we assume $\boldsymbol{\eta}$ is 1-dimensional, and denote it by $\eta_{0}$ . Let $\mathbb{P}_{t}$ be the marginal of $\mathbb{P}$ over $\Pi_{\mathcal{C}}^{t+1}$ (the set of finite paths with lengths $t+1$ ). We recall $\mathbb{E}_{\mathbb{P}}[|X_{t}|]=\int_{s_{0}\ldots s_{t}\in\Pi_{\mathcal{C}}^{t+1}}|\eta_{0}(s_{t})|d\mathbb{P}_{t}$ holds by definition of $X_{t}$ . Hence, it suffices to show the following: for each $t\in\mathbb{N}$ and any 1-dimensional linear MM $\eta$ , the integral $\int_{s_{0}\ldots s_{t}\in\Pi_{\mathcal{C}}^{t+1}}\eta(s_{t})d\mathbb{P}_{t}$ is finite.

This is shown by induction on $t$ . The base case is true as $\mathbb{P}_{0}=\delta_{s_{0}}$ is Dirac. For the step case, for each $\tau\in\Delta$ and $s_{t}\in\llbracket G(\tau)\rrbracket$ we have $\underline{\mathbb{X}}_{\tau}\eta(s_{t})\leq\mathbb{X}_{\sigma}\eta(s_{0}\ldots s_{t})\leq\overline{\mathbb{X}}_{\tau}\eta(s_{t})$ ; here, the function $\underline{\mathbb{X}}_{\tau}\eta$ is so-called minimal pre-expectation [TakisakaOUH21, ChatterjeeGNZZ21arxiv], which is identical to $\overline{\mathbb{X}}_{\tau}\eta$ except that the supremum is substituted with the infimum. The functions $\underline{\mathbb{X}}_{\tau}\eta$ and $\overline{\mathbb{X}}_{\tau}\eta$ are linear over $\llbracket G(\tau)\rrbracket$ whenever $\eta$ and $\mathcal{C}$ are linear, and $\mathcal{C}$ satisfies the integrability assumption. Hence $\int_{w\in\Pi_{\mathcal{C}}^{t+1}}\mathbb{X}_{\sigma}\eta(w)d\mathbb{P}_{t}$ is finite by induction hypothesis, and this value is equal to $\int_{s_{0}\ldots s_{t+1}\in\Pi_{\mathcal{C}}^{t+2}}\eta(s_{t+1})d\mathbb{P}_{t+1}$ , hence finiteness of the latter is proved. ∎

Now we have the desired proposition as follows. {myproposition} Let $\mathcal{C}$ be a linear, well-behaved pCFG, and let $\boldsymbol{\eta}$ be a linear LLexRSM map over $\mathcal{C}$ with a level map $\mathsf{Lv}$ supported by an invariant $I$ . Then for any $\Delta$ -deterministic strategy $\sigma$ and an initial state $s_{I}$ of $\mathcal{C}$ , the instance $\mathcal{I}=((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ induced by $\boldsymbol{\eta}$ and $\mathsf{Lv}$ under $\sigma$ and $s_{I}$ is an LLexRSM for the termination time $T_{\mathrm{term}}^{\mathcal{C}}$ .

Proof.

The proof of adaptedness of $(\mathbf{X}_{t})_{t=0}^{\infty}$ and heredity of ranking and SC non-negativity conditions from LexRSM map to LexRSM is identical to the one in existing works [AgrawalCP18, ChatterjeeGNZZ21], so we omit it.

Existence of $\mathbb{E}_{\mathbb{P}}[\mathbf{X}_{t}[k]]$ is shown by Prop. C.1.3. Indeed, any well-behaved distribution is integrable, as we show below. If $p\in\mathcal{D}(\mathbb{R})$ is well-behaved, then take $C_{1}$ in Eq. (15) for $a=1$ , and take sufficiently large $b\in\mathbb{R}$ so that $p(x+b)\leq C_{1}$ . Then we have

\int_{\llbracket x+b<0\rrbracket}x+bdp=\int_{\llbracket x<-b\rrbracket}xdp+b\cdot p(x+b<0)\geq-C_{2}\cdot p(x+b<0),

which proves $\int_{\llbracket x<-b\rrbracket}xdp>-\infty$ . Similarly, by letting $a=-1$ in Eq. (15), we can derive an inequality $\int_{\llbracket x>b^{\prime}\rrbracket}xdp<\infty$ for some $b^{\prime}\in\mathbb{R}$ . As we have $\int_{\llbracket-b\leq x\leq b^{\prime}\rrbracket}|x|dp^{\prime}<\infty$ for any $p^{\prime}\in\mathcal{D}(\mathbb{R})$ , the claim holds.

Stability at negativity of the induced instance $\mathcal{I}$ is proved as follows. For $\omega=s_{0}s_{1}\ldots\in\Pi_{\mathcal{C}}$ , let $\tau_{t}(\omega)=\sigma_{\Delta}(s_{0}\ldots s_{t})$ . Then the stability at negativity of $\mathcal{I}$ is described as follows: For a given $t\in\mathbb{N}$ , $\omega=s_{0}s_{1}\ldots\in\Pi_{\mathcal{C}}$ , and $k\in\{1,\ldots,\mathsf{Lv}(\tau_{t}(\omega))-1\}$ , it holds that

\displaystyle\boldsymbol{\eta}[k](s_{t})<0\Rightarrow\boldsymbol{\eta}[k](s_{t+1})<0\lor k>\mathsf{Lv}(\tau_{t+1}(\omega)).

(16)

This is derived from the stability at negativity of $\boldsymbol{\eta}$ ; indeed, if $s_{t}$ is a terminal state, then $\mathsf{Lv}_{t}(\omega)=0$ and there is nothing to be checked. Otherwise, we have $s_{t}\in\llbracket I\rrbracket$ as $s_{0}s_{1}\ldots\in\Pi_{\mathcal{C}}$ means $s_{t}$ is reachable from an initial state $s_{0}$ ; we also have $s_{t}\in\llbracket G(\tau_{t}(\omega))\rrbracket$ because $\tau_{t}(\omega)$ is chosen by $\sigma$ at $s_{t}$ ; and hence, Eq. (16) follows from the stability at negativity of $\boldsymbol{\eta}$ . ∎

C.1.4 The main proof

Now we wrap up everything into the proof of the main theorem. The hardest part is the proof of Thm. 5, which we give below.

Proof of Thm. 5. Let $\mathcal{I}=((\mathbf{X}_{t})_{t=0}^{\infty},(\mathsf{Lv}_{t})_{t=0}^{\infty})$ be the induced instance. Then by Prop. C.1.3 $\mathcal{I}$ is an LLexRSM for the termination time $T_{\mathrm{term}}^{\mathcal{C}}$ . Therefore, by Prop. C.1.1, for any $\varepsilon>0$ , the $\varepsilon$ -fixing $\tilde{\mathcal{I}}$ of $\mathcal{I}$ satisfies the ranking condition over $\llbracket\tilde{\mathbf{X}}_{t}[k]=-\varepsilon\land k\leq\mathsf{Lv}_{t}\rrbracket$ . Hence, the proof is done if we find $\varepsilon>0$ and $\gamma\in(0,1)$ under which $\tilde{\mathcal{I}}$ satisfies the ranking condition over $\llbracket\tilde{\mathbf{X}}_{t}[k]\geq 0\land k\leq\mathsf{Lv}_{t}\land\lnot\varphi_{t,k}\rrbracket$ , where $\varphi_{t,k}(\omega)\equiv\mathbb{P}[\tilde{\mathbf{X}}_{t+1}[k]=-\varepsilon\mid\mathcal{F}_{t}](\omega)\geq\gamma,$ as in Eq. (3). More explicitly, our goal is to find $\varepsilon>0$ and $\gamma\in(0,1)$ under which the following holds $\mathbb{P}$ -a.s. for each $t\in\mathbb{N}$ and $k\in\{1,\ldots,n\}$ :

\displaystyle\tilde{\mathbf{X}}_{t}[k](\omega)\geq 0\land k\leq\mathsf{Lv}_{t}(\omega)\land\lnot\varphi_{t,k}(\omega)\Rightarrow\mathbb{E}[\tilde{\mathbf{X}}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)\leq\tilde{\mathbf{X}}_{t}[k](\omega)-{\bf 1}_{\llbracket\mathsf{Lv}_{t}=k\rrbracket}(\omega).

(17)

Observe that the following are equivalent: (a) $\tilde{\mathbf{X}}_{t}[k](\omega)\geq 0$ , (b) $\mathbf{X}_{t}[k](\omega)\geq 0\land k\leq\mathsf{Lv}_{t}(\omega)$ , and (c) $\tilde{\mathbf{X}}_{t}[k](\omega)=\mathbf{X}_{t}[k](\omega)\geq 0\land k\leq\mathsf{Lv}_{t}(\omega)$ . Hence, (17) is equivalent to

\displaystyle\mathbf{X}_{t}[k](\omega)\geq 0\land k\leq\mathsf{Lv}_{t}(\omega)\land\lnot\varphi_{t,k}(\omega)\Rightarrow\mathbb{E}[\tilde{\mathbf{X}}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)\leq\mathbf{X}_{t}[k](\omega)-{\bf 1}_{\llbracket\mathsf{Lv}_{t}=k\rrbracket}(\omega).

(18)

We prove (18) via two case distinctions. If $\omega$ satisfies $\mathbb{P}[\tilde{\mathbf{X}}_{t+1}[k]=-\varepsilon\mid\mathcal{F}_{t}](\omega)=0$ , then we have $\tilde{\mathbf{X}}_{t+1}[k](\omega)\geq 0$ $\mathbb{P}$ -a.s., and hence $\mathbb{E}[\tilde{\mathbf{X}}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)=\mathbb{E}[\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)$ $\mathbb{P}$ -a.s. Therefore, in this case, (18) is proved by the ranking condition for $\mathcal{I}$ .

The rest of the proof is devoted to the case where $\mathbb{P}[\tilde{\mathbf{X}}_{t+1}[k]=-\varepsilon\mid\mathcal{F}_{t}](\omega)>0$ holds (thus it is always assumed below). For $\omega=s_{0}s_{1}\ldots\in\Omega$ , let $\tau_{t}(\omega)=\sigma_{\Delta}(s_{0}\ldots s_{t})$ (i.e., the transition chosen by $\sigma$ given the history $s_{0}\ldots s_{t}$ ; recall $\sigma$ is $\Delta$ -deterministic). We first show that $\tau_{t}(\omega)\not\in\Delta_{d}$ holds when $\gamma$ is sufficiently small. Observe that, by Prop. A, the value $\mathbb{P}[\tilde{\mathbf{X}}_{t+1}[k=-\varepsilon\mid\mathcal{F}_{t}](\omega)=\mathbb{P}[\mathbf{X}_{t+1}[k]<0\lor k>\mathsf{Lv}_{t+1}\mid\mathcal{F}_{t}](\omega)$ $\mathbb{P}$ -a.s. represents the probability that either $\boldsymbol{\eta}[k](s^{\prime})<0$ or $k>\mathsf{Lv}(\sigma_{\Delta}(s_{0}\ldots s_{t}s^{\prime}))$ holds, where $s^{\prime}$ is the successor of $s_{t}$ under $\tau_{t}(\omega)$ . Also observe that, through a deterministic transition $\tau=(\delta,u)\in\Delta_{d}$ , the successor state $(\ell^{\prime},\boldsymbol{x}^{\prime})$ is determined once the successor location $\ell^{\prime}$ is sampled from the distribution $\delta$ . Hence, if $\tau_{t}(\omega)=(\ell,\delta)\in\Delta_{d}$ , then it should hold $\mathbb{P}$ -a.s. that $\mathbb{P}[\tilde{\mathbf{X}}_{t+1}[k]=-\varepsilon\mid\mathcal{F}_{t}](\omega)=\sum_{s^{\prime}\in S^{\prime}}\delta(s^{\prime})$ for some $S^{\prime}\subseteq\mathrm{supp}(\delta)$ . Therefore, if $\gamma<\delta(\ell^{\prime})$ holds for every $(\ell,\delta)\in\Delta$ and $\ell^{\prime}\in\mathrm{supp}(\delta)$ , then it is $\mathbb{P}$ -a.s. true that $\mathbb{P}[\tilde{\mathbf{X}}_{t+1}[k]=-\varepsilon\mid\mathcal{F}_{t}](\omega)\in(0,\gamma)$ implies $\tau_{t}(\omega)\not\in\Delta_{d}$ .

Now observe that the consequent part of (18) can be rewritten as follows, by explicitly writing $\tilde{\mathbf{X}}_{t}$ down: here we use the notation $\mathsf{DONE}_{k}^{t}\equiv\mathbf{X}_{t}[k]<0\lor k>\mathsf{Lv}_{t}$ to describe the fixing condition (recall Def. 4).

\mathbb{E}[{\bf 1}_{\llbracket\lnot\mathsf{DONE}_{k}^{t+1}\rrbracket}\cdot\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)-\varepsilon\cdot\mathbb{P}[\mathsf{DONE}_{k}^{t+1}\mid\mathcal{F}_{t}](\omega)\leq\mathbf{X}_{t}[k](\omega)-{\bf 1}_{\llbracket k=\mathsf{Lv}_{t}\rrbracket}(\omega).

Meanwhile, the antecedent part of (18) implies the following, due to the ranking condition of $\mathcal{I}$ :

\mathbb{E}[{\bf 1}_{\llbracket\lnot\mathsf{DONE}_{k}^{t+1}\rrbracket}\cdot\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)+\mathbb{E}[{\bf 1}_{\llbracket\mathsf{DONE}_{k}^{t+1}\rrbracket}\cdot\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)\leq\mathbf{X}_{t}[k](\omega)-{\bf 1}_{\llbracket k=\mathsf{Lv}_{t}\rrbracket}(\omega).

Hence, to prove (18), it suffices to show $-\mathbb{E}[{\bf 1}_{\llbracket\mathsf{DONE}_{k}^{t+1}\rrbracket}\cdot\mathbf{X}_{t+1}[k]\mid\mathcal{F}_{t}](\omega)\leq\varepsilon\cdot\mathbb{P}[\mathsf{DONE}_{k}^{t+1}\mid\mathcal{F}_{t}](\omega)$ whenever the antecedent of (18) and $\tau_{t}(\omega)\not\in\Delta_{d}$ holds. In particular, it suffices to show

\displaystyle\begin{split}\mathbf{X}_{t}[k](\omega)\geq 0\land k\leq\mathsf{Lv}_{t}(\omega)&\land\varphi_{t,k}(\omega)\land\tau_{t}(\omega)\not\in\Delta_{d}\\ &\Rightarrow-\mathbb{E}[\min\{0,\mathbf{X}_{t+1}[k]\}\mid\mathcal{F}_{t}](\omega)\leq\varepsilon\cdot\mathbb{P}[\mathbf{X}_{t+1}[k]<0\mid\mathcal{F}_{t}](\omega).\end{split}

(19)

Let $P\subseteq\mathcal{D}(\mathbb{R})$ be the set of all distributions over $\mathbb{R}$ that can be used for variable updates in the pCFG $\mathcal{C}$ ; more precisely, let $P=\{u_{\tau}\mid\tau\in\Delta_{p}\}\cup\bigcup_{\tau\in\Delta_{n}}\{u^{\prime}\mid\mathrm{supp}(u^{\prime})\subseteq u_{\tau}\}$ . For a given $a\in\mathbb{R}\setminus\{0\}$ , there are numbers $C_{1}(a)\in(0,1)$ and $C_{2}(a)>0$ such that (15) holds under $C_{1}=C_{1}(a)$ and $C_{2}=C_{2}(a)$ for any $p\in P$ ; this is derived from Prop. C.1.2.(a). In what follows, we fix such $C_{1}(a)$ and $C_{2}(a)$ for each $a\in\mathbb{R}\setminus\{0\}$ . Also, for $\ell^{\prime}\in L$ and $\boldsymbol{x}_{-i}\in\mathbb{R}^{|V|-1}$ , define $\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}:\mathbb{R}\to\mathbb{R}^{n}$ by $\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}(x_{i})=\boldsymbol{\eta}(\ell^{\prime},x_{i},\boldsymbol{x}_{-i})$ . Notice that $\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}[k]$ is a 1-dimensional linear function of the form $\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}[k](x_{i})=a_{\ell^{\prime},k}x_{i}+b$ , where the coefficient $a_{\ell^{\prime},k}$ only depends on $\ell^{\prime}$ and $k$ , and is independent of $\boldsymbol{x}_{-i}$ . Let $C_{1}=\min_{\ell^{\prime},k}C_{1}(a_{\ell^{\prime},k})$ and $C_{2}=\max_{\ell^{\prime},k}C_{2}(a_{\ell^{\prime},k})$ ; as $L$ is finite, $C_{1}\in(0,1)$ and $0<C_{2}<\infty$ .

Now fix $\tau=(\ell,\delta)\in\Delta_{p}$ and $u=u_{\tau}$ . By the well-behavedness of the pCFG $\mathcal{C}$ , we have the following for each $k\in\{1,\ldots,n\}$ :

\displaystyle\forall\ell^{\prime}.\forall\boldsymbol{x}_{-i}.\ \biggl{[}u(\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}[k]<0)\leq C_{1}\Rightarrow\int_{\mathbb{R}}\min\{0,\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}[k]\}du\geq-C_{2}\cdot u(\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}[k]<0)\biggr{]}.

(20)

Also, by Prop. A we have the following $\mathbb{P}$ -a.s. for any $t\in\mathbb{N}$ and $k\in\{1,\ldots,n\}$ ; if $\tau_{t}(\omega)=\tau$ , then

\mathbb{P}[\mathbf{X}_{t+1}[k]<0\mid\mathcal{F}_{t}](\omega)=\sum_{\ell^{\prime}\in L}\delta(\ell^{\prime})\cdot u(\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}[k]<0),

where $\omega=s_{0}s_{1}\ldots$ and $s_{t}=(\ell,x_{i},\boldsymbol{x}_{-i})$ . Hence, for $\gamma\in(0,1)$ that satisfies $\gamma<\delta(\ell^{\prime})\cdot C_{1}$ for each $\ell^{\prime}\in\mathrm{supp}(\delta)$ , we have the following $\mathbb{P}$ -a.s. for any $t\in\mathbb{N}$ : if $\tau_{t}(\omega)=\tau$ , then

\displaystyle\mathbb{P}[\mathbf{X}_{t+1}[k]<0\mid\mathcal{F}_{t}](\omega)<\gamma\Rightarrow\forall\ell^{\prime}\in\mathrm{supp}(\delta).u(\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}[k]<0)\leq C_{1}.

(21)

Now, for any $t\in\mathbb{N}$ , let $\omega=s_{0}s_{1}\ldots$ be given, let $s_{t}=(\ell,x_{i},\boldsymbol{x}_{-i})$ , and suppose $\tau_{t}(\omega)=\tau$ and $\lnot\varphi_{t,k}(\omega)$ holds (observe $\mathbf{X}_{t+1}[k]<0\Rightarrow\tilde{\mathbf{X}}_{t+1}[k]=-\varepsilon$ always holds, and thus $\lnot\varphi_{t,k}(\omega)$ implies $\mathbb{P}[\mathbf{X}_{t+1}[k]<0\mid\mathcal{F}_{t}](\omega)<\gamma$ ). Then we have

$\displaystyle\mathbb{E}[\min\{0,\mathbf{X}_{t+1}[k]\}\mid\mathcal{F}_{t}](\omega)$	$\displaystyle=\sum_{\ell^{\prime}\in L}\delta(\ell^{\prime})\cdot\int_{\mathbb{R}}\min\{0,\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}[k]\}du$	(Prop. A)
	$\displaystyle\geq\sum_{\ell^{\prime}\in L}\delta(\ell^{\prime})\cdot-C_{2}\cdot u(\boldsymbol{\eta}_{\ell^{\prime},\boldsymbol{x}_{-i}}[k]<0)$	(conditions (20) and (21))
	$\displaystyle=-C_{2}\cdot\mathbb{P}[\mathbf{X}_{t+1}[k]<0\mid\mathcal{F}_{t}](\omega).\quad\mbox{($\mathbb{P}$-a.s.)}$	(Prop. A)

Now we let $\varepsilon=C_{2}$ and $\gamma\in(0,1)$ be a number that satisfies $\gamma<\delta(\ell^{\prime})\cdot C_{1}$ for each $(\ell,\delta)\in\Delta$ and $\ell^{\prime}\in\mathrm{supp}(\delta)$ (such a $\gamma$ can be taken from $(0,1)$ as $\Delta$ and $L$ are finite). The argument above shows that the following holds $\mathbb{P}$ -a.s.;

	$\displaystyle\mathbf{X}_{t}[k](\omega)\geq 0\land k\leq\mathsf{Lv}_{t}(\omega)$	$\displaystyle\land\varphi_{t,k}(\omega)\land\tau_{t}(\omega)\not\in\Delta_{d}$
		$\displaystyle\Rightarrow-\mathbb{E}[\min\{0,\mathbf{X}_{t+1}[k]\}\mid\mathcal{F}_{t}](\omega)\leq\varepsilon\cdot\mathbb{P}[\mathbf{X}_{t+1}[k]<0\mid\mathcal{F}_{t}](\omega).$

This holds $\mathbb{P}$ -a.s. for any $\tau\in\Delta_{p}$ ; it can be shown via the similar argument that this also holds $\mathbb{P}$ -a.s. for each $\tau\in\Delta_{n}$ . Hence (19) holds $\mathbb{P}$ -a.s. ∎

Having Thm. 5 proved, soundness of LLexRSM maps now easily follows.

Proof of Thm. 5. By Thm. 5, for any $\Delta$ -deterministic scheduler $\sigma$ and an initial state $s_{I}$ , the instance $\mathcal{I}$ induced by $\boldsymbol{\eta}$ and $\mathsf{Lv}$ under $\sigma$ and $s_{I}$ is $(\varepsilon,\gamma)$ -fixable for some $\varepsilon>0$ and $\gamma\in(0,1)$ . By Cor. 4, this proves that $\mathcal{C}$ is AST under $\sigma$ and $s_{I}$ . As $\mathcal{C}$ is AST whenever it is AST for each $\Delta$ -deterministic $\sigma$ and an initial state $s_{I}$ [ChatterjeeGNZZ21arxiv, Prop. 1], the claim follows. ∎

Appendix D Omitted details of Section 6

Proof of Thm. 6. Over a non-probabilistic CFG, the ranking condition of $\boldsymbol{\eta}:\mathcal{S}\to\mathbb{R}^{n}$ implies

\forall\tau\neq\tau_{\mathrm{out}}.\forall s\in\llbracket I\land G(\tau)\rrbracket.\forall k\in\{1,\ldots,\mathsf{Lv}(\tau)\}.\forall s^{\prime}\in\mathrm{succ}_{\tau}(s).\bigl{[}\boldsymbol{\eta}[k](s^{\prime})\leq\boldsymbol{\eta}[k](s)-\mathbf{1}_{k=\mathsf{Lv}(\tau)}\bigr{]}.

This condition clearly implies the pointwise unafecting condition (5) at every $k\in\{1,\ldots,n\}$ , and hence, $\boldsymbol{\eta}$ satisfies MCLC. ∎

Line-by-line explanation of the synthesis algorithm. The pseudocode of our algorithm is given in Alg. 1. whose summary is as follows. Similar to existing LexRSM synthesis algorithms [ChatterjeeGNZZ21, AgrawalCP18], it constructs a LexRSM $(\eta_{1},\cdots,\eta_{d})$ in an iterative way. At the $k$ -th iteration, the algorithm attempts to construct $\eta_{k}$ that ranks transitions in $U\subseteq\Delta$ , i.e, those which are not ranked by $\eta_{1},\ldots,\eta_{k-1}$ (Line 1-1). It first tries to construct such $\eta_{k}$ under the non-negativity condition (4). This is done by solving the LP problem $\mathcal{L}\mathcal{P}_{U}^{1}$ (Line 1), which looks for a 1-dimensional MM $\eta$ such that

(a)

for every $\tau\in U$ and $s\in\llbracket I\land G(\tau)\rrbracket$ , we have $\overline{\mathbb{X}}_{\tau}\eta(s)\leq\eta(s)$ and $\eta(s)\geq 0$ ; and
(b)

for as many $\tau\in U$ as possible, we have $s\in\llbracket I\land G(\tau)\rrbracket\Rightarrow\overline{\mathbb{X}}_{\tau}\eta(s)\leq\eta(s)-1$ (i.e., $\eta$ ranks $\tau$ ).

The LP problem $\mathcal{L}\mathcal{P}_{U}^{1}$ is obtained by the reduction of conditions (a-b) via Farkas’ lemma. If the solution $\eta$ of $\mathcal{L}\mathcal{P}_{U}^{1}$ ranks at least one transition in $U$ , then the algorithm lets $\eta_{k}=\eta$ , adds it to the output, and eliminates ranked transitions from $U$ (line 1-1); otherwise, it tries to construct $\eta_{k}$ under the pointwise unaffecting condition (5). This is done by solving $\mathcal{L}\mathcal{P}_{U,\mathcal{T}}^{2}$ for each $\mathcal{T}\in\mbox{Class}(U)$ (line 1), where $\mbox{Class}(U)\subseteq 2^{U}\setminus\{\emptyset\}$ is a user-defined parameter; there, the LP problem $\mathcal{L}\mathcal{P}_{U,\mathcal{T}}^{2}$ looks for $\eta$ that exactly ranks transitions in $\mathcal{T}$ , that is,

(a’)

for every $\tau\in\mathcal{T}$ and $s\in\llbracket I\land G(\tau)\rrbracket$ , we have $\overline{\mathbb{X}}_{\tau}\eta(s)\leq\eta(s)-1$ and $\eta(s)\geq 0$ ; and
(b’)

for every $\tau\in\Delta\setminus\mathcal{T}$ , $s\in\llbracket I\land G(\tau)\rrbracket$ and $s^{\prime}\in\mathrm{succ}_{\tau}(s)$ , we have $\eta(s^{\prime})\leq\eta(s)$ .

Once $\mathcal{L}\mathcal{P}_{U,\mathcal{T}}^{2}$ is solved for any $\mathcal{T}\in\mbox{Class}(U)$ , the algorithm does a similar update as line 1-1 and breaks (line 1-1); if it fails to solve $\mathcal{L}\mathcal{P}_{U,\mathcal{T}}^{2}$ for every $\mathcal{T}$ , then it concludes a failure and terminates (line 1-1). If $\eta_{k}$ is computed (line 1 or 1), the algorithm goes to the next iteration after updating $U$ ; the iteration continues until $U$ is empty.

1 Input: A pCFG

\mathcal{C}

with an invariant

I

;

2 Initialize

U\leftarrow

all generalized transitions of pCFG

\mathcal{C}

;

d\leftarrow 0

;

3 while $U$ is not empty do

d\leftarrow d+1

;

ranked\leftarrow False

;

5 Construct and solve

\mathcal{LP}^{1}_{U}

;

6 if No solution to $\mathcal{LP}^{1}_{U}$ then

7 for each $\mathcal{T}\in\mbox{Class}(U)$ do

8 Construct and solve

\mathcal{LP}^{2}_{U,\mathcal{T}}

;

9 if Exist solution to $\mathcal{LP}^{2}_{U,\mathcal{T}}$ then

\eta_{d}\leftarrow

RF from Solution of

\mathcal{LP}^{2}_{U,\mathcal{T}}

;

ranked\leftarrow True

;

U\leftarrow U\backslash\mathcal{T}

;

12 break;

14 if not $ranked$ then

15 Return FALSE;

17 else

\eta_{d}\leftarrow

ranking function from Solution of

\mathcal{LP}^{1}_{U}

;

U\leftarrow U\backslash\{\tau\mid\eta_{d}\mbox{ ranks $\tau$}\}

;

Return

(\eta_{1},\eta_{2},\cdots,\eta_{d})

Algorithm 1 synthesis algorithm of linear SC-LexRSM map with MCLC.

{mytheorem}

For any $\mbox{\rm Class}(U)$ , the algorithm returns a linear SC-LexRSM map with MCLC for a linear, well-behaved pCFG $\mathcal{C}$ whenever it reports a success; it also decides if $\mathcal{C}$ admits a linear LW-LexRSM map, and whenever $\mathcal{C}$ does, the output $\boldsymbol{\eta}$ has the minimal dimension among those. When $\mbox{\rm Class}(U)=2^{U}\setminus\{\emptyset\}$ , the algorithm decides in NP if $\mathcal{C}$ admits a linear SC-LexRSM with MCLC. ∎

Proof.

Fix a linear, well-behaved pCFG $\mathcal{C}$ . Suppose $\mathcal{C}$ admits a linear SC-LexRSM $\boldsymbol{\eta}=(\eta_{1},\ldots,\eta_{n})$ with MCLC, and Alg. 1 has generated $(\hat{\eta}_{1},\ldots,\hat{\eta}_{m})$ (which possibly does not rank all transitions) and terminated. Let $U_{k}\subseteq\Delta$ and $\hat{U}_{k}\subseteq\Delta$ be the set of transitions unranked by $(\eta_{1},\ldots,\eta_{k})$ and $(\hat{\eta}_{1},\ldots,\hat{\eta}_{k})$ , respectively. We prove that, for each $k\in\{0,\ldots,n\}$ , there exists $\hat{k}\in\{0,\ldots,m\}$ such that $\hat{U}_{\hat{k}}\subseteq U_{k}$ if either of the following holds;

1.

The LexRSM $\boldsymbol{\eta}$ satisfies leftward non-negativity, or
2.

$2^{U}\setminus\{\emptyset\}$ .

We also show that $k\leq\hat{k}$ additionally holds for each $k$ in Case 1. The base case is true because $U_{0}=\hat{U}_{0}$ . For the step case, if we have $\hat{U}_{\hat{k}}\subseteq U_{k+1}$ then we are done. If not,

•

in Case 1, it must be the case that $\hat{U}_{\hat{k}+1}$ exists and $\hat{U}_{\hat{k}+1}\subseteq U_{k+1}$ ; indeed, $\mathcal{LP}^{1}_{U_{k}}$ is no easier than $\mathcal{LP}^{1}_{\hat{U}_{\hat{k}}}$ , so Alg. 1 should find a solution of $\mathcal{LP}^{1}_{\hat{U}_{\hat{k}}}$ whenever $U_{k+1}$ exists. Then by a similar analysis as [ChatterjeeGNZZ21arxiv], $\hat{\eta}_{\hat{k}+1}+\eta_{k+1}$ must be also in the solution space of $\mathcal{LP}^{1}_{\hat{U}_{\hat{k}}}$ , and this would rank strictly more transitions than $\hat{\eta}_{\hat{k}+1}$ if $\hat{U}_{\hat{k}+1}\subseteq U_{k+1}$ does not hold. This contradicts the maximality property of $\mathcal{LP}^{1}_{U}$ . Hence the claim follows.
•

in Case 2, if $\eta_{k+1}$ satisfies the condition by Definition 6.(4), then we have the same argument as Case 1. Suppose $\eta_{k+1}$ satisfies the condition by Definition 6.(5), and let $\mathcal{T}$ be the set of transitions ranked by $\eta_{k+1}$ . Then for any $U\subseteq U_{k}$ , the function $\eta_{k+1}$ must be in the solution space of $\mathcal{LP}^{2}_{U,U\cap\mathcal{T}}$ . Because Alg. 1 does the brute-force search, this means that Alg. 1 never returns FALSE before ranking every transition in $\mathcal{T}$ . As $\Delta$ is finite, we eventually observe $\hat{U}_{l}\subseteq U_{k+1}$ for some $l$ .

Because $\boldsymbol{\eta}$ is an SC-LexRSM, We have $U_{n}=\emptyset$ ; and we proved there is an $l$ such that $\hat{U}_{l}\subseteq U_{n}$ when either Case 1 or Case 2 is true, and we also have $l\leq n$ in Case 1. Hence the theorem is proved. ∎

Appendix E Full Experiment Result

Full experiment result. Experiments are performed on Ryzen7 6800H, 16GB RAM machine, WSL-Ubuntu 20.04 platform.

Table 2: Full Experiment Result. Ticks in “p.l.” and “p.a.” indicate the benchmark has a probabilistic loop and assignment, respectively. In the “dim.” column, a number indicates that the algorithm found a LexRSM with that dimension; a cross indicates a failure;

\times

* means the computation is aborted by our experiment platform due to out of memory; “N/A” means we did not run the experiment. The “time” column shows the computation time in seconds.

Benchmark Spec.

Synthesis result

Baselines

Our Algs

STR

LWN

SMC

EMC

Model

p.l.

p.a.

dim.

time

dim.

time

dim.

time

dim.

time

aaron2

\times

\times

0.12

0.06

0.05

0.06

aaron2

\surd

\times

0.05

0.06

aaron2

\surd

\surd

0.05

0.06

alain

\times

\times

0.08

0.07

alain

\surd

\times

0.09

0.11

0.10

alain

\surd

\surd

0.16

0.15

0.17

\times

\times

0.08

0.07

0.08

\surd

\times

0.07

0.08

0.07

0.08

\surd

\surd

0.08

0.09

catmouse

\times

\times

0.06

0.05

catmouse

\surd

\times

0.05

0.06

0.05

0.06

catmouse

\surd

\surd

0.05

0.06

complex

\times

\times

\times

0.08

\times

0.06

0.16

254.03

complex

\surd

\times

\times

0.06

\times

0.07

0.20

4300.40

complex

\surd

\surd

\times

0.07

\times

0.07

0.09

281.59

counterex1a

\times

\times

\times

0.07

\times

0.06

\times

0.20

\times

18.38

counterex1b

\times

\times

0.07

0.06

0.07

counterex1b

\surd

\times

0.07

0.08

counterex1b

\surd

\surd

0.08

0.07

0.08

counterex1c

\times

\times

\times

0.08

\times

0.07

\times

0.14

\times

3.96

counterex1c

\surd

\times

\times

0.07

\times

0.08

\times

0.17

\times

17.07

counterex1c

\surd

\surd

\times

0.08

\times

0.08

\times

0.17

\times

17.43

cousot9

\times

\times

\times

0.06

0.07

cousot9

\surd

\times

\times

0.06

\times

0.07

0.10

0.17

easy1

\times

\times

0.05

0.04

0.05

0.04

easy1

\surd

\times

0.04

0.05

0.04

0.05

easy1

\surd

\surd

0.05

easy2

\times

\times

0.05

easy2

\surd

\times

0.06

0.05

0.06

easy2

\surd

\surd

0.06

exmini

\times

\times

0.05

0.06

exmini

\surd

\times

0.06

0.05

0.06

exmini

\surd

\surd

0.06

insertsort

\times

\times

0.06

0.07

insertsort

\surd

\times

0.07

0.08

insertsort

\surd

\surd

0.07

loops

\times

\times

\times

0.06

\times

0.05

0.09

3.99

ndecr

\times

\times

0.05

ndecr

\surd

\times

0.05

0.06

ndecr

\surd

\surd

0.05

0.06

nestedLoop

\times

\times

\times

0.07

\times

0.07

\times

0.11

\times

263.17

nestedLoop

\surd

\times

0.08

0.09

nestedLoop

\surd

\surd

\times

0.15

\times

0.15

0.18

535.30

perfect

\times

\times

0.06

0.07

0.08

perfect

\surd

\times

0.07

0.08

perfect

\surd

\surd

0.08

perfect1

\times

\times

0.06

0.07

perfect1

\surd

\times

0.07

0.08

perfect1

\surd

\surd

0.08

perfect2

\times

\times

0.06

0.07

perfect2

\surd

\times

0.07

0.08

perfect2

\surd

\surd

\times

0.08

\times

0.07

\times

0.11

\times

0.20

random1d

\times

\times

0.06

0.05

0.06

random1d

\surd

\times

0.05

0.06

random1d

\surd

\surd

0.06

0.05

0.06

0.07

real2

\times

\times

\times

0.06

\times

0.05

\times

0.15

\times

66.48

realbubble

\times

\times

0.08

0.07

0.11

realbubble

\surd

\times

0.07

0.08

realbubble

\surd

\surd

0.09

0.08

0.09

realheapsort

\times

\times

\times

0.09

0.08

0.09

0.10

realheapsort

_step1

\times

\times

\times

0.07

0.08

realheapsort

_step1

\surd

\surd

\times

0.07

0.08

realheapsort

_step2

\times

\times

0.06

0.05

0.06

realselect

\times

\times

0.07

0.08

realselect

\surd

\times

0.07

0.08

realselect

\surd

\surd

0.09

0.08

0.09

0.10

realshellsort

\times

\times

0.05

0.04

0.05

realshellsort

\surd

\times

0.05

0.04

0.05

realshellsort

\surd

\surd

\times

0.06

0.07

rsd

\times

\times

\times

0.06

\times

0.05

\times

0.14

\times

2.66

rsd

\surd

\times

\times

0.06

\times

0.06

\times

0.16

\times

10.57

rsd

\surd

\surd

0.07

serpent

\times

\times

\times

0.06

\times

0.06

0.08

30.59

serpent

\surd

\times

0.06

0.07

serpent

\surd

\surd

0.08

0.09

sipma91

\times

\times

0.07

0.08

sipma91

\surd

\times

0.06

sipma91

\surd

\surd

0.08

0.09

0.08

sipmabubble

\times

\times

0.07

0.08

sipmabubble

\surd

\times

0.08

0.07

0.08

sipmabubble

\surd

\surd

0.08

0.07

0.08

sipma

mergesort

\times

\times

\times

213.99

\times

185.24

\times

183.39

\times

\times

speedDis1

\times

\times

\times

0.06

\times

0.06

0.09

0.19

speedDis2

\times

\times

\times

0.06

\times

0.06

0.09

0.10

speedFails1

\times

\times

0.05

0.06

speedFails1

\surd

\times

0.06

0.05

0.06

speedFails1

\surd

\surd

0.05

0.06

speedFails2

\times

\times

\times

0.05

\times

0.05

\times

0.07

\times

0.07

speedFails2

\surd

\times

\times

0.06

\times

0.05

\times

0.08

\times

0.17

speedFails2

\surd

\surd

\times

0.05

\times

0.05

\times

0.08

\times

0.18

speedFails4

\times

\times

\times

0.05

\times

0.05

\times

0.09

\times

0.30

speedNested

Multiple

\times

\times

0.06

0.07

speedNested

Multiple

\surd

\times

0.08

0.06

0.07

speedNested

Multiple

\surd

\surd

0.07

0.06

0.07

speedNested

MultipleDep

\times

\times

0.06

0.07

speedNested

MultipleDep

\surd

\times

0.09

0.06

0.07

speedNested

MultipleDep

\surd

\surd

0.08

speedSimple

Multiple

\times

\times

\times

0.06

\times

0.06

0.08

0.10

speedSimple

MultipleDep

\times

\times

\times

0.06

\times

0.06

0.09

0.10

speedSingle

Single

\times

\times

0.05

0.06

speedSingle

Single

\surd

\times

0.10

0.06

0.05

0.06

speedSingle

Single

\surd

\surd

0.06

speedSingle

Single2

\times

\times

0.06

speedSingle

Single2

\surd

\times

0.08

0.06

0.07

speedSingle

Single2

\surd

\surd

\times

0.08

\times

0.08

0.17

1.79

speedpldi2

\times

\times

0.06

0.05

0.06

speedpldi2

\surd

\times

0.08

0.05

0.06

speedpldi2

\surd

\surd

0.07

0.06

0.07

speedpldi3

\times

\times

\times

0.07

0.06

0.07

speedpldi3

\surd

\times

\times

0.08

\times

0.06

0.10

0.18

speedpldi3

\surd

\surd

0.08

0.09

speedpldi4

\times

\times

0.06

0.05

0.06

speedpldi4

\surd

\times

0.09

0.05

0.06

speedpldi4

\surd

\surd

0.06

0.07

0.06

terminate

\times

\times

0.05

0.06

terminate

\surd

\times

0.08

0.05

0.06

terminate

\surd

\surd

0.06

unperfect

\times

\times

0.07

0.08

0.07

unperfect

\surd

\times

0.10

0.08

unperfect

\surd

\surd

\times

0.09

\times

0.10

\times

0.34

\times

473.42

wcet0

\times

\times

0.06

0.08

0.06

wcet0

\surd

\times

0.09

0.07

0.06

0.07

wcet0

\surd

\surd

0.08

0.10

wcet1

\times

\times

0.06

0.07

0.06

wcet1

\surd

\times

0.11

0.07

0.06

0.07

wcet1

\surd

\surd

0.08

0.09

0.07

0.08

wcet2

\times

\times

0.05

0.06

wcet2

\surd

\times

0.09

0.06

wcet2

\surd

\surd

0.06

0.07

0.06

while2

\times

\times

0.06

0.07

while2

\surd

\times

0.11

0.06

0.07

while2

\surd

\surd

0.07

0.08

wise

\times

\times

\times

0.05

\times

0.05

\times

0.09

\times

0.29

counterexStr1

\times

\surd

N/A

0.08

0.06

0.18

counterexStr2

\times

\surd

\times

0.06

\times

0.10

0.13

0.36

Lexicographic Ranking Supermartingales with Lazy Lower Bounds

Abstract

1 Introduction

2 Key Observations with Examples

3 Preliminaries

3.1 Syntax and Semantics of Probabilistic Programs

3.2 Lexicographic Ranking Supermartingales

4 Fixable LexRSMs

5 Lazy LexRSM and Its Soundness

6 Automated Synthesis Algorithm of LexRSM

7 Experiments

8 Related Work

9 Conclusion

Acknowledgment

References

Appendix

Appendix A Omitted Details of Section 3

Proof.

Appendix B Omitted details of Section 4

Proof.

Appendix C Omitted details of Section 5

C.1 Proof of Thm. 5

C.1.1 Preparation 1: carving out the LLexRSM conditions from fixability

Proof.

C.1.2 Preparation 2: Well-behaved distributions and its properties

C.1.3 Preparation 3: LLexRSM map induces LLexRSM

Proof.

Proof.

C.1.4 The main proof

Appendix D Omitted details of Section 6

Proof.

Appendix E Full Experiment Result

Lexicographic Ranking Supermartingales with
Lazy Lower Bounds