Utrecht University, Netherlands and https://www.uu.nl/staff/IvanderGiessen/i.vandergiessen@uu.nl Utrecht University, Netherlands and https://www.uu.nl/medewerkers/RJalaliKeshavarzr.jalalikeshavarz@uu.nl TU Wien, Austria and https://sites.google.com/site/kuznets/roman@logic.at \Copyright2021 Iris van der Giessen, Raheleh Jalali, and Roman Kuznets \ccsdesc[500]Theory of Computation—Logic—Proof theory \ccsdesc[500]Theory of Computation—Logic—Modal and temporal logics \fundingIris van der Giessen and Raheleh Jalali acknowledge the support of the Netherlands Organization for Scientific Research under grant 639.073.807. Roman Kuznets is funded by the Austrian Science Fund (FWF) ByzDEL project P 33600. \hideLIPIcs
Uniform interpolation via nested sequents and hypersequents
Abstract
A modular proof-theoretic framework was recently developed to prove Craig interpolation for normal modal logics based on generalizations of sequent calculi (e.g., nested sequents, hypersequents, and labelled sequents). In this paper, we turn to uniform interpolation, which is stronger than Craig interpolation. We develop a constructive method for proving uniform interpolation via nested sequents and apply it to reprove the uniform interpolation property for normal modal logics , , and . We then use the know-how developed for nested sequents to apply the same method to hypersequents and obtain the first direct proof of uniform interpolation for via a cut-free sequent-like calculus. While our method is proof-theoretic, the definition of uniform interpolation for nested sequents and hypersequents also uses semantic notions, including bisimulation modulo an atomic proposition.
keywords:
uniform interpolation, nested sequents, hypersequents, modal logic1 Introduction
Uniform interpolation is stronger than Craig interpolation and provides a simulation of quantifiers in a logic. Similar to Craig interpolation, uniform interpolation is useful in computer science, for example, in quantifier elimination procedures [11] or in knowledge representation to perform tasks such as forgetting irrelevant information in descriptive logics [16]. This shows the practical value of uniform interpolation. The goal of this paper is to expand the reach of proof-theoretic method of proving uniform interpolation.
A propositional (modal) logic admits the Craig interpolation property (CIP) if for any formulas and such that , there is an interpolant containing only atomic propositions that occur in both and such that and . One could say that the purpose of the interpolant is to state the reason follows from by using the common language of the two. Logic has the uniform interpolation property (UIP) if for each formula and each atomic proposition there are uniform interpolants and containing only atomic propositions occurring in except for such that for all formulas not containing :
It is well known that this property is stronger than Craig interpolation. Indeed, by computing uniform interpolants consecutively, it is possible to remove a given set of atomic propositions and construct a formula that would uniformly serve as a Craig interpolant for a fixed and all with a given common language.
Analytic sequent calculi can be used to prove the CIP constructively. For the UIP, terminating cut-free sequent calculi play a similar role. Whereas for the CIP the syntactic proofs are often straightforward, the case of the UIP is much more complicated. Pitts provided a first syntactic proof of this kind, establishing the UIP for [24]. Bílková successfully adjusted the method to (re)prove the UIP for several modal logics including , and [2]. Iemhoff provided a modular method for (intuitionistic) modal logics and intermediate logics with sequent calculi consisting of so-called focused rules, among others establishing the UIP for [14, 15].
There are also algebraic and model-theoretic methods. The UIP for and is due to Shavrukov [28] and Ghilardi [12] respectively. Interestingly, modal logics and do not enjoy the UIP [2, 13] despite enjoying the CIP. Visser provided purely semantic proofs for , , and based on bounded bisimulation up to atomic propositions [31]. This method was later applied to prove the stronger Lyndon UIP for a wide range of modal logics [17]. The semantic interpretation of uniform interpolation is called bisimulation quantifiers, see [7] for an extended explanation. Bisimulations will also play a role in the current paper.
The proof-theoretic approach has two advantages. First, it enables one to find interpolants constructively rather than merely prove their existence.111More precisely, the method enables one to find interpolants efficiently rather than by an exhaustive search of all formulas, the search that terminates due to the proven existence of an interpolant. Second, it can turn uniform interpolation into a powerful tool in the study of existence of proof systems. Negative results are obtained in [14, 15] stating that logics without the UIP cannot have certain natural sequent calculi. As a consequence, and do not possess such proof systems. Similar negative results are obtained for modal and substructural logics in [29] and [30] using the CIP and UIP. These methods are based on calculi with regular sequents.
The goal of this paper is to extend the same line of research to multisequent formalisms such as hypersequents and nested sequents222Nested sequents are also known as tree-hypersequents [25] or deep sequents [5] in the literature.. Such forms of sequent calculi have recently been adapted to prove the CIP of modal logics via nested sequents [10] and hypersequents [18]. A modular proof-theoretic framework encompassing these and also labelled sequents was provided in [19]. The same ideas were extended to multisequent calculi for intermediate logics [21]. The method combines syntactic and semantic reasoning. Generalized Craig interpolants are constructed using the calculus in a purely syntactic manner, but the method’s correctness uses semantic notions from Kripke models of the underlying logic.
This paper extends this method providing proof-theoretic proofs for the UIP based on nested sequents for , , and and on hypersequents for . The UIP for these logics has been known, but we provide a new method that can hopefully be extended to other logics. Similar to [19], we combine syntactic and semantic reasoning. We use the terminating calculi to define the uniform interpolants and then provide model modifications and use bisimulations to prove the correctness of these interpolants.
Bílková [3] also provided a syntactic method for uniform interpolation for based on nested sequents. She presented proofs based on two nested calculi for : one with a standard modality and another that is based on a different modal language with a cover modality . Bílková’s method for nested sequents is closely related to her work based on regular sequents in [2]. The main difference with our method is that we exploit the treelike structure of nested sequents reflecting the treelike models for by incorporating semantic arguments while the algorithm for the interpolant computation remains fully syntactic. We intend our method to form a good basis for generalizing to other logics with multisequent calculi.
The paper is organized as follows. In Sect. 2, we introduce the nested sequent calculi for , , and , as well as model modifications invariant under bisimulation. In Sect. 3, we present our method to prove uniform interpolation for , , and . In Sect. 4, we show how the method can be adjusted to hypersequents for . Section 5 concludes the paper and maps the immediate next steps.
2 Preliminaries
Definition 2.1.
Modal formulas in negation normal form are defined by the following grammar where and are Boolean constants, is an atomic proposition from a countable set Prop, and is the negation of for each . The set Lit of literals consists of all atomic propositions and their negations, with used to denote its elements. Literals and Boolean constants are atomic formulas.
We define (or ) recursively as usual using De Morgan’s laws to push the negation inwards. .
Definition 2.2.
Nested sequents are recursively defined in the following form:
is a nested sequent where are modal formulas for and are nested sequents for . We call brackets a structural box. The formula interpretation of a nested sequent is defined recursively by
One way of looking at a nested sequent is to consider a tree of ordinary (one-sided) sequents, i.e., of multisets of formulas. Each structural box in the nested sequent creates a child in the tree. In order to be able to reason about formulas in a particular tree node, we introduce labels. A label is a finite sequence of natural numbers. We denote labels by ; a label (or simply ) denotes the label extended by the natural number .
Definition 2.3 (Labeling).
For a nested sequent and label we define a labeling function to recursively label structural boxes in nested sequents as follows:
Let be the set of labels occurring in plus label (for formulas outside all structural boxes). Define the labeled nested sequent , and let .333Labeled nested sequents are closely related to labelled sequents from [23] but retain the nested notation.
Formulas in a nested sequent are labeled according to the labeling of the structural boxes containing them. We write iff the formula occurs in outside all structural boxes. Otherwise, whenever occurs in within a structural box labeled .
The set can be considered as the set of nodes of the corresponding tree of , with being the root of this tree. Often, we do not distinguish between a nested sequent and its labeled sequent . For example, we say that if .
Example 2.4.
Consider a nested sequent . The corresponding labeled nested sequent is with . The corresponding tree is pictured as follows, where each node is labeled on the left and marked by its formulas on the right (in particular, here and , but ):
Following [5], we will work with contexts in rules to signify that the rules can be applied in an arbitrary node of the nested sequent. We will work with unary contexts which are nested sequents with exactly one hole, denoted by the symbol . Such contexts are denoted by . The insertion of a nested sequent into a context is obtained by replacing the occurrence with . The hole can be labeled the same way as formulas. We write to denote the label of the hole.
Example 2.5.
is a context. Its labeled context is . Let . Then equals from Example 2.4.
Definition 2.6 (Variables).
Whether is a formula, or a sequence/set/multiset of formulas, or a nested sequent/context, or some other formula-based object, we denote by the set of atomic propositions occurring in (note that may also occur in the form of ).
In this paper we use nested sequent calculi for classical modal logics , , and from [5]. Recall that consists of all classical tautologies, the -axiom and is closed under modus ponens (from and , infer ) and necessitation (from , infer ). Further, and . We now introduce nested sequent calculi and then Kripke semantics for these logics.
The terminating nested sequent calculus NK for the modal logic K consists of all rules in the first two rows in Fig. 1 plus the rule k. This calculus is an extension of the multiset-based version from [5] to the language with Boolean constants and , necessitating an addition of the rule for handling these (cf. also the treatment of Boolean constants in [10]). The nested calculus ND (NT) for the modal logic D (T) is obtained by adding to NK the rule (). As shown in [5], the nested sequent calculi , , and are sound and complete for modal logics , , and respectively, i.e., a nested sequent is derivable in (, ) if and only if its formula interpretation is a theorem of (, ).
Definition 2.7 (Saturation).
Consider a sequent , i.e., . The formula is -saturated in if the following conditions hold depending on the form of :
-
•
is an atomic formula;
-
•
if , then both and ;
-
•
if , then either or ;
-
•
if , then there is a label such that .
The formula of the form is
-
•
-saturated in w.r.t. if ;
-
•
-saturated in if there is a label ;
-
•
-saturated in if .
A nested sequent is -saturated if it is neither of the form for some atomic proposition nor of the form ; all its formulas are -saturated w.r.t. every child of ; and all its other formulas are -saturated in . A nested sequent is -saturated -saturated if it is -saturated and all its formulas are -saturated -saturated in .
Example 2.8.
The sequent is -saturated but neither -saturated nor -saturated. Indeed, for the logic we would need to be present for some and for the logic we would need to have in order to saturate .
The rules from Fig. 1 with embedded contraction are sometimes called Kleene’d rules. Following [5], in order to ensure finite proof search, we only apply a rule when the principal formula in the conclusion is not saturated w.r.t. this rule, i.e., , , and are not -saturated, in the rule k is not -saturated w.r.t. the label of the bracket containing , in the rule d is not -saturated, and in the rule t is not -saturated. Since for Kleene’d rules principal formulas are preserved in the premises, the number of applications of each of the rules , , and is bounded. The way to think of a saturated sequent is that in a bottom-up proof search when we reach a saturated sequent, it does not make sense to apply more rules as these would only lead to duplications.
Intuitively, nested sequents capture the tree structure of Kripke models for modal logics. We define truth for nested sequents in Kripke models and then recall relevant facts about bisimulations and introduce model modifications that we use in the proof of uniform interpolation.
Definition 2.10.
A Kripke model is a triple , where is a set of worlds or nodes, , and is a valuation function mapping each atomic proposition to a set of worlds from . If , we say that is accessible from , or that is a parent of , or that is a child of . We define by induction on the construction of as usual: and ; for , we have iff and iff ; we have iff and (or) ; finally, iff whenever and iff for some . A formula is valid in , denoted , when for all .
A model is a submodel of when , , and for each . A submodel generated by , denoted , is the smallest submodel of such that and whenever and .
We will use models based on finite intransitive directed trees, usually denoting the root . For , the accessibility relation is required to be reflexive, i.e., . For , the accessibility relation must be serial, i.e., . Note that such seriality implies reflexivity of the leaves of the tree. Finally, we assume to be irreflexive for . From now on we call these models -models, -models, and -models respectively.
Theorem 2.11.
If , then iff for each -model .
Following [19], we now extend the definitions of truth and validity to nested sequents.
Definition 2.12.
A (treelike) multiworld interpretation of a nested sequent into a model is a function from labels in to worlds of such that whenever . Then
is valid in , denoted , means that for all multiworld interpretations of into .
The following lemma, which can be easily proved by induction on the structure of , implies completeness for validity of nested sequents.
Lemma 2.13.
For a nested sequent and a model , we have iff
Proof 2.14.
By induction on the structure of , we prove that implies for one direction and that implies for some such that for the other direction. Let be of the form .
First suppose . Then for all we have , in particular, for all . In addition, we show that for all . To prove this, we define as follows: for each ; in particular, . It is easy to see that is a multiworld interpretation of into and that . Thus, by induction hypothesis, , i.e., . Since , it follows that . We conclude that .
Now suppose . For each , there is a world such that and . By induction hypothesis, there exists a multiworld interpretation of into such that and . Define as follows: and . We immediately have .
We now define bisimulations modulo an atomic proposition , similar to the ones from [7, 31], where uniform interpolation is studied on the basis of bisimulation quantifiers. While those papers focus on purely semantic methods, we embed the semantic tool of bisimulation into our constructive proof-theoretic approach in Sect. 3. Our bisimulations behave largely like standard bisimulations except they do not have to preserve the truth of formulas with occurrences of .
Definition 2.15 (Bisimilarity).
A bisimulation up to an atomic proposition between models and is a non-empty binary relation such that the following conditions hold for all and with :
- atomsp.
-
iff for all ;
- forth.
-
if , then there exists such that and ; and
- back.
-
if , then there exists such that and .
When , we write . Further, we write for functions and with a common domain if there is a bisimulation up to between and such that for each .
The main property of bisimulations is truth preservation for modal formulas. The following theorem is proved the same way as [4, Theorem 2.20].
Theorem 2.16.
If , then for all formulas with , we have iff .
We are interested in manipulations of treelike models that preserve bisimulations up to , in particular, in duplicating a part of a model or replacing it with a bisimilar model.
Definition 2.17 (Model transformations).
Let be an intransitive tree (possibly with some reflexive worlds), be its subtree with root , and be another tree with root . A model is the result of replacing the subtree with in if
A model is the result of duplicating cloning in if another copy444Here , , , and . of is inserted alongside as a subtree of , i.e., if
Lemma 2.18.
In the setup from Def. 2.17, let be a bisimulation demonstrating that . Then, for obtained by replacing with in we have that for all and that whenever . Moreover, if both and are -models -models, -models, then so is .
For obtained by duplicating in , we have for all and, in addition, for all . If is a -model -model, -model not rooted at , so is .
The same holds for cloning if , except that cloning does not preserve -models.
Proof 2.19.
It is easy to see that one bisimulation witnesses all the stated bisimilarities in each case: for replacing or for duplicating and cloning. Both the tree structure and reflexivity of worlds are preserved by all operations. Leaves are preserved by replacement and duplication, whereas cloning turns a leaf into a non-leaf without removing its reflexivity as required in -models.
3 Uniform interpolation for nested sequents
In this section we prove the uniform interpolation theorem for , , and via their nested sequent calculi , and respectively. We define a new notion of uniform interpolation for nested sequents in Def. 3.15 that involves Kripke semantics. We then prove in Lemma 3.13 that this implies the standard definition of uniform interpolation.
Definition 3.1 (Uniform interpolation property).
A logic in a language containing an implication and Boolean constants and (primary or defined) has the uniform interpolation property, or UIP, if for every formula in the logic and atomic proposition , there exist formulas and such that
-
(i)
and ,
-
(ii)
and
-
(iii)
for each formula with :
For classical-based logics, the existence of left-interpolants ensures the existence of right-interpolants, and vice versa. Assuming is defined for each formula , we can define . Thus, from now on, we focus on .
We import some of the notation from [19] in order to formulate the uniform interpolation property for nested sequents.
Definition 3.2.
Multiformulas are defined by the grammar
where is a label and is a formula. We write for the set of labels occurring in .
Remark 3.3.
The symbol is pronounced ‘mho’, which is the reverse of ‘ohm’ the same way as is the reverse of , the symbol for ohm in physics.
Definition 3.4 (Suitability).
A multiworld interpretation of a sequent is suitable for a multiformula if , in which case we call it a multiworld interpretation of into .
Definition 3.5 (Truth for multiformulas).
Let be a multiworld interpretation of a multiformula into a model . We define recursively as follows:
enumerate iff , iff for both , iff for at least one .
Note that , meaning that is also a multiworld interpretation of each into .
We define the label-erasing function from multiformulas to formulas, as well as multiformula equivalence, and list some of the latter’s easily provable properties.
Definition 3.6.
The function from multiformulas to formulas is defined as follows:
Definition 3.7 (Multiformula equivalence).
Multiformulas and are equivalent, denoted , iff and for any multiworld interpretation of into a model .
Lemma 3.8 (Equivalence property).
For any multiformula , label , and formulas and ,
-
•
,
-
•
, and
-
•
.
Lemma 3.9 (Normal forms).
For each multiformula , there exists an equivalent multiformula in SDNF SCNF such that is a -disjunction -conjunction of -conjunctions -disjunctions of labeled formulas such that each disjunct conjunct contains exactly one occurrence of each label .
Proof 3.10.
Since and behave classically, one can employ the standard transformation into the DNF/CNF. In order to ensure one label per disjunct/conjunct rule, multiple labels can be combined using Lemma 3.8, whereas missing labels can be added in the form of ().
We now introduce the uniform interpolation property for nested sequents. Here, the uniform interpolants are multiformulas instead of formulas.
Definition 3.11 (NUIP).
Let a nested sequent calculus be sound and complete w.r.t. a logic . We say that NL has the nested-sequent uniform interpolation property, or NUIP, if for each nested sequent and atomic proposition there exists a multiformula , called a nested uniform interpolant, such that
-
(i)
and ;
-
(ii)
for each multiworld interpretation of into an -model
-
(iii)
for each nested sequent with and and for each multiworld interpretation of into an -model ,
for some multiworld interpretation of into some -model .
The condition on labels in (i) ensures that interpretations of are suitable for .
Remark 3.12.
Lemma 3.13.
If a nested calculus NL has the NUIP, then its logic has the UIP.
Proof 3.14.
To show the existence of , consider a nested uniform interpolant of the nested sequent , with . By Lemma 3.9, w.l.o.g. we can assume that . Let . We establish the UIP properties based on the corresponding NUIP properties.
For UIP(ii) we use a semantic argument. Assume towards a contradiction that , in which case by completeness for some -model and . Consider a multiworld interpretation of sequent into such that . Then but , in contradiction to NUIP(ii). Hence, as required.
Finally, for UIP(iii), let and suppose . Once again, by completeness, for some -model and . Consider the nested sequent , with , and a multiworld interpretation of sequent into with . Then and . By NUIP(iii), there must exist an -model and a multiworld interpretation of sequent into such that and . In other words, and . Thus, by soundness of , we have , thus completing the proof of UIP(iii).
Since we use bisimulations up to to find a model in the NUIP(iii) condition, we replace it with a (possibly) stronger condition (iii)′:
Definition 3.15 (BNUIP).
A nested sequent calculus has the bisimulation nested-sequent uniform interpolation property, or BNUIP, if, in addition to conditions NUIP(i)–(ii) from Def. 3.11,
- (iii)′
-
for each -model and multiworld interpretation of into , if , then there are an -model and multiworld interpretation of into such that and .
It easily follows from Theorem 2.16 that, like formulas, both nested sequents and multiformulas are invariant under bisimulations:
Lemma 3.16.
Let be a sequent multiformula not containing and let and be multiworld interpretations of into and respectively such that . Then iff iff .
Proof 3.17.
Proof 3.19.
Let be a nested sequent with and . Let and . By BNUIP(iii)′ we find an -model and from into such that and . By Lemma 3.16, we also conclude .
Corollary 3.20.
If a nested calculus NL has the BNUIP, then its logic has the UIP.
3.1 Uniform interpolation for
matches | equals |
---|---|
where is the smallest integer such that and the SCNF | |
of is , | |
In this section, we present our method of constructing nested uniform interpolants satisfying BNUIP for the calculus . It is based on Pitts’s method [24]. Interpolants are defined recursively on the basis of the terminating calculus from Fig. 1. If is not -saturated, is defined recursively in Table 1 based on the form of . For rows 2–5, we assume that the formula displayed in the left column is not -saturated in , whereas for in the last row we assume it not to be -saturated w.r.t. in .555Strictly speaking, this is a non-deterministic algorithm. Since the order does not affect our results, we do not specify it. However, it is more efficient to apply rows 1–2 of Table 1 first and row 5 last. Each row in the table corresponds to a rule in the proof search, where the left column in the table corresponds to the conclusion of a rule and the right column uses the premise(s) of the rule.
For -saturated , we define recursively as follows:
(1) |
where . Since here we apply to the multiformula with 1 being its only label, we have iff for such multiformulas . (As usual, we define the empty disjunction to be false, which in this format means .)
The construction of is well-defined (modulo a chosen order) because it terminates w.r.t. the following ordering on nested sequents. For a nested sequent , let be the number of its distinct diamond subformulas. Let be the ordering in which the rules of terminate (see Lemma 2.9). Consider the lexicographical ordering based on the pair . For each row in Table 1, stays the same but the recursive calls are for premise(s) lower w.r.t. ordering . The recursive call in (1) for -saturated sequents, on the other hand, decreases because the set of diamond subformulas of is strictly smaller than that of . When for a -saturated , the second disjunct of (1) is empty and, thus, no new recursive calls are generated.
Example 3.21.
We use Lemmas 3.8 and 3.9 as necessary.
-
1.
The algorithm for calls the calculation of , which in turn calls . The latter sequent is -saturated, and the algorithm returns , the first disjunct corresponding to the empty disjunction of literals other than and and the second one representing the absent diamond formulas. Computing its SCNF we get . Applying the transformation from the penultimate row of Table 1, we first get
and finally . It is easy to check that is indeed a bisimulation nested uniform interpolant of the nested sequent w.r.t. , and, accordingly, is a uniform interpolant of the formula .
-
2.
Consider the nested sequent . In the absence of boxes, the algorithm amounts to processing the -saturated sequents in the leaves of the proof-search tree
We have
Since formulas and can be simplified to and respectively, putting everything together yields , which is equivalent to since can never be true. Again, it is easy to verify that is a bisimulation nested uniform interpolant of w.r.t. . For instance, if is false at , then one can falsify the sequent by making true at and false everywhere else in the irreflexive intransitive finite treelike model.
Theorem 3.22.
The nested calculus has the BNUIP.
Proof 3.23.
It is easy to see that BNUIP(i) is satisfied. To prove BNUIP(ii), let be a nested sequent and be a multiworld interpretation of into a -model such that (by BNUIP(i), is suitable for ). We show by induction on the nested sequent ordering . Considering the construction of , we treat the cases of Table 1 first and deal with the case of -saturated last.
-
•
For rows 1–2 of Table 1, both and hold in all models, under all interpretations.
-
•
For row 3, if and , by induction hypothesis, we have . Then since either of or implies .
-
•
For row 4, if and , by induction hypothesis, and . Hence, .
-
•
For row 6, if and , by induction hypothesis, . Since implies , it follows that .
-
•
For row 5, let , and for some , and
(2) For any such that , define a multiworld interpetation of into . It follows from (2) that, for each , either for some or , meaning that . By induction hypothesis, whenever . Clearly, if . Otherwise, there exists a such that and . For this world implies , which yields because agrees with on all labels from .
-
•
Finally, let be -saturated and from (1). Clearly, if we have for some . Thus, it only remains to consider the case when for some . Then for some such that and, accordingly, for . By induction hypothesis (for smaller ), , and, hence, for some . Now follows from . This case concludes the proof for (ii).
It only remains to prove BNUIP(iii)′. Let be a multiworld interpretation of into a -model such that . We must find another multiworld interpretation into some -model such that and . We construct these and while simultaneously proving BNUIP(iii)′ by induction on the lexicographic order . Recall that -models (and their submodels) are irreflexive intransitive trees.
-
•
Let be -saturated and for from (1). We first briefly sketch the construction and the proof. The labeled literals from (1) are used to determine the requisite truth values of atomic propositions other than in the worlds from . With that in place, saturation conditions typically take care of the appropriate truth values for compound formulas, with the exception of diamond formulas. By contrast, truth values of are not (and cannot be) specified in . To refute , they must generally be adjusted on a world-by-world basis, which prompts the additional requirement that be injective666It must be injective as a function, i.e., implies . in order to avoid incompatible requirements on the truth value of in a world that originates from distinct nodes and . Finally, for to be false at a world , one must falsify in all children of , including those outside . This is achieved by replacing subtrees rooted in these “out-of-range” children with bisimilar models obtained by the induction hypothesis from the right disjunct of (1), as schematically depicted in Fig. 2. We now describe it in detail and prove that it falsifies .
-
(1)
First, we make the interpretation injective. It is easy to see (though tedious to describe in detail) that by a breadth-first recursion on nodes in , one can duplicate according to Def. 2.17 whenever for some to obtain a model and an injective multiworld interpretation of into it such that . Thus, whenever and by Lemma 3.16.
Figure 2: Main transformations for constructing model : circles represent worlds in . -
(2)
Then we deal with out-of-range children. A model is constructed from by applying the following -processing step for each node that contains at least one formula of the form (nodes can be chosen in any order). Start by setting and :
-
–
-processing step for : Since , it follows from the second disjunct in (1) that . Thus, for any child of in , and, accordingly, for the multiworld interpretation of sequent into the subtree of with root . By the induction hypothesis for a smaller , there exists a -model with root such that and . Let be the result of replacing each subtree for children of not in with in according to Def. 2.17. Note that all these subtrees are disjoint because the models are intransitive trees and, hence, these replacements do not interfere with one another. Note also that since is downward closed and the roots of the replaced subtrees are outside, no world from the range is modified. Thus, remains an injective interpretation into . Finally, it follows from Lemma 2.18 that . Hence, .
Let be the model obtained after replacements for all ’s are completed (again they do not interfere with each other). Then and, for each out-of-range child of in , the world is a child of in and . This accounts for all children of in .
-
–
-
(3)
It remains to adjust the truth values of . We define by modifying the valuation of as follows:
For , it immediately follows from the definition that
(3) (4) (the latter follows from the injectivity of and being -saturated). Moreover, since subtrees are disjoint from ,
(5)
After these three steps, we have a model that satisfies (3), (4), and (5). It remains to prove that by showing that for all , which is done by induction on the structure of . For this is trivial, while cannot occur in a -saturated sequent. For , this follows from (3) and (4). For any other literal , according to (1), because , which transfers to and by bisimilarity up to . For compound formulas other that diamonds, the statement follows by the saturation of . For instance, if , we get for some label by -saturation. By induction hypothesis, . Since , we conclude as required. Finally, let . To falsify at , we need to show that whenever . If for some label , saturation ensures that , hence, by induction hypothesis. The only other children of are , for which follows from (5). This completes the proof of BNUIP(iii)′ for -saturated sequents.
-
(1)
-
•
Now we treat all sequents that are not -saturated based on Table 1. , which cannot be false, thus, BNUIP(iii)′ for them is vacuously true.
-
•
For non-saturated , , and , the requisite statement easily follows by induction hypothesis. For instance, for the last of the three, one obtains such that . Since consists of some of these formulas in the same nodes, clearly it is also falsified by .
-
•
For the remaining case, assume , i.e.,
(6) where
(7) By (6), for some , we have and for all . The former means that for some such that . Therefore, a multiworld interpretation of into falsifies (7), and, by induction hypothesis, there is a multiworld interpretation into a -model such that and . For , it is easy to see that and because all formulas from are present in .
This concludes the proof of BNUIP(iii)′, as well as of BNUIP.
This implies the UIP for , first proved by Ghilardi [12].
Corollary 3.24.
Logic has the uniform interpolation property.
Remark 3.25.
Note that the structure of models as irreflexive intransitive trees was substantially used to ensure that the replacements applied to the original model do not interfere with each other. The fact that each world has at most one parent provided the modularity necessary to implement various requirements on the sequent-refuting model.
Example 3.26.
In Example 3.21 we saw that . We now use this example to demonstrate the importance of injectivity in BNUIP(iii)′. Indeed, suppose , i.e., has at least one child. Assume this is the only child, as in a model depicted on the left:
For a saturation of this sequent, we found an interpolant in SCNF: namely, . A multiworld interpretation mapping both and to the only child of yields the picture on the right. Clearly, the SCNF is false: . But, without forcing to be injective, it is impossible to make false at : whichever truth value has at , it makes one of the boxes true.
3.2 Uniform interpolation for and
matches | equals |
---|---|
in logic | |
in logic | where the SDNF of |
is |
The proof for can be adjusted to prove the same result for and .
Theorem 3.27.
The nested sequent calculi and have the BNUIP.
Proof 3.28.
We follow the structure of the proof of Theorem 3.22 for and only describe deviations from it. If is not -saturated (-saturated), then cases in Table 1 are appended with the bottom row (top row) of Table 2, which is applied only if is not -saturated (-saturated) in . For -/-saturated , we define by (1) as in the previous section. BNUIP(i) is clearly satisfied by either row in Table 2.
Let us first show BNUIP(ii) for . Although -models are reflexive, this does not affect the reasoning for either saturated sequents or non-saturated box formulas. The only new case is applying the top row of Table 2 to a non--saturated in . Assume for a -model . By induction hypothesis, . Since implies by reflexivity, the desired follows.
For BNUIP(iii)′ for -saturated sequents, we have to modify the construction in step (1) on p. 1 of an injective multiworld interpretation into a new -model out of the given into where . In the case of , the breadth-first order of injectifying the interpretations of sequent nodes could only yield one situation of being conflated with some already processed : namely, when is a sibling. This can still happen for -models and is processed the same way. But, due to reflexivity, there is now another possibility: conflating with the parent . In this case, cloning is used (see Fig. 4) instead of or in addition to duplication, which produces a bisimilar -model by Lemma 2.18. Having intransitive trees that are reflexive rather than irreflexive in step (2) on p. 2 does not affect the argument. The proof that for the given -saturated in step (3) on p. 3 requires an adjustment only for the case of . It is additionally necessary to show that for the reflexive loop at . This is resolved by observing that due to -saturation and, hence, must also be false in by induction hypothesis.
Finally, for BNUIP(iii)′ for non--saturated sequents, we gain a new case when the top row of Table 2 is used, but it is clear that obtained by induction hypothesis directly implies . This completes the proof of BNUIP for .
For BNUIP(ii) for , the only new case is applying the bottom row of Table 2 to a non--saturated in . Let
for some multiworld interpretation into a -model where
Then, for some , we have for all and . Thus, for some such that . Since is not -saturated in , it follows that is a multiworld interpretation of into such that . By induction hypothesis, , from which it easily follows that .
For BNUIP(iii)′ for -saturated sequent, we must change step (1) to preserve -models. By Lemma 2.18, duplication used for preserves -models when applied to non-leaves of -models because they are irreflexive. Now consider the case when is a leaf of a model , but node has children in the sequent tree, which can only map to . To ensure injectivity, we construct an intermediate model separating from its children as follows (see Fig. 4):
Accordingly, if is a descendant of this (or itself) or if is not a descendant of any of . By reasoning similar to Lemma 2.18, it is easy to show that is a -model and with all being bisimilar to . The replacements of step (2) preserve -models by Lemma 2.18. Step (3) requires no changes either. The only subtlety in the proof that for a -saturated is for . The argument for does work the same way as in for the following reason. Since this is -saturated, node must have a child in the sequent tree. Injectivity of the constructed means that is not a leaf in the -model and, hence, not reflexive.
The only remaining new case is the application of the bottom row of Table 2 for a non--saturated , i.e., when node is a leaf of the sequent tree, in BNUIP(iii)′. Let
By seriality of , there exists a world such that . Then is a multiworld interpretation of into such that
By induction hypothesis, there is a multiworld interpretation of into some -model such that and . Similar to the case of for , restricting this to the labels of yields a multiworld interpretation bisimilar to and refuting .
4 Uniform interpolation for S5
The uniform interpolation property easily follows for logics satisfying local tabularity and the Craig interpolation property [6]. A logic is locally tabular if there are only finitely many pairwise nonequivalent formulas for each finite set of atomic propositions. Examples of locally tabular logics are classical propositional logic and . In this case, the left interpolant can be taken to be the disjunction of all formulas without implying (accordingly, the right interpolant is the conjunction of all formulas without implied by ).
Although proving uniform interpolation for is therefore simple, we want to use our method applied to a hypersequent calculus for , which provides a direct construction for the interpolants. Important for our method are the form of Kripke models and the structure of the proof system. For , , and we used intransitive treelike models and nested sequents mimicking this treelike structure, which fit well with the recursive step of our method. is complete with respect to single finite clusters, i.e., finite models with the total accessibility relation. In the rest of this section we only work with these kinds of models, i.e., it is assumed that .
Cut-free hypersequent calculi for were first (independently) introduced in [1, 22, 26]. A hypersequent has the form where ’s are multisets of formulas in negation normal form, and its corresponding formula . We use letters and to denote hypersequents. Among the many existing hypersequent calculi, we use the one closest to tableaus. The hypersequent rules for used here are presented in Fig. 5. These modal rules can be found (as derived rules) in [9]. They are the sequent-style equivalent of what Fitting called there the “Simple Tableau System,” i.e., prefixed tableaus with prefixes being integers rather than sequences of integers, and are used to reduce hypersequent completeness to tableau completeness. The same rules can be obtained by Kleene’ing the hypersequent calculus from [27] as explained in [20, Sect. 5] (strictly speaking, rules in [20] are grafted hypersequent rules for K5, but the crown rules for these grafted hypersequents are exactly the hypersequent rules for ; another minor difference is that we are using one-sides sequents and negation normal form). Being Kleene’d, these rules form a terminating calculus for under the proviso that and be applied only if the principal in their conclusion is saturated w.r.t. the component of the active formula and that all the other rules are applied only when their principal formula is not saturated in the conclusion, as defined presently.
Definition 4.1 (Saturation in hypersequents).
A formula is saturated in a hypersequent if it satisfies the following conditions according to the form of :
-
•
is an atomic formula;
-
•
if , then both and are in ;
-
•
if , then at least one of or is in ;
-
•
if , then is either in or in ;
The formula is saturated with respect to a sequent component of if is in that sequent component. A hypersequent is saturated if all diamond formulas in it are saturated w.r.t. each sequent component of , all other formulas are saturated, and, additionally, is neither of the form nor of the form for any atomic proposition .
Labels for hypersequents are natural numbers. For a hypersequent we use the set of labels . We define multiworld interpretations and multiformulas for hypersequents by analogy with nested sequents, but now using natural numbers as labels.777Strictly speaking, these labels impose an ordering on the sequent components turning it into a sequence of sequents rather than a multiset of sequents. Since permuting sequent components is both trivial and tedious, we continue with the multiset representation, stating labels explicitly if necessary.
Definition 4.2.
A cluster-like multiworld interpretation of a hypersequent into an -model is a function .
Within this section, by “multiworld interpretation” we always mean “cluster-like multiworld interpretation.” Note that there is no restriction on the image of , because we work with -models where all worlds are related to each other. For a fixed multiworld interpretation , we usually write instead of and represent the whole by . A multiworld interpretation is injective if the worlds are pairwise disjoint. The rest of the definitions and results for hypersequents are completely analogous to the nested sequent setting (modulo the change of labels into natural numbers). The analog of Def. 2.12 is
Definition 4.3.
Let be a model with worlds and let be a hypersequent. We say that iff
A hypersequent is valid in a model , denoted , when for all multiworld interpretations of into .
We have completeness for the validity of hypersequents, i.e., for all hypersequents and -models .
A multiformula is similarly defined as in Def. 3.2, where we now use natural numbers as labels instead of sequences of natural numbers, i.e., use instead of . All definitions and lemmas about multiformulas based on nested sequents also apply to the hypersequent setting (Def. 3.4 until Lemma 3.9).
Uniform interpolation for hypersequents is defined in the same way as for nested sequents. All definitions and lemmas between Def. 3.11 and Cor. 3.20 are naturally adapted to the hypersequent setting. Instead of NUIP and BNUIP we now speak of the hypersequent uniform interpolation property (HUIP) and the bisimulation hypersequent uniform interpolation property (BHUIP) respectively.
So far, everything goes analogously to the nested sequent case. Even defining the uniform interpolants seems to work analogously. However, when performing the inductive proof (analogous to Theorem 3.22) ensuring that those are actual uniform interpolants, one runs into a problem in the recursive case for saturated sequents. Roughly speaking, the problem is caused by the fact that in -models, the truth of a formula in one world generally depends on all the worlds, including its immediate “parent.” Contrast this with treelike models where the truth of a formula in a world is fully determined by its descendants which are disjoint from its parent, as well as from its siblings and their descendants. The reason this feature of cluster-like models is problematic is that changing the valuation of in a later recursive call may conflict with valuations of necessitated by the preceding one.
To circumvent this problem, we use a special property of : every modal formula is -equivalent to a formula of modal depth 1 (see [8, Sect. 5.13], where Fitting proved this in order to establish Craig interpolation for ). This means that we can restrict ourselves to formulas where each literal or is under the scope of at most one modality. Therefore, after stripping this one modality away, the resulting formulas are purely propositional, meaning that no further recursive calls are needed and, at the same time, that their truth values depends on the valuation in only one world instead of all worlds in the model. This resolves the aforementioned conflict between recursive calls.
matches | equals |
---|---|
where the SCNF of | |
is | |
So from now on, we only consider hypersequents , where each contains only formulas of modal depth . With that in mind, we define multiformula interpolants for hypersequents . If is not saturated, is defined in Table 3 following the finite proof-search tree of the hypersequent. In particular, , , and must be non-saturated; in the rule for , w.l.o.g. we assume to be the largest label; the penultimate row is applied only if is not saturated w.r.t. its own component; and the last row is only applied if is not saturated w.r.t. the component containing the displayed .
For saturated , we define
(8) |
where represents the uniform interpolant of a propositional formula w.r.t. classical propositional logic. Any known algorithm for its computation can be used. The construction of is well-defined because the recursion in Table 3 terminates by the termination of the rules.
Theorem 4.4.
Logic has the BHUIP.
Proof 4.5.
We follow the proof of Theorem 3.22 showing the three condition for BHUIP. It is easily seen that does not contain and that its labels are from .
For BHUIP(ii), let be a multiworld interpretation of a hypersequent , and of the multiformula , into an -model . We use induction to show
First we treat some cases from Table 3 and then we consider the case where is saturated.
-
•
Both and hold in all models, under all interpretations.
-
•
Boolean cases work the same way as for nested sequents.
-
•
The case of is also very similar. The only difference from the nested case for is that instead of considering only children of the node where needs to be true in a treelike model, here we have to consider all worlds in the model. Otherwise, the reasoning is the same.
- •
- •
-
•
Finally, if is saturated, let for from (8). As for nested sequents, the case of with is straightforward. It remains to consider the case when, This means that there is a such that . Since is a propositional tautology for any by Def. 3.1, we have for some . Therefore for all , including the label of the component containing . Thus, .
For BHUIP(iii)′, let a multiworld interpretation of into an -model such that We need to find worlds from another -model such that and We define and and prove BHUIP(iii)′ by simultaneous recursion. We first consider the case where is saturated, then we show several cases following Table 3.
-
•
For being saturated, we assume for from (8). We have three steps in the construction of model , which can be compared to the steps of the construction in Theorem 3.22.
-
(1)
Whenever , duplicate this world, until all ’s are distinct. Clearly, this yields a -bisimilar model with and an injective multiworld interpretation of into such that .
-
(2)
Now we construct a model from by changing valuations of in all worlds . It follows from the last disjunct in (8) that for all such . It is a straightforward consequence of Def. 3.1 for the purely propositional formula that it is possible to modify the valuation in such a way that for the resulting we have for all worlds . Changing only truth values of results in a -bisimilar model.
-
(3)
Finally, we define model to be the same as model except for valuations of as follows: . Note that the resulting model is still -bisimilar and, moreover, still holds for all .
This finishes the construction.
Now we prove that whenever by induction on the structure of .
-
–
We leave the cases for , , , , , and , which are analogous to , to the reader.
-
–
If , then by saturation, there is a label such that . By induction hypothesis, . Therefore, .
-
–
If , then for each we have to prove . First, consider for some . Since is saturated, . By induction hypothesis . Otherwise, if , the falsity of was assured in step (3). Thus, .
-
(1)
There is nothing new for non-saturated cases from Table 3. Most of them work the same way as for , with the exception of the penultimate row that works the same way as for and uses reflexivity of -models.
5 Conclusion
We have developed a constructive method of proving uniform interpolation based on generalized sequent calculi such as nested sequents and hypersequents. While this is an important and natural step to further exploit these formalisms, much remains to be done. This method works well for the non-transitive logics , , and but meets with difficulties, e.g., for , which is also known to enjoy uniform interpolation. And while we successfully adapted the method to hypersequents to cover this logic, the adaptation relies on the reduction to uniform interpolation for classical propositional logic and, thus, is not fully recursive. There are other logics in the so-called modal cube between and with the UIP, for which it remains to find the right formalism and adaptation of our method. Another natural direction of future work is intermediate logics, where exactly seven logics are known to have the UIP.
References
- [1] Arnon Avron. The method of hypersequents in the proof theory of propositional non-classical logics. In Wilfrid Hodges, Martin Hyland, Charles Steinhorn, and John Truss, editors, Logic: from Foundations to Applications: European Logic Colloquium, pages 1–32. Clarendon Press, 1996.
- [2] Marta Bílková. Interpolation in Modal Logics. PhD thesis, Charles University Prague, 2006. Available from: https://dspace.cuni.cz/handle/20.500.11956/15732.
- [3] Marta Bílková. A note on uniform interpolation proofs in modal deep inference calculi. In Nick Bezhanishvili, Sebastian Löbner, Kerstin Schwabe, and Luca Spada, editors, Logic, Language, and Computation, 8th International Tbilisi Symposium on Logic, Language, and Computation, TbiLLC 2009, Bakuriani, Georgia, September 2009, Revised Selected Papers, volume 6618 of Lecture Notes in Artificial Intelligence, pages 30–45. Springer, 2011. doi:10.1007/978-3-642-22303-7_3.
- [4] Patrick Blackburn, Maarten de Rijke, and Yde Venema. Modal Logic, volume 53 of Cambridge Tracts in Theoretical Computer Science. Cambridge University Press, 2001. doi:10.1017/CBO9781107050884.
- [5] Kai Brünnler. Deep sequent systems for modal logic. Archive for Mathematical Logic, 48(6):551–577, July 2009. doi:10.1007/s00153-009-0137-3.
- [6] Alexander Chagrov and Michael Zakharyaschev. Modal Logic, volume 35 of Oxford Logic Guides. Clarendon Press, 1997.
- [7] Giovanna D’Agostino. Uniform interpolation, bisimulation quantifiers, and fixed points. In Balder D. ten Cate and Henk W. Zeevat, editors, Logic, Language, and Computation, 6th International Tbilisi Symposium on Logic, Language, and Computation, TbiLLC 2005, Batumi, Georgia, September 2005, Revised Selected Papers, volume 4363 of Lecture Notes in Artificial Intelligence, pages 96–116. Springer, 2007. doi:10.1007/978-3-540-75144-1_8.
- [8] Melvin Fitting. Proof Methods for Modal and Intuitionistic Logics, volume 169 of Synthese Library. Springer, 1983. doi:10.1007/978-94-017-2794-5.
- [9] Melvin Fitting. Modal proof theory. In Patrick Blackburn, Johan van Benthem, and Frank Wolter, editors, Handbook of Modal Logic, volume 3 of Studies in Logic and Practical Reasoning, pages 85–138. Elsevier, 2007. doi:10.1016/S1570-2464(07)80005-X.
- [10] Melvin Fitting and Roman Kuznets. Modal interpolation via nested sequents. Annals of Pure and Applied Logic, 166(3):274–305, March 2015. doi:10.1016/j.apal.2014.11.002.
- [11] Dov M. Gabbay, Renate A. Schmidt, and Andrzej Szalas. Second-Order Quantifier Elimination: Foundations, Computational Aspects and Applications, volume 12 of Studies in Logic. College Publications, 2008.
- [12] Silvio Ghilardi. An algebraic theory of normal forms. Annals of Pure and Applied Logic, 71(3):189–245, February 1995. doi:10.1016/0168-0072(93)E0084-2.
- [13] Silvio Ghilardi and Marek Zawadowski. Undefinability of propositional quantifiers in the modal system S4. Studia Logica, 55(2):259–271, June 1995. doi:10.1007/BF01061237.
- [14] Rosalie Iemhoff. Uniform interpolation and sequent calculi in modal logic. Archive for Mathematical Logic, 58(1–2):155–181, February 2019. doi:10.1007/s00153-018-0629-0.
- [15] Rosalie Iemhoff. Uniform interpolation and the existence of sequent calculi. Annals of Pure and Applied Logic, 170(11):102711, November 2019. doi:10.1016/j.apal.2019.05.008.
- [16] Patrick Koopmann. Practical Uniform Interpolation for Expressive Description Logics. PhD thesis, University of Manchester, 2015. Available from: https://www.research.manchester.ac.uk/portal/files/54574261/FULL_TEXT.PDF.
- [17] Taishi Kurahashi. Uniform Lyndon interpolation property in propositional modal logics. Archive for Mathematical Logic, 59(5–6):659–678, August 2020. doi:10.1007/s00153-020-00713-y.
- [18] Roman Kuznets. Craig interpolation via hypersequents. In Dieter Probst and Peter Schuster, editors, Concepts of Proof in Mathematics, Philosophy, and Computer Science, volume 6 of Ontos Mathematical Logic, pages 193–214. De Gruyter, 2016. doi:10.1515/9781501502620-012.
- [19] Roman Kuznets. Multicomponent proof-theoretic method for proving interpolation properties. Annals of Pure and Applied Logic, 169(12):1369–1418, December 2018. doi:10.1016/j.apal.2018.08.007.
- [20] Roman Kuznets and Björn Lellmann. Grafting hypersequents onto nested sequents. Logic Journal of IGPL, 24(3):375–423, June 2016. doi:10.1093/jigpal/jzw005.
- [21] Roman Kuznets and Björn Lellmann. Interpolation for intermediate logics via hyper- and linear nested sequents. In Guram Bezhanishvili, Giovanna D’Agostino, George Metcalfe, and Thomas Studer, editors, Advances in Modal Logic, Volume 12, pages 473–492. College Publications, 2018. Available from: http://www.aiml.net/volumes/volume12/Kuznets-Lellmann.pdf.
- [22] G. E. Minc. On some calculi of modal logic. In V. P. Orevkov, editor, The Calculi of Symbolic Logic. I, volume 98 (1968) of Proceedings of the Steklov Institute of Mathematics, pages 97–124. AMS, 1971. Originally published in Russian. Available from: http://mi.mathnet.ru/eng/tm/v98/p88.
- [23] Sara Negri and Jan von Plato. Proof Analysis: A Contribution to Hilbert’s Last Problem. Cambridge University Press, 2011. doi:10.1017/CBO9781139003513.
- [24] Andrew M. Pitts. On an interpretation of second order quantification in first order intuitionistic propositional logic. Journal of Symbolic Logic, 57(1):33–52, March 1992. doi:10.2307/2275175.
- [25] F. Poggiolesi. The method of tree-hypersequents for modal propositional logic. In David Makinson, Jacek Malinowski, and Heinrich Wansing, editors, Towards Mathematical Philosophy, Papers from the Studia Logica conference Trends in Logic IV, volume 28 of Trends in Logic, pages 31–51. Springer, 2009. doi:10.1007/978-1-4020-9084-4_3.
- [26] Garrel Pottinger. Uniform, cut-free formulations of , , and . Journal of Symbolic Logic, 48(3):900, September 1983. Abstract. doi:10.2307/2273495.
- [27] Greg Restall. Proofnets for S5: Sequents and circuits for modal logic. In Costas Dimitracopoulos, Ludomir Newelski, Dag Normann, and John R. Steel, editors, Logic Colloquium 2005, volume 28 of Lecture Notes in Logic, pages 151–172. ASL, 2007. doi:10.1017/CBO9780511546464.012.
- [28] V. Yu. Shavrukov. Subalgebras of diagonalizable algebras of theories containing arithmetic. Dissertationes Mathematicae, 323, 1993. Available from: http://matwbn.icm.edu.pl/ksiazki/rm/rm323/rm32301.pdf.
- [29] Amirhossein Akbar Tabatabai and Raheleh Jalali. Universal proof theory: Semi-analytic rules and Craig interpolation. E-print 1808.06256, arXiv, 2018. arXiv:1808.06256.
- [30] Amirhossein Akbar Tabatabai and Raheleh Jalali. Universal proof theory: Semi-analytic rules and uniform interpolation. E-print 1808.06258, arXiv, 2018. arXiv:1808.06258.
- [31] Albert Visser. Uniform interpolation and layered bisimulation. In Petr Hájek, editor, Gödel ’96: Logical Foundations of Mathematics, Computer Science and Physics — Kurt Gödel’s Legacy, volume 6 of Lecture Notes in Logic, pages 139–164. ASL, 1996. doi:10.1017/9781316716939.010.