Modal Separation of Fixpoint Formulae
Abstract
Modal separability for modal fixpoint formulae is the problem to decide for two given modal fixpoint formulae whether there is a modal formula that separates them, in the sense that and . We study modal separability and its special case modal definability over various classes of models, such as arbitrary models, finite models, trees, and models of bounded outdegree. Our main results are that modal separability is PSpace-complete over words, that is, models of outdegree , ExpTime-complete over unrestricted and over binary models, and 2-ExpTime-complete over models of outdegree bounded by some . Interestingly, this latter case behaves fundamentally different from the other cases also in that modal logic does not enjoy the Craig interpolation property over this class. Motivated by this we study also the induced interpolant existence problem as a special case of modal separability, and show that it is coNExpTime-complete and thus harder than validity in the logic. Besides deciding separability, we also investigate the problem of efficient construction of separators. Finally, we consider in a case study the extension of modal fixpoint formulae by graded modalities and investigate separability by modal formulae and graded modal formulae.
Keywords and phrases:
Modal Logic, Fixpoint Logic, Separability, InterpolationCopyright and License:
![[Uncaptioned image]](x1.png)
2012 ACM Subject Classification:
Theory of computation Modal and temporal logicsEditors:
Olaf Beyersdorff, Michał Pilipczuk, Elaine Pimentel, and Nguyễn Kim ThắngSeries and Publisher:

1 Introduction
For given logics , the -separability problem for is to decide given two -formulae whether there is an -formula that separates and in the sense that and . Obviously, a separator can only exist when and are mutually exclusive, and the problem is only meaningful when is less expressive than . Intuitively, a separator formulated in a “simpler” logic explains a given inconsistency in a “complicated” logic . Note that, for logics closed under negation, -separability generalizes the -definability problem for : decide whether a given -formula is equivalent to an -formula. Indeed, is equivalent to an -formula iff and are -separable. Since separability is more general than definability, solving it requires an even better understanding of the logics under consideration. Both separability and definability are central problems with many applications in computer science. As seminal work let us only mention definability and separability of regular word languages by first-order logic [26, 29, 9].
In this paper we study definability and separability of formulae of the modal -calculus [27, 20] by formulae in propositional modal logic . is the extension of with fixpoints that encompasses virtually all specification languages such as [12] and and [3]. Let us consider an example.
Example 1.
Consider the following properties of vertex-labelled trees:
-
: there is an infinite path starting in the root on which each point satisfies ;
-
: on every path there are only finitely many points satisfying ;
-
: on every path at most two points satisfy .
The properties are expressible in but not in , and both and are mutually exclusive. The properties are separated by the -formula which expresses that there is a path starting with three points satisfying . On the other hand, no -formula separates . The intuitive reason for this is that any -formula only sees trees up to depth , and one can find two trees with properties which nonetheless look the same up to depth . ∎
We explore the definability and separability problems over several classes of models relevant for computer science: all models, words, trees of bounded or unbounded outdegree; as well as restrictions of all these classes to finite models. On top of analyzing the decision problems, we also address the problem of constructing efficient definitions and separators whenever they exist. The starting point for our research is the seminal paper of Otto [24], where he solves modal definability over models of bounded and unbounded outdegree. In this paper, we continue this line of research and establish a fairly complete and interesting picture. Table 1 summarizes our results. We now explain its content further.
all models | words | binary trees | -ary trees, | |
---|---|---|---|---|
-definability | ExpTime [24] | PSpace | ExpTime [24] | ExpTime [24] |
-separability | ExpTime | PSpace | ExpTime | 2-ExpTime |
separator construction | double exp. | single exp. | double exp. | triple exp. |
interpolant existence | always | always | always | coNExpTime |
The first line essentially repeats Otto’s results; we only add the observation that -definability over words is PSpace-complete. Interestingly, separability is substantially more difficult. The case of words is the easiest one, both in terms of computational complexity and required arguments. Next come the cases of binary and of unrestricted trees. These two classes possess some nice structural properties which (although true for different reasons) enable a common algorithmic treatment. Finally, the cases of trees with outdegree bounded by a number enter the stage. These trees lack the good properties essential for previous constructions which results in higher computational complexity. The hardness result for is interesting for two reasons. First, as it is entirely standard to encode trees of higher outdegree into binary ones, one could expect the ternary (and higher) case to have the same complexity as the binary one. And second, even though there are known cases when separation is provably harder than definability (regularity of visibly pushdown languages is decidable [23, Theorem 19] but regular separability thereof is not [19, Theorem 2.4]), to the best of our knowledge our results are the only such case known in logic.
The complexity landscape for deciding separability is also reflected in the maximal sizes of the separators that we construct. Relying on the well-known connection of to automata, we provide effective constructions for the cases of all models, words, and binary trees. It is worth mentioning that equally effective constructions for definability over all models are given in [22], but they do not work for separability. The ternary case follows from a general argument. Our construction of separators over words is optimal. Under mild assumptions (there are at least two modalities) the constructions over binary and over unrestricted trees are optimal as well, but we leave it open whether these assumptions are needed for the lower bounds. In the case of ternary and higher outdegree trees we only conjecture optimality of the constructed separators.
Finally, we observe that lacks the Craig interpolation property over trees of outdegree bounded by . Recall that a Craig interpolant for in some logic is a formula only using the common symbols of and and such that . A logic satisfies the Craig interpolation property (CIP) if a Craig interpolant of always exists. It is known that enjoys CIP over all models and over words [15] and it follows from our techniques that this transfers to binary trees. In contrast and as mentioned above, over ternary and higher-arity trees lacks the CIP. It is worth mentioning that modal logic over frames of arity bounded by some has been studied under the name [4]. Our results imply that enjoys CIP iff . Motivated by the lack of CIP over higher-arity trees, we study the induced interpolant existence problem – determining whether two given -formulae admit a Craig interpolant – as a special case of separability. We show it to be coNExpTime-complete over higher arity trees, and thus harder than validity. Interpolant existence has recently been studied for other logics without CIP [18, 1].
As an application of our results for -ary trees with we additionally present a case study: separability in the graded setting in which we allow counting modalities saying “there are at least children such that […]” [11]. Counting modalities are a standard extension of modal logic that is especially relevant in applications in knowledge representation for conceptual modeling [2]. We show that -separability of graded is 2-ExpTime-complete, while it is ExpTime-complete if we allow counting modalities also in the separator. The intuitive reason for the hardness in the former case is that trees of bounded arity are definable in graded . This former case is also related with a recent study about separating logics supporting counting quantifiers by logics without these [21].
It is worth to mention that -definability of -formulae generalizes the boundedness problem which asks whether a formula with a single fixpoint is equivalent to a modal formula. Boundedness has been studied for other logics such as monadic-second order logic [6], datalog [16], and the guarded fragment of first-order logic [5]. Our paper is an extension of the preliminary paper [17].
The paper is organized as follows. After this introduction 1, we set notation and recall basic facts in the preliminary Section 2. Next, we introduce some topic-specific terminology, discuss a relevant construction of Otto, and solve the case of all models in Section 3. In the following Sections 4 and 5 we deal with unary and binary trees, and in Section 6 we solve the most challenging case of trees of outdegree bounded by . Section 7 applies our results to the case with graded modalities. The last Section 8 contains conclusions and final remarks.
2 Preliminaries
We recall the main notions about modal logic and the modal -calculus . For the rest of this paper fix disjoint, countably infinite sets of atomic propositions and of variables. The syntax of is given by the rule
where and . We assume that formulae of are in a normal form such that every appears at most once in a formula, and if it does appear then its appearance has a unique superformula beginning with or . Modal logic is defined as the fragment of with no fixpoint operators and nor variables. Both in and , we use abbreviations like (for for some ), (for a formula with leading ’s), and . We denote with the set of propositions that occur in , and recall that the modal depth of an formula is the maximal nesting of . With we denote the class of all -formulae of modal depth at most , and with we denote its subclass restricted to signature . The size of a formula is the length of represented as a string. This choice of the simplest possible measure of size does not matter for most of our results. We will briefly discuss alternative notions of size in the concluding Section 8.
Both and are interpreted in pointed Kripke structures. More formally, a model is a quadruple consisting of a set called its universe, a distinguished point called the root, an accessibility relation , and a valuation .
The semantics of can be defined in multiple equivalent ways. The one most convenient for us is through parity games (see [32] for an introduction). Given a model and a formula we define a semantic game played between players and . The positions are . The moves depend on the topmost connective. From a position of the shape or it is allowed to move to either or . From and the allowed moves lead to all such that . In position or the game stops and wins iff satisfies the formula component or , respectively. From and the game moves to , and from to where is the unique superformula of beginning with or . owns positions whose formula component has or as the topmost connective and owns all other positions. wins an infinite play if the outermost subformula seen infinitely often in begins with . We say that satisfies and write if wins the game from position . Since is by definition pointed, we abbreviate with .
The same symbol denotes entailment: means that every model of is a model of . In the case only models from some fixed class are considered we talk about satisfiability and entailment over . Let be a subset of such as or . If two models and satisfy the same formulae of then we call them -equivalent and write .
In the paper we will study models of bounded and unbounded outdegree. The outdegree of a point in a model is the number of successors of in the underlying directed graph . We say that has finite outdegree if every point has finite outdegree and bounded outdegree if there is a finite uniform upper bound on the outdegree of its points. In the latter case, we will call -ary, and binary or ternary if or . If , then we call a word. A -ary model is full if each of its nodes is either a leaf (i.e. has no children) or has precisely children. A model is a tree if is a (directed) tree with root . We denote with the class of all -ary tree models. Both and are invariant under bisimulation, and every (-ary) model is bisimilar to a (-ary) tree. Hence, we do not loose generality by only looking at tree models.
A prefix of a tree is a subset of its universe closed under taking ancestors. When no confusion arises we identify a prefix with the induced subtree of that has as its universe. The depth of a point is the distance from the root. The prefix of depth (or just -prefix) is the set of all points at depth at most and is denoted by (and the corresponding subtree by ).
Bisimulations
We define bisimulations and bisimilarity for trees, assuming for convenience that bisimulations only link points at the same depth. Let be trees and a relation between and that relates only points of the same depth. Then, is a bisimulation between and if it links the roots , and for every the following conditions are satisfied:
-
,
-
for every with there is a with and , and
-
for every with there is a with and .
A functional bisimulation (also known as bounded morphism) is a function whose graph is a bisimulation. If is a functional bisimulation from to then we write and call a bisimulation quotient of . The bisimilarity quotient of is a quotient of such that if then . It follows from analogous results for arbitrary models that every tree has a unique (up to isomorphism) bisimilarity quotient and that two trees are bisimilar iff their bisimilarity quotients are isomorphic.
Further, for every and every subset of the signature we consider a restricted variant of bisimulations called -bisimulations. In a -bisimulation the condition is only checked with respect to and the and conditions only for points at depth smaller than . Formally, a relation is a -bisimulation if it is a bisimulation between the -prefixes of the -reducts of . We call a -bisimulation between a -isomorphism if it is bijective on the -prefixes of . We write if there exists a -bisimulation between and and if there is a -isomorphism between them. Crucially, over every class of models and for every finite the equivalences and coincide, for every .
Automata
We exploit the well-known connection of and automata that read tree models. A nondeterministic parity tree automaton (NPTA) is a tuple where is a finite set of states, is the initial state, for some finite set , assigns each state a priority, and is a transition function of type:
where denotes the set of all tuples over of length at most . A run of on a tree is an assignment sending the root of the tree to and consistent with in the sense that for every point with children . On occasion when considering trees of unbounded outdegree we will use automata with transition function of type . Then, consistency of with means that for every with a set of children. In either case, we call the run accepting if for every infinite path in the sequence satisfies the parity condition. We write in case has an accepting run on . An automaton that is identical to except that the original initial state is replaced with is denoted . The size of an automaton is the number of its states and is denoted by .
An NPTA is equivalent to a formula over a class of trees when iff for every tree . We rely on the following classical result (see for example the discussion in [31] and the well-presented Dealternation Theorem 5.7 in [7]):
Theorem 2.
For every -formula and class of trees, we can construct an NPTA with exponentially many states equivalent to over . The construction takes exponential time when for some , and doubly exponential time in the unrestricted case.
3 Foundations of Separability
We start with recalling the notion of separability and discuss some of its basic properties.
Definition 3.
Assume a subset of all formulae. Given , an -separator of is a formula with and . If additionally for some signature , is called an -separator.
The -separability problem is to determine, given formulae and a signature , if they admit an -separator . -definability is the special case of -separability in which , since an -separator of is equivalent to . All notions can be relativized to a class of models by considering entailment over that class. We investigate -separability and -definability over different classes of models. The reader may have expected the problems to be defined without restrictions on , but in fact such versions of the problems are special instances of our problems with . Conversely, all lower bounds already hold for such special instances.
We start with observing that, by the tree model property and the finite model property of , is an -separator of (over all models) iff is an -separator of over trees iff is an -separator of over finite models. Thus, separability coincides over all these classes. Moreover, with the help of the -formula expressing the existence of an infinite path originating in the root, -separability over finite trees reduces to -separability over all models. More formally:
Lemma 4.
Let and . Then is an -separator of over finite trees iff is an -separator of . This is also true inside , for .
This lemma allows us to transfer all upper bounds obtained in the paper also to the restrictions of the classes to finite models. The lower bounds do not follow from this lemma, but analyzing the proofs yields that they actually work as well. Thus, in the rest of the paper we focus on the classes of all models and , for .
The starting point for the technical developments in the paper are model-theoretic characterizations for separability. Similar to what has been done in the context of interpolation, see for example [28], they are given in terms of joint consistency, which we introduce next. Let be a binary relation on some class of models, such as -isomorphism or -equivalence . We call two formulae joint consistent up to (in short joint -consistent) if there are models and with . For technical reasons we will sometimes also talk about joint consistency of automata in place of formulae . Joint -consistency over a class of models is defined by only looking at models from . Clearly, if and then joint -consistency over implies joint -consistency over . We use the following standard equivalence:
are not -separable over are joint -consistent over . | () |
for every , , finite , and class . The implication from right to left is immediate. The opposite one follows from the observation that for every and finite there are only finitely many equivalence classes of , and each such class is fully described with a single modal formula.
Let us illustrate how Equivalence () is used to solve -separability. Let and be -formulae expressing the respective properties and from Example 1. Let be an infinite path in which every point satisfies , and let be a finite path of length in which every point satisfies . Then, for each the models witness joint -consistency of . By Equivalence () this means that are not -separable for any , and thus not -separable at all.
Definability is a special case of separability. Since the tools used for solving definability are a starting point for our work, we recall them now.
Modal Definability: A Recap
In his seminal paper [24] Otto showed that -definability of -formulae is ExpTime-complete over all models and over for every .
Theorem 5 ([24, Main Theorem and Proposition 5]).
Over the class of all models, as well as over for every , -definability of -formulae is ExpTime-complete.
We start by recalling and rephrasing Otto’s construction and fixing a small mistake in the original proof. The lower bound follows by an immediate reduction from satisfiability of -formulae. We look at the upper bound. The first step is the following lemma, which is the heart of [24, Lemma 2].
Lemma 6.
For every and the following are equivalent:
-
1.
are joint -consistent over .
-
2.
are joint -consistent over .
The lemma is true, but its proof in [24] is mistaken. The problem there is that the construction duplicates subtrees and hence may turn -ary models into ones with outdegree greater than . We present an easy alternative proof.
Proof.
Only the implication 1 2 is nontrivial. To prove it assume -ary , with and assume towards contradiction that are not -consistent over . We have where is the -reduct of , and is the bisimilarity quotient of its -prefix . By the assumption that are not joint -consistent, implies . By invariance of under , this in turn implies . We construct symmetrically. By definition, means that and are bisimilar, which is equivalent to saying that their bisimilarity quotients and are isomorphic, and hence -isomorphic. Thus, witness joint -consistency of over , a contradiction.
Proposition 7.
For every parity automata and : are joint -consistent over for all iff are joint -consistent over for . The latter condition can be checked in time polynomial in .
Proof (Sketch).
Due to well-known relativization techniques we do not loose generality by only running on full -ary trees with no leaves. Let be a language of finite full -ary trees over such that iff is a prefix of a reduct of a model of . Let be an analogous language for . The tallness of a finite tree is the minimal distance from the root to a leaf. Observe that are -consistent over iff contains a tree of tallness . Thus, it suffices to check if contains trees of arbitrarily high tallness. To that end construct an automaton recognizing of size polynomial in . An easy pumping argument shows that the language of contains trees of arbitrarily high tallness iff it contains a tree of tallness . To test the latter condition it is enough to inductively compute a sequence of subsets of states of , where is the set of all states such that recognizes a tree of tallness at least .
We are ready to solve -definability over in exponential time. Assume -formula . For every , we know by Equivalence () that is equivalent over to some iff are not joint -consistent over . By Lemma 6 this is equivalent to the lack of joint -consistency of over . By Theorem 2 we can compute exponentially-sized automata , equivalent to and over . It follows that is not -definable over iff are joint -consistent over for every . The last condition is decided using Proposition 7. The runtime of our algorithm is polynomial in , and thus exponential in . This proves the part of Theorem 5 about . The remaining part concerning unrestricted models is a special case of Theorem 9, which we will prove next.
Modal Separation: the Unrestricted Case
Over unrestricted models, separability turns out to be only slightly more complicated than definability. Lemma 6 becomes false if is replaced with arbitrary (which would be the statement relevant for separability). We have the following lemma, however.
Lemma 8.
For every and the following are equivalent:
-
1.
are joint -consistent over all models.
-
2.
are joint -consistent over , where .
Proof.
The implication (1)(2) is immediate. To prove the other one (1)(2) consider an intermediate property:
are joint -consistent over all models. | (1.5) |
The implication (1)(1.5) can be read off from Otto’s original proof. The remaining one (1.5)(2) is a special case of a stronger claim which we prove later: the implication (3)(4) of Lemma 27. Lemma 8 allows us to solve -separability in exponential time.
Theorem 9.
Over all models, -separability of -formulae is ExpTime-complete.
Proof.
The proof is almost the same as our proof of Theorem 5. The only difference is that we consider an arbitrary in place of , and hence use Lemma 8 in place of Lemma 6. Apart from deciding separability we also construct separators when they exists. Given a subset of formulae, , and , we call an -uniform consequence of if for every such that . The notion relativizes to a fixed class of models by only considering entailment over that class. Observe that if are -separable and is an -uniform consequence of then is an -separator for . The same is true over any class .
Note that it follows from the proof of Theorem 9 that if are -separable then they admit a separator of modal depth at most exponential in . It follows that constructing an -separator for boils down to constructing an -uniform consequence of . A naive construction which always works is to take the disjunction of all -types consistent with over . Here, by an -type we mean a maximal consistent subset of . Since up to equivalence there are only finitely many formulae in , each -type can be represented as a single -formula and the mentioned disjunction is well-defined. This construction is non-elementary in over all models and doubly exponential in over models of bounded outdegree.
We present an efficient construction of -uniform consequences. The construction works over unrestricted models, over and over but not over for . Since in the following Section 4 we will provide a more efficient construction for , now we only look at the unrestricted and binary case. For convenience, we construct -uniform consequences of automata instead of formulae, with definition adapted in an obvious way.
Proposition 10.
Let be the class of all models or . Assume an NPTA over , a signature and . An -uniform consequence of over can be constructed in time if is the class of all models and in time if .
Proof.
Let be an NPTA. Let be an automaton of the same size recognizing -reducts of models of . A formula is an -uniform consequence of over iff it is an -uniform consequence of over . Thus, it suffices to construct the latter.
We construct for every and by induction on . For the base case we put:
For the induction step define:
where is an abbreviation for . Assume is either the class of all models or . The construction preserves the following invariant:
(1) |
for every structure . Hence, is an -uniform consequence of over . It is routine to check that in either case the formula has the right size.
The proof of (1) proceeds by an easy induction, with slightly different details for the cases of binary and of unrestricted models. It is worth to point out, however, that the implication from left to right would not be valid over with .
Given the exponential construction of automata from Theorem 2 and the exponential upper bound on modal depth of separators, Proposition 10 yields an efficient construction of separators.
Theorem 11.
If are -separable, then one can compute an -separator in time doubly exponential in .
It is not difficult to show that, in the presence of at least two accessibility relations , the construction is optimal: one can express in that the model embeds a full binary tree of depth and in which each inner node has both a - and a -successor. Using standard techniques, one can show that any modal formula expressing this property is of doubly exponential size [13]. Whether having two accessibility relations is necessary for this lower bound is an interesting question which we leave open.
It is interesting to note that the separators we compute are not the logically strongest separators and, in fact, strongest separators do not even have to exist.
Example 12.
Consider from before and For every , the modal formula separates from , and whenever .
The remaining open cases are the problems of -separability (and separator construction) over for . We investigate the cases of unary (), binary (), and higher maximal outdegree () in turn. We emphasize that the outdegree is not a part of the input but rather a property of the considered class of models.
4 Unary Case
We first investigate -separability over , that is, models that are essentially words. Note that satisfiability of over words is PSpace-complete (an upper bound follows, e.g., via the translation to automata and the lower bound is inherited from [30, Theorem 4.1]) which suggests that also definability and separability could be easier. Indeed, we show:
Theorem 13.
-definability and -separability of -formulae is PSpace-complete over .
Proof.
The lower bound is by a reduction from satisfiability, and applies to definability.
Given formulae and a subset of the signature , consider the set of finite words . Let be a similar language defined for . Two unary models are bisimilar iff they are identical. Hence, by Equivalence () the formulae are not -separable over iff is infinite. It is standard to define a finite automaton recognizing and check if its language is infinite (which is equivalent to checking if contains input longer than ). To do it in polynomial space, we nondeterministically guess the long input, letter by letter, and only remember the current state and a binary counter measuring the length of the input guessed so far.
We conclude this section with proving that -separators can be constructed in exponential time and are thus of at most exponential size. Note that this is optimal, since over , is exponentially more succinct than . Indeed, it is standard to implement an exponential counter using a polynomially sized -formula.
Theorem 14.
If are -separable over , then one can compute an -separator in time exponential in .
As argued in the previous section, it suffices to construct an -uniform consequence of the NPTA equivalent to , which we do next.
Proposition 15.
Let be an NPTA over with states, , and a signature. An -uniform consequence of over can be constructed in time polynomial in , , and .
Proof.
As argued in the previous section, it suffices to construct an -uniform consequence of the NPTA which recognizes precisely the -reducts of models of . Let have states . By construction of , we have . As an auxiliary step, we define for every and a formula such that for every :
(2) |
The are defined inductively with the base cases () read off from , and using divide and conquer in the inductive step (), to keep the formulae small. More formally, we define for and all by taking:
It is routine to verify that satisfies (2) and is of size . Based on the , one can define a formula that describes all possible prefixes of length of models of , and thus is the sought -uniform consequence of . One can think of as the disjunction of formulae for the initial state of , but the full construction is slightly more involved since models accepted by might be also shorter than .
5 Binary Case
We next handle the binary case . The key observation here is that, between full binary trees, bisimilarity entails isomorphism.
Proposition 16.
Assume full binary trees . If and are -bisimilar then they are -isomorphic.
Proof.
By definition a -bisimulation between two models is a bisimulation between their reducts to , and -isomorphism is such a bisimulation which is additionally bijective. It therefore suffices to show that if are full binary trees and is a bisimulation between them then there is a bijective bisimulation . We pick such inductively starting with the pair of roots . The key observation is that if has children and has children and then either (i) and or (ii) and (the cases are not exclusive). Proposition 16 can be used to prove the Craig interpolation property of over and implies the following separability-variant of Lemma 6 over .
Lemma 17.
For every and the following are equivalent:
-
1.
are joint -consistent over .
-
2.
are joint -consistent over .
Proof.
We show only the nontrivial implication 1 2. Assume binary , with . Let and be full binary trees obtained from and by duplicating subtrees. By Proposition 16, which proves 2. Similarly to the definability case, Lemma 17 combined with Equivalence () and Proposition 7 immediately give an exponential procedure for separability. Since the lower bound is inherited from definability, we get the following result.
Theorem 18.
-separability and -definability of -formulae is ExpTime-complete over .
Theorem 19.
If are -separable over , then one can compute an -separator in time doubly exponential in .
6 Ternary and Beyond
In this section we address the case of models with outdegree bounded by a number . We illustrate that this case behaves differently as it lacks the Craig interpolation property.
Example 20.
Consider -formulae and . Clearly, over . Observe that models in Figure 1 witness that are joint -consistent and thus joint -consistent for every . By Equivalence () there is no -separator, which is nothing else than a Craig interpolant. ∎
Motivated by the lack of the Craig interpolation property, we study the -interpolant existence problem: given and signature , decide whether there is an -separator of , that is, with . Craig -interpolant existence is the special case in which . Observe that -interpolant existence is the special case of -separability of -formulae in which the input to the separability is restricted to -formulae. We show that already -interpolant existence over is harder than -separability of -formulae over arbitrary models.
Theorem 21.
For , -interpolant existence over is coNExpTime-complete. Hardness already applies to Craig -interpolant existence over .
Proof.
The upper bound is easy to establish based on the observation that of modal depth at most do not admit an -separator over iff they are joint -consistent over . The witness of joint -consistency of can assumed to be of depth . Such models are of exponential size (they have at most points) and can thus be guessed by a non-deterministic exponential time bounded Turing machine.
The lower bound is more intriguing and relies on an extension of Example 20. Reconsidering the example it is important to note that in every witness of joint -consistency of , there are two successors of that are bisimilar to the same successor of . We extend the idea and enforce exponentially many bisimilar points. More precisely, consider families , of modal formulae inductively defined as follows:
Clearly, the size of is polynomial in . Moreover, by induction on , it is readily verified that for every , for every with , , and every -bisimulation witnessing , there are points in depth in and a point in depth in such that for all and such that distinct can be distinguished by some proposition in . Intuitively, this means that enforce in joint -consistent models that contains points which are all linked to the same point in . We exploit this link to synchronize information between the , following a strategy that has recently been used to show coNExpTime-hardness for interpolant existence in some description logics [1].
We reduce a NExpTime-complete tiling problem [14]: Given a set of tile types and horizontal and vertical compatibility relations , and some in unary, decide whether one can tile the torus with tiles from complying with . Given , we define formulae , of modal depth and with common signature such that
has a solution are joint -consistent. |
To explain the idea, let witness joint -consistency of . The gadget formulae enforce points in depth in which are all linked via the bisimulation to a single point in . These points shall represent the cells of the torus. The intended solution of the tiling problem is represented via propositions , for each . To synchronize them we proceed as follows. Using the propositions (which are not in ), we can associate coordinates to each point in the torus. To understand the purpose of , suppose for a moment that the outdegree of the points and the is at most (instead of 3). Then we could proceed by enforcing (via ) below each with coordinates three successors such that
-
, , have coordinates , , and , respectively;
-
the coordinates of the are made visible using propositions in ;
-
satisfy for such that and .
These three successors stipulate bisimilar successors of . Since each point in the torus is stipulated three times as successor of some and since the outdegree of is restricted to , the three copies of the same point satisfy the same proposition . By the last item above, the selected propositions comply with and thus represent a solution to the tiling problem. Now, since the outdegree below is at most 3 (and not as assumed), the have to be a bit more complicated, but the idea remains the same.
We show next that the situation for the full separability problem is even worse.
Theorem 22.
For every , -separability of -formulae over is 2-ExpTime-complete.
Thus, over for , -separability is provably harder than -definability, c.f. Theorem 5. Both the upper and the lower bound of Theorem 22 are non-trivial; we provide proof sketches in the following two subsections. Before doing that let us conclude this part with separator construction.
Theorem 23.
If are -separable over , , then one can compute an -separator in time triply exponential in .
Proof.
(Sketch) It follows from the upper bound proof of Theorem 22 that, if admit an -separator, then they admit one of modal depth bounded doubly exponentially in . Observe that over the signature of and there are only triple exponentially many trees of fixed outdegree and double exponential depth, and that each such tree is characterized by a modal formula of triply exponential size. The sought separator is then the disjunction of all such formulae consistent with .
6.1 Lower Bound for Theorem 22
We reduce the word problem of exponentially space bounded alternating Turing machines (ATMs), which is known to be 2-ExpTime-complete [8]. Informally, the states of such ATMs are partitioned into universal states and existential states . Configurations of ATMs are defined as usual, but computations are not sequences of configurations but trees of configurations such that an existential configuration has exactly one successor labeled with a universal configuration and a universal configuration has exactly two successors labeled with existential configurations. A computation tree for an input is a tree whose root is labeled with the initial configuration and such that successor nodes contain successor configurations. is accepted if there is a computation tree in which each path is infinite (this acceptance condition is slightly non-standard, but eases the proof).
The reduction relies on the same gadget formulae as used in the proof of Theorem 21 and additionally uses ideas for showing 2-ExpTime-hardness for recently studied interpolant existence problems for description logics [1]. For a given ATM and input of length , we construct formulae such that
are joint -consistent for every iff accepts . |
This suffices by Equivalence (). The signature will consist of , , and propositions for every possible cell content of , that is, . Additionally, and will use auxiliary propositions, e.g., to encode counters. The only purpose of is to mention the propositions in ; the main work is done by .
To explain the idea, let us consider witnesses for joint -consistency of for sufficiently large . By the properties of , we find points in depth in which are bisimilar to a single point in depth in . Recall that in every , we have access to its index via a counter using propositions . Now, is a -formula with the following properties, see also Figure 2 for illustration.
-
enforces the “skeleton” of a computation tree for , in which each configuration is modeled by a path of length (using an exponential counter), and in which universal and existential configurations alternate.
-
also enforces that each point of the skeleton is labeled with some cell content via -propositions , but without any synchronization except the initial configuration.
-
makes sure that below the positions of successor configurations are coordinated.
The key point is that this enforces (due to bisimilarity) a computation tree below in which, due to the last item above, all positions of configurations are coordinated.
We remark that the hardness also holds when is not part of the input: one can reduce separability of by -formulae to separability of by (arbitrary) -formulae.
6.2 Upper Bound for Theorem 22
We show that over models of outdegree at most , -separability of fixpoint formulae can be solved in doubly exponential time. Let us start with establishing a technical but useful fact. For every language of -ary trees denote the language:
of bisimulation quotients of trees from .
Proposition 24.
For every NPTA , an NPTA recognizing can be computed in time exponential in the size of .
Proof.
Fix an NPTA . For every , we characterize existence of -ary with with the following parity game . The game has the set as positions. The pair consisting of the root of and is the initial position. From a position first chooses and a surjective map where is the set of children of . Then responds with a choice of and the next round starts in position . The game is a parity game: the ranks are inherited from in the sense that the rank of equals . It is easy to show that:
(3) |
for every . Using (3) we prove Proposition 24. It suffices to construct an automaton which accepts iff wins . To that end, using standard techniques we encode ’s positional strategies for as colorings of with and construct, in time exponential in , an automaton recognizing models labelled with such winning positional strategies. We then obtain recognizing by projecting out the additional colors from .
With the help of Proposition 24 we prove Theorem 22. Fix , -formulae and and signature . By Equivalence (), it suffices to check if and are jointly -consistent over for every . However, unlike with definability or in the binary case, we cannot conclude joint -consistency from joint -consistency. Instead, we use Proposition 24 to directly decide joint -consistency for all . For a language , define the language:
of finite -ary trees which are bisimulation quotients of finite prefixes of models from . By Proposition 24 and the closure properties of parity automata, for every one can construct in exponential time an automaton recognizing .
We prove the upper bound from Theorem 22. Using Theorem 2 compute automata accepting -reducts of models of . Compute recognizing and . Recall that any two trees are bisimilar iff they have isomorphic bisimulation quotients. It follows that admit a -separator over iff are joint -consistent iff are joint consistent. By Proposition 7, the latter condition holds for all iff it holds for and this can be tested in time polynomial in . Since are exponential, and are doubly exponential in the size of , this gives the upper bound from Theorem 22.
7 Case Study: Graded Modalities
In this section we apply our techniques and results to the case with graded modal operators. Formally, we extend with formulae of the shape and , where and the grade is a natural number. Intuitively, is true in a point if has at least successors satisfying and dually, is true in if all but at most successors satisfy [11, 25]. We denote with and the extension of and , respectively, with such graded modalities. Clearly, for any , is -definable by the formula , which is an additional motivation to study and .
Indeed, using the results and techniques from the previous section one can easily prove that -separability of -formulae (defined as expected) is 2-ExpTime-complete.
Theorem 25.
-separability of -formulae is 2-ExpTime-complete.
Proof.
For the lower bound, we reduce -separability of -formulae over in spirit similar to Lemma 4. Since the former problem is 2-ExpTime-hard by Theorem 22, the latter is as well. Recall the formula defining . Then, for any -formulae and , we have that is an -separator of over iff is an -separator of .
Towards the upper bound, suppose . Using standard arguments, one can show that are -separable over all models iff they are -separable over , where and is the greatest grade occurring in . We then construct NPTA equivalent to over -ary trees via (an analogue for of) Theorem 2 and proceed with as described in the upper bound proof of Theorem 22.
Interestingly, the problem becomes easier if we allow grades in the separating formula.
Theorem 26.
-separability of -formulae is ExpTime-complete.
The lower bound follows by the usual reduction from satisfiability. We thus focus on the upper bound. Similarly to the non-graded case, we establish first a model-theoretic characterization, based on the appropriate notion of bisimilarity that characterizes the expressive power of [10]. A relation between models is a graded bisimulation if it satisfies () and graded variants of the () and () conditions of bisimulations. The graded () condition says that if then for every and pairwise different children of , there are pairwise different children of satisfying for all . The graded () condition is symmetric. It is a -graded bisimulation if the graded () and () conditions need to be satisfied only for . We denote with (resp., ) the fact that there is a graded bisimulation (resp., a -graded bisimulation) between and that relates their roots. Variants with bounded depth and/or given signature are defined and denoted as expected.
Lemma 27.
For every with maximal grade , signature , and , the following are equivalent:
-
1.
are not -separable (over all models).
-
2.
are joint -consistent (over all models).
-
3.
are joint -consistent (over all models).
-
4.
are joint -consistent over for .
Using Lemma 27, one can solve -separability of formulae in exponential time, following the approach described in Section 3. More precisely, given , we construct NPTA equivalent to over -ary trees, as in Lemma 27, and decide whether are joint -consistent over for all via Proposition 7.
Let us provide some details on the proof of the central Lemma 27.
Proof.
We show the implications in turn. The implication is immediate.
For , suppose are not -separable. Hence, for every there is a pair of models and with . One can encode with an -sentence that two models and are depth- trees, is a prefix of some and of some . If is a fresh binary symbol, then it is also possible to encode with an (infinite) set of -sentences that is a graded bisimulation between and . Every finite fragment of only mentions finitely many grades and hence by assumption is satisfiable. Thus, by compactness of , the entire is satisfiable. This gives us with extensions and .
For , fix witnesses of joint -consistency, that is, and there are extensions of with and . By the Löwenheim-Skolem property of we may assume that both models are at most countable. It remains to apply the known fact that countable trees and satisfy iff and are isomorphic. For the sake of completeness, we add a brief justification of this latter statement. Assume and with respective children and such that . For every -equivalence class of the corresponding equivalence class has the same cardinality as . This is immediate for finite , and for infinite it follows because in countable models every two infinite subsets have the same cardinality. This allows us to inductively pick a bijective subrelation of between and which is still a graded bisimulation.
For , fix witnesses of joint -consistency, that is, and there are extensions of with and . We trim and so that the outdegree becomes at most . Without loosing generality we assume that the prefixes of and are not only isomorphic but identical. The semantics of every in a model is captured by a parity game whose positions are . We extend the definition of the game to . The set of positions and the winning condition are defined as in the classical case, and so are the moves for all the positions with topmost connective other than the graded modalities. In the classical game, from chooses a child of and the next position is . In , first chooses a subset of size of children of , then chooses one of these children and the next round starts at . Dually, in first picks a subset of at most ’s children, then responds with a choice of some not in and the next position is . It is tedious but straightforward to check that wins the game from iff is true at , as in the classical case. Note that if we take a submodel of which contains at least the root and all -witnesses (that is, points chosen by a winning strategy in for positions of shape ) then (the restriction of) to is a winning strategy for .
Let and be positional winning strategies for in the semantic games and . We take submodel of as follows. In the -prefix we take the root and all -witnesses for both and . In the rest of the model we only take -witnesses for . A submodel of is defined symmetrically. It follows that and .
Recall that is the maximal grade appearing in and . Since the respective sets of positions of and are and , for every point there are at most -witnesses chosen by from a position which has on the first coordinate. Consequently, the outdegree of and is not greater than . This proves Lemma 27.
8 Conclusion
We have presented an in-depth study of modal separation of -formulae over different classes of structures. For us, the most interesting results are the differences that are obtained over classes of bounded outdegree for different bounds , , . Without much effort our results on trees of bounded outdegrees can be transferred to infinite words and to ranked trees, via reductions similar to Lemma 4.
Throughout the paper we used the simplest possible measure of formula size: the length of a formula written as a string. Alternative more succinct measures, such as the number of non-isomorphic subformulae (DAG-size), are also interesting. Thus, a natural question is to what extent our results depend on the choice of size measure. In principle, using a more succinct measure makes the problems of definability and separability harder. However, all our decision procedures, with an exception of Theorem 21, are automata-based. Consequently, these procedures carry over with unchanged complexity to any size measure for which the translation from logic to nondeterministic automata has the same complexity as in Theorem 2. In the remaining case of Theorem 21 a weaker assumption suffices: the modal depth of a formula is at most polynomial in its size. Both the mentioned assumptions are arguably modest.
A place where the choice of size measure matters a little more is the construction of modal definitions and separators. In the cases of unrestricted, unary (), and high outdegree models ( for ) the constructed formulae have DAG-size essentially the same as size: doubly, singly, and triply exponential, respectively. Interestingly, however, in the binary case our formulae have only singly exponential DAG-size, which is easily seen to be optimal and contrasts with their doubly exponential size. This demonstrates that the lower bounds for size of modal definitions over cannot work for DAG-size. The same lower bound construction fails for DAG-size over unrestricted models, although there the exact DAG-size complexity of optimal separators remains unknown.
We mention some interesting open problems. First, the relative succinctness of over is to the best of our knowledge open in the setting with only one accessibility relation. Second, as we have mentioned in Section 3, the separators we compute are not necessarily the logically strongest ones. The logically strongest separators of are precisely the -uniform consequences of (if they exist) and are a natural object of study. Clearly, modal definability of is a sufficient condition, but not a necessary one. Let and for some . Then is not equivalent to a modal formula, but is a strongest separator. In the context of , open questions are -definability (and separability) and -definability (and separability) of -formulae. We conjecture them to be easier than 2-ExpTime. Finally, let us mention that definability of -formulae by safety formulae has been studied in [22]. It would be natural to investigate separability there as well.
References
- [1] Alessandro Artale, Jean Christoph Jung, Andrea Mazzullo, Ana Ozaki, and Frank Wolter. Living without Beth and Craig: Definitions and interpolants in description and modal logics with nominals and role inclusions. ACM Trans. Comput. Log., 24(4):34:1–34:51, 2023. doi:10.1145/3597301.
- [2] Franz Baader, Ian Horrocks, Carsten Lutz, and Ulrike Sattler. An Introduction to Description Logic. Cambridge University Press, 2017. URL: http://www.cambridge.org/de/academic/subjects/computer-science/knowledge-management-databases-and-data-mining/introduction-description-logic?format=PB#17zVGeWD2TZUeu6s.97.
- [3] Christel Baier and Joost-Pieter Katoen. Principles of model checking. MIT Press, 2008.
- [4] Fabio Bellissima. On the lattice of extensions of the modal logics KAlt. Arch. Math. Log., 27(2):107–114, 1988. doi:10.1007/BF01620760.
- [5] Michael Benedikt, Balder ten Cate, Thomas Colcombet, and Michael Vanden Boom. The complexity of boundedness for guarded logics. In 30th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS, pages 293–304. IEEE Computer Society, 2015. doi:10.1109/LICS.2015.36.
- [6] Achim Blumensath, Martin Otto, and Mark Weyer. Decidability results for the boundedness problem. Log. Methods Comput. Sci., 10(3), 2014. doi:10.2168/LMCS-10(3:2)2014.
- [7] Mikołaj Bojańczyk and Wojciech Czerwiński. Automata Toolbox. University of Warsaw, 2018. URL: https://www.mimuw.edu.pl/~bojan/papers/toolbox.pdf.
- [8] Ashok K. Chandra, Dexter Kozen, and Larry J. Stockmeyer. Alternation. J. ACM, 28(1):114–133, 1981. doi:10.1145/322234.322243.
- [9] Sang Cho and Dung T. Huynh. Finite-automaton aperiodicity is PSpace-complete. Theor. Comput. Sci., 88(1):99–116, 1991. doi:10.1016/0304-3975(91)90075-D.
- [10] Maarten de Rijke. A note on graded modal logic. Stud Logica, 64(2):271–283, 2000. doi:10.1023/A:1005245900406.
- [11] Kit Fine. In so many possible worlds. Notre Dame J. Formal Log., 13(4):516–520, 1972. doi:10.1305/NDJFL/1093890715.
- [12] Michael J. Fischer and Richard E. Ladner. Propositional dynamic logic of regular programs. J. Comput. Syst. Sci., 18(2):194–211, 1979. doi:10.1016/0022-0000(79)90046-1.
- [13] Tim French, Wiebe van der Hoek, Petar Iliev, and Barteld P. Kooi. On the succinctness of some modal logics. Artif. Intell., 197:56–85, 2013. doi:10.1016/j.artint.2013.02.003.
- [14] Martin Fürer. The computational complexity of the unconstrained limited domino problem (with implications for logical decision problems). In Logic and Machines: Decision Problems and Complexity, Proceedings of the Symposium ”Rekursive Kombinatorik”, volume 171 of Lecture Notes in Computer Science, pages 312–319. Springer, 1983. doi:10.1007/3-540-13331-3_48.
- [15] Dov M. Gabbay. Craig’s interpolation theorem for modal logics. In Conference in Mathematical Logic — London ’70, pages 111–127, Berlin, Heidelberg, 1972. Springer Berlin Heidelberg.
- [16] Gerd G. Hillebrand, Paris C. Kanellakis, Harry G. Mairson, and Moshe Y. Vardi. Undecidable boundedness problems for datalog programs. J. Log. Program., 25(2):163–190, 1995. doi:10.1016/0743-1066(95)00051-K.
- [17] Jean Christoph Jung and Jędrzej Kołodziejski. Modal separability of fixpoint formulae. In Proceedings of the 37th International Workshop on Description Logics (DL 2024), volume 3739 of CEUR Workshop Proceedings. CEUR-WS.org, 2024. URL: https://ceur-ws.org/Vol-3739/paper-5.pdf.
- [18] Jean Christoph Jung and Frank Wolter. Living without Beth and Craig: Definitions and interpolants in the guarded and two-variable fragments. In Proceedings of Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), pages 1–14. IEEE, 2021. doi:10.1109/LICS52264.2021.9470585.
- [19] Eryk Kopczynski. Invisible pushdown languages. In Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), pages 867–872. ACM, 2016. doi:10.1145/2933575.2933579.
- [20] Dexter Kozen. Results on the propositional mu-calculus. Theor. Comput. Sci., 27:333–354, 1983. doi:10.1016/0304-3975(82)90125-6.
- [21] Louwe Kuijer, Tony Tan, Frank Wolter, and Michael Zakharyaschev. Separating counting from non-counting in fragments of two-variable first-order logic (extended abstract). In Proc. of DL 2024, 2024.
- [22] Karoliina Lehtinen and Sandra Quickert. Deciding the first levels of the modal mu alternation hierarchy by formula construction. In Proceedings of Annual Conference on Computer Science Logic CSL, volume 41 of LIPIcs, pages 457–471. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2015. doi:10.4230/LIPICS.CSL.2015.457.
- [23] Christof Löding and Christopher Spinrath. Decision problems for subclasses of rational relations over finite and infinite words. Discrete Mathematics & Theoretical Computer Science, Vol. 21 no. 3, January 2019. doi:10.23638/DMTCS-21-3-4.
- [24] Martin Otto. Eliminating recursion in the -calculus. In Proceedings of 16th Annual Symposium on Theoretical Aspects of Computer Science (STACS), volume 1563 of Lecture Notes in Computer Science, pages 531–540. Springer, 1999. doi:10.1007/3-540-49116-3_50.
- [25] Martin Otto. Graded modal logic and counting bisimulation. CoRR, abs/1910.00039, 2019. arXiv:1910.00039.
- [26] Thomas Place and Marc Zeitoun. Separating regular languages with first-order logic. Log. Methods Comput. Sci., 12(1), 2016. doi:10.2168/LMCS-12(1:5)2016.
- [27] Vaughan R. Pratt. A decidable mu-calculus: Preliminary report. In Proceedings of 22nd Annual Symposium on Foundations of Computer Science (FOCS), pages 421–427. IEEE Computer Society, 1981. doi:10.1109/SFCS.1981.4.
- [28] Abraham Robinson. A result on consistency and its application to the theory of definition. Journal of Symbolic Logic, 25(2):174–174, 1960. doi:10.2307/2964240.
- [29] Marcel Paul Schützenberger. On finite monoids having only trivial subgroups. Inf. Control., 8(2):190–194, 1965. doi:10.1016/S0019-9958(65)90108-7.
- [30] A. Prasad Sistla and Edmund M. Clarke. The complexity of propositional linear temporal logics. J. ACM, 32(3):733–749, 1985. doi:10.1145/3828.3837.
- [31] Moshe Y. Vardi. Reasoning about the past with two-way automata. In Proceedings of International Colloquium Automata, Languages and Programming (ICALP), volume 1443 of Lecture Notes in Computer Science, pages 628–641. Springer, 1998. doi:10.1007/BFB0055090.
- [32] Yde Venema. Lectures on the modal -calculus, 2020.