Boolean Basis and Succinctness of Modal Logic via Hella-Vilander Games
Abstract
The Hella-Vilander game for modal logic is a model comparison game that captures the formula size necessary to separate sets of pointed Kripke structures. We introduce the -ON game as a modification of this game. Our game captures the necessary number of modal operators, i.e., and instead of formula size. We use our game to show that the bi-implication , sometimes also called equivalence, enables us to write modal logic formula with significantly fewer modal operators. With this we show, that with bi-implications we can also write significantly shorter modal logic formulas. This result holds even if only special classes of Kripke structures are considered. To be more precise we show that there is an exponential succinctness gap between modal logic and its extension with bi-implication on the class of structures with a transitive and reflexive accessibility relation, as well as on the class of structures with a symmetrical and reflexive accessibility relation. Lastly we show that for the class of structures with a transitive and symmetrical accessibility relation this succinctness gap disappears.
Keywords and phrases:
succinctness, modal logic, model comparison games2012 ACM Subject Classification:
Theory of computation Modal and temporal logicsEditors:
Stefano Guerrini and Barbara KönigSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
Model comparison games, like the Ehrenfeucht-Fraïssé game [4, 6], are a type of logic games. They can be used to prove that it is difficult to express certain properties in a logic. For example the game defined by Adler and Immerman in [1] has been used to prove lower bounds for formula size in FO2(TC) and the logic CTL. Extended syntax trees [8] can be seen as an equivalent method to this game, even though they drop the game setting. The game and the syntax tree method can also be modified to work on other logics, see [7, 12, 13] for examples. In [9] Hella and Vilander introduce their own game, which builds on the Adler-Immerman game. This Hella-Vilander game is defined for modal logic and the -calculus, but it can also be adapted for other logics. The main idea of this game is as follows: players (Spoiler, Samson) and (Duplicator, Delilah) play on two sets of structures and with a resource parameter . Player spends the parameter in order to make his moves, while answers. has to play a winning move before runs out. In such a play has a winning strategy if, and only if, there is a formula of size at most of the corresponding logic, that separates and .
There are also model comparison games that only count the number of occurrences of specific characters, instead of the more general formula size. See [3, 5] for examples. As part of their conclusion Hella and Vilander suggest that their game can be modified to count a more specific parameter as well. One such possible parameter is the number of modal operators used, i.e., and . This can be done, because every move of the Hella-Vilander game corresponds to some character of a separating formula. So by simply adjusting the way the parameter has to be spend, we can create a game for the number of modal operators instead of formula size. The main problem with this approach is that some moves do not reduce the parameter at all. Firstly this opens up the possibility of endless games, where cannot win, but he can ensure he does not lose either. Secondly it makes the game inefficient, since moves that do not change the parameter are inherently less interesting.
In order to deal with these problems we look at another model-comparison game used by Vinall-Smeeth in [14]. He calls this game the -QVT game. However in this paper we use the parameter with a different meaning. So in order to avoid confusion we will use the name -QVT game when referring to this game instead. The -QVT counts the number of quantifiers in formulas of the -variable fragment of first-order logic. It can be seen as a modification of the Adler-Immerman game, where after every move all structures without a partially isomorphic counterpart in the respective other set are removed. This is possible, because these are exactly the structures that can be removed in the Adler-Immerman game without playing any quantifier-moves. With this idea we can create our own game, which counts the number of modal-operators of separating modal logic formulas. We call this game the -Operator-Number game.
Afterwards we use our new game in order to look at how different sets of Boolean connectives can be used to write more succinct formulas. This topic was already researched by Berkholz, Kuske and Schwarz in [2]. They show that all complete operator sets, which only contain so-called locally monotone operators, have the same succinctness up to a polynomial. On the other hand all sets that contain some non locally monotone operator also coincide in succinctness. In this paper we use the De Morgan basis, consisting of , and , as a candidate from the first class and the De Morgan basis extended by the bi-implication as a candidate of the second. Berkholz et al. also show that for modal logic there is an exponential succinctness gap between these two classes. They prove this by working directly with formulas and using intricate and ad hoc arguments. This gap still exists if only reflexive Kripke structures are taken into consideration. However if the focus is tightened even further to only Kripke structures with an equivalence relation as accessibility relation, then this succinctness gap is closed.
From these findings some interesting follow-up questions emerge: Is it possible to remove without significantly increasing the number of modal operators? This question is interesting because modal logic formulas with few modal operators are easier to work with. However we answer this question negatively by showing that there is an exponential operator-succinctness gap between modal logic and modal logic with bi-implication as well. Another interesting follow-up question is the following: What about other classes of Kripke-structures? We want to answer this question in this paper. In particular we show that for the class of Kripke structures with a reflexive and transitive accessibility relation, as well as the class of structures with a reflexive and symmetrical relation this exponential succinctness gap exists. This directly implies that the gap also exists for the class of transitive and the class of symmetrical structures. For all of these classes there is an exponential operator-succinctness gap as well. Only for the class of structures with a transitive and symmetrical relation this gap disappears. In this way our work offers a more complete and systematic examination of the connection between succinctness and the basis of boolean connectives in modal logic on special classes of Kripke structures.
2 Preliminaries
2.1 Modal logic with Bi-implication and Kripke structures
Let be an infinite set of propositional variables, also called labels. We define ML as the set of all modal logic formulas over using and as constants, and as modal operators, as well as , , and as Boolean connectives. Formally ML is the set of all formulas generated by the following grammar:
where is a propositional variable. Now modal logic can be defined as a restriction of modal logic with bi-implication. So ML is defined as the set of all formulas from ML that do not contain any occurrences of .
Let be a connected, directed, labelled graph, where is the set of possible worlds or nodes, is a binary accessibility relation and is a labelling function, assigning sets of propositional variables to each node. For a node we call the colour of . If , the set of all labels used in , is finite, we say is a finitely labelled graph. The tuple is called a pointed Kripke structure or just Kripke structure. It should be noted here, that our naming conventions might differ from the norm. In the literature the labelled graph is sometimes referred to as a Kripke structure, while is called a pointed Kripke structure. However in this paper we do not view as a structure. Also our conventions allow us to use the terms pointed Kripke structure, Kripke structure or sometimes just structure interchangeably when referring to .
Let be a set of nodes. We write to mean the class of structures . We define as the set of all successors of a node . This notation can also be extended to sets of nodes with and structures with . The extension to classes of structures is defined in the obvious way. We can now start giving the definition of the semantics of modal logic with bi-implication on Kripke structures.
The satisfaction relation is defined inductively as follows:
-
1.
and for all
-
2.
iff
-
3.
iff
-
4.
iff or
-
5.
iff and
-
6.
iff
-
7.
iff holds for some
-
8.
iff holds for all
Next we extend this satisfaction relation to classes of structures. Let be a class of pointed Kripke structures. Then if, and only if, for all . We say a formula separates two classes of pointed Kripke structures and if either and , or and . So must be true for all structures of one class and for no structure of the other class. In this case we will also call a separating formula. It should be noted here that our definition differs slightly from the definition used by Hella and Vilander. In their definition is a separating formula only if and . However it is easy to see that if is a separating formula by our definition then or is a separating formula by their definition. For and we might also say separates and instead of and .
We say that two formulas and are equivalent, in writing , if every pointed Kripke structure satisfies if, and only if, it satisfies . Let be some class of Kripke structures. Then two formulas and are equivalent on , in writing , if every pointed Kripke structure from satisfies if, and only if, it satisfies .
2.2 Formula size
The size of a formula is the number of symbols needed to write it, excluding brackets. This can be expressed quite easily using the following inductive definition:
-
1.
if
-
2.
if , or
-
3.
if , or
Similarly the operator-number of is the number of modal operators, i.e. and , used in the formula. The inductive definition is as follows:
-
1.
if
-
2.
if
-
3.
if or
-
4.
if , , or
The term succinctness is used to compare logics based on the length of formulas needed to express the same property. Given two logics , with the same expressiveness and a function , we say is -times more succinct than if, and only if, for every there is a formula with , so that for every -formula with . If is an exponential function, we also say is exponentially more succinct than . Let be a class of structures, then succinctness on is defined by taking the definition of succinctness and replacing with .
Succinctness is based on formula size, but we can define a corresponding measurement for operator-number. We say that is -times more operator-succinct than if, and only if, for every there is a formula with , so that for every -formula with . Exponential operator-succinctness and operator-succinctness on special classes of structures are defined in the obvious way.
Counter intuitively it is theoretically possible that there are two logics and where is exponentially more succinct than and is exponentially more succinct than . This is because there might be some properties that can be expressed more succinctly in , while some other properties can be expressed more succinctly in . However this cannot happen for the logics examined in this paper. Since every ML formula is also an ML formula, ML cannot be more succinct than ML.
3 The operator-number game
Let be a finitely labelled graph and let be two sets of nodes of the graph. Then is an unordered pair. From this point on, we will use just the term pair to refer to unordered pairs. We say such a pair is tidy, if for every there is a with and vice versa. For any sets and , can be tidied up by removing all nodes without a fitting node in and all without a fitting node in . Let , where and are the largest subsets of and respectively, so that is tidy.
Definition 1.
Let be a finitely labelled graph, two sets of nodes and . The -Operator-Number game, or -ON game for short, played from and with parameter has two players and . An -position of the game consists of a tidy pair of sets of nodes , as well as a parameter . The starting position is . A turn for starts from an -position and proceeds in one of the following two ways:
-
1.
Split-move: may only play this type of move if . He picks a set and splits it into two disjoint and non-empty sets and . Then he chooses two parameters with . Let be the set that fulfils . This play by creates the -position . Afterwards plays by taking the set and picking a subset of it. Then she chooses and the game continues from the -position .
-
2.
Operator-move: may only play this type of move if . He picks a set and then picks a function , where for every . Let be the image of under and let be the set with . Then the -position after this move by is . Afterwards plays by taking the set and then determining a set of successors of the nodes from . The game continues from the position .
Player wins a play of the game, if the play arrives at a position with . Otherwise if cannot make another move, player wins.
When looking at an operator-move we might say picks as a successor of instead of picks a function with . We usually make play in such a way that no nodes are removed during tidy-up. For this reason we will only mention the tidy-up process if nodes are getting removed. The composition of a -position depends on the type of move played by . However in contrast to -positions we mainly view -positions as intermediate positions. So it is not necessary to give a full definition for them.
The parameter is reduced in every move and the play cannot continue, once the parameter is depleted. So a play with starting parameter can go on for rounds at most. In particular every play has to be finite. This implies in every play either player or player wins. So our game is a finite, chance-less two-player game with complete information, that cannot end in a draw. Then by Zermelo’s Theorem for every position of the -ON game one of the players, or , has a positional winning strategy.
Such a positional strategy for a player, or , is a partial map or . For a strategy takes as input an -position and outputs a -position, that can be reached from the input position via a single move by . It is a winning strategy, if every play where plays according to is won by him. Similarly a strategy for takes a -position as input and gives an -position, reachable in one move by , as output. It is a winning strategy, if wins every play in which she plays . If player has a winning strategy for some position , we will say this position is a winning position for player . The definition of winning positions for is done in the obvious way.
As previously stated, the main influence for our game is the Hella-Vilander game from [9], which in turn builds on the Adler-Immerman game from [1]. Before we show that our game can be used to prove lower bounds for operator-number, we want to discuss how our game differs from these games.
The -ON game is played by two players, (Samson) and (Delilah), just like the game defined by Hella and Vilander. However instead of playing on two sets of structures, in our game the players play on unordered pairs of sets of nodes. This shows two significant differences between our game and their game. Firstly we are using nodes, where they used structures. This change however is only notational, because we will still implicitly talk about pointed Kripke structures. We will use the node as a representation of the structure . In order to do so, we have to fix a single graph for the entire game. So every pointed Kripke structure that appears in a play must be based on the same labelled graph. While this appears like a real restriction, it is possible to merge multiple labelled graphs into one without changing the validity of formulas at any world.
The second significant difference is that we use unordered pairs instead of ordered pairs or tuples. This change allows us to switch around the elements of our pairs at will. In the game by Adler and Immerman, such a switch is used as a not-move. So basically in our game, we can simulate a not-move without actually making a move. This is possible, because we are only interested in the number of operators and not in the number of negations. We also add an additional step to every move, where we tidy up the pair of our position. By doing so we remove all elements, that could have been removed in the Hella-Vilander game by playing a number of turns without playing an operator-move. A similar approach is used in the -QVT game from [14]. In this game structures without a partially isomorphic counterpart in the other set are removed after every step. A problem with this method occurs when tidying up would skip an infinite number of moves. Vinall-Smeeth solves this problem by only considering finite structures. However for our purposes a weaker condition is sufficient. We only demand that our graphs are finitely labelled.
As a last major difference: In the Hella-Villander game, player is forced to play following the so-called oblivious strategy. This means she always has to answer with the largest possible set. This works, because this oblivious strategy is always optimal. So if can win the game, she can do so by playing obliviously. While this simplifies the definition of the game, it would also complicate our proofs. Simply put we only want to prove that has a winning strategy. Whether this strategy is also optimal is of no importance to us. Sometimes playing in a suboptimal way might lead to smaller positions, which can simplify our proofs.
Our aim is to prove that the -ON game can be used to characterize the number of operators needed in order to separate sets of nodes. As a stepping stone towards this goal, we state the following two lemmas.
Lemma 2.
Let be a finitely labelled graph and be sets of nodes with . If there is a separating formula for and , then there is also a separating formula for and with .
Lemma 3.
Let be a finitely labelled graph, be sets of nodes that form a tidy pair and let be an ML-formula without any modal operators. Then if, and only if, .
Now we want to prove the following Correctness- and Completeness-Theorem for our game. We do so by proving both directions separately, beginning with the proof of correctness.
Lemma 4.
Let be a finitely labelled graph, two sets of nodes and . If has a winning strategy in the -ON game played from and with parameter , then there is an ML-formula separating and with .
Proof.
For this proof we assume that has a winning strategy for the -ON game played from and with parameter . Because of Lemma 2 we may assume that and already form a tidy pair. Then the starting position of this game is . We construct the separating formula by induction over . First we look at the base case . Since the parameter is already used up, cannot make a move at all. So the only possible way for him to win is, if the current position already fulfils . Then is a trivial separating formula.
We now assume, the implication holds for all parameters and we look at some winning strategy for the -ON game starting from . We have to differentiate between the types of moves prescribes as the next move from this position.
We begin by assuming that prescribes a split-move. In this case picks a set from , without loss of generality he picks . Then splits it into two sets and and he splits into and , both of which have to be at most . Since is a winning strategy for , it has to win against every possible strategy of , in particular against the following ones: answers by picking and then picks either or . The game continues from or respectively. From both positions is still a winning strategy. So thanks to our induction hypothesis and Lemma 2, for we have separating formulas for and with . We may assume , because otherwise we could use instead. Then is a separating formula for and with .
Next we look into prescribing an operator-move. Once again we may assume that plays on . For every node in he picks a successor and forms . The strategy has to win against a strategy of that simply picks . So can also win from the position . This means there has to be a formula with , and . So every node in has at least one successor, namely those picked by during the construction of , that satisfies . On the other hand no node in can have such a successor, since all of them are part of . So is a separating formula for and with at most operators. This finishes our prove of the first implication.
Next we prove the completeness of our game.
Lemma 5.
Let be a finitely labelled graph, an ML-formula, two sets of nodes and , so that is a separating formula for and with . Then has a winning strategy in the -ON game played from and with parameter .
Proof.
To begin with we may assume that the separating formula only contains the connectives and , as well as the operator , since otherwise we can use the well known equivalences and in order to remove all occurrences of and . None of these changes increase the number of modal operators, so still holds. The -ON game played from and with parameter has the starting position . If is a separating formula for and , it is also a separating formula for the tidied up subsets of and . So we may assume that is already tidy, meaning . We construct , the winning strategy of , by playing according to . This construction is done by induction over the formula size of .
For the base case, we look at all atomic formulas, so . Because of Lemma 3, and agree on . So the only way for to be a separating formula is . In this case is already a winning -position and does not have to make a move in order to win the play.
We now assume the implication holds for all formulas with and we look at a separating formula for and with and . We have to differentiate based on the outermost operator of .
We begin with . Because of our definition of separating formula, also is a separating formula for and with and . So thanks to our induction hypothesis, already has a winning strategy for the -ON game played on and with parameter .
Next we look at with and . This also implies . So every node in one of the sets, w.l.o.g. we assume , has a successor that satisfies , while no node in the other set, , has such a successor. Then prescribes an operator-move, where picks for every a successor with and collects them into the set . Then answers by gathering some nodes into a set . However none of the nodes picked by can satisfy . The game continues from the position , where is a separating formula for and and thus also for the sets after tidying. So because of our induction hypothesis, for every choice made by during this turn, has a winning strategy from the resulting position. So simply copies the fitting strategy for ’s choice of .
For we have to make an additional differentiation. It is possible that one of the sub-formulas, w.l.o.g. we assume , does not contain any modal operators. Then one of the sets, we assume , satisfies , while the other satisfies its negation, so . This implies and . So because of Lemma 3, we also know , which in turn implies . This means is also a separating formula for and with and . So our hypothesis offers a winning strategy for in .
Lastly we look at , where both and contain at least one modal operator. This implies . Once again we assume and thus . Let and be the disjoint subsets of with and . If one of these sets, w.l.o.g. we assume , is empty, then and thus holds. So is a shorter separating formula for and and according to our induction hypothesis has a winning strategy for the position . Otherwise we know and . So may play a split-move where he splits into and . Then he picks the new parameters as and , which implies . Player answers by picking a set . Independent from her choice holds. The game continues from either or . For the first case, can use as a separating formula for and . Similarly is a separating formula for and . So by our induction hypothesis every possible follow-up position is a winning position for and thus is as well.
Theorem 6.
Let be a finitely labelled graph, two sets of nodes and . Then has a winning strategy in the -ON game played from and with parameter if, and only if, there is an ML-formula with , that is a separating formula for and .
4 Succinctness between ML[] and ML
We begin this section by proving some simple upper bounds for succinctness.
Lemma 7.
The logic ML is at most exponentially more succinct and at most exponentially more operator-succinct than ML.
Proof.
It is quite easy to see that the succinctness gap between ML and ML is at most exponential. We can simply replace every sub-formula with . Every time this is done, the formula size is doubled at most. Since the number of bi-implications cannot exceed the formula size, this procedure leads to an exponential size increase at worst.
However this method cannot provide an upper bound for the operator-succinctness. While every such substitution only doubles the numbers of operators as well, we cannot bound the number of operators in the resulting formula by some function of the number of operators in the original formula. E.g. for a formula with only one modal operator that appears in the scope of nested bi-implications this method would result in a formula with modal operators.
However there is another method, that guarantees an at most exponential increase in operator-number. This method is based on the construction used by Spira in [11]. We can use the equivalence , which holds as long as does not appear in in the scope of any operator. This allows us to inductively move all modal operators outside of the scope of any bi-implication. Every time this is done, the number of modal operators is doubled, which leads to an exponential increase. Then the bi-implications can be removed in the known way without increasing the number of modal operators any further. A more in-depth proof of this is given as part of the appendix.
In the following Section 4.1 we show that there is an exponential succinctness gap between ML and ML on the class of Kripke structures with a transitive and reflexive accessibility relation. Afterwards in Section 4.2 we show that this exponential succinctness gap also exists on the class of Kripke structures with a symmetrical and reflexive accessibility relation. Since these two claims are proven in similar ways, we want to use this opportunity to give a general overview of these proofs.
To begin with we construct an infinite graph as well as two series of nodes and . Then for every we give an ML-formula . This formula separates and . The number of modal operators in is bound from above by a polynomial over . Then we give some parameter that is exponential in and show that is a winning position for player in the game. It follows that cannot have a winning strategy for this position. So by Theorem 6, there cannot be an ML-formula that separates from with implying that every ML-formula equivalent to must be of size exponential in at least. This proves an exponential succinctness gap between ML and ML for all classes of Kripke structures belongs to.
Now we construct the infinite labelled graph , which will be the basis for all of our other graphs. The nodes of this graph are formally defined as . The edges of this graph are as follows:
The graph can be separated into segments. Then segment contains all nodes with index , as well as and . So the latter two nodes are shared nodes between segment and segment . The nodes and edges of segment are presented graphically in Figure 1. All edges of connect nodes within a segment and all segments have equivalent edges. So this figure fully represents the nodes and edges of . The numbers on the left side of the figure count the layers of the graph. Segment contains all nodes of layers , and , as well as the nodes and from layer . Lastly we define the labelling function of . The labels we use are . The only node labelled with is . For every the node is the only node of segment labelled with . Lastly we use the remaining labels , and in order to label the layers of the graph in an alternating pattern. More precisely for every we label all nodes of layer with , all nodes of layer with and all nodes of layer with . The formal definition of the labelling function is given by the following list:
-
for all ,
-
for all and
-
-
for all ,
-
for all ,
Claim 8.
Let be the ML formula inductively defined by and . Then is a separating formula for and with for every .
Proof.
This proof is done via induction on . For the base case the formula holds only in , because is the only node labelled with . Next let be some positive number. We assume that our claim holds for . So and . Now we look at the other nodes of segment . We know , while no other node of layer is labelled with . This implies , , , as well as . So and model , while and do not. Because and are the only successors of , this also means . On the other hand, is a predecessor of and is a predecessor of . So neither nor satisfy this formula. Lastly is a successor of , so . But and are the only successors of , so , which proves the claim.
It should be noted here that the formula does not contain any occurrences of either , or . However we need those labels in order to adapt to the modified graphs we use in the following proofs.
4.1 Transitive and reflexive graphs
The graph is defined as an extension of . The nodes and labels stay the same. First we add a self-loop for every node. So is a new edge for every node . Then we add a new edge from a node to a node , for all and where is a node on layer and is a node on layer with . So every node is now a predecessor to all nodes two or more layers below. So formally . It is easy to see that is a reflexive graph. On top of that it is also transitive. Let and be edges of the graph. If either of them is a self loop, then is the same as the other edge. Otherwise must be on a lower layer than and must be on a lower layer than . So is at least two layers below and is an edge in .
Lemma 9.
There are two polynomial functions , so that for every there is an ML-formula , which separates from , with and .
Proof.
This is proven, by simply adapting the formula from Claim 8 to separate and in instead of in . In every edge goes down exactly one layer. But all edges added to when constructing go down at least two layers or are self-loops. So we need a way to determine what layer a node belongs to. We look at the formula defined by and with and . This formula states, that there is a path of length that starts at the current node and on this path the labels , and appear in an alternating pattern. Such a path cannot use any self-loops, so every edge has to move down at least one layer. Then only nodes of layer or above satisfy this formula. Similarly only nodes of layer or below satisfy . Now we can use the formula in order to check whether or not a node belongs to layer .
Then for any node of layer holds if, and only if, has a successor exactly one layer below that satisfies . Similarly holds if, and only if, every successor of that is exactly one layer below the current layer satisfies . So we can add conditions like these to in order to only consider edges that go down exactly one layer, which are all edges from , while ignoring all edges from . We define inductively with and . Then for all nodes of layer , holds if, and only if . So by Claim 8, is a separating formula for and . For a rough estimation of the size of we can assert that the number of times a sub-formula occurs in is linear in . Also the size of such a formula is linear in , which is bounded from above by . So the overall size of is quadratic in , which is polynomial. A more in-depth examination leads to the findings and .
Lemma 10.
There is an exponential function , so that for every and for every ML-formula which separates from , we have .
Proof.
First we define . It is easy to see that holds. So from now on we may assume . Then we show that there is no separating ML-formula for and with at most modal operators. This is done by playing the -ON game on and with parameter and showing that has a winning strategy for these games. In order to do so we define the following sets of positions of the game.
Then is defined as the union of these sets. For it is easy to see, that and thus holds. It is also easy to see that no contains a position with . So does not contain any immediate winning position for . Player can also stop player from leaving . More precisely if makes a move from any position in , then can answer in such a way that the follow-up position is also in . We can prove this fact by going through all positions within and checking every possible move that can make. In this shortened proof we will showcase this for the set . The remaining cases are given as part of the appendix.
So let with and be some position from . In this position player can play an operator-move, where he picks some successor of . If he picks a node that is also a successor of or , then can answer by picking the same node. In this case the next position satisfies . So it belongs to . Then cannot leave by picking a shared successor, meaning a node that is a successor of both sets of the position. So for in particular we can ignore all moves where picks a node two or more layers below any of the current nodes. Every successor of besides itself is also a successor of or . If picks as its own successor then can answer by also using the self-loops of and . So this move leads to the new position , which is also a part of . Alternatively can also play an operator-move where he picks successors of and . Once again if picks a node that can also be picked by , then the next position is a member of . The only other options of are to pick or as a successor of and or as a successor of . If he uses the self-loop both times, answers with her own self-loop and the follow-up position belongs to . If choose and then answers . This moves the play to , a position from . On the other hand if picked and , then answers with , which leads to , a member of . Lastly if he choose and , picks . The new position belongs to . So cannot leave from by playing an operator-move.
At can also play a split-move. In this case he has to split into and . He also splits the parameter into and . One of these two new parameters is at most half of the original parameter . So can pick in such a way, that . Then she answers with . The new position of the play is either or . So we have shown that cannot leave by making a move in a position from .
The same holds for all other positions from . Since cannot leave and does not contain any immediate winning position, cannot win from any of these positions. Then all positions in are winning positions for . This includes . So there cannot be a separating formula for and with or less modal operators.
Theorem 11.
The logic ML is exponentially more operator-succinct and exponentially more succinct than ML on the class of Kripke structures with a transitive and reflexive accessibility relation.
Proof.
Let be some natural number. Then is the ML-formula given in Lemma 9. We know and for some polynomial functions and . We also know that separates the two transitive and reflexive Kripke structures and from one another. Let be some ML-formula equivalent to on the class of Kripke structures with a transitive and reflexive accessibility relation. Then has to be a separating formula for and as well. So, according to Lemma 10, for some exponential function . This implies as well. We define as . Then holds. So ML is -times more operator-succinct than ML and thus exponentially more operator-succinct. We also define as . We know holds. This implies that ML is -times more succinct than ML, so it is exponentially more succinct as well.
From this theorem it is easy to infer, that the succinctness gap also exists on all classes of structures, which are supersets of the class of Kripke structures with a transitive and reflexive accessibility relation. We say an accessibility relation is serial, if it satisfies the condition . It is called weakly dense, if it satisfies . Both of these conditions are true for every reflexive relation.
Corollary 12.
The logic ML is exponentially more succinct than ML on the following classes of pointed Kripke structures:
-
1.
The class of all pointed Kripke structures.
-
2.
The class of all pointed Kripke structures with a transitive accessibility relation.
-
3.
The class of all pointed Kripke structures with a reflexive accessibility relation.
-
4.
The class of all pointed Kripke structures with a serial accessibility relation.
-
5.
The class of all pointed Kripke structures with a weakly dense accessibility relation.
4.2 Symmetrical and reflexive graphs
The graph is also constructed as an extension of . We do not change the nodes or their labels. We only add edges to . For every edge we add its inverse to our new graph. For nodes and , that belong to the same level, we also add the edges and . This explicitly includes all possible self-loops. So formally . It is easy to see that is both symmetrical and reflexive.
Lemma 13.
There are two polynomial functions , so that for every there is an ML-formula , which separates from , with and .
Proof.
Once again this is proven by adapting from Claim 8. All edges in go down exactly one layer. But all of the edges added to during the construction of either go up one layer or stay on the same layer. Since we used the labels , and in an alternating pattern, the current layer, the layer one below and the layer one above are all coloured differently. This means we can use these labels to make sure that only the original edges from are considered. We define via and . For all nodes of layer we know that holds if, and only if, . So, by Claim 8, is a separating formula for and with and . So and fulfil all conditions of the lemma.
Lemma 14.
There is an exponential function , so that for every and for every ML-formula , which separates from , we have .
Proof.
This proof is very similar to that of Lemma 10. We also use , so holds. Then for we show that is a winning position for player in the -ON game. For this purpose we define the following sets of positions of the -ON game.
Then is defined as the union of these sets. For it is easy to see that and thus holds. It is also easy to see, that no contains a position with , so does not contain any immediate winning position for . Next we show that for every move of from a position of player can defend in such a way that the next position is in as well. In this shortened version of our proof we will only give a proper explanation for . The description of the other cases is once again moved to the appendix.
So we have to look at positions with and . Once again if picks a shared successor, can move the game to a position from . For this happens every time picks a node of the same layer as any node in the other set. In particular if uses the self-loop on any nodes. Since every successor of is also a successor of or , player cannot leave by playing an operator-move on . Similarly when playing on we only have to look at picking or as a successor of , as well as or as a successor of . This is because every other successor is also a successor of . If picked at all, then answers with . If he choose or in addition to , then these nodes will be removed when tidying up, because they are labelled differently from and . So in this case the game continues with , a position from . On the other hand, if picked both and as successors, answers with . Then the follow-up position is from . So cannot leave by playing an operator-move at . Lastly we look at what happens when plays a split-move. He has to split into and and into and . Then can answer in such a way that and the follow-up position is either or . With this we have shown that cannot leave in one move from .
The same goes for all other positions from . So can stop from leaving and there is no immediate winning position for in . Then all position in , including , have to be winning positions for player . This is why there cannot be a separating formula for and with or less modal operators.
Theorem 15.
The logic ML is exponentially more operator-succinct and exponentially more succinct than ML on the class of Kripke structures with a symmetrical and reflexive accessibility relation.
Proof.
The proof of this theorem can be done analogous to the proof of Theorem 11. Simply replace all mentions of transitivity with reflexivity and use Lemma 13 and Lemma 14 instead of Lemma 9 and Lemma 10 respectively.
Since the succinctness gap exists when looking only at symmetrical and reflexive graphs, it has to exist for any class of graphs that contains all symmetrical and reflexive graphs.
Corollary 16.
The logic ML is exponentially more succinct than ML on the class of Kripke structures with a symmetrical accessibility relation.
4.3 Transitive and symmetrical graphs
So far in this paper we have shown that adding bi-implications allows us to write more succinct formulas. Even when only looking at special classes of Kripke structures, like those with a transitive accessibility relation and those with a symmetrical one. In this section we are going to show that the same does not hold if we combine these two conditions. More precisely we show that there is no exponential succinctness gap between ML and ML on the class of Kripke structures with a transitive and symmetrical accessibility relation.
We start with a few remarks about our notation in this section. Let be the class of all Kripke structures with a transitive, symmetrical and reflexive accessibility relation. We write if and are equivalent on all pointed Kripke structures with a transitive and symmetrical accessibility relation. Similarly we use when talking about structures from and for graphs with an empty accessibility relation.
All connected, transitive and symmetrical graphs, that are not reflexive as well, contain a single node and no edges. So holds if, and only if, both and hold. This fact offers us a way to find formulas that are equivalent on transitive and symmetrical structures, which we will use to prove the following Theorem.
Theorem 17.
For every ML-formula there is an ML-formula with and the size of is polynomial in the size of .
Proof.
In [2] it is shown, that ML has polynomial translations with regards to in ML. So for every ML-formula there is an ML-formula with and the size of is polynomial in the size of . We now look at the graph with only one node and no edges. We know as well as for all formulas . We can use this to replace all operators in , obtaining a operator free formula with . Since this formula does not contain any operators, we can treat it as a formula of propositional logic. So as described by Pratt in [10], we can use Spira’s construction from [11] in order to construct an ML-formula with . The size of is polynomial in the size of , which in turn is bounded from above by the size of . Now we just need to combine these formulas to obtain . In order to do so we use the formula , which is true on all graphs from and false for all edgeless graphs, as well as its negation . We construct the wanted ML-formula as . This formula fulfils the following:
Which also implies . The size of this formula is , so the sum of two polynomials over and a constant, which is also polynomial in the size of .
5 Conclusion
The main result of this paper is the development of a new model comparison game, the -Operator-Number game. This game can be used to prove lower bounds for the number of modal operators necessary to express certain properties in modal logic. An interesting follow-up question to this might be: For which logics and which parameters can a similar game be designed? If there is a version of the Hella-Vilander game for some logic, it should be possible to adjust that game to only count all occurrences of one specific type of operator. This should work for every type of operator of that logic. The more interesting part of that question is whether or not some process similar to our idea of ’tidying up’ is possible. A positive answer is given for FOl in [14]. However additional research is necessary to achieve a better understanding of this topic.
We also give a more complete overview of the succinctness of modal logic with bi-implication on different classes of Kripke structures. For every possible combination of the properties reflexive, transitive and symmetrical, we have either proven or disproven the existence of an exponential succinctness gap between ML and ML on the corresponding class of structures.
A possible future task could be to look into even more classes of Kripke structures, for example those with a function or a partial function as accessibility relation. Although these examples might require a quite different proof from the ones presented in this paper. Instead we could also attempt to adjust our proofs for a different logic. Our current goal is to do this for the logic CTL. This logic can be seen as an extension of ML, where more complex operators are added. Our hope is that we can show that bi-implications can be used to write exponentially shorter formulas in CTL as well.
References
- [1] Micah Adler and Neil Immerman. An lower bound on formula size. ACM Transactions on Computational Logic, 4(3):296–314, 2003. doi:10.1145/772062.772064.
- [2] Christoph Berkholz, Dietrich Kuske, and Christian Schwarz. Boolean basis, formula size, and number of modal operators. Logical Methods in Computer Science, Volume 21, Issue 3, July 2025. doi:10.46298/lmcs-21(3:10)2025.
- [3] Marco Carmosino, Ronald Fagin, Neil Immerman, Phokion Kolaitis, Jonathan Lenchner, and Rik Sengupta. Multi-structural games and beyond. Logical Methods in Computer Scienc, 20:27:1–27:40, 2024. doi:10.48550/arXiv.2301.13329.
- [4] Andrzej Ehrenfeucht. An application of games to the completeness problem for formalized theories. Fundamenta Mathematicae, 49:129–141, 1960. URL: https://eudml.org/doc/213582.
- [5] Ronald Fagin, Jonathan Lenchner, Kenneth W. Regan, and Nikhil Vyas. Multi-structural games and number of quantifiers. Logical Methods in Computer Science, 21(1), 2025. doi:10.46298/lmcs-21(1:10)2025.
- [6] Roland Fraïssé. On some classifications of relationship systems. Publications scientifiques de l’Université d’Alger, Série A, Sciences mathématiques, 1954.
- [7] Tim French, Wiebe van der Hoek, Petar Iliev, and Barteld Kooi. Succinctness of epistemic languages. In Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence (IJCAI-11), pages 881–886, 2011. doi:10.5591/978-1-57735-516-8/IJCAI11-153.
- [8] Martin Grohe and Nicole Schweikardt. The succinctness of first-order logic on linear orders. Logical Methods in Computer Science, 1(1), 2005. doi:10.2168/LMCS-1(1:6)2005.
- [9] Lauri T. Hella and Miikka S. Vilander. Formula size games for modal logic and -calculus. Journal of Logic and Computation, 29(8):1311–1344, 2019. doi:10.1093/logcom/exz025.
- [10] Vaughan R. Pratt. The effect of basis on size of boolean expressions. In Proceedings - Annual IEEE Symposium on Foundations of Computer Science, FOCS, pages 119–121, October 1975. doi:10.1109/SFCS.1975.29.
- [11] Philip Spira. On time-hardware complexity tradeoffs for boolean functions. In Proceedings of the 4th Hawaii Symposium on System Sciences, 1971, pages 525–527, 1971.
- [12] Wiebe van der Hoek, Petar Iliev, and Barteld Kooi. On the relative succinctness of two extensions by definitions of multimodal logic. Lecture Notes in Computer Science, 7318 LNCS:323–333, 2012. doi:10.1007/978-3-642-30870-3_33.
- [13] Hans van Ditmarsch, Jie Fan, Wiebe van der Hoek, and Petar Iliev. Some exponential lower bounds on formula-size in modal logic. In Advances in Modal Logic, volume 10, pages 139–157, August 2014. URL: https://www.scopus.com/inward/record.uri?eid=2-s2.0-84924671231&partnerID=40&md5=3f43caf649380733487e57e7338bd34d.
- [14] Harry Vinall-Smeeth. From quantifier depth to quantifier number: Separating structures with variables. In Proceedings - Symposium on Logic in Computer Science, 2024. doi:10.1145/3661814.3662125.
Appendix A Appendix
A.1 Proof of Lemma 2
Claim of Lemma 2. Let be a finitely labelled graph and be sets of nodes with . If there is a separating formula for and , then there is also a separating formula for and with .
Proof.
Without loss of generality we assume and . In order to prove this lemma, we look at the sets and . We know that no node in has the same colour as any node in and no node in has the same color as any node in . It is easy to see that two pointed Kripke structures based on differently coloured nodes can be separated by a literal. In this case let be a literal with and . Then for every we define . All nodes in agree on all labels that are not used in . Then every formula in must be either an atomic formula from or the negation of such an atomic formula. So even though might be infinite, will always be finite, since is finite. So we can construct the formula , which separates from . Next we look at the set and the formula . This formula holds in , while its negation holds in .
In a similar fashion we define and for every . Next we construct and . This formula holds in and its negation holds in . With this we can finally define the wanted formula . Without loss of generality we assume . Then is a separating formula for and . Since neither nor contain any operators, holds as well.
A.2 Proof of Lemma 3
Claim of Lemma 3. Let be a finitely labelled graph, be sets of nodes that form a tidy pair and let be an ML-formula without any modal operators. Then if, and only if, .
Proof.
If , then the Lemma is trivially true. Otherwise let . For every there is a with . Then and agree on all atomic formulas and thus on all operator-free ML-formulas, so especially on . This means for every and thus . The converse implication follows from .
A.3 Proof of Lemma 7
During the proof of Lemma 7 we sketch a method that removes all from an ML-formula, while guaranteeing an exponential increase at most in the number of modal operators. Here we give a proper formal definition of this construction. We also prove its correctness.
Proof.
We recursively define a function MLML. The base case consist of all formulas without any operators and in this case . For any other formula there must be a sub-formula so that appears in outside of the scope of any other operator. Then can be written as for some formula . We define as .
Next we want to show that fulfils the following three conditions for every ML formula : Firstly , secondly no operator in appears in the scope of any bi-implication and thirdly . We do this via induction on the number of operators in . As our base case we look at all ML formulas with no modal operators. Then we know , so the first condition is obviously fulfilled. Because does not contain any operators, the second condition is also trivially true and because of , the third condition is also fulfilled.
For the induction step we look at all formulas with for some . In this case let and be the sub-formulas with , that were used when constructing . Because of and we may assume as our induction hypothesis that both and fulfil all three conditions. Then we know . Because of and , this is equivalent to . No operator in or in appears in the scope of any bi-implication. So the same must hold for , since no new bi-implication is added during its construction. For the number of operators we know and . So by our induction hypothesis we know that . The value of the right hand side of this inequality becomes largest if all but one operators of belong to the same sub-formula. So it is bounded from above by . For this is in turn bounded from above by . So the third condition is fulfilled as well.
In total we have shown that for every ML-formula there exists an equivalent ML-formula where no operator appears in the scope of a bi-implication and the number of operators in is at most exponential in the number of operators in . Lastly we can replace every sub-formula in with , in order to transform into an ML-formula, without increasing the number of operators any further. This construction proves, that the operator-succinctness gap between ML and ML is at most exponential.
A.4 Proof of Lemma 10
During the proof of Lemma 10 we claimed that player can ensure that a play of the game never leaves the set . Here we want to give a complete proof of this fact.
Proof.
We begin with some general facts. Since is reflexive, player can always play an operator-move, where he picks every node as its own successor. However can answer by doing the same. So the follow up position consists of the same pair of sets as the current position only with the parameter reduced by one. So the new position belongs to the same set as the old position. Also cannot play a split-move, as long as both sets of the current position contain only one element each.
We start with . So let be some position and let be some node shared between and . For , can play a split-move in this position. He picks and then splits this set into and . Then the shared node has to be in one of these two sets, w.l.o.g. we can assume . Then can ensure for her answer and the game continues from the position . The node is also a shared node for and . For this reason cannot be removed by tidying up, because there will always be a node of the same colour in the respective other set, namely itself. So in the position after the move, the two sets still have as a shared node. Then this new position lies in as well and thus in . On the other hand might play an operator-move instead. In this case he picks and then for every he picks a successor . Because of , we know . So can ensure . Then is a shared node for and and thus it is not removed by tidying up. This means that the follow-up position is also a member of and of . So can ensure that the -ON game cannot move from a position inside of to a position outside of .
During an operator-move, if tries to pick some node as with , the can ensure . So is a shared node for the new position and thus this position is a part of . Then cannot leave by playing like this. So from now on we may ignore all of his moves where he picks some shared successor. This is especially relevant, because in all nodes that are two or more layers below the current node are successors of the current node. So every move where picks some node that is two or more layers below any node from leads to a position from . For the rest of this proof we may ignore every move of with this property.
Next we look at . Let be a position with and . For we know . So cannot play any moves, meaning he cannot leave . So from now on we can assume . Then has to play an operator-move in such a position. However every successor of is either itself or a successor of as well. Also the only successor of that is not a successor of as well is . So has to pick as a successor of . To this answers with and the follow-up position is a part of .
We already gave a detailed explanation for during the main proof, so we will skip it here.
Now we look at , which contains positions with and . Every successor of is also a successor of as well, including itself. So we can ignore all operator-moves, where tries to pick successors for and . Similarly every proper successor of is also a successor of . This means we only need to look at picking as a successor of itself and as successor of . Then answers with and the new position belongs to . can also play a split-move, where he either splits into and or into and . Either way he also has to split into and . Because of , can always pick a so that . The set picks depends on the choice of . She answers with to and the other way around. Then the follow-up position is a member of . On the other hand picks as an answer to and vice versa. In this case the play moves to , which belongs to . Either way cannot leave from .
For the next set, , we look at positions with and . The arguments for these positions are very similar to those used for . Every successor of is also a successor of and every proper successor of is a successor of . So the only operator-move we need to look at is picking as a successor of and as its own successor. To this answers with The follow-up position is and belongs to . can also either split into and or he can split into and . Either way can always guarantee for some . Then she can answer in such a way that the next position is either from or from . So cannot leave from either.
contains all positions with and . Every proper successor of or is also a successor of or and vice versa. So we already know that every operator-move leads to a position from . However can also play a split move. He either splits into and or he splits into and . He also splits the parameter into and . Then can always pick in such a way that . She can also choose in such a way that the follow-up position is either , which belongs to , or , a member of .
The next set we look at is , which contains positions with and . Here has to play an operator-move. Since we can ignore all shared successors, the only moves we need to look at are picking as a successor of and picking as a successor of . In any case answers with the respective other node. Then the new position is .
In positions from has to play an operator-move as well. The only moves we need to check are picking as a successor of or as a successor of . In either case can guarantee , a position from , as the follow-up position.
The set contains positions with and . In such a position has to play an operator-move. Aside from picking shared successors or using only self-loops, can only pick as a successor of or as a successor of . Then answers with whatever node did not pick and the next position is , a member of .
Lastly we look at the set , so positions with and . We only need to check what happens when picks either as a successor of or as a successor of . Independent of his choice, can guarantee as the new position.
A.5 Proof of Lemma 14
During the proof of Lemma 14 we stated that player can ensure that it is impossible to leave the set by playing a move of the -ON game. Here we want to give a complete proof of this fact.
Proof.
During the proof of Lemma 10 we showed that in player cannot leave . The argumentation we used was completely independent from the actual edges of the graph. So we can use exactly the same argumentation here to show that cannot leave in either. This also means that cannot leave by picking some shared successor. For in particular this includes all moves, where he tries to pick a node of the current layer.
The next set we want to look at is . So let be a position with and . For we know . As already mentioned cannot make another move for , meaning he cannot leave . For he cannot play a split-move, since both sets have only one element each. So he has to play an operator-move instead. Recall that the nodes and belong segment as well as to segment and since is a symmetrical graph, these nodes also have successors in the layer above, layer . Then can pick , or as a successor of . Alternatively he may pick or as a successor of . Every other successor of or is a shared successor. If he choose either or , then answers with a set that contains only the respective other node. This leads to the follow-up position . Because of this position belongs to . Alternatively if picked or , then also answers with the respective other node. Then the follow-up position is , which belongs to . Lastly if decided to move from to , then answers with the set which progresses the play to the position . This position belongs to . So independent of the choices made by , the play always continues from some position within .
We already checked during the main proof, so we skip it here.
We continue by analysing . Let be some position with and . If plays an operator-move from this position, he has to pick as a successor of both and , since every other successor is a shared successor. To this, answers with the set . Then the follow-up position belongs to . However for , can also play a split-move. In this case he has to split either into and or into and . He also splits into and . Let be the smaller of these two. Then we know . Then answers by picking the side with . The set she picks depends on what choice was made by for . She answers with to and vice versa. Similarly she picks as an answer to and the other way around. So the follow-up position is either , which is a member of , or , which is a member of . So cannot leave from either.
The next set is . So we look at a position with and . In this position, cannot play a split-move, since both sets contain only one node each. This will be the case for all sets from this point on, so we will stop mentioning it going forward. Now player has to play an operator-move, where he picks or as a successor of or as a successor of . If he picked , answers with . This moves the game to . Because of , which implies , this position belongs to . On the other hand, if choose or , then answers with the set only containing the respective other node. Then the follow-up position is , which belongs to . Then can force any play from to stay in .
We continue our proof with . The positions within this set are the shape of with and . When playing an operator-move, may pick or as a successor of or he may pick or as a successor of . Then answers with to , with to , with to and lastly with to . Then the play continues with , a position from , or with , a position from . This finishes the case of .
Now we look at the set , which contains all positions with and . Here may pick as a successor of or alternatively or as a successor of . If he picked then answers with . Then the follow-up position is a member of . If choose or instead, then answers by picking a singleton of the respective other node. In this case the game moves on to the position , which belongs to .
The set consists of all positions with and . Then can play an operator-move by picking , or as a successor of . Alternatively he may also pick or . If choose or , then answers with or respectively. This creates the follow-up position , which is a part of . If picked , then answers with , creating instead. This position belongs to . On the other hand if picked or , then answers with a set only containing the respective other node. In this case the follow-up position belongs to .
Lastly we look at the set . This set contains all positions with and . From such a position, may play an operator-move where he picks either or as a successor of or , or as a successor of . If he choose or , then answers with the respective other node as a singleton. The follow-up position is a member of . If choose instead, then answers with and the new position is , a member of . On the other hand if picked or , then picks and respectively. Then the game continues with , a position from . So all follow-up positions belong to .
