Abstract 1 Introduction 2 Preliminaries 3 The operator-number game 4 Succinctness between ML[] and ML 5 Conclusion References Appendix A Appendix

Boolean Basis and Succinctness of Modal Logic via Hella-Vilander Games

Sebastian Pfau ORCID Technische Universität Ilmenau, Germany
Abstract

The Hella-Vilander game for modal logic is a model comparison game that captures the formula size necessary to separate sets of pointed Kripke structures. We introduce the -ON game as a modification of this game. Our game captures the necessary number of modal operators, i.e., and instead of formula size. We use our game to show that the bi-implication , sometimes also called equivalence, enables us to write modal logic formula with significantly fewer modal operators. With this we show, that with bi-implications we can also write significantly shorter modal logic formulas. This result holds even if only special classes of Kripke structures are considered. To be more precise we show that there is an exponential succinctness gap between modal logic and its extension with bi-implication on the class of structures with a transitive and reflexive accessibility relation, as well as on the class of structures with a symmetrical and reflexive accessibility relation. Lastly we show that for the class of structures with a transitive and symmetrical accessibility relation this succinctness gap disappears.

Keywords and phrases:
succinctness, modal logic, model comparison games
Copyright and License:
[Uncaptioned image] © Sebastian Pfau; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Modal and temporal logics
Editors:
Stefano Guerrini and Barbara König

1 Introduction

Model comparison games, like the Ehrenfeucht-Fraïssé game [4, 6], are a type of logic games. They can be used to prove that it is difficult to express certain properties in a logic. For example the game defined by Adler and Immerman in [1] has been used to prove lower bounds for formula size in FO2(TC) and the logic CTL. Extended syntax trees [8] can be seen as an equivalent method to this game, even though they drop the game setting. The game and the syntax tree method can also be modified to work on other logics, see [7, 12, 13] for examples. In [9] Hella and Vilander introduce their own game, which builds on the Adler-Immerman game. This Hella-Vilander game is defined for modal logic and the μ-calculus, but it can also be adapted for other logics. The main idea of this game is as follows: players S (Spoiler, Samson) and D (Duplicator, Delilah) play on two sets of structures 𝔸 and 𝔹 with a resource parameter k. Player S spends the parameter k in order to make his moves, while D answers. S has to play a winning move before k runs out. In such a play S has a winning strategy if, and only if, there is a formula φ of size at most k of the corresponding logic, that separates 𝔸 and 𝔹.

There are also model comparison games that only count the number of occurrences of specific characters, instead of the more general formula size. See [3, 5] for examples. As part of their conclusion Hella and Vilander suggest that their game can be modified to count a more specific parameter as well. One such possible parameter is the number of modal operators used, i.e., and . This can be done, because every move of the Hella-Vilander game corresponds to some character of a separating formula. So by simply adjusting the way the parameter k has to be spend, we can create a game for the number of modal operators instead of formula size. The main problem with this approach is that some moves do not reduce the parameter at all. Firstly this opens up the possibility of endless games, where S cannot win, but he can ensure he does not lose either. Secondly it makes the game inefficient, since moves that do not change the parameter are inherently less interesting.

In order to deal with these problems we look at another model-comparison game used by Vinall-Smeeth in [14]. He calls this game the k-QVT game. However in this paper we use the parameter k with a different meaning. So in order to avoid confusion we will use the name l-QVT game when referring to this game instead. The l-QVT counts the number of quantifiers in formulas of the l-variable fragment of first-order logic. It can be seen as a modification of the Adler-Immerman game, where after every move all structures without a partially isomorphic counterpart in the respective other set are removed. This is possible, because these are exactly the structures that can be removed in the Adler-Immerman game without playing any quantifier-moves. With this idea we can create our own game, which counts the number of modal-operators of separating modal logic formulas. We call this game the -Operator-Number game.

Afterwards we use our new game in order to look at how different sets of Boolean connectives can be used to write more succinct formulas. This topic was already researched by Berkholz, Kuske and Schwarz in [2]. They show that all complete operator sets, which only contain so-called locally monotone operators, have the same succinctness up to a polynomial. On the other hand all sets that contain some non locally monotone operator also coincide in succinctness. In this paper we use the De Morgan basis, consisting of , and ¬, as a candidate from the first class and the De Morgan basis extended by the bi-implication as a candidate of the second. Berkholz et al. also show that for modal logic there is an exponential succinctness gap between these two classes. They prove this by working directly with formulas and using intricate and ad hoc arguments. This gap still exists if only reflexive Kripke structures are taken into consideration. However if the focus is tightened even further to only Kripke structures with an equivalence relation as accessibility relation, then this succinctness gap is closed.

From these findings some interesting follow-up questions emerge: Is it possible to remove without significantly increasing the number of modal operators? This question is interesting because modal logic formulas with few modal operators are easier to work with. However we answer this question negatively by showing that there is an exponential operator-succinctness gap between modal logic and modal logic with bi-implication as well. Another interesting follow-up question is the following: What about other classes of Kripke-structures? We want to answer this question in this paper. In particular we show that for the class of Kripke structures with a reflexive and transitive accessibility relation, as well as the class of structures with a reflexive and symmetrical relation this exponential succinctness gap exists. This directly implies that the gap also exists for the class of transitive and the class of symmetrical structures. For all of these classes there is an exponential operator-succinctness gap as well. Only for the class of structures with a transitive and symmetrical relation this gap disappears. In this way our work offers a more complete and systematic examination of the connection between succinctness and the basis of boolean connectives in modal logic on special classes of Kripke structures.

2 Preliminaries

2.1 Modal logic with Bi-implication and Kripke structures

Let P be an infinite set of propositional variables, also called labels. We define ML[] as the set of all modal logic formulas over P using and as constants, and as modal operators, as well as ¬, , and as Boolean connectives. Formally ML[] is the set of all formulas generated by the following grammar:

φ::=p¬φ(φφ)(φφ)(φφ)φφ

where pP is a propositional variable. Now modal logic can be defined as a restriction of modal logic with bi-implication. So ML is defined as the set of all formulas from ML[] that do not contain any occurrences of .

Let =(W,R,V) be a connected, directed, labelled graph, where W is the set of possible worlds or nodes, RW×W is a binary accessibility relation and V:W𝒫(P) is a labelling function, assigning sets of propositional variables to each node. For a node wW we call V(w)P the colour of w. If V(W):=wWV(w), the set of all labels used in , is finite, we say is a finitely labelled graph. The tuple (,w) is called a pointed Kripke structure or just Kripke structure. It should be noted here, that our naming conventions might differ from the norm. In the literature the labelled graph is sometimes referred to as a Kripke structure, while (,w) is called a pointed Kripke structure. However in this paper we do not view as a structure. Also our conventions allow us to use the terms pointed Kripke structure, Kripke structure or sometimes just structure interchangeably when referring to (,w).

Let AW be a set of nodes. We write (,A) to mean the class of structures {(,w)wA}. We define w={wW(w,w)R} as the set of all successors of a node w. This notation can also be extended to sets of nodes with A=wAw and structures with (,w)=(,w)={(,w)ww}. The extension to classes of structures is defined in the obvious way. We can now start giving the definition of the semantics of modal logic with bi-implication on Kripke structures.

The satisfaction relation is defined inductively as follows:

  1. 1.

    (,w) and (,w)⊧̸ for all (,w)

  2. 2.

    (,w)p iff pV(w)

  3. 3.

    (,w)¬φ iff (,w)⊧̸φ

  4. 4.

    (,w)(φψ) iff (,w)φ or (,w)ψ

  5. 5.

    (,w)(φψ) iff (,w)φ and (,w)ψ

  6. 6.

    (,w)φψ iff (,w)(φψ)(¬φ¬ψ)

  7. 7.

    (,w)φ iff (,v)φ holds for some vw

  8. 8.

    (,w)φ iff (,v)φ holds for all vw

Next we extend this satisfaction relation to classes of structures. Let 𝔄 be a class of pointed Kripke structures. Then 𝔄φ if, and only if, (,w)φ for all (,w)𝔄. We say a formula φ separates two classes of pointed Kripke structures 𝔄 and 𝔅 if either 𝔄φ and 𝔅¬φ, or 𝔅φ and 𝔄¬φ. So φ must be true for all structures of one class and for no structure of the other class. In this case we will also call φ a separating formula. It should be noted here that our definition differs slightly from the definition used by Hella and Vilander. In their definition φ is a separating formula only if 𝔄φ and 𝔅¬φ. However it is easy to see that if φ is a separating formula by our definition then φ or ¬φ is a separating formula by their definition. For 𝔄={𝒜} and 𝔅={} we might also say φ separates 𝒜 and instead of 𝔄 and 𝔅.

We say that two formulas φ and ψ are equivalent, in writing φψ, if every pointed Kripke structure (,w) satisfies φ if, and only if, it satisfies ψ. Let 𝒞 be some class of Kripke structures. Then two formulas φ and ψ are equivalent on 𝒞, in writing φ𝒞ψ, if every pointed Kripke structure (,w)𝒞 from 𝒞 satisfies φ if, and only if, it satisfies ψ.

2.2 Formula size

The size of a formula φ is the number of symbols needed to write it, excluding brackets. This can be expressed quite easily using the following inductive definition:

  1. 1.

    |φ|=1 if φ{,}P

  2. 2.

    |φ|=|ψ|+1 if φ=¬ψ, φ=ψ or φ=ψ

  3. 3.

    |φ|=|ψ|+|χ|+1 if φ=ψχ, φ=ψχ or φ=ψχ

Similarly the operator-number of φ is the number of modal operators, i.e. and , used in the formula. The inductive definition is as follows:

  1. 1.

    |φ|O=0 if φ{,}P

  2. 2.

    |φ|O=|ψ|O if φ=¬ψ

  3. 3.

    |φ|O=|ψ|O+1 if φ=ψ or φ=ψ

  4. 4.

    |φ|O=|ψ|O+|χ|O if φ=ψχ, φ=ψχ, or φ=ψχ

The term succinctness is used to compare logics based on the length of formulas needed to express the same property. Given two logics L1, L2 with the same expressiveness and a function f:, we say L1 is f-times more succinct than L2 if, and only if, for every n there is a formula φL1 with |φ|n, so that |ψ|f(|φ|) for every L2-formula ψ with ψφ. If f is an exponential function, we also say L1 is exponentially more succinct than L2. Let 𝒞 be a class of structures, then succinctness on 𝒞 is defined by taking the definition of succinctness and replacing with 𝒞.

Succinctness is based on formula size, but we can define a corresponding measurement for operator-number. We say that L1 is f-times more operator-succinct than L2 if, and only if, for every n there is a formula φL1 with |φ|On, so that |ψ|Of(|φ|O) for every L2-formula ψ with ψφ. Exponential operator-succinctness and operator-succinctness on special classes of structures are defined in the obvious way.

Counter intuitively it is theoretically possible that there are two logics L1 and L2 where L1 is exponentially more succinct than L2 and L2 is exponentially more succinct than L1. This is because there might be some properties that can be expressed more succinctly in L1, while some other properties can be expressed more succinctly in L2. However this cannot happen for the logics examined in this paper. Since every ML formula is also an ML[] formula, ML cannot be more succinct than ML[].

3 The operator-number game

Let =(W,R,V) be a finitely labelled graph and let A,BW be two sets of nodes of the graph. Then {A,B} is an unordered pair. From this point on, we will use just the term pair to refer to unordered pairs. We say such a pair {A,B} is tidy, if for every aA there is a bB with V(a)=V(b) and vice versa. For any sets A and B, {A,B} can be tidied up by removing all nodes aA without a fitting node in B and all bB without a fitting node in A. Let t({A,B})={A,B}, where AA and BB are the largest subsets of A and B respectively, so that {A,B} is tidy.

Definition 1.

Let =(W,R,V) be a finitely labelled graph, A0,B0W two sets of nodes and k0. The -Operator-Number game, or -ON game for short, played from A0 and B0 with parameter k0 has two players S and D. An S-position ({A,B},k) of the game consists of a tidy pair of sets of nodes A,BW, as well as a parameter k. The starting position is (t({A0,B0}),k0). A turn for S starts from an S-position and proceeds in one of the following two ways:

  1. 1.

    Split-move: S may only play this type of move if k2. He picks a set X{A,B} and splits it into two disjoint and non-empty sets X1 and X2. Then he chooses two parameters k1,k21 with k1+k2=k. Let Y be the set that fulfils {X,Y}={A,B}. This play by S creates the D-position (X1,X2,Y,k1,k2). Afterwards D plays by taking the set Y and picking a subset YY of it. Then she chooses i{1,2} and the game continues from the S-position (t({Xi,Y}),ki).

  2. 2.

    Operator-move: S may only play this type of move if k1. He picks a set X{A,B} and then picks a function f:XX, where f(x)x for every xX. Let X={f(x)xX} be the image of X under f and let Y be the set with {X,Y}={A,B}. Then the D-position after this move by S is (X,Y,k1). Afterwards D plays by taking the set Y and then determining a set YY of successors of the nodes from Y. The game continues from the position (t({X,Y}),k1).

Player S wins a play of the game, if the play arrives at a position ({A,B},k) with A=B=. Otherwise if S cannot make another move, player D wins.

When looking at an operator-move we might say S picks x as a successor of x instead of S picks a function f with f(x)=x. We usually make D play in such a way that no nodes are removed during tidy-up. For this reason we will only mention the tidy-up process if nodes are getting removed. The composition of a D-position depends on the type of move played by S. However in contrast to S-positions we mainly view D-positions as intermediate positions. So it is not necessary to give a full definition for them.

The parameter k is reduced in every move and the play cannot continue, once the parameter is depleted. So a play with starting parameter k can go on for k rounds at most. In particular every play has to be finite. This implies in every play either player S or player D wins. So our game is a finite, chance-less two-player game with complete information, that cannot end in a draw. Then by Zermelo’s Theorem for every position of the -ON game one of the players, S or D, has a positional winning strategy.

Such a positional strategy for a player, S or D, is a partial map fS or fD. For S a strategy fS takes as input an S-position and outputs a D-position, that can be reached from the input position via a single move by S. It is a winning strategy, if every play where S plays according to fS is won by him. Similarly a strategy fD for D takes a D-position as input and gives an S-position, reachable in one move by D, as output. It is a winning strategy, if D wins every play in which she plays fD. If player S has a winning strategy for some position ({A,B},k), we will say this position is a winning position for player S. The definition of winning positions for D is done in the obvious way.

As previously stated, the main influence for our game is the Hella-Vilander game from [9], which in turn builds on the Adler-Immerman game from [1]. Before we show that our game can be used to prove lower bounds for operator-number, we want to discuss how our game differs from these games.

The -ON game is played by two players, S (Samson) and D (Delilah), just like the game defined by Hella and Vilander. However instead of playing on two sets of structures, in our game the players play on unordered pairs of sets of nodes. This shows two significant differences between our game and their game. Firstly we are using nodes, where they used structures. This change however is only notational, because we will still implicitly talk about pointed Kripke structures. We will use the node w as a representation of the structure (,w). In order to do so, we have to fix a single graph for the entire game. So every pointed Kripke structure that appears in a play must be based on the same labelled graph. While this appears like a real restriction, it is possible to merge multiple labelled graphs into one without changing the validity of formulas at any world.

The second significant difference is that we use unordered pairs instead of ordered pairs or tuples. This change allows us to switch around the elements of our pairs at will. In the game by Adler and Immerman, such a switch is used as a not-move. So basically in our game, we can simulate a not-move without actually making a move. This is possible, because we are only interested in the number of operators and not in the number of negations. We also add an additional step to every move, where we tidy up the pair of our position. By doing so we remove all elements, that could have been removed in the Hella-Vilander game by S playing a number of turns without playing an operator-move. A similar approach is used in the l-QVT game from [14]. In this game structures without a partially isomorphic counterpart in the other set are removed after every step. A problem with this method occurs when tidying up would skip an infinite number of moves. Vinall-Smeeth solves this problem by only considering finite structures. However for our purposes a weaker condition is sufficient. We only demand that our graphs are finitely labelled.

As a last major difference: In the Hella-Villander game, player D is forced to play following the so-called oblivious strategy. This means she always has to answer with the largest possible set. This works, because this oblivious strategy is always optimal. So if D can win the game, she can do so by playing obliviously. While this simplifies the definition of the game, it would also complicate our proofs. Simply put we only want to prove that D has a winning strategy. Whether this strategy is also optimal is of no importance to us. Sometimes D playing in a suboptimal way might lead to smaller positions, which can simplify our proofs.

Our aim is to prove that the -ON game can be used to characterize the number of operators needed in order to separate sets of nodes. As a stepping stone towards this goal, we state the following two lemmas.

Lemma 2.

Let =(W,R,V) be a finitely labelled graph and A,B,A,BW be sets of nodes with t({A,B})={A,B}. If there is a separating formula φ for (,A) and (,B), then there is also a separating formula φ for (,A) and (,B) with |φ|O=|φ|O.

Lemma 3.

Let =(W,R,V) be a finitely labelled graph, A,BW be sets of nodes that form a tidy pair and let φ be an ML-formula without any modal operators. Then (,A)φ if, and only if, (,B)φ.

Now we want to prove the following Correctness- and Completeness-Theorem for our game. We do so by proving both directions separately, beginning with the proof of correctness.

Lemma 4.

Let =(W,R,V) be a finitely labelled graph, A,BW two sets of nodes and k. If S has a winning strategy in the -ON game played from A and B with parameter k, then there is an ML-formula φ separating (,A) and (,B) with |φ|Ok.

Proof.

For this proof we assume that S has a winning strategy for the -ON game played from A and B with parameter k. Because of Lemma 2 we may assume that A and B already form a tidy pair. Then the starting position of this game is ({A,B},k). We construct the separating formula φ by induction over k. First we look at the base case k=0. Since the parameter is already used up, S cannot make a move at all. So the only possible way for him to win is, if the current position already fulfils {A,B}={,}. Then φ= is a trivial separating formula.

We now assume, the implication holds for all parameters ki and we look at some winning strategy fS for the -ON game starting from ({A,B},i+1). We have to differentiate between the types of moves fS prescribes as the next move from this position.

We begin by assuming that fS prescribes a split-move. In this case S picks a set from {A,B}, without loss of generality he picks A. Then S splits it into two sets A1 and A2 and he splits i+1 into k1 and k2, both of which have to be at most i. Since fS is a winning strategy for S, it has to win against every possible strategy of D, in particular against the following ones: D answers by picking B=B and then picks either 1 or 2. The game continues from (t({A1,B}),k1) or (t({A2,B}),k2) respectively. From both positions fS is still a winning strategy. So thanks to our induction hypothesis and Lemma 2, for j{1,2} we have separating formulas φj for (,Aj) and (,B) with |φj|Okj. We may assume (,Aj)φj, because otherwise we could use ¬φj instead. Then φ=φ1φ2 is a separating formula for (,A) and (,B) with |φ|O=|φ1|O+|φ2|Ok1+k2=i+1.

Next we look into fS prescribing an operator-move. Once again we may assume that S plays on A. For every node in A he picks a successor and forms A. The strategy fS has to win against a strategy of D that simply picks B=B. So S can also win from the position (t({A,B}),i). This means there has to be a formula φ with (,A)φ, (,B)¬φ and |φ|Oi. So every node in A has at least one successor, namely those picked by S during the construction of A, that satisfies φ. On the other hand no node in B can have such a successor, since all of them are part of B. So φ=φ is a separating formula for (,A) and (,B) with at most i+1 operators. This finishes our prove of the first implication.

Next we prove the completeness of our game.

Lemma 5.

Let =(W,R,V) be a finitely labelled graph, φ an ML-formula, A,BW two sets of nodes and k, so that φ is a separating formula for (,A) and (,B) with |φ|Ok. Then S has a winning strategy in the -ON game played from A and B with parameter k.

Proof.

To begin with we may assume that the separating formula φ only contains the connectives and ¬, as well as the operator , since otherwise we can use the well known equivalences ψχ¬(¬ψ¬χ) and ψ¬¬ψ in order to remove all occurrences of and . None of these changes increase the number of modal operators, so |φ|Ok still holds. The -ON game played from A and B with parameter k has the starting position (t({A,B}),k). If φ is a separating formula for (,A) and (,B), it is also a separating formula for the tidied up subsets of A and B. So we may assume that {A,B} is already tidy, meaning t({A,B})={A,B}. We construct fS, the winning strategy of S, by playing according to φ. This construction is done by induction over the formula size of φ.

For the base case, we look at all atomic formulas, so φP{,}. Because of Lemma 3, (,A) and (,B) agree on φ. So the only way for φ to be a separating formula is A=B=. In this case ({A,B},k) is already a winning S-position and S does not have to make a move in order to win the play.

We now assume the implication holds for all formulas ψ with |ψ|i and we look at a separating formula φ for (,A) and (,B) with |φ|=i+1 and |φ|Ok. We have to differentiate based on the outermost operator of φ.

We begin with φ=¬ψ. Because of our definition of separating formula, ψ also is a separating formula for (,A) and (,B) with |ψ|=i and |ψ|Ok. So thanks to our induction hypothesis, S already has a winning strategy for the -ON game played on A and B with parameter k.

Next we look at φ=ψ with |ψ|=i and |ψ|Ok1. This also implies k1. So every node in one of the sets, w.l.o.g. we assume A, has a successor that satisfies ψ, while no node in the other set, B, has such a successor. Then fS prescribes an operator-move, where S picks for every aA a successor a with (,a)ψ and collects them into the set A. Then D answers by gathering some nodes bB into a set B. However none of the nodes picked by D can satisfy ψ. The game continues from the position (t({A,B}),k1), where ψ is a separating formula for A and B and thus also for the sets after tidying. So because of our induction hypothesis, for every choice made by D during this turn, S has a winning strategy from the resulting position. So fS simply copies the fitting strategy for D’s choice of B.

For φ=ψχ we have to make an additional differentiation. It is possible that one of the sub-formulas, w.l.o.g. we assume χ, does not contain any modal operators. Then one of the sets, we assume A, satisfies ψχ, while the other satisfies its negation, so (,B)¬ψ¬χ. This implies (,B)¬ψ and (,B)¬χ. So because of Lemma 3, we also know (,A)¬χ, which in turn implies (,A)ψ. This means ψ is also a separating formula for A and B with |ψ|i and |ψ|Ok. So our hypothesis offers a winning strategy for S in ({A,B},k).

Lastly we look at φ=ψχ, where both ψ and χ contain at least one modal operator. This implies k2. Once again we assume (,A)ψχ and thus (,B)¬ψ¬χ. Let A1 and A2 be the disjoint subsets of A with A1:={aA(,a)ψ} and A2:={aA(,a)¬ψχ}=AA1. If one of these sets, w.l.o.g. we assume A2, is empty, then A=A1 and thus (,A)ψ holds. So ψ is a shorter separating formula for (,A) and (,B) and according to our induction hypothesis S has a winning strategy for the position ({A,B},k). Otherwise we know A1 and A2. So S may play a split-move where he splits A into A1 and A2. Then he picks the new parameters as k1=|ψ|O and k2=kk1, which implies k2|χ|O. Player D answers by picking a set BB. Independent from her choice (,B)¬ψ¬χ holds. The game continues from either (t({A1,B}),k1) or (t({A2,B}),k2). For the first case, S can use ψ as a separating formula for A1 and B. Similarly χ is a separating formula for A2 and B. So by our induction hypothesis every possible follow-up position is a winning position for S and thus ({A,B},k) is as well.

Lemma 4 and Lemma 5 can be combined into the following Correctness- and Completeness-Theorem.

Theorem 6.

Let =(W,R,V) be a finitely labelled graph, A,BW two sets of nodes and k. Then S has a winning strategy in the -ON game played from A and B with parameter k if, and only if, there is an ML-formula φ with |φ|Ok, that is a separating formula for (,A) and (,B).

4 Succinctness between ML[] and ML

We begin this section by proving some simple upper bounds for succinctness.

Lemma 7.

The logic ML[] is at most exponentially more succinct and at most exponentially more operator-succinct than ML.

Proof.

It is quite easy to see that the succinctness gap between ML and ML[] is at most exponential. We can simply replace every sub-formula φψ with (φψ)(¬φ¬ψ). Every time this is done, the formula size is doubled at most. Since the number of bi-implications cannot exceed the formula size, this procedure leads to an exponential size increase at worst.

However this method cannot provide an upper bound for the operator-succinctness. While every such substitution only doubles the numbers of operators as well, we cannot bound the number of operators in the resulting formula by some function of the number of operators in the original formula. E.g. for a formula φ with only one modal operator that appears in the scope of n nested bi-implications this method would result in a formula with 2n modal operators.

However there is another method, that guarantees an at most exponential increase in operator-number. This method is based on the construction used by Spira in [11]. We can use the equivalence α(β)(α()β)(α()¬β), which holds as long as β does not appear in α in the scope of any operator. This allows us to inductively move all modal operators outside of the scope of any bi-implication. Every time this is done, the number of modal operators is doubled, which leads to an exponential increase. Then the bi-implications can be removed in the known way without increasing the number of modal operators any further. A more in-depth proof of this is given as part of the appendix.

In the following Section 4.1 we show that there is an exponential succinctness gap between ML and ML[] on the class of Kripke structures with a transitive and reflexive accessibility relation. Afterwards in Section 4.2 we show that this exponential succinctness gap also exists on the class of Kripke structures with a symmetrical and reflexive accessibility relation. Since these two claims are proven in similar ways, we want to use this opportunity to give a general overview of these proofs.

To begin with we construct an infinite graph as well as two series of nodes (an)n and (bn)n. Then for every n we give an ML[]-formula φn. This formula separates (,an) and (,bn). The number of modal operators in φn is bound from above by a polynomial over n. Then we give some parameter kn that is exponential in n and show that ({{an},{bn}},kn) is a winning position for player D in the ON game. It follows that S cannot have a winning strategy for this position. So by Theorem 6, there cannot be an ML-formula ψ that separates (,an) from (,bn) with |ψ|Okn implying that every ML-formula equivalent to φn must be of size exponential in n at least. This proves an exponential succinctness gap between ML and ML[] for all classes of Kripke structures belongs to.

Now we construct the infinite labelled graph =(W,R,V), which will be the basis for all of our other graphs. The nodes of this graph are formally defined as W:={a0,b0}i1{ai,bi,1i,2i,3i,4i,5i,6i,7i,8i,9i}. The edges of this graph are as follows:

R=i1{(ai,1i),(ai,2i),(ai,3i),(bi,1i),(bi,3i),(1i,4i),(1i,6i),(2i,4i),(2i,7i),(3i,5i),(3i,7i),(4i,8i),(4i,ai1),(5i,8i),(5i,bi1),(6i,ai1),(6i,9i),(7i,bi1),(7i,9i)}

The graph can be separated into segments. Then segment i contains all nodes with index i, as well as ai1 and bi1. So the latter two nodes are shared nodes between segment i and segment i1. The nodes and edges of segment i are presented graphically in Figure 1. All edges of connect nodes within a segment and all segments have equivalent edges. So this figure fully represents the nodes and edges of . The numbers on the left side of the figure count the layers of the graph. Segment i contains all nodes of layers 3i1, 3i2 and 3(i1), as well as the nodes ai and bi from layer 3i. Lastly we define the labelling function of . The labels we use are {p0,p1,q0,q1,q2}P. The only node labelled with p0 is a0. For every i1 the node 8i is the only node of segment i labelled with p1. Lastly we use the remaining labels q0, q1 and q2 in order to label the layers of the graph in an alternating pattern. More precisely for every i we label all nodes of layer 3i with q0, all nodes of layer 3i1 with q1 and all nodes of layer 3i2 with q2. The formal definition of the labelling function is given by the following list:

  • V(9i)=V(bi1)={q0} for all i1,

  • V(8i)={p1,q0} for all i1 and

  • V(ai1)={{p0,q0}for i=1{q0}for all i>1

  • V(1i)=V(2i)=V(3i)={q1} for all i1,

  • V(4i)=V(5i)=V(6i)=V(7i)={q2} for all i1,

Figure 1: The ith segment of including labels.
Claim 8.

Let φi be the ML formula inductively defined by φ0:=p0 and φi:=((p1)(φi1)). Then φi is a separating formula for (,ai) and (,bi) with (,ai)φi for every i.

Proof.

This proof is done via induction on i. For the base case i=0 the formula φi=p0 holds only in (,a0), because a0 is the only node labelled with p0. Next let i1 be some positive number. We assume that our claim holds for i1. So (,ai1)φi1 and (,bi1)¬φi1. Now we look at the other nodes of segment i. We know (,8i)p1, while no other node of layer 3(i1) is labelled with p1. This implies (,4i)(p1)(φi1), (,5i)(p1)¬(φi1), (,6i)¬(p1)(φi1), as well as (,7i)¬(p1)¬(φi1). So (,4i) and (,7i) model (p1)(φi1), while (,5i) and (,6i) do not. Because 4i and 7i are the only successors of 2i, this also means (,2i)((p1)(φi1)). On the other hand, 1i is a predecessor of 6i and 3i is a predecessor of 5i. So neither (,1i) nor (,3i) satisfy this formula. Lastly 2i is a successor of ai, so (,ai)((p1)(φi1))=φi. But 1i and 3i are the only successors of bi, so (,bi)¬φi, which proves the claim.

It should be noted here that the formula φi does not contain any occurrences of either q0, q1 or q2. However we need those labels in order to adapt φi to the modified graphs we use in the following proofs.

4.1 Transitive and reflexive graphs

The graph t,r=(W,Rt,r,V) is defined as an extension of =(W,R,V). The nodes and labels stay the same. First we add a self-loop for every node. So (x,x) is a new edge for every node x. Then we add a new edge from a node x to a node y, for all x and y where x is a node on layer i and y is a node on layer j with ij+2. So every node is now a predecessor to all nodes two or more layers below. So formally Rt,r=R{(x,x)xW}{(x,y)x belongs to layer iy belongs to layer jij+2}. It is easy to see that t,r is a reflexive graph. On top of that it is also transitive. Let (x,y) and (y,z) be edges of the graph. If either of them is a self loop, then (x,z) is the same as the other edge. Otherwise y must be on a lower layer than x and z must be on a lower layer than y. So z is at least two layers below x and (x,z) is an edge in t,r.

Lemma 9.

There are two polynomial functions g1,g2:, so that for every n there is an ML[]-formula ψn, which separates (t,r,an) from (t,r,bn), with |ψn|O=g1(n)n and |ψn|=g2(n)n.

Proof.

This is proven, by simply adapting the formula φn from Claim 8 to separate an and bn in t,r instead of in . In every edge goes down exactly one layer. But all edges added to R when constructing Rt,r go down at least two layers or are self-loops. So we need a way to determine what layer a node belongs to. We look at the formula χi+ defined by χ0+:= and χi+1+:=(qjχi+) with i and j=imod3. This formula states, that there is a path of length i that starts at the current node and on this path the labels q0, q1 and q2 appear in an alternating pattern. Such a path cannot use any self-loops, so every edge has to move down at least one layer. Then only nodes of layer i or above satisfy this formula. Similarly only nodes of layer i or below satisfy χi:=¬χi+1+. Now we can use the formula χi:=χi+χi in order to check whether or not a node belongs to layer i.

Then for any node w of layer i1 (,w)(χi1φ) holds if, and only if, w has a successor exactly one layer below that satisfies φ. Similarly (,w)(¬χi1φ) holds if, and only if, every successor of w that is exactly one layer below the current layer satisfies φ. So we can add conditions like these to φn in order to only consider edges that go down exactly one layer, which are all edges from R, while ignoring all edges from Rt,rR. We define ψn inductively with ψ0:=p0 and ψi:=(χ3i1(¬χ3i2(((p1χ3(i1))((ψi1χ3(i1)))))). Then for all nodes w of layer 3n, (t,r,w)ψn holds if, and only if (,w)φn. So by Claim 8, ψn is a separating formula for (t,r,an) and (t,r,bn). For a rough estimation of the size of φn we can assert that the number of times a sub-formula χi occurs in φn is linear in n. Also the size of such a formula is linear in i, which is bounded from above by 3n1. So the overall size of φn is quadratic in n, which is polynomial. A more in-depth examination leads to the findings |φn|O=24n2+10n=:g1(n) and |φn|=36n2+17n+1=:g2(n).

Lemma 10.

There is an exponential function h:, so that for every n and for every ML-formula χn which separates (t,r,an) from (t,r,bn), we have |χn|O>h(n).

Proof.

First we define h(n)=2n2. It is easy to see that |χ0|O>1=h(0) holds. So from now on we may assume n1. Then we show that there is no separating ML-formula for {(t,r,an)} and {(t,r,bn)} with at most h(n) modal operators. This is done by playing the t,r-ON game on {an} and {bn} with parameter h(n) and showing that D has a winning strategy fD for these games. In order to do so we define the following sets of positions of the game.

  • WD0:={({A,B},k)A,BW,AB,k}

  • WD1:={({{ai},{bi}},k)i1,k2i2}

  • WD2:={({{2i},{1i,3i}},k)i2,k2i3}

  • WD3:={({{2i,4i},{1i,5i}},k)i2,k2i4}

  • WD4:={({{2i,7i},{3i,6i}},k)i2,k2i4}

  • WD5:={({{4i,7i},{5i,6i}},k)i2,k2i4}

  • WD6:={({{1i},{2i}},k)i2,k2i1}

  • WD7:={({{2i},{3i}},k)i2,k2i1}

  • WD8:={({{4i},{5i}},k)i2,k2i11}

  • WD9:={({{6i},{7i}},k)i2,k2i11}

Then WD=0j9WDj is defined as the union of these sets. For n1 it is easy to see, that ({{an},{bn}},2n2)WD1 and thus ({{an},{bn}},2n2)WD holds. It is also easy to see that no WDj contains a position ({A,B},k) with A=B=. So WD does not contain any immediate winning position for S. Player D can also stop player S from leaving WD. More precisely if S makes a move from any position in WD, then D can answer in such a way that the follow-up position is also in WD. We can prove this fact by going through all positions within WD and checking every possible move that S can make. In this shortened proof we will showcase this for the set WD2. The remaining cases are given as part of the appendix.

So let ({{2i},{1i,3i}},k) with i2 and k2i3 be some position from WD2. In this position player S can play an operator-move, where he picks some successor of 2i. If he picks a node that is also a successor of 1i or 3i, then D can answer by picking the same node. In this case the next position ({A,B},k1) satisfies AB. So it belongs to WD0. Then S cannot leave WD by picking a shared successor, meaning a node that is a successor of both sets of the position. So for t,r in particular we can ignore all moves where S picks a node two or more layers below any of the current nodes. Every successor of 2i besides 2i itself is also a successor of 1i or 3i. If S picks 2i as its own successor then D can answer by also using the self-loops of 1i and 3i. So this move leads to the new position ({{2i},{1i,3i}},k1), which is also a part of WD2. Alternatively S can also play an operator-move where he picks successors of 1i and 3i. Once again if S picks a node that can also be picked by D, then the next position is a member of WD0. The only other options of S are to pick 1i or 6i as a successor of 1i and 3i or 5i as a successor of 3i. If he uses the self-loop both times, D answers with her own self-loop and the follow-up position belongs to WD2. If S choose 1i and 5i then D answers {2i,4i}. This moves the play to ({{2i,4i},{1i,5i}},k1), a position from WD3. On the other hand if S picked 6i and 3i, then D answers with {2i,7i}, which leads to ({{2i,7i},{3i,6i}},k1), a member of WD4. Lastly if he choose 5i and 6i, D picks {4i,7i}. The new position ({{4i,7i},{5i,6i}},k1) belongs to WD5. So S cannot leave WD from WD2 by playing an operator-move.

At ({{2i},{1i,3i}},k) S can also play a split-move. In this case he has to split {1i,3i} into {1i} and {3i}. He also splits the parameter k into k1 and k2. One of these two new parameters is at most half of the original parameter k. So D can pick j in such a way, that kj2i1. Then she answers with {2i}. The new position of the play is either ({{1i},{2i}},kj)WD6 or ({{2i},{3i}},kj)WD7. So we have shown that S cannot leave WD by making a move in a position from WD2.

The same holds for all other positions from WD. Since S cannot leave WD and WD does not contain any immediate winning position, S cannot win from any of these positions. Then all positions in WD are winning positions for D. This includes ({{an},{bn}},2n2). So there cannot be a separating formula for {an} and {bn} with 2n2 or less modal operators.

Theorem 11.

The logic ML[] is exponentially more operator-succinct and exponentially more succinct than ML on the class of Kripke structures with a transitive and reflexive accessibility relation.

Proof.

Let n be some natural number. Then ψn is the ML[]-formula given in Lemma 9. We know |ψn|O=g1(n)n and |ψn|=g2(n)n for some polynomial functions g1 and g2. We also know that ψn separates the two transitive and reflexive Kripke structures (t,r,an) and (t,r,bn) from one another. Let χn be some ML-formula equivalent to ψn on the class of Kripke structures with a transitive and reflexive accessibility relation. Then χn has to be a separating formula for (t,r,an) and (t,r,bn) as well. So, according to Lemma 10, |χn|Oh(n) for some exponential function h. This implies |χn|h(n) as well. We define f1: as f1(n)=h((g1)1(n)). Then |χn|Oh(n)h((g1)1(|ψn|O))=f1(|ψn|O) holds. So ML[] is f1-times more operator-succinct than ML and thus exponentially more operator-succinct. We also define f2: as f2(n)=h((g2)1(n)). We know |χn|f2(|ψn|) holds. This implies that ML[] is f2-times more succinct than ML, so it is exponentially more succinct as well.

From this theorem it is easy to infer, that the succinctness gap also exists on all classes of structures, which are supersets of the class of Kripke structures with a transitive and reflexive accessibility relation. We say an accessibility relation R is serial, if it satisfies the condition xy:R(x,y). It is called weakly dense, if it satisfies xy:(R(x,y)z:(R(x,z)R(z,y)). Both of these conditions are true for every reflexive relation.

Corollary 12.

The logic ML[] is exponentially more succinct than ML on the following classes of pointed Kripke structures:

  1. 1.

    The class of all pointed Kripke structures.

  2. 2.

    The class of all pointed Kripke structures with a transitive accessibility relation.

  3. 3.

    The class of all pointed Kripke structures with a reflexive accessibility relation.

  4. 4.

    The class of all pointed Kripke structures with a serial accessibility relation.

  5. 5.

    The class of all pointed Kripke structures with a weakly dense accessibility relation.

4.2 Symmetrical and reflexive graphs

The graph s,r is also constructed as an extension of =(W,R,V). We do not change the nodes or their labels. We only add edges to . For every edge (x,y)R we add its inverse (y,x) to our new graph. For nodes x and y, that belong to the same level, we also add the edges (x,y) and (y,x). This explicitly includes all possible self-loops. So formally Rs,r=R{(y,x)(x,y)R}{(x,y)i:x and y belong to layer i}. It is easy to see that s,r is both symmetrical and reflexive.

Lemma 13.

There are two polynomial functions g1,g2:, so that for every n there is an ML[]-formula ψn, which separates (s,r,an) from (s,r,bn), with |ψn|O=g1(n)n and |ψn|=g2(n)n.

Proof.

Once again this is proven by adapting φn from Claim 8. All edges in go down exactly one layer. But all of the edges added to R during the construction of Rs,r either go up one layer or stay on the same layer. Since we used the labels q0, q1 and q2 in an alternating pattern, the current layer, the layer one below and the layer one above are all coloured differently. This means we can use these labels to make sure that only the original edges from R are considered. We define ψn via ψ0:=p0 and ψi:=(q1(¬q2(((p1q0)((ψi1q0))))). For all nodes w of layer 3n we know that (s,r,w)ψn holds if, and only if, (,w)φn. So, by Claim 8, ψn is a separating formula for (s,r,an) and (s,r,bn) with |ψn|O=|ψn1|O+4 and |ψn|=|ψn1|+15. So g1(n)=4n and g2(n)=15n+1 fulfil all conditions of the lemma.

Lemma 14.

There is an exponential function h:, so that for every n and for every ML-formula χn, which separates {(s,r,an)} from {(s,r,bn)}, we have |χn|O>h(n).

Proof.

This proof is very similar to that of Lemma 10. We also use h(n)=2n2, so |χ0|O>1=h(0) holds. Then for n1 we show that ({{an},{bn}},h(n)) is a winning position for player D in the s,r-ON game. For this purpose we define the following sets of positions of the s,r-ON game.

  • WD0:={({A,B},k)A,BW,AB,k}

  • WD1:={({{ai},{bi}},k)i1,k2i2}

  • WD2:={({{2i},{1i,3i}},k)i2,k2i3}

  • WD3:={({{4i,7i},{5i,6i}},k)i2,k2i4}

  • WD4:={({{1i},{2i}},k)i2,k2i1}

  • WD5:={({{1i},{3i}},k)i2,k2i1}

  • WD6:={({{2i},{3i}},k)i2,k2i1}

  • WD7:={({{4i},{5i}},k)i2,k2i11}

  • WD8:={({{6i},{7i}},k)i2,k2i11}

Then WD:=0j8WDj is defined as the union of these sets. For n1 it is easy to see that ({{an},{bn}},2n2)WD1 and thus ({{an},{bn}},2n2)WD holds. It is also easy to see, that no WDj contains a position ({A,B},k) with A=B=, so WD does not contain any immediate winning position for S. Next we show that for every move of S from a position of WD player D can defend in such a way that the next position is in WD as well. In this shortened version of our proof we will only give a proper explanation for WD2. The description of the other cases is once again moved to the appendix.

So we have to look at positions ({{2i},{1i,3i}},k) with i2 and k2i3. Once again if S picks a shared successor, D can move the game to a position from WD0. For s,r this happens every time S picks a node of the same layer as any node in the other set. In particular if S uses the self-loop on any nodes. Since every successor of 2i is also a successor of 1i or 3i, player S cannot leave WD by playing an operator-move on {2i}. Similarly when playing on {1i,3i} we only have to look at S picking bi or 6i as a successor of 1i, as well as bi or 5i as a successor of 3i. This is because every other successor is also a successor of 2i. If S picked bi at all, then D answers with {ai}. If he choose 5i or 6i in addition to bi, then these nodes will be removed when tidying up, because they are labelled differently from ai and bi. So in this case the game continues with ({{ai},{bi}},k1), a position from WD1. On the other hand, if S picked both 5i and 6i as successors, D answers with {4i,7i}. Then the follow-up position is ({{4i,7i},{5i,6i}},k1) from WD3. So S cannot leave WD by playing an operator-move at WD2. Lastly we look at what happens when D plays a split-move. He has to split {1i,3i} into {1i} and {3i} and k into k1 and k2. Then D can answer in such a way that kj2i1 and the follow-up position is either ({{1i},{2i}},kj)WD4 or ({{2i},{3i}},kj)WD6. With this we have shown that S cannot leave WD in one move from WD2.

The same goes for all other positions from WD. So D can stop S from leaving WD and there is no immediate winning position for S in WD. Then all position in WD, including ({{an},{bn}},2n2), have to be winning positions for player D. This is why there cannot be a separating formula for {an} and {bn} with 2n2 or less modal operators.

Theorem 15.

The logic ML[] is exponentially more operator-succinct and exponentially more succinct than ML on the class of Kripke structures with a symmetrical and reflexive accessibility relation.

Proof.

The proof of this theorem can be done analogous to the proof of Theorem 11. Simply replace all mentions of transitivity with reflexivity and use Lemma 13 and Lemma 14 instead of Lemma 9 and Lemma 10 respectively.

Since the succinctness gap exists when looking only at symmetrical and reflexive graphs, it has to exist for any class of graphs that contains all symmetrical and reflexive graphs.

Corollary 16.

The logic ML[] is exponentially more succinct than ML on the class of Kripke structures with a symmetrical accessibility relation.

4.3 Transitive and symmetrical graphs

So far in this paper we have shown that adding bi-implications allows us to write more succinct formulas. Even when only looking at special classes of Kripke structures, like those with a transitive accessibility relation and those with a symmetrical one. In this section we are going to show that the same does not hold if we combine these two conditions. More precisely we show that there is no exponential succinctness gap between ML and ML[] on the class of Kripke structures with a transitive and symmetrical accessibility relation.

We start with a few remarks about our notation in this section. Let 𝒮S5 be the class of all Kripke structures with a transitive, symmetrical and reflexive accessibility relation. We write φT,Sψ if φ and ψ are equivalent on all pointed Kripke structures with a transitive and symmetrical accessibility relation. Similarly we use 𝒮S5 when talking about structures from 𝒮S5 and E for graphs with an empty accessibility relation.

All connected, transitive and symmetrical graphs, that are not reflexive as well, contain a single node and no edges. So φT,Sψ holds if, and only if, both φ𝒮S5ψ and φEψ hold. This fact offers us a way to find formulas that are equivalent on transitive and symmetrical structures, which we will use to prove the following Theorem.

Theorem 17.

For every ML[]-formula φ there is an ML-formula ψ with φT,Sψ and the size of ψ is polynomial in the size of φ.

Proof.

In [2] it is shown, that ML[] has polynomial translations with regards to 𝒮S5 in ML. So for every ML[]-formula φ there is an ML-formula ψ1 with φ𝒮S5ψ1 and the size of ψ1 is polynomial in the size of φ. We now look at the graph with only one node and no edges. We know χE as well as χE for all formulas χ. We can use this to replace all operators in φ, obtaining a operator free formula ψ with ψEφ. Since this formula does not contain any operators, we can treat it as a formula of propositional logic. So as described by Pratt in [10], we can use Spira’s construction from [11] in order to construct an ML-formula ψ2 with ψ2ψEφ. The size of ψ2 is polynomial in the size of ψ, which in turn is bounded from above by the size of φ. Now we just need to combine these formulas to obtain ψ. In order to do so we use the formula , which is true on all graphs from 𝒮S5 and false for all edgeless graphs, as well as its negation . We construct the wanted ML-formula as ψ:=(ψ1)(ψ2). This formula fulfils the following:

ψ𝒮S5(ψ1)(ψ2)𝒮S5(ψ1)(ψ2)𝒮S5ψ1𝒮S5φ
ψE(ψ1)(ψ2)E(ψ1)(ψ2)Eψ2Eφ

Which also implies ψT,Sφ. The size of this formula is |ψ1|+|ψ2|+7, so the sum of two polynomials over |φ| and a constant, which is also polynomial in the size of φ.

5 Conclusion

The main result of this paper is the development of a new model comparison game, the -Operator-Number game. This game can be used to prove lower bounds for the number of modal operators necessary to express certain properties in modal logic. An interesting follow-up question to this might be: For which logics and which parameters can a similar game be designed? If there is a version of the Hella-Vilander game for some logic, it should be possible to adjust that game to only count all occurrences of one specific type of operator. This should work for every type of operator of that logic. The more interesting part of that question is whether or not some process similar to our idea of ’tidying up’ is possible. A positive answer is given for FOl in [14]. However additional research is necessary to achieve a better understanding of this topic.

We also give a more complete overview of the succinctness of modal logic with bi-implication on different classes of Kripke structures. For every possible combination of the properties reflexive, transitive and symmetrical, we have either proven or disproven the existence of an exponential succinctness gap between ML and ML[] on the corresponding class of structures.

A possible future task could be to look into even more classes of Kripke structures, for example those with a function or a partial function as accessibility relation. Although these examples might require a quite different proof from the ones presented in this paper. Instead we could also attempt to adjust our proofs for a different logic. Our current goal is to do this for the logic CTL. This logic can be seen as an extension of ML, where more complex operators are added. Our hope is that we can show that bi-implications can be used to write exponentially shorter formulas in CTL as well.

References

  • [1] Micah Adler and Neil Immerman. An n! lower bound on formula size. ACM Transactions on Computational Logic, 4(3):296–314, 2003. doi:10.1145/772062.772064.
  • [2] Christoph Berkholz, Dietrich Kuske, and Christian Schwarz. Boolean basis, formula size, and number of modal operators. Logical Methods in Computer Science, Volume 21, Issue 3, July 2025. doi:10.46298/lmcs-21(3:10)2025.
  • [3] Marco Carmosino, Ronald Fagin, Neil Immerman, Phokion Kolaitis, Jonathan Lenchner, and Rik Sengupta. Multi-structural games and beyond. Logical Methods in Computer Scienc, 20:27:1–27:40, 2024. doi:10.48550/arXiv.2301.13329.
  • [4] Andrzej Ehrenfeucht. An application of games to the completeness problem for formalized theories. Fundamenta Mathematicae, 49:129–141, 1960. URL: https://eudml.org/doc/213582.
  • [5] Ronald Fagin, Jonathan Lenchner, Kenneth W. Regan, and Nikhil Vyas. Multi-structural games and number of quantifiers. Logical Methods in Computer Science, 21(1), 2025. doi:10.46298/lmcs-21(1:10)2025.
  • [6] Roland Fraïssé. On some classifications of relationship systems. Publications scientifiques de l’Université d’Alger, Série A, Sciences mathématiques, 1954.
  • [7] Tim French, Wiebe van der Hoek, Petar Iliev, and Barteld Kooi. Succinctness of epistemic languages. In Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence (IJCAI-11), pages 881–886, 2011. doi:10.5591/978-1-57735-516-8/IJCAI11-153.
  • [8] Martin Grohe and Nicole Schweikardt. The succinctness of first-order logic on linear orders. Logical Methods in Computer Science, 1(1), 2005. doi:10.2168/LMCS-1(1:6)2005.
  • [9] Lauri T. Hella and Miikka S. Vilander. Formula size games for modal logic and μ-calculus. Journal of Logic and Computation, 29(8):1311–1344, 2019. doi:10.1093/logcom/exz025.
  • [10] Vaughan R. Pratt. The effect of basis on size of boolean expressions. In Proceedings - Annual IEEE Symposium on Foundations of Computer Science, FOCS, pages 119–121, October 1975. doi:10.1109/SFCS.1975.29.
  • [11] Philip Spira. On time-hardware complexity tradeoffs for boolean functions. In Proceedings of the 4th Hawaii Symposium on System Sciences, 1971, pages 525–527, 1971.
  • [12] Wiebe van der Hoek, Petar Iliev, and Barteld Kooi. On the relative succinctness of two extensions by definitions of multimodal logic. Lecture Notes in Computer Science, 7318 LNCS:323–333, 2012. doi:10.1007/978-3-642-30870-3_33.
  • [13] Hans van Ditmarsch, Jie Fan, Wiebe van der Hoek, and Petar Iliev. Some exponential lower bounds on formula-size in modal logic. In Advances in Modal Logic, volume 10, pages 139–157, August 2014. URL: https://www.scopus.com/inward/record.uri?eid=2-s2.0-84924671231&partnerID=40&md5=3f43caf649380733487e57e7338bd34d.
  • [14] Harry Vinall-Smeeth. From quantifier depth to quantifier number: Separating structures with k variables. In Proceedings - Symposium on Logic in Computer Science, 2024. doi:10.1145/3661814.3662125.

Appendix A Appendix

A.1 Proof of Lemma 2

Claim of Lemma 2. Let =(W,R,V) be a finitely labelled graph and A,B,A,BW be sets of nodes with t({A,B})={A,B}. If there is a separating formula φ for (,A) and (,B), then there is also a separating formula φ for (,A) and (,B) with |φ|O=|φ|O.

Proof.

Without loss of generality we assume AA and BB. In order to prove this lemma, we look at the sets A′′=AA and B′′=BB. We know that no node in A′′ has the same colour as any node in B and no node in B′′ has the same color as any node in A. It is easy to see that two pointed Kripke structures based on differently coloured nodes can be separated by a literal. In this case let ψa,b be a literal with (,a)ψa,b and (,b)⊧̸ψa,b. Then for every aA′′ we define Ψa:={ψa,bbB}. All nodes in W agree on all labels that are not used in . Then every formula in Ψa must be either an atomic formula from V(W) or the negation of such an atomic formula. So even though B might be infinite, Ψa will always be finite, since V(W) is finite. So we can construct the formula ψa:=ψa,bΨaψa,b, which separates (,a) from (,B). Next we look at the set Ψ:={ψaaA′′} and the formula ψ:=ψaΨψa. This formula ψ holds in (,A′′), while its negation holds in (,B).

In a similar fashion we define Xb:={ψa,baA} and χb:=ψa,bXbψa,b for every bB′′. Next we construct X:={χbbB′′} and χ:=χbXχb. This formula holds in (,A) and its negation holds in (,B′′). With this we can finally define the wanted formula φ. Without loss of generality we assume (,A)φ. Then φ=(φχ)ψ is a separating formula for (,A) and (,B). Since neither ψ nor χ contain any operators, |φ|O=|φ|O holds as well.

A.2 Proof of Lemma 3

Claim of Lemma 3. Let =(W,R,V) be a finitely labelled graph, A,BW be sets of nodes that form a tidy pair and let φ be an ML-formula without any modal operators. Then (,A)φ if, and only if, (,B)φ.

Proof.

If A=B=, then the Lemma is trivially true. Otherwise let (,A)φ. For every bB there is a aA with V(a)=V(b). Then a and b agree on all atomic formulas and thus on all operator-free ML-formulas, so especially on φ. This means (,b)φ for every bB and thus (,B)φ. The converse implication follows from {A,B}={B,A}.

A.3 Proof of Lemma 7

During the proof of Lemma 7 we sketch a method that removes all from an ML[]-formula, while guaranteeing an exponential increase at most in the number of modal operators. Here we give a proper formal definition of this construction. We also prove its correctness.

Proof.

We recursively define a function f:ML[]ML[]. The base case consist of all formulas φ without any operators and in this case f(φ):=φ. For any other formula φ there must be a sub-formula χ so that χ appears in φ outside of the scope of any other operator. Then φ can be written as ψ(χ) for some formula ψ. We define f(φ) as f(φ):=(f(ψ())f(χ))(f(ψ())¬f(χ)).

Next we want to show that f(φ) fulfils the following three conditions for every ML[] formula φ: Firstly f(φ)φ, secondly no operator in f(φ) appears in the scope of any bi-implication and thirdly |f(φ)|O2|φ|O|φ|O. We do this via induction on the number of operators in φ. As our base case we look at all ML[] formulas φ with no modal operators. Then we know f(φ)=φ, so the first condition is obviously fulfilled. Because f(φ) does not contain any operators, the second condition is also trivially true and because of |f(φ)|O=|φ|O=0, the third condition is also fulfilled.

For the induction step we look at all formulas φ with |φ|O=i for some i1. In this case let ψ and χ be the sub-formulas with φ=ψ(χ), that were used when constructing f(φ). Because of |ψ|O<|φ|O and |χ|O<|φ|O we may assume as our induction hypothesis that both f(ψ) and f(χ) fulfil all three conditions. Then we know φ(ψ()χ)(ψ()¬χ). Because of f(ψ)ψ and f(χ)χ, this is equivalent to (f(ψ())f(χ))(f(ψ())¬f(χ))=f(φ). No operator in f(ψ) or in f(χ) appears in the scope of any bi-implication. So the same must hold for f(φ), since no new bi-implication is added during its construction. For the number of operators we know |φ|O=|ψ|O+|χ|O+1 and |f(φ)|O=2|f(ψ)|O+2|f(χ)|O+2. So by our induction hypothesis we know that |f(φ)|O2|ψ|O+1|ψ|O+2|χ|O+1|χ|O+2. The value of the right hand side of this inequality becomes largest if all but one operators of φ belong to the same sub-formula. So it is bounded from above by 2|φ|O(|φ|O1)+2. For |φ|O1 this is in turn bounded from above by 2|φ|O|φ|O. So the third condition is fulfilled as well.

In total we have shown that for every ML[]-formula φ there exists an equivalent ML[]-formula f(φ) where no operator appears in the scope of a bi-implication and the number of operators in φ is at most exponential in the number of operators in φ. Lastly we can replace every sub-formula ψχ in f(φ) with (ψχ)(¬ψ¬χ), in order to transform f(φ) into an ML-formula, without increasing the number of operators any further. This construction proves, that the operator-succinctness gap between ML and ML[] is at most exponential.

A.4 Proof of Lemma 10

During the proof of Lemma 10 we claimed that player D can ensure that a play of the game never leaves the set WD. Here we want to give a complete proof of this fact.

Proof.

We begin with some general facts. Since t,r is reflexive, player S can always play an operator-move, where he picks every node as its own successor. However D can answer by doing the same. So the follow up position consists of the same pair of sets as the current position only with the parameter reduced by one. So the new position belongs to the same set as the old position. Also S cannot play a split-move, as long as both sets of the current position contain only one element each.

We start with WD0. So let ({A,B},k)WD0 be some position and let wAB be some node shared between A and B. For k2, S can play a split-move in this position. He picks X{A,B} and then splits this set into X1 and X2. Then the shared node w has to be in one of these two sets, w.l.o.g. we can assume wX1. Then D can ensure wY for her answer and the game continues from the position (t({X1,Y}),k1). The node w is also a shared node for X1 and Y. For this reason w cannot be removed by tidying up, because there will always be a node of the same colour in the respective other set, namely w itself. So in the position after the move, the two sets still have w as a shared node. Then this new position lies in WD0 as well and thus in WD. On the other hand S might play an operator-move instead. In this case he picks X{A,B} and then for every xX he picks a successor f(x). Because of wY, we know f(w)Y. So D can ensure f(w)Y. Then f(w) is a shared node for X and Y and thus it is not removed by tidying up. This means that the follow-up position (t({X,Y}),k1) is also a member of WD0 and of WD. So D can ensure that the s,r-ON game cannot move from a position inside of WD0 to a position outside of WD.

During an operator-move, if S tries to pick some node x as f(x) with xY, the D can ensure xY. So x is a shared node for the new position and thus this position is a part of WD0. Then S cannot leave WD by playing like this. So from now on we may ignore all of his moves where he picks some shared successor. This is especially relevant, because in t,r all nodes that are two or more layers below the current node are successors of the current node. So every move where D picks some node that is two or more layers below any node from Y leads to a position from WD0. For the rest of this proof we may ignore every move of S with this property.

Next we look at WD1. Let ({{ai},{bi}},k) be a position with i1 and k2i2. For i=1 we know k=0. So S cannot play any moves, meaning he cannot leave WD. So from now on we can assume i>1. Then S has to play an operator-move in such a position. However every successor of bi is either bi itself or a successor of bi as well. Also the only successor of ai that is not a successor of bi as well is 2i. So S has to pick 2i as a successor of ai. To this D answers with {1i,3i} and the follow-up position ({{2i},{1i,3i}},k1) is a part of WD2.

We already gave a detailed explanation for WD2 during the main proof, so we will skip it here.

Now we look at WD3, which contains positions ({{2i,4i},{1i,5i}},k) with i2 and k2i4. Every successor of 4i is also a successor of 1i as well, including 4i itself. So we can ignore all operator-moves, where S tries to pick successors for 2i and 4i. Similarly every proper successor of 5i is also a successor of 2i. This means we only need to look at S picking 5i as a successor of itself and 6i as successor of 1i. Then D answers with {4i,7i} and the new position ({{4i,7i},{5i,6i}},k1) belongs to WD5. S can also play a split-move, where he either splits {2i,4i} into {2i} and {4i} or {1i,5i} into {1i} and {5i}. Either way he also has to split k into k1 and k2. Because of k2i4, D can always pick a j{1,2} so that kj2i11. The set D picks depends on the choice of Xj. She answers with {2i} to {1i} and the other way around. Then the follow-up position ({{1i},{2i}},kj) is a member of WD6. On the other hand D picks {5i} as an answer to {4i} and vice versa. In this case the play moves to ({{4i},{5i}},kj), which belongs to WD8. Either way S cannot leave WD from WD3.

For the next set, WD4, we look at positions ({{2i,7i},{3i,6i}},k) with i2 and k2i4. The arguments for these positions are very similar to those used for WD3. Every successor of 7i is also a successor of 3i and every proper successor of 6i is a successor of 2i. So the only operator-move we need to look at is S picking 5i as a successor of 3i and 6i as its own successor. To this D answers with {4i,7i} The follow-up position is ({{4,7},{5,6}},k1) and belongs to WD5. S can also either split {2i,7i} into {2i} and {7i} or he can split {3i,6i} into {3i} and {6i}. Either way D can always guarantee kj2i11 for some j{1,2}. Then she can answer in such a way that the next position is either ({{2i},{3i}},kj) from WD7 or ({{6i},{7i}},kj) from WD9. So S cannot leave WD from WD4 either.

WD5 contains all positions ({{4i,7i},{5i,6i}},k) with i2 and k2i4. Every proper successor of 4i or 7i is also a successor of 5i or 6i and vice versa. So we already know that every operator-move leads to a position from WD. However S can also play a split move. He either splits {4i,7i} into {4i} and {7i} or he splits {5i,6i} into {5i} and {6i}. He also splits the parameter k into k1 and k2. Then D can always pick j{1,2} in such a way that kj2i11. She can also choose Y in such a way that the follow-up position is either ({{4i},{5i}},kj), which belongs to WD8, or ({{6i},{7i}},kj), a member of WD9.

The next set we look at is WD6, which contains positions ({{1i},{2i}},k) with i2 and k2i1. Here S has to play an operator-move. Since we can ignore all shared successors, the only moves we need to look at are picking 6i as a successor of 1i and picking 7i as a successor of 2i. In any case D answers with the respective other node. Then the new position is ({{6i},{7i}},k1)WD9.

In positions ({{2i},{3i}},k) from WD7 S has to play an operator-move as well. The only moves we need to check are S picking 4i as a successor of 2i or 5i as a successor of 3i. In either case D can guarantee ({{4i},{5i}},k1), a position from WD8, as the follow-up position.

The set WD8 contains positions ({{4i},{5i}},k) with i2 and k2i11. In such a position S has to play an operator-move. Aside from picking shared successors or using only self-loops, S can only pick ai1 as a successor of 4i or bi1 as a successor of 5i. Then D answers with whatever node S did not pick and the next position is ({{ai1},{bi1}},k1), a member of WD2.

Lastly we look at the set WD9, so positions ({{6i},{7i}},k) with i2 and k2i11. We only need to check what happens when S picks either ai1 as a successor of 6i or bi1 as a successor of 7i. Independent of his choice, D can guarantee ({{ai1},{bi1}},k1)DW1 as the new position.

A.5 Proof of Lemma 14

During the proof of Lemma 14 we stated that player D can ensure that it is impossible to leave the set WD by playing a move of the s,r-ON game. Here we want to give a complete proof of this fact.

Proof.

During the proof of Lemma 10 we showed that in t,r player S cannot leave WD0. The argumentation we used was completely independent from the actual edges of the graph. So we can use exactly the same argumentation here to show that S cannot leave WD0 in s,r either. This also means that S cannot leave WD by picking some shared successor. For s,r in particular this includes all moves, where he tries to pick a node of the current layer.

The next set we want to look at is WD1. So let ({{ai},{bi}},k) be a position with i1 and k2i2. For i=1 we know k=0. As already mentioned S cannot make another move for k=0, meaning he cannot leave WD. For i2 he cannot play a split-move, since both sets have only one element each. So he has to play an operator-move instead. Recall that the nodes ai and bi belong segment i+1 as well as to segment i and since s,r is a symmetrical graph, these nodes also have successors in the layer above, layer 3i+1. Then S can pick 4i+1, 6i+1 or 2i as a successor of ai. Alternatively he may pick 5i+1 or 7i+1 as a successor of bi. Every other successor of ai or bi is a shared successor. If he choose either 4i+1 or 5i+1, then D answers with a set that contains only the respective other node. This leads to the follow-up position ({{4i+1},{5i+1}},k1). Because of k12i3 this position belongs to WD7. Alternatively if S picked 6i+1 or 7i+1, then D also answers with the respective other node. Then the follow-up position is ({{6i+1},{7i+1}},k1), which belongs to WD8. Lastly if S decided to move from ai to 2i, then D answers with the set {1i,3i} which progresses the play to the position ({{2i},{1i,3i}},k1). This position belongs to WD2. So independent of the choices made by S, the play always continues from some position within WD.

We already checked WD2 during the main proof, so we skip it here.

We continue by analysing WD3. Let ({{4i,7i},{5i,6i}},k) be some position with i2 and k2i4. If S plays an operator-move from this position, he has to pick 2i as a successor of both 4i and 7i, since every other successor is a shared successor. To this, D answers with the set {1i,3i}. Then the follow-up position ({{2i},{1i,3i}},k1) belongs to WD2. However for k2, S can also play a split-move. In this case he has to split either {4i,7i} into {4i} and {7i} or {5i,6i} into {5i} and {6i}. He also splits k into k1 and k2. Let kj be the smaller of these two. Then we know kj2i12. Then D answers by picking the side with kj. The set she picks depends on what choice was made by S for Xj. She answers with {5i} to {4i} and vice versa. Similarly she picks {6i} as an answer to {7i} and the other way around. So the follow-up position is either ({{4i},{5i}},kj), which is a member of WD7, or ({{6i},{7i}},kj), which is a member of WD8. So S cannot leave WD from WD3 either.

The next set is WD4. So we look at a position ({{1i},{2i}},k) with i2 and k2i1. In this position, S cannot play a split-move, since both sets contain only one node each. This will be the case for all sets from this point on, so we will stop mentioning it going forward. Now player S has to play an operator-move, where he picks bi or 6i as a successor of 1i or 7i as a successor of 2i. If he picked bi, D answers with {ai}. This moves the game to ({{ai},{bi}},k1). Because of i2, which implies 2i12i2, this position belongs to WD1. On the other hand, if S choose 6i or 7i, then D answers with the set only containing the respective other node. Then the follow-up position is ({{6i},{7i}},k1), which belongs to WD8. Then D can force any play from WD4 to stay in WD.

We continue our proof with WD5. The positions within this set are the shape of ({1i},{3i}},k) with i2 and k2i1. When playing an operator-move, S may pick 4i or 6i as a successor of 1i or he may pick 5i or 7i as a successor of 3i. Then D answers with {5i} to 4i, with {7i} to 6i, with {4i} to 5i and lastly with {6i} to 7i. Then the play continues with ({{4i},{5i},k1), a position from WD7, or with ({{6i},{7i}},k1), a position from WD8. This finishes the case of WD5.

Now we look at the set WD6, which contains all positions ({{2i},{3i}},k) with i2 and k2i1. Here S may pick 4i as a successor of 2i or alternatively bi or 5i as a successor of 3i. If he picked bi then D answers with {ai}. Then the follow-up position ({{ai},{bi}},k1) is a member of WD1. If S choose 4i or 5i instead, then D answers by picking a singleton of the respective other node. In this case the game moves on to the position ({{4i},{5i}},k1), which belongs to WD7.

The set WD7 consists of all positions ({{4i},{5i}},k) with i2 and k2i11. Then S can play an operator-move by picking 1i, 2i or ai1 as a successor of 4i. Alternatively he may also pick 3i or bi1. If S choose 2i or 3i, then D answers with {3i} or {2i} respectively. This creates the follow-up position ({{2i},{3i}},k1), which is a part of WD6. If S picked 1i, then D answers with {3i}, creating ({{1i},{3i}},k1) instead. This position belongs to WD5. On the other hand if S picked ai1 or bi1, then D answers with a set only containing the respective other node. In this case the follow-up position ({{ai1},{bi1}},k1) belongs to WD1.

Lastly we look at the set WD8. This set contains all positions ({{6i},{7i}},k) with i2 and k2i11. From such a position, S may play an operator-move where he picks either 1i or ai1 as a successor of 6i or 2i, 3i or bi1 as a successor of 7i. If he choose 1i or 2i, then D answers with the respective other node as a singleton. The follow-up position ({{1i},{2i}},k1) is a member of WD4. If S choose 3i instead, then D answers with {1i} and the new position is ({{1i},{3i}},k1), a member of WD5. On the other hand if S picked ai1 or bi1, then D picks {bi1} and {ai1} respectively. Then the game continues with ({{ai1},{bi1}},k1), a position from WD1. So all follow-up positions belong to WD.