A Dichotomy Theorem for Ordinal Ranks in MSO
Abstract
We focus on formulae of monadic second-order logic over the full binary tree, such that the witness is a well-founded set. The ordinal rank of such a set measures its depth and branching structure. We search for the least upper bound for these ranks, and discover the following dichotomy depending on the formula . Let be the minimal ordinal such that, whenever an instance satisfies the formula, there is a witness with . Then is either strictly smaller than or it reaches the maximal possible value . Moreover, it is decidable which of the cases holds. The result has potential for applications in a variety of ordinal-related problems, in particular it entails a result about the closure ordinal of a fixed-point formula.
Keywords and phrases:
dichotomy result, limit ordinal, countable ordinals, nondeterministic tree automataCopyright and License:
![[Uncaptioned image]](x1.png)
2012 ACM Subject Classification:
Theory of computation Automata over infinite objects ; Theory of computation Tree languagesAcknowledgements:
The authors would like to thank Marek Czarnecki for preliminary discussions on related subjects.Funding:
All authors supported by the National Science Centre, Poland (grant no. 2021/41/B/ST6/03914).Editors:
Olaf Beyersdorff, Michał Pilipczuk, Elaine Pimentel, and Nguyễn Kim ThắngSeries and Publisher:

1 Introduction
The concept of well-founded relation plays a central role in foundations of mathematics. It gives rise to ordinal numbers, which underlie the basic results in set theory, for example that any two sets can be compared in cardinality. Well-foundedness is no less important in the realm of computer science, where it often underlies the proofs of termination of non-deterministic processes, especially when no efficient bound on the length of a computation is known. In such cases, the complexity of possible executions is usually measured using an ordinal called rank. Such a rank can be seen as a measure of the depth of the considered partial order, taking into account suprema of lengths of possible descending chains. Estimates on a rank can provide upper-bounds on the computational complexity of the considered problem [26].
In this work, we adopt the perspective of mathematical foundations of program verification and model-checking. We focus on the monadic second-order logic (MSO) interpreted in the infinite binary tree (with the left and right successors as the only non-logical predicates), which is one of the reference formalisms in the area [29]. The famous Rabin Tree Theorem [25] established its decidability, but – half a century after its introduction – the theory is still an object of study. On one hand, it has led to numerous extensions, often shifting the decidability result far beyond the original theory (see e.g. [4, 24]). On the other hand, a number of natural questions regarding Rabin’s theory remain still open, including a large spectrum of simplification problems. For example, we still do not know whether we can decide if a given formula admits an equivalent form with all quantifiers restricted to finite sets. Similar questions have been studied in related formalisms like -calculus or automata; for example if we can effectively minimise the Mostowski index of a parity tree automaton [11, 15], or the -alternation depth of a -calculus formula [5].
On positive side, some decidability questions have been solved by reduction to the original theory. For example, it has been observed [22] that for a given formula , the cardinality of the family of tuples of sets satisfying can be computed; this cardinality can be either finite, , or . Later on, Bárány, Kaiser, and Rabinovich [3] proved a more general result: they studied cardinality quantifiers , stating that there are at least distinct sets satisfying , and showed that these quantifiers can be expressed in the standard syntax of MSO; thus the extended theory remains decidable.
In the present work, instead of asking how many sets witness to the formula , we ask how complex these witnesses must be in terms of their depth-and-branching structure. A set of nodes of a tree is well-founded if it contains no infinite chain with respect to the descendant order. In this case, a countable ordinal number is well defined (see Section 2 below); intuitively, the smaller , the simpler the set is, in terms of its branching structure.
We consider formulae of the form , where is an arbitrary formula of MSO (it may contain quantifiers). We assume that, whenever the formula is satisfied for some valuation of variables , the value of witnessing the formula is a well-founded set. Note that well-foundedness of a set is expressible in MSO (it suffices to say that each branch contains only finitely many nodes in ), hence the requirement can be expressed within . For a fixed formula as above, we ask what is the minimal ordinal , such that the rank of a witness can be bounded by ,
(1.1) |
where and range, as expected, over the values satisfying .
Since is a supremum of countable ordinals, its value is at most (the least uncountable ordinal). It can be achieved, for example, by the formula “ and is a well-founded set”, as there are well-founded sets of arbitrarily large countable ranks. On the other hand, for each pair of natural numbers , one can construct a formula with rank in an analogous way to Czarnecki [12], see Lemma 9.4 below. The main result of this work shows that no other ordinals can be obtained:
Theorem 1.1.
For any formula as above, the ordinal is either strictly smaller than or equal to . Moreover, it can be effectively decided which of the cases holds. In the former case, it is possible to compute a number such that .
We also show that, in contrast to the aforementioned cardinality quantifiers, the property that is smaller than cannot be expressed directly in MSO (see Corollary 9.3).
The proof of Theorem 1.1 develops the game-based technique used previously to characterise certain properties of MSO-definable tree languages (see e.g. [9, 28]). Each application of this technique requires a specific game, designed in a way which reflects the studied property. The game should have a finite arena and be played between two perfectly-informed players and . The winning condition of the game is given by a certain -regular set. Then, the seminal result of Büchi and Landweber [8] yields that the game is determined and the winner of this game can be effectively decided. The construction of the game is such that a winning strategy of each of the players provides a witness of either of the considered possibilities: if wins then and if wins then for some computable . The crucial difficulty of this approach lies in a proper definition of the game, so that both these implications actually hold.
Related work
In the context of -calculus, one asks how many iterations are needed to reach a fixed point; this aspect concerns complexity of model checking (cf. [7, 14]), as well as expressive power of the logic (cf. [7, 6]). Recall that, in an infinite structure, a least fixed point is in general reached in a transfinite number of iterations: , where, for a limit ordinal , . It is therefore natural to ask if, for a given formula, one can effectively find a closure ordinal , such that, in any model, the fixed point can be reached in , but in general not less, iterations. Fontaine [18] effectively characterised the formulae such that in each model the fixed point is reached in a finite number of steps. Czarnecki [12] observed that some formulae have no closure ordinals but, for each ordinal , there is a formula whose closure ordinal is . He also raised the following question.
Question 1.2 (Czarnecki [12]).
Is there a -calculus formula of the form that has a closure ordinal ?
Gouveia and Santocanale [19] exhibited an example of a formula with an essential alternation of the least and greatest fixed-point operators whose closure ordinal is ; clearly this limit can be achieved only in uncountable models. In general, it remains open whether a formula of the -calculus may have a countable closure ordinal . Afshari and Leigh [2] claimed a negative answer for formulas of the alternation-free fragment of -calculus; however, as the authors have later admitted [1], the proof contained some gaps. In a recent paper [1], Afshari, Barlucchi, and Leigh update the proof and extend the result to formulae of the so-called -fragment of the -calculus. More specifically, the authors consider systems of equations , where the formulae may contain closed sub-formulae of the full -calculus, but the variables do not fall in the scope of any fixed-point operators. The authors show that if a countable number of iterations suffice to reach the least fixed point of such a system in any Kripke frame then . (Note that, by Bekič principle, each component of such a solution can be expressed by a single formula with the nested -operators.)
As a direct consequence of our result, we obtain an alternative proof for formulae of the form , where does not fall in the scope of any fixed-point operator, but arbitrary closed subformulae may appear in . (This corresponds to the case of in the equation system of [1].) We show that no such formula has a closure ordinal satisfying , and moreover, it can be decided whether ; see Section 10. This is achieved via a well-known reduction of the -calculus to the MSO theory of the binary tree, based on the tree-model property [7], and a natural encoding of a tree model.
In the studies of topological complexity of MSO-definable tree languages, it has been also observed that dichotomies may help to solve decision problems. An open problem related to the aforementioned question of definability with finite-set quantifiers, is whether we can decide if a tree language belongs to the Borel hierarchy (in general, it need not). A positive answer is known if a tree language is given by a deterministic parity automaton [23], based on the following dichotomy: such a language is either -complete (very hard), or on the level (relatively low) of Borel hierarchy. Skrzypczak and Walukiewicz [28] gave a proof in the case when a tree language is given by a non-deterministic Büchi tree automaton, inspired by a rank-related dichotomy conjectured in the previous work of the first author [27, Conjecture 4.4]. There, an ordinal has been associated with each Büchi tree automaton, and it turns out (in view of [28]) that this ordinal either equals or is smaller than , in which case the tree language is Borel. It should be mentioned that a procedure to decide if a Büchi definable tree language is weakly definable was given earlier by Colcombet et al. [10].
Relation between ordinals and automata has been also considered in the studies of automatic structures. In particular, Delhommé [13] showed that automatic ordinals are smaller than while tree-automatic ordinals (defined in terms of automata on finite trees) can go higher, but not above . Later, Finkel and Todorcevic [17] showed that -tree automatic ordinals are smaller than as well. These results may appear in contrast with our more restrictive bound of ; however, representation by automatic structures is in general more powerful than expressibility in MSO, so the two approaches are not directly related.
2 Basic notions
denotes the set of natural numbers. We use standard notation for ordinal numbers, with being the least ordinal number, being the least infinite ordinal, and the least uncountable ordinal. Although and coincide as sets, we distinguish the two notations to emphasise the perspective.
Words.
An alphabet is any finite non-empty set, whose elements are called letters. A word over is a finite sequence of letters , with for , and being the length of . The empty word, denoted , is the unique word of length . By we denote the concatenation of words and , that is, .
denotes the set of all finite words over the alphabet , while denotes the set of all -words over , that is, functions . If is an infinite word and then by we denote the finite word consisting of the first letters of , that is, .
Trees.
Let and denote two distinct letters called directions. For a direction , the opposite direction is denoted . Words over the alphabet are called nodes, with the empty word often called the root. An -word is called an infinite branch. A node is called the parent of its children and .
A (full infinite binary) tree over an alphabet is a function , assigning letters to nodes . The set of all trees over is denoted . A subtree of a tree in a node is the tree over defined by taking .
Automata.
Instead of working with formulae of MSO, we use another equivalent formalism, namely non-deterministic parity tree automata [29]. An automaton is a tuple , where is an alphabet, is a finite set of states, is an initial state, is a transition relation, and is a priority mapping. A run of an automaton from a state over a tree is a tree such that and for every node the quadruple
is a transition of (i.e., belongs to ).
A sequence of states is accepting if is an even natural number. A run is accepting if for every infinite branch the sequence of states that appear in on is accepting.
A tree is accepted by an automaton from a state if there exists an accepting run of from the state over . The language of , denoted , is the set of trees which are accepted by the automaton from the initial state . A language is regular if it is for some automaton .
We now recall the famous theorem of Rabin, which allows us to transform MSO formulae into equivalent tree automata.
Theorem 2.1 ([25], see also [29]).
For every MSO formula the set of valuations satisfying is a regular language over the alphabet .
We say that an automaton is pruned if every state appears in some accepting run, that is, there exists an accepting run of over a tree such that for some node . Note that every automaton can effectively be pruned, without affecting its language, by detecting and removing states that do not appear in any accepting run.
Games.
We use the standard framework of two-player games of infinite duration (see e.g., [8, 16]). The arena of such a game is given as a graph with both players having perfect information about the current position. The winning condition of the game is given by a language of infinite plays won by one of the players.
Ranks.
Recall that an ordinal can be seen as the linearly ordered set of all ordinals smaller than . Given a set , a counting function is a function such that whenever is a proper descendant of (such a function exists for some whenever is well-founded). The rank of a well-founded set is the least ordinal for which a counting function from to exists.
Taking the automata-based perspective, instead of working with monadic variables and , we consider labellings of the tree by certain finite alphabets. In particular, a set of nodes can be identified with its characteristic function, that is, a tree over the alphabet . Such a tree is well-founded if no infinite branch contains infinitely many nodes labelled by . The set of well-founded trees is denoted . Likewise, we define the rank of , denoted , as the least for which there is a counting function from the set of -labelled nodes of to . The considered rank is analogous with the standard rank of well-founded trees (e.g., [21, Section 2.E]).
Example 2.2.
The tree with all nodes labelled by has rank .
Consider a tree where a node has label if with . It is a comb: the rightmost branch is labelled by zeros, and below its -th node we have a tooth of nodes labelled by ones, going left. The rank of is .
Such combs can be nested: suppose that starts analogously to , but below every tooth we again insert (i.e., for every ); then . Repeating this, we can insert below every tooth of , and obtain of rank , for every .
Then, we can place every at node , below a -labelled rightmost branch; the resulting tree has rank . In a similar manner we can create a tree having rank equal to any countable ordinal .
3 Problem formulation
We begin by formulating the problem under consideration. Instead of working with a formula , we assume that is some alphabet and is a regular language over the alphabet . We identify a tree over with a pair , where and such that for every node . Thus, can be seen as a relation whose elements are pairs . We additionally require that whenever then is well-founded, which means that (treated as a relation) is contained in . We say that such a relation is regular if it is regular as a language over .
Let be the projection of onto the coordinate, that is, the set of those trees for which there exists a (necessarily well-founded) tree such that . Similarly, for a tree by we denote the section of over the tree , that is, the set .
The following definition is just a reformulation of Formula 1.1 in terms of a relation .
Definition 3.1.
The closure ordinal of a relation (or of an automaton recognising it) is defined as
Example 3.2.
Consider the following automaton over the alphabet . Its states are with being initial, where the subscript provides the priority of a state (i.e., ). The transitions are, for all , , and ,
In this example, we should see -labelled nodes as separators, splitting the whole tree into -labelled fragments. Let us see when a pair can be accepted. Note first that accepts from for every tree , and for having all nodes labelled by . Next, observe that states become aligned along a single branch, either finite or infinite. If in there is a branch that infinitely often goes left, but visits only finitely many -labelled nodes, then we can align the states along this branch, and will be accepted. Indeed, just below every we have (state of large odd priority; has to occur finitely often); below every , if we go left we have , and if we go right we have (so the parity condition requires infinitely many left-turns). Another possibility is that the branch with states is finite, and below some -labelled node the state changes to . Now the run sends to every left child, and to every right child; hence every left child in should have label , and every right child – label . We can continue the zone of states until reaching a node labelled by ; such a node allows us to change the state into and accept anything below. The acceptance condition requires that there are only finitely many states (hence also nodes with label in ) on every branch; the tree is necessarily well-founded.
This determines the optimal rank of a witness for a tree . Namely, if in there is a branch that infinitely often goes left, but visits only finitely many -labelled nodes, we have a witness of rank . Otherwise, we should consider every zone of -labelled nodes in , surrounded by -labelled nodes; consider having in every left child in that zone; and take the minimum of ranks of such trees , over all choices of zones (not including the topmost zone, above the first on a branch). Thus, every witness of a tree has rank at least if and only if every such zone results in rank at least , and the former case does not hold. Such a tree exists for every , so the closure ordinal of is .
4 The dichotomy game
We now move to the definition of the game designed to decide the dichotomy from Theorem 1.1.
Note that for every regular relation there exists another regular relation with the same closure ordinal, but such that is the set of all trees over . To see this, it is enough to take . Then for and for .
Towards the proof of Theorem 1.1, we consider some relation such that and . Reformulating Theorem 1.1, we need to show that either for some , or , and we can effectively decide which case holds. We assume that is given by a pruned (i.e., all states appear in some accepting run) automaton .
First, a side is a symbol (which stands for “trunk” and “reach”), and a mode is a symbol (which stands for non-branching and binary-branching).
A quadruple such that implies (i.e., is allowed only for ) is called a selector for , where is of the form with , . Such a selector agrees with a direction if either or (i.e., both directions are fine if , but if then we require ). The output side of a selector in direction , where , is
-
if and , or and , and
-
otherwise: if and , or and .
A state-flow is a triple , where , , and . A flow is a set of state-flows. The set is called the image of . Note that the number of all possible state-flows, hence also of all possible flows, is finite.
Given a flow , we say that a flow is a back-marking of if for every pair in the image of , precisely one state-flow leading to belongs to . This state flow is called back-marked for .
Given a sequence of flows, we can define their composition as the graph with vertices and with a directed edge from to labelled by for every state-flow , . Note that in a flow there may be two state-flows with the same pairs and , with modes and , leading to two parallel edges in this graph.
Assume that we are given a set and a letter . By we denote the set of transitions of the form with .
Finally, for two sets of states we denote . Note that every subset of can be uniquely represented as for some .
We can now move to the definition of the crucial game , used to prove the desired dichotomy. The positions of are of the form (formally, one also needs additional auxiliary positions to represent the situation between particular steps of a round; we do not refer to these positions explicitly). The initial position is , where is the initial state of the automaton . The consecutive steps in a round from a position are as follows:
-
1.
declares a subset .
-
2.
declares a letter .
-
3.
declares a set of selectors, containing one selector for each .
-
4.
declares a direction .
-
5.
We define a flow as the set containing the state-flows for each selector that agrees with direction , where is the output side of the selector in the direction , and (so is the source state of and is the state sent by in direction ).
-
6.
declares some back-marking of the flow .
The new position of the game, , is the image of the flow .
Given a play of the game, the winning condition for is the disjunction A)B) of the following parts.
- A)
-
In the graph obtained as a composition of the back-markings there exists a path which infinitely many times changes sides between and .
- B)
-
In the graph obtained as a composition of the flows every infinite path either is rejecting or contains infinitely many state-flows of mode
.
Above, while saying that a path going through is rejecting, we mean that the sequence of states is rejecting, that is, is odd.
Remark 4.1.
The arena of the game is finite and the winning condition defined above is -regular. Thus, the theorem of Büchi and Landweber [8] applies: one can effectively decide the winner of . Moreover, there exists a computable bound such that whoever wins , can win using a finite-memory strategy that uses at most memory states.
The following proposition formalises the relation between and Theorem 1.1.
Proposition 4.2.
Assume that is a pruned automaton which recognises a relation such that (i.e., the projection of is full). Then, we have the following two implications:
-
1.
if player wins then ;
-
2.
if player wins then , for a number computable based on the automaton .
Depiction of a round of .
Figure 4.1 depicts an example of a round of the game . First declares a subset effectively removing the state from the side. Note that some states might repeat on both sides. Once the letter is chosen by , there are six possible transitions from the considered states: one from each of them, except the state which has two possible transitions, over and over . declares a set of selectors
Thus, the selectors for the transitions and are in the mode
When chooses a back-marking of the flow , he needs to make a choice which state-flow to select for the pair as there are two possible choices.
5 Intuitions
Let us explain the intuitions behind the game. First very generally: the game is designed so that wins when there are trees requiring witnesses of arbitrarily large ranks. Conversely, wins if there is a bound such that every tree has a witness of rank at most . But because this is a finite game with an -regular winning condition, if wins then he wins with a finite-memory winning strategy; from such a strategy we can deduce that the bound is actually of the form for some natural number depending on the size of the memory of .
Having the above in mind, let us discuss details of the game. The role of should be to show a tree (whose all witnesses have large rank), so we allow her to propose a label of a node in step 2. However, as usually in games, we do not continue in both children of the current node, but we rather ask to choose one direction ( in step 4), where has to continue creating the tree, and where thinks that it is impossible to continue.
Recall now that arbitrarily large countable ordinals can be obtained by alternately applying two operations in a well-founded way: add one, and take the supremum of infinitely many ordinals. Similarly, in a tree of a large rank, we can always find a comb: an infinite branch – a trunk – such that from infinitely many nodes on the side of this trunk we can reach a node labelled by , below which the rank is again large (but slightly smaller). In these places we can repeat this process, that is, again find an analogous comb, and so on, obtaining a tree of nested combs that is well-formed but itself has a large rank. Obviously the converse holds as well: such a tree of nested combs having large rank can exist inside only if has a large rank. Let us also remark that in the case of a finite-memory strategy of we obtain that such combs in can be nested at most times, which itself implies that the rank of is at most .
Thus, in order to show that the constructed tree allows only witnesses of large rank, shows a nested comb structure. But this has to be done for every such that , so, in a sense, for every run of over for some . As usual, we cannot require from to choose a run on the fly, during the game; an interesting (even an accepting) run of can only be fixed after the whole tree (i.e., the whole future of the play) is fixed. This means that during the game we have to trace all possible runs of . However, there are infinitely many runs, so we cannot do that. To deal with that, we keep track of all “interesting” states in the sets , , and we consider all possible transitions from them in step 3. This makes the situation of a bit worse: she has to make decisions based only on the current state, not knowing the past of the run (the same state may emerge after two different run prefixes). But it turns out that can handle that; in our proof this corresponds to positionality of strategies in an auxiliary game considered in Section 7.
Now, how exactly does show the nested comb structure?
This is done via selectors proposed in the sets .
For states in (i.e., on the “trunk” side) has to show in which direction the trunk continues.
Moreover, has to show places where on side of the trunk we can reach label followed by a nested comb; in these places plays mode
There is one more issue taken into account in the design of the game: should be obliged to produce the combs nested arbitrarily many times, but not infinitely many times. The number of nestings (i.e., of switches between sides and ) is controlled by . When he is satisfied with the nesting depth provided by for runs ending in some state, he can remove this step from the position in step 1, and let provide appropriate comb structures only from remaining states (we allow this removal only on the side, but we could equally well allow it on the side, or on both sides). The A) part of the winning condition obliges to indeed remove a state after seeing finitely many nestings. To see the usefulness of back-marking used to formulate this condition, consider a situation with two runs leading to some state: one with already many nestings of combs provided by , and other where we are still on the trunk of the first comb. Because should provide many nested combs for all runs, in such a situation should still be able to analyse the latter run. To this end, he can select the latter run as the back-marked history of the considered state, and continue waiting for further nested combs, without worrying that he will lose by the A) part of the winning condition due to the former run.
Example 5.1.
Let us see how the game behaves for the automaton from Example 3.2. Recall that the closure ordinal of is , so should be able to win.
The strategy of from a position is as follows. If contains a state , then plays letter , otherwise letter . Then proposes selectors: transitions originating from states are handled by selectors with mode and with direction being such that sends state to the opposite direction (i.e., for , etc.); transitions originating from are handled by direction , and by mode on the side and mode on the side. As we argue below, never becomes an element of , so transitions from need not to be handled.
The initial position is .
Here plays letter and some selectors with mode
It is also possible that erases a state from the side (i.e., plays being a proper subset of ). If state is erased, we end up in a position , being like the initial position. We may also have positions and , and flows , , between them. Finally, we may also reach .
Note that in our example there is always only one state-flow leading to every pair ; in consequence, every state-flow is back-marked.
Let us now check the winning condition.
One possibility is that infinitely many letters were played.
In these moments no state was present in the position, so the only infinite path in the composition of flows is the path going through appropriate states.
But after seeing every this state was , so the path is rejecting; part B) of the winning condition is satisfied
(we may also have no infinite path, if the state was removed by , but then condition B) holds even more).
The opposite case is that from some moment on, only letter was played.
We then also have an infinite path going, from some moment, through the states (this path really exists: if removes the state , then letter is played).
Note that whenever goes left, this path changes sides from to , and in the next round returns back to the side.
If this happens infinitely often, wins by condition A).
Otherwise, from some moment on constantly goes right.
After that, the path going through the states has all state-flows of mode
6 Soundness
We begin by proving Item 1 of Proposition 4.2: if wins then . To this end, assume that wins the game and fix any winning strategy for her. For every countable ordinal , our goal is to construct a tree such that whenever . We do this by inductively unravelling the fixed strategy of . During this process, we keep track of
-
the current node ,
-
the current position of the game,
-
a mapping , which assigns some ordinals to all elements of the set .
Initially , the position is the initial position of the game (i.e., ), and . Then, in every node , the letter declared by provides a label of this node in , and while moving to a child of we trace a play in which declares the respective direction . What remains is to declare the remaining choices of and say how the mapping is updated. The role of the ordinal is to provide a lower bound for ranks of witnesses such that can be accepted by from the state . While going down the tree, this ordinal decreases whenever we change the side from to ; when it becomes , removes the respective state from the set in his move. The back-markings are declared by in a way that maximises the ordinals of the respective state-flows.
It remains to show that if then . This is achieved by considering any accepting run of over . Since all possible transitions of are taken into account in each round of , one can trace the run in the simulated plays of . The policy of updating the mapping inductively assures that for every node and currently traced side we have and . Moreover, this policy guarantees that condition A) is never satisfied in these plays. Thus, condition B) must hold, inductively guaranteeing the inequality on .
7 Auxiliary game
Before moving towards a proof of the other implication, we need to be able to construct reasonable strategies of in . This is achieved by considering an auxiliary game, based directly on , when considering a single transition of at each round. The players of the game are called Automaton (responsible for choosing transitions and choices of in ) and Pathfinder (responsible for choices of in ). The game is denoted and depends on the fixed automaton , a tree , and a number . Positions of are triples , plus some additional auxiliary positions, to which we do not refer explicitly.
For a node and a state define
We assume (which is greater than all ordinals).
In positions such that the game reaches an immediate victory in which Pathfinder wins. A round from a position such that consists of the following steps:
-
1.
Automaton declares a transition from the current state over for some .
-
2.
Pathfinder declares a selector for .
-
3.
Automaton declares a direction that agrees with the selector.
Let , , and let be the output side of the selector in the direction . Now, for every possible number , the following four conditions of immediate victory may end the game, making Automaton win:
If no immediate victory happened, the game proceeds to the new position which is . Note that the conditions of immediate victory depend only on and , so can be directly hardwired in the structure of the game.
An infinite play (i.e., without any immediate victory) of is won by Pathfinder if the sequence of visited states is rejecting or a selector with mode
The main result about is the following lemma. First, the winning condition of guarantees that if Pathfinder wins the game then he wins it positionally. To prove that he wins, one assumes that Automaton wins from some position, which leads to a contradiction.
Lemma 7.1.
Pathfinder has a positional strategy in that is winning from every position of the game (we call such a strategy uniform).
8 Completeness
We now move to the proof of Item 2 of Proposition 4.2: if wins then for some computable . First observe the following important consequence of the winning condition of the game.
Proposition 8.1.
There exists a computable bound such that if wins then he has a winning strategy that guarantees the following. In every play (either finite or infinite) consistent with this strategy, every path in the graph obtained as the composition of back-markings contains less than changes of sides from to .
Compute the bound as above. Assume that wins the game and fix his winning strategy satisfying the thesis of the above Proposition. We claim that then the closure ordinal of the automaton is bounded by . Assume for the sake of contradiction that there exists a tree which does not have a witnessing tree of rank smaller than . The above assumption about implies that .
Based on the tree , we now construct a play that is consistent with the fixed ’s strategy, but is won be , leading to a contradiction. Together with the current position of the game, , we store some node of the tree, starting with . In order to construct the play, we need to provide choices of : she declares a letter based on the label of in the fixed tree , and a set of selectors based on a fixed uniform positional strategy of Pathfinder in , given by Lemma 7.1. The current node is then updated according to the direction declared by in .
For a pair denote by the number of switches from the side to the side on the back-marked history of . We keep a val-preserving invariant saying that for every pair with either
-
1.
and , or
-
2.
and .
Note that the val-preserving invariant is initially met, because . Note also that always because Proposition 8.1 implies that . The fact that the Pathfinder’s strategy is winning in can be used to deduce that the constructed play is won by in . This concludes the proof of Proposition 4.2, hence also of Theorem 1.1.
9 Definability of ranks in MSO
We now move to a study of definability of particular rank bounds in MSO.
With some analogy to the cardinality quantifier by Bárány et al. [3], one can propose a quantifier , expressing that there exists a well-founded set of rank at most such that holds. Note that this can be equivalently rewritten using the predicate , defined by . Below we show that the predicate , and consequently the respective quantifier, cannot be expressed in MSO except for the basic case of natural numbers (or ). Definability of this predicate in MSO boils down to checking if the following language is regular:
Clearly, holds for every well-founded tree, thus it remains to consider .
We first show that these languages are regular for .
Fact 9.1.
For every the language is regular.
Going beyond , the languages stop being regular. First, a simple pumping argument shows the following.
Lemma 9.2.
For all ordinals and all the language is not regular.
The argument from Lemma 9.2 applies in particular to all ordinals such that , whereas the ordinals are covered by Theorem 1.1. Thus we may conclude.
Corollary 9.3.
For an ordinal the language is regular if and only if .
The above Corollary does not exclude the possibility that a higher ordinal may be a supremum of a regular subset of .
Lemma 9.4.
For each pair of natural numbers , there exists a regular language such that .
Remark 9.5.
Clearly no countable formalism can define all the languages for . However, certain formalisms can go beyond MSO. For instance, the logic WMSO+U (for which the satisfiability problem is known to be decidable [4]) is capable of defining the language : a tree has rank at most if below every node labelled by there is a bound on the number of nodes labelled by that can appear on branches of passing through . This construction can be iterated to define the languages and possibly even beyond that.
10 Closure ordinals
In this section we show how a negative answer to Czarnecki’s question on closure ordinals can be derived from the present result. We use the standard syntax and semantics of modal -calculus [7], with its formulae constructed using the following grammar:
where is a letter from a fixed alphabet, is a variable from some fixed set of variables, and are the least and the greatest fixed-point operators, and and are the standard modalities (“exists a successor” and “for all successors”). For technical convenience we make an assumption that, in each point of our model, exactly one proposition (letter in ) is satisfied. For the sake of readability we often identify a formula with its semantics, that is, we read a closed formula as a subset of the domain, and a formula with free variables as a -ary function over its powerset.
Given an ordinal number , we use the standard notation for the -approximation of the fixed point in a given model (which amounts to the ’s iteration ). Given a model , we define the closure ordinal of in as the least ordinal such that . Then, the closure ordinal of (as considered in 1.2) is the supremum of these ordinals over all models (or if the supremum does not exist). We aim at providing an (alternative to Afshari, Barlucchi, and Leigh [1]) proof of the following Theorem.
Theorem 10.1.
Let be a -calculus formula in which the variable does not occur in scope of any fixed-point operator. Then, the closure ordinal of is either strictly smaller than , or at least and it can be effectively decided which of the cases holds.
Note that we allow arbitrary closed formulae of -calculus to be nested in ; however, we do not cover the whole -calculus, because of the restriction on occurrences of . This stays in line with the fragment considered by Afshari, Barlucchi, and Leigh [1] (as explained in Section 1), but we additionally provide a decision procedure that makes the dichotomy effective.
Towards a proof of Theorem 10.1, as a first step, we eliminate from all occurrences of that are not in scope of any modal operator; this can be done without changing the closure ordinal. Next, using standard techniques we obtain the following Lemma.
Lemma 10.2.
Let be a formula as in Theorem 10.1, with all occurrences of being in scope of a modal operator. The following three conditions are equivalent for every countable limit ordinal :
-
1.
the closure ordinal of is bounded by ;
-
2.
for every model that is a countable tree with its root in , this root belongs to ;
-
3.
for every model that is a countable tree with its root in , there exists a well-founded set containing the root, such that and .
A countable tree , occurring in Items 2 and 3 above, can be seen as a function from a prefix-closed subset to a finite alphabet . Now, recall a natural encoding of into . This encoding preserves the prefix order on and moreover preserves ranks of well-founded sets. Take the relation that contains if:
-
encodes a model ,
-
encodes a set ,
-
the root of belongs to ,
-
the root of belongs to , and .
The -calculus formulae and can be rewritten into MSO [7] and then modified to read the above encoding of in the binary tree, instead of itself. It follows that the relation is MSO-definable.
Observe that Item 3 of Lemma 10.2 can be rephrased by saying that the closure ordinal of is bounded by . Applying Theorem 1.1 to , and then Lemma 10.2, we have one of two possibilities: If the closure ordinal of is smaller than , then it is bounded by for some ; then also the closure ordinal of is bounded by . Otherwise, the closure ordinal of is , so it is not bounded by any countable limit ordinal; then also the closure ordinal of is not bounded by any countable limit ordinal, hence it is at least . This concludes the proof of Theorem 10.1.
Remark 10.3.
One may ask if Theorem 10.1 is merely a consequence or it is in some sense equivalent to our main result Theorem 1.1. To the best of our understanding, Theorem 10.1 does not transfer back to the general realm of MSO-definable relations, as in Theorem 1.1. One of the reasons is that the iterations of fixed points are required to proceed in a monotone fashion, driven by the internal formula ; while in full MSO one can express arbitrary correspondence between the parameters and a well-founded witness .
11 Conclusions
This work contributes to the study of expressive power of the MSO theory of binary tree. We investigate to what extent this theory can express properties of well-founded trees, and in particular distinguish between their ordinal ranks. We observe that the ability of explicit expression of properties of ranks is practically limited to statements of the form: all trees satisfying have , for a fixed (cf. Corollary 9.3 above). However the implicit expressive power of MSO logic goes much higher. In particular, our main result (Theorem 1.1) allows us to decide whether the property
is generally true (for all ), although the property itself is not expressible in MSO.
There is, however, a number of questions that remain to be answered. As ordinals smaller than can be effectively represented, we would like to have an effective procedure that, given a formula , computes the exact bound, that is, (a representation of) the least ordinal that can be substituted for in the construction above. Even more elementarily, given an MSO-definable set of well-founded trees, we would like to compute the supremum of ranks of trees in . These questions are subjects of ongoing research.
A more far-reaching direction is to relate the techniques of the present paper to the open problem of computing the Mostowski index, mentioned in Introduction. The parity condition itself imposes well-foundedness restriction on the occurrences of each odd label in the fragments of tree where this label is the highest. Colcombet and Löding [11] have approached the index problem (still unsolved) by reducing it to the boundedness problem for distance automata (see also Idir and Lehtinen [20] for a simplified version of this reduction). One may consider an alternative approach towards the index problem by studying the ordinal ranks which arise from the well-foundedness restriction of the parity condition.
References
- [1] Bahareh Afshari, Giacomo Barlucchi, and Graham E. Leigh. The limit of recursion in state-based systems. In FICS, 2024. URL: https://www.irif.fr/_media/users/saurin/fics2024/pre-proceedings/fics-2024-afshari-etal.pdf.
- [2] Bahareh Afshari and Graham E. Leigh. On closure ordinals for the modal mu-calculus. In CSL, volume 23 of LIPIcs, pages 30–44. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2013. doi:10.4230/LIPICS.CSL.2013.30.
- [3] Vince Bárány, Łukasz Kaiser, and Alexander M. Rabinovich. Expressing cardinality quantifiers in monadic second-order logic over trees. Fundam. Inform., 100(1-4):1–17, 2010. doi:10.3233/FI-2010-260.
- [4] Mikołaj Bojańczyk. Weak MSO+U with path quantifiers over infinite trees. In ICALP (2), volume 8573 of Lecture Notes in Computer Science, pages 38–49. Springer, 2014. doi:10.1007/978-3-662-43951-7_4.
- [5] Julian C. Bradfield. Simplifying the modal mu-calculus alternation hierarchy. In STACS, volume 1373 of Lecture Notes in Computer Science, pages 39–49. Springer, 1998. doi:10.1007/BFB0028547.
- [6] Julian C. Bradfield, Jacques Duparc, and Sandra Quickert. Transfinite extension of the mu-calculus. In CSL, volume 3634 of Lecture Notes in Computer Science, pages 384–396. Springer, 2005. doi:10.1007/11538363_27.
- [7] Julian C. Bradfield and Igor Walukiewicz. The mu-calculus and model checking. In Handbook of Model Checking, pages 871–919. Springer, 2018. doi:10.1007/978-3-319-10575-8_26.
- [8] Julius R. Büchi and Lawrence H. Landweber. Solving sequential conditions by finite-state strategies. Transactions of the American Mathematical Society, 138:295–311, 1969.
- [9] Lorenzo Clemente and Michał Skrzypczak. Deterministic and game separability for regular languages of infinite trees. In ICALP, volume 198 of LIPIcs, pages 126:1–126:15. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2021. doi:10.4230/LIPICS.ICALP.2021.126.
- [10] Thomas Colcombet, Denis Kuperberg, Christof Löding, and Michael Vanden Boom. Deciding the weak definability of Büchi definable tree languages. In CSL, volume 23 of LIPIcs, pages 215–230. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2013. doi:10.4230/LIPICS.CSL.2013.215.
- [11] Thomas Colcombet and Christof Löding. The non-deterministic Mostowski hierarchy and distance-parity automata. In ICALP (2), volume 5126 of Lecture Notes in Computer Science, pages 398–409. Springer, 2008. doi:10.1007/978-3-540-70583-3_33.
- [12] Marek Czarnecki. How fast can the fixpoints in modal mu-calculus be reached? In FICS, pages 35–39. Laboratoire d’Informatique Fondamentale de Marseille, 2010. URL: https://hal.archives-ouvertes.fr/hal-00512377/document#page=36.
- [13] Christian Delhommé. Automaticité des ordinaux et des graphes homogènes. Comptes Rendus de L’Académie des Sciences, Mathématiques, 339(1):5–10, 2004.
- [14] Javier Esparza, Stefan Kiefer, and Michael Luttenberger. Derivation tree analysis for accelerated fixed-point computation. Theor. Comput. Sci., 412(28):3226–3241, 2011. doi:10.1016/J.TCS.2011.03.020.
- [15] Alessandro Facchini, Filip Murlak, and Michał Skrzypczak. Rabin-Mostowski index problem: A step beyond deterministic automata. In LICS, pages 499–508. IEEE Computer Society, 2013. doi:10.1109/LICS.2013.56.
- [16] Nathanaël Fijalkow, Nathalie Bertrand, Patricia Bouyer-Decitre, Romain Brenguier, Arnaud Carayol, John Fearnley, Hugo Gimbert, Florian Horn, Rasmus Ibsen-Jensen, Nicolas Markey, Benjamin Monmege, Petr Novotný, Mickael Randour, Ocan Sankur, Sylvain Schmitz, Olivier Serre, and Mateusz Skomra. Games on graphs. CoRR, abs/2305.10546, 2023. doi:10.48550/arXiv.2305.10546.
- [17] Olivier Finkel and Stevo Todorcevic. Automatic ordinals. Int. J. Unconv. Comput., 9(1-2):61–70, 2013. URL: http://www.oldcitypublishing.com/journals/ijuc-home/ijuc-issue-contents/ijuc-volume-9-number-1-2-2013/ijuc-9-1-2-p-61-70/.
- [18] Gaëlle Fontaine. Continuous fragment of the mu-calculus. In CSL, volume 5213 of Lecture Notes in Computer Science, pages 139–153. Springer, 2008. doi:10.1007/978-3-540-87531-4_12.
- [19] Maria J. Gouveia and Luigi Santocanale. and the modal -calculus. Log. Methods Comput. Sci., 15(4), 2019. doi:10.23638/LMCS-15(4:1)2019.
- [20] Olivier Idir and Karoliina Lehtinen. Mostowski index via extended register games. CoRR, abs/2412.16793, 2024. doi:10.48550/arXiv.2412.16793.
- [21] Alexander Kechris. Classical descriptive set theory. Springer-Verlag, New York, 1995.
- [22] Damian Niwiński. On the cardinality of sets of infinite trees recognizable by finite automata. In MFCS, volume 520 of Lecture Notes in Computer Science, pages 367–376. Springer, 1991. doi:10.1007/3-540-54345-7_80.
- [23] Damian Niwiński and Igor Walukiewicz. A gap property of deterministic tree languages. Theor. Comput. Sci., 1(303):215–231, 2003. doi:10.1016/S0304-3975(02)00452-8.
- [24] C.-H. Luke Ong. On model-checking trees generated by higher-order recursion schemes. In LICS, pages 81–90. IEEE Computer Society, 2006. doi:10.1109/LICS.2006.38.
- [25] Michael O. Rabin. Decidability of second-order theories and automata on infinite trees. Transactions of the American Mathematical Society, 141:1–35, 1969.
- [26] Sylvain Schmitz. Algorithmic Complexity of Well-Quasi-Orders. (Complexité algorithmique des beaux pré-ordres). École normale supérieure Paris-Saclay, 2017. URL: https://tel.archives-ouvertes.fr/tel-01663266.
- [27] Michał Skrzypczak. Descriptive Set Theoretic Methods in Automata Theory – Decidability and Topological Complexity, volume 9802 of Lecture Notes in Computer Science. Springer, 2016. doi:10.1007/978-3-662-52947-8.
- [28] Michał Skrzypczak and Igor Walukiewicz. Deciding the topological complexity of Büchi languages. In ICALP, volume 55 of LIPIcs, pages 99:1–99:13. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2016. doi:10.4230/LIPICS.ICALP.2016.99.
- [29] Wolfgang Thomas. Languages, automata, and logic. In Handbook of Formal Languages (3), pages 389–455. Springer, 1997. doi:10.1007/978-3-642-59126-6_7.