Tree Algebras and Bisimulation-Invariant MSO on Finite Graphs

Colcombet, Thomas; Doumane, Amina; Kuperberg, Denis

doi:10.4230/LIPIcs.ICALP.2025.152

Tree Algebras and Bisimulation-Invariant MSO on Finite Graphs

Thomas Colcombet

CNRS, IRIF, Université Paris Cité, France Amina Doumane CNRS, LIP, ENS Lyon, France Denis Kuperberg

CNRS, LIP, ENS Lyon, France

Abstract

We establish that the bisimulation invariant fragment of MSO over finite transition systems is expressively equivalent over finite transition systems to modal $\mu$ -calculus, a question that had remained open for several decades.

The proof goes by translating the question to an algebraic framework, and showing that the languages of regular trees that are recognised by finitary tree algebras whose sorts zero and one are finite are the regular ones. This corresponds for trees to a weak form of the key translation of Wilke algebras to omega-semigroup over infinite words, and was also a missing piece in the algebraic theory of regular languages of infinite trees for twenty years.

Keywords and phrases:

MSO, mu-calculus, finite graphs, bisimulation, tree algebra

Category:

Track B: Automata, Logic, Semantics, and Theory of Programming

Funding:

Denis Kuperberg: ANR ReCiProg.

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Finite Model Theory ; Theory of computation

\rightarrow

Modal and temporal logics ; Theory of computation

\rightarrow

Algebraic language theory ; Theory of computation

\rightarrow

Regular languages

Related Version:

Full Version: https://arxiv.org/abs/2407.12677

Acknowledgements:

We thank Achim Blumensath for helpful discussions, proofreading this work, and finding a major mistake in a previous version of this paper (now solved).

DOI:

10.4230/LIPIcs.ICALP.2025.152

Event:

52nd International Colloquium on Automata, Languages, and Programming (ICALP 2025)

Editors:

Keren Censor-Hillel, Fabrizio Grandoni, Joël Ouaknine, and Gabriele Puppis

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

Low CO2 This is a low-CO₂ research paper. The authors did not use plane travel, neither for the development of this work nor its presentation.

1 Introduction

A well-known result of Janin and Walukiewicz [13, Thm 11] states:

Theorem 1.

For a property of transition systems, the following statements are equivalent:

$\blacksquare$

being MSO-definable and bisimulation-invariant, and
$\blacksquare$

being $\mu$ -calculus-definable¹¹1In this work, $\mu$ -calculus refers to the standard propositional modal $\mu$ -calculus, i.e. the temporal logic constructed from propositions, modalities $\square$ and $\diamond$ , boolean connectives, and least and greatest fixpoints..

In which a property of transition systems is bisimulation-invariant if for any two bisimilar transition systems, both satisfy the property, or none.

Keeping the above result in mind, it is enlightening to recall the following well-known finite model property: two $\mu$ -calculus sentences that can be separated by some transition system can always be separated by a finite one. In other words, the semantics of a $\mu$ -calculus sentence is entirely defined by what it expresses over finite transition systems. For this reason, the question of whether a version of Theorem 1 for finite transition systems would hold is extremely natural. It remained an important open question in the field for almost three decades. One contribution of this paper is to provide a positive answer to it:

Theorem 2.

For a property of finite transition systems, the following items are equivalent:

$\blacksquare$

being MSO-definable and bisimulation-invariant, and
$\blacksquare$

being $\mu$ -calculus-definable.

For both Theorem 1 and Theorem 2, the upward direction is the same. It follows from well-known facts: (1) $\mu$ -calculus sentences can be effectively translated into equivalent MSO-sentences, and (2) properties definable in $\mu$ -calculus are invariant under bisimulation.

The difficult direction is to show that bisimulation-invariant MSO-definable properties can be translated to equivalent $\mu$ -calculus sentences. The explanation of why the original approach of Janin and Walukiewicz cannot used in the finite case has been clearly phrased by Blumensath and Wolf [5]:

The above mentioned result by Janin and Walukiewicz on bisimulation-invariant monadic second-order logic has so far defied all attempts at a similar transfer to the realm of finite structures. The main reason is that the original proof is based on automata-theoretic techniques and an essential ingredient is a reduction to trees, via the unravelling operation. As this operation produces infinite trees, we cannot use it for formulae that are only bisimulation-invariant over finite transition systems.

Our proof of the downward implication of Theorem 2 uses a translation of the problem to an algebraic formalism. Though this approach is similar to other works in the field, it requires to develop several algebraic arguments that are not present in the literature.

Overview of the arguments and key contributions

We present here a high-level overview of the sequence of arguments used in our proof, putting an emphasis on lemmas that involve entirely new arguments. Due to lack of space, only subsets of these arguments will be covered in this extended abstract.

We roughly decompose the arguments in three main parts: from MSO to algebras, from algebras to yield-algebras, and from yield-algebras to $\mu$ -calculus.

PART 1: From MSO to algebras.

The first part of the proof concerns how to transform monadic second-order logic into an algebraic framework. Technically, it follows the algebraic/compositional approach, which is standard in the field, and does not require to develop new arguments.

1.

We abstract transition systems by the notion of systems. Systems are finite structures built by combining symbols from a ranked alphabet. Systems also have an initial vertex and variables $x_{1},\dots,x_{n}$ ( $n$ is the rank of the system). See Section 2.1. Finite transition systems can be seen as systems over a suitably chosen ranked alphabet. See Section 2.5.
2.

Systems are equipped with composition operations that allow to build complex systems by combining simpler ones together. In the spirit of category theory, all these operations are uniformly described by a single one, ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ . See Section 2.2.
3.

From these composition operations, one derives a notion of algebras, that can be used for recognising sets of systems. See Sections 3.1 and 3.2.
4.

At this point, the so-called “composition method” can be used for showing that MSO-definable languages of transition systems are recognisable by rankwise-finite algebras (i.e. an algebra in which all ranks are finite). See Section 3.3.
5.

We define morphisms between systems, that correspond to folding the source system to the target system (or unfolding when seen the other way round). Two systems are unfold-equivalent if they share a common unfolding. One can identify the usual notion of “infinite regular tree” as an unfold-equivalence-class of systems. See Section 2.4. We also show that unfold-equivalence is a congruence with respect to ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ (Lemma 7).

PART 2: From algebras to yield-algebras.

What we have achieved so far is that we have translated the logical hypothesis of Theorem 2, i.e. an MSO-definable bisimilar-invariant property, into a rankwise-finite algebra that recognises it. The next step toward constructing a $\mu$ -calculus formula is to add some form of non-determinism to systems. This is what we review now.

6.

We generalise the notion of systems to the richer notion of set-systems. The generalisation goes along two directions: (1) a form of non-determinism, and (2) some special vertices called “root vertices” are added (we will not discuss this point for the moment). See Section 2.1. Again, these set-systems are equipped with composition operations via a ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ operation. See Section 2.2.
7.

Using the notion of morphism, we can define what are the systems that a set-system can produce: its yields. Yields are, up to unfold-equivalence, systems that can be produced by “resolving the non-determinism”. Two set-systems are yield-equivalent if they share the same yields. Over systems, yield-equivalence coincides with unfold-equivalence. See Section 4.1.
8.

As we did for systems, we also have to study how yield-equivalence interacts with the operations of composition of set-systems, and as before, the conclusion is that yield-equivalence is a congruence with respect to ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ See Section 4.3.
9.

As a consequence, there is a notion of yield-algebra, which corresponds to set-systems modulo yield-equivalence. Such yield-algebras can be used for recognising sets of systems.
10.

A key contribution of this work is Lemma 20: A language of regular-trees which is recognisable by a rankwise-finite algebra is recognisable by a rankwise-finite yield-algebra. See Section 4.4.

PART 3: From yield-algebras to $\mu$ -calculus.

At this moment of the proof, the original MSO-definable bisimulation-invariant property has been transformed into a rankwise-finite yield-algebra. The final arguments involve a careful study of such algebraic objects that we overview in Section 5:

11.

We first identify inside yield-algebras some elements that we call deterministic. Deterministic elements form a sub-yield-algebra.
12.

We then introduce the key automaton property for a yield-algebra that recognises a language $L$ : it signifies informally that “all elements can be under-approximated by deterministic ones in a faithful way”.
13.

The reason of this definition is that if a language of regular-trees $L$ is recognised by a rankwise-finite yield-algebra that has the automaton property, there exists an automaton of infinite trees (in the classical sense) recognising $L$ . If the language is furthermore bisimilar-invariant, by adapting the techniques of [13], one can produce a $\mu$ -calculus formula along these lines.
14.

The last missing piece, and the second technical contribution of this work is Lemma 25: Syntactic rankwise-finite yield-algebras have the automaton property.

Related work

Bisimulation-invariant logics.

The seminal Hennessy-Milner theorem [12] gives a first characterisation of modal logics via bisimulation-equivalence. Van Benthem generalised it to First-Order logic (FO) in [20]. The result was then transferred to finite structures [18], and specialised to particular classes of finite structures [9, 11]. In [14], the equivalence between the bisimulation-invariant fragment of monadic path logic with CSL* is shown. An intermediate logic strictly between FO and MSO, obtained by extending FO with fixed-point operators, is considered in [16], where it is shown that the two-way bisimulation-invariant fragment of this logic is equivalent to two-way $\mu$ -calculus on finite structures. Concerning the logic MSO, as already explained, Janin and Walukiewicz prove the equivalence with $\mu$ -calculus on general structures [13]. The case of weak Monadic logic was shown to correspond to continuous $\mu$ -calculus in [8].

On finite structures, this equivalence was shown for structures of bounded Cantor-Bendixson rank in [5].

Composition method and tree algebra.

The composition method that we use here was pioneered in [10] for FO and used intensively for MSO in [19, 17]. There is a long line of research pursuing a well-behaved notion of algebra for regular languages of infinite words and trees [21, 15, 1, 6, 2, 3]. The theory has inherent difficulties. For instance [7] gives an example of rankwise-finite algebras that define non-regular languages of infinite trees. In [4], the specific notion of powerset is investigated, it shows that distributivity laws do not exist if there are non-linear identities in the theory (and this is relevant to the present work).

2 Set-systems and Systems

In this section, we introduce systems and set-systems. It may be meaningful to read this part first while concentrating on systems.

We define set-systems and systems in Section 2.1, how to compose them in Section 2.2, and the notion of [set]-context in Section 2.3. In Section 2.4 we introduce morphisms of set-systems, and the notions of folding, unfold-equivalence and regular-trees. We finally explain what are transition systems, how these can be encoded as systems, and how to phrase bisimulation in this framework (Section 2.5).

2.1 Set-Systems and Systems

A >ranked set $\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}\Sigma}$ (or >alphabet) is a family of sets ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}_{n}$ indexed by natural numbers. We write $a\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ for $a\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}_{n}$ for some $n$ . For $a\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ , the >rank of ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ , noted $\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}\mathrm{rk}}(a)$ , is the integer $n$ such that $a\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}_{n}$ . A >map of ranked sets $f$ from $\Sigma$ to $\Gamma$ is a map that preserves the ranks, i.e. a family of maps $f_{n}\colon\Sigma_{n}\to\Gamma_{n}$ for all $n\in\mathbb{N}$ . A ranked set ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ is >rankwise-finite if ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}_{n}$ is finite for all $n$ . Note that ranked sets equipped with their maps form a category, that we call the >category of ranked sets. We fix an infinite set $\mathrm{\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[named% ]{pgfstrokecolor}{rgb}{.1,0,.75}Vars}}=\{x_{1},x_{2},\dots\}$ of >variables. Variables can be seen as having rank $0$ . For $n\in\mathbb{N}$ , we denote with $\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{% pgfstrokecolor}{rgb}{.1,0,.75}[}n{\color[rgb]{.1,0,.75}\definecolor[named]{% pgfstrokecolor}{rgb}{.1,0,.75}]}$ the set $\{1,\dots,n\}$ , and $\mathrm{{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}Vars}}[n]$ the variables $\{x_{1},\dots,x_{n}\}$ .

Figure 1: A

{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}

-set-system

S_{1}

and a

{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}

-system

S_{2}

, both over variables

\{x_{1},x_{2}\}

(i.e.

S_{1}

and

S_{2}

are of rank

2

), for

{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}

containing symbols

b_{1}

of rank

1

,

a_{2}

of rank

2

, and

c_{3}

of rank

3

. The topmost outgoing edge of a symbol has direction

1

, the next one

2

, and so on. The circle is placed before an initial vertex, the symbol is placed before a root vertex, and emphasizes the presence of multiple successors in the same direction (i.e. non-determinism). Implicitly, edges are directed from left to right unless explicitly using an arrow notation.

Let us define now the notions of set-systems and systems, illustrated on Figure 1.

Definition 3 (set-systems).

Given a ranked set ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ , a ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-system of >rank $n$ is a tuple $S=({\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{.1,0,.75}% \mathit{V}},\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor% }{rgb}{.1,0,.75}\mathit{Vi}},\immediate{\color[rgb]{.1,0,.75}\definecolor[% named]{pgfstrokecolor}{rgb}{.1,0,.75}\mathit{Vr}},{\color[rgb]{.1,0,.75}% \definecolor[named]{pgfstrokecolor}{rgb}{.1,0,.75}\mathit{label}},{\color[rgb]% {.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{.1,0,.75}\mathit{Edges}})$ , composed of

$\blacksquare$

a finite set of >vertices $\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}\mathit{V}}$ , a set of >initial vertices ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{Vi}}\subseteq{\color[rgb]{0.07,0,0.525}\definecolor[named% ]{pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{V}}$ , a set of >root vertices ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{Vr}}\subseteq{\color[rgb]{0.07,0,0.525}\definecolor[named% ]{pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{V}}$ ,
$\blacksquare$

a >labelling function $\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}\mathit{label}}:{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{V}}\to{\color[rgb]{0.07,0,0.525}% \definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}\Sigma}$ ; in practice, we simply write $S(s)$ for ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{label}}(s)$ ,
$\blacksquare$

an >edge relation $\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}\mathit{Edges}}\subseteq{\color[rgb]{0.07,0,0.525}\definecolor[named]% {pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{V}}\times\mathbb{N}\times({\color[% rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}% \mathit{V}}\uplus\{x_{1},\dots,x_{n}\})$ consisting of >edges of the form $(v,d,f)$ with $d\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}[}{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}\mathrm{rk}}(S(v)){\color[rgb]{0.07,0,0.525}\definecolor[% named]{pgfstrokecolor}{rgb}{0.07,0,0.525}]}$ ; $d$ is called the >direction of the edge; if $f$ is a vertex, then $(v,d,f)$ is called a >transition edge; otherwise $f$ is a variable and $(v,d,f)$ is a >variable edge. We also denote ${\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{.1,0,.75}% \mathit{Edges}}(v)$ the set $\{(d,f)\mid(v,d,f)\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{Edges}}\}$ , and ${\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{.1,0,.75}% \mathit{Edges}}(v,d)=\{f\mid(v,d,f)\in{\color[rgb]{0.07,0,0.525}\definecolor[% named]{pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{Edges}}\}$ .

We may add subscripts to these elements to identify the set-system they belong to, e.g. $V_{S}$ for the set of vertices of $S$ . We will note $S\mathbin{\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[% named]{pgfstrokecolor}{rgb}{.1,0,.75}+}}S^{\prime}$ the disjoint union of $S$ and $S^{\prime}$ . A ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-system is >closed if it has rank $0$ . Two ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-systems $S,S^{\prime}$ are of the >same shape if they differ only on their labelling, i.e. ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{V}}_{S}={\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{V}}_{S^{\prime}}$ , ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{Vi}}_{S}={\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{Vi}}_{S^{\prime}}$ , ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{Vr}}_{S}={\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{Vr}}_{S^{\prime}}$ , and ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{Edges}}_{S}={\color[rgb]{0.07,0,0.525}\definecolor[named]% {pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{Edges}}_{S^{\prime}}$ .

Given a map of ranked sets $\eta$ from $\Sigma$ to $\Gamma$ , we denote $\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{% pgfstrokecolor}{rgb}{.1,0,.75}\overline{\eta}}$ the map from ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-systems to $\Gamma$ -set-systems that sends $S$ to ${\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{.1,0,.75}% \overline{\eta}}(S)=({\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{V}}_{S},{\color[rgb]{0.07,0,0.525}% \definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{Vi}}_{S},{\color% [rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}% \mathit{Vr}}_{S},\eta\circ{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{label}}_{S},{\color[rgb]{% 0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{% Edges}}_{S})$ .

We may drop the mention of the alphabet ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ , and simply talk about set-systems. The variables used in a [set-]system of rank $n$ are $x_{1},\dots,x_{n}$ . We sometimes use other variable names such as $x, y, z$ for convenience if there is no ambiguity about the meaning.

Definition 4 (systems).

A ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -system $T$ is a ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-system such that ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{Vi}}_{S}$ is a singleton, ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{Vr}}_{S}$ is empty, and for all vertices $v\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{V}}_{S}$ and all directions $d\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}[}{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}\mathrm{rk}}({\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{label}}(v)){\color[rgb]{0.07,0,0.525% }\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}]}$ , ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{Edges}}_{S}(v,d)$ is a singleton.

2.2 Composing systems and set-systems: the ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ operation

We describe in this section the operations used for composing set-systems (or systems), allowing to build complex ones out of simpler elements. We do it using the categorical approach of monads: the idea is to have a single complex operation of composition, called ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ , that can then be specialized into simpler ones.

The operation $\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{% pgfstrokecolor}{rgb}{.1,0,.75}\mathrm{flatten}}$ , takes a set-system of set-systems as input, and produces a set-system as output. We explain it here through an example: the following picture shows a set-system of set-systems and its >flattening. The subsystems are drawn inside the dashed boxes, and boxes are organised themselves as a set-system structure that we call the pattern²²2Note that the definition enforces that the rank of subsystems (i.e. the number of variables they use) should be consistent with the number of outgoing directions of the vertex in the pattern.. Direction names for edges going out of a node $v$ will always be implicit, the topmost one being $1$ , then $2$ , and so on, ending with ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{rk}}({\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathit{label}}(v))$ for the bottommost.

As is shown, the ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ operation glues the subsystems together, keeping their internal structures. If an edge of a subsystem ends in a variable $x_{d}$ , then it is connected instead to all the initial vertices of the subsystems reachable following direction $d$ in the pattern, as well as to the variables reachable following direction $d$ . After flattening, the initial vertices are the initial vertices of the subsystems that were themselves initial vertices in the pattern. Root vertices are treated differently: the root vertices from subsystems remain root vertices during the process, while initial vertices of subsystems that are root get promoted to roots.

Note that the ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ operation, when given a system of systems as input, produces a system. Hence, the same operation shall be used for composing systems or set-systems depending on the context. In a categorical description, ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ , together with the corresponding operation $\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{% pgfstrokecolor}{rgb}{.1,0,.75}\mathrm{unit}}$ (that turns a single symbol into a one-vertex system), equips the set-system definition (resp. system definition) with a monad structure.

2.3 Set-contexts, and contexts

A special form of composition, which is derived from ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ is the notion of set-context, which are set-systems with a hole that can be filled.

A ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-context with a $k$ -hole is a $({\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}\uplus\{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\square}_{k}\})$ -set-system, in which $\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}\square}_{k}$ is a new symbol of rank $k$ , called the >hole symbol, and which labels exactly one vertex, called the >hole vertex. Set-contexts are denoted $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{k}]$ , $D[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{k}]$ , … A set-context is >closed if it is of rank $0$ . Given a ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-context $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{k}]$ and a ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-system $S$ of rank $k$ , $C[S]$ denotes the ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-system obtained by substituting $S$ for the hole. A >context is a set-context which is a system.

An example of a closed context $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{2}]$ , a system $S$ of rank $2$ , and the result $C[S]$ are pictured below. The hole vertex is depicted as an empty box.

Formally, this amounts to construct the (set-system)-set-system $S^{\prime}$ of the same shape as $C$ such that $S^{\prime}(h)=S$ for $h$ the hole vertex of $C$ , and $S^{\prime}(s)={\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}\mathrm{unit}}(C(s))$ for all other vertices $s\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathit{V}}_{C}$ . We then define $C[S]$ to be ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}(S^{\prime})$ .

Note in particular that this substitution follows the “rules of flattening”, which means that (1) if the hole vertex is initial in $S$ , then all initial vertices of $S$ remain initial in $C[S]$ , (2) all the root vertices of $S$ remain root vertices in $C[S]$ , and (3) if the hole vertex is root in $S$ , then all initial vertices of $S$ get to be promoted into root vertices in $C[S]$ .

Depending on the situations, we shall use ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ or [set-]contexts. In many situations, this is just a choice of presentation.

2.4 Morphisms, unfoldings, and regular-trees

So far, we have seen systems and set-systems as “rigid structures”. The notions of morphisms allows us to compare them in more subtle ways.

Set-systems come with a natural definition of morphisms: a >morphism of set-systems from $S$ to $S^{\prime}$ , set-systems of same rank, is a map from vertices of $S$ to vertices of $S^{\prime}$ that preserves initial vertices, root vertices and edges.

If we specialise morphisms to ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems we obtain the notions of unfolding, unfold-equivalence and regular-trees:

Definition 5 (unfolding and regular-trees).

If there is a morphism of set-systems from a ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -system $T$ to another one $T^{\prime}$ , then $T$ is called an >unfolding of $T^{\prime}$ , and $T^{\prime}$ a >folding of $T$ . Two ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems that have a common unfolding are called >unfold-equivalent³³3Showing that it is an equivalence relation requires a proof. Note that this would be equivalent to require to have a common folding, but this direction is of no use in the present work.. A ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -regular-tree is an unfold-equivalence class of ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems.

For instance, in the pictures below, the systems $S_{1}$ and $S_{2}$ have both $U$ as unfolding, and thus are unfold-equivalent. The system $F$ is a folding of both $S_{1}$ and $S_{2}$ :

$\blacktriangleright$ Remark 6.

Note that if one would “>fully unfold” a system (as is classically done), one would obtain an infinite tree which is “regular in the classical sense” (meaning that it has a finite number of subtrees), and that all regular trees (in the classical sense) can be obtained as the full unfolding of some system. It is also easy to check that two systems are unfold-equivalent if and only if they have isomorphic full unfoldings. Hence, the map which to a regular tree (in the classical sense) associates the set of systems that fully unfold to it is a bijection between regular trees (in the classical sense) and regular-trees as defined in this work.

At this point, one can prove that unfold-equivalence interacts nicely with ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ :

Lemma 7 (unfold-equivalence is a congruence).

Let $T,T^{\prime}$ be ( ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -system)-systems of the same shape such that $T(t)$ is unfold-equivalent to $T^{\prime}(t)$ for all vertices $t$ , then ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}(T)$ and ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}(T^{\prime})$ are unfold-equivalent.

Equivalently, for all ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -contexts $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{k}]$ and unfold-equivalent systems $S,S^{\prime}$ of rank $k$ , then $C[S]$ and $C[S^{\prime}]$ are unfold-equivalent.

From a categorical point of view, this means that systems modulo unfold-equivalence, i.e. regular-trees, equipped with the ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ and ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{unit}}$ operations form a monad.

2.5 Transition systems as systems

In this section, we recall what is the standard definition of a transition system, and we explain how these can be seen as systems as defined above. We also explain how bisimilarity can be phrased in this setting.

A >transition system $(V,i,T,\gamma)$ consists of a finite⁴⁴4In this work, as for systems, transition systems are finite. set of >vertices $V$ , an >initial vertex $i$ , a binary relation $T$ called the >transition relation, and a labelling function $\gamma:V\to 2^{\mathtt{P}}$ . We now explain how to encode transition systems as systems over a specific ranked alphabet ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathbf{Tr}}$ .

Let $\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{% pgfstrokecolor}{rgb}{.1,0,.75}\mathbf{Tr}}$ be the ranked alphabet which has an element $\nu_{n}$ of rank $n$ for all $\mathtt{P}$ -valuation $\nu\in 2^{\mathtt{P}}$ , and all $n\in\mathbb{N}$ . Given a closed ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathbf{Tr}}$ -system $S$ , call its >decoding the transition system $\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}\mathrm{decode}}(S)$ that has the same set of vertices with same labelling, the same initial vertex, and $T(u,v)$ holds if $(u,d,v)$ is an edge for some $d$ . Conversely, $S$ is called an >encoding of ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{decode}}(S)$ . Note that the decoding is unique, while several non-isomorphic ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathbf{Tr}}$ -systems may encode the same transition system. Note also that all transition systems admit at least one encoding (we use here the assumption that transition systems are finite).

We define the >bisimilarity relation over closed ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathbf{Tr}}$ -systems as the least equivalence relation that contains unfold-equivalence and the relation “encoding the same transition system”.

The following lemma states why this is consistent with the standard terminology.

Lemma 8.

Two ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathbf{Tr}}$ -systems are bisimilar if and only if their decodings are “bisimilar in the standard sense”.

From now on, we shall only consider transition systems through this encoding as ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathbf{Tr}}$ -systems.

3 Algebras

We have seen in the previous section all the necessary material for introducing a natural notion of algebras and using them for recognising languages of systems.
In this section, we define these algebras (Section 3.1), and describe how they can be used to recognise languages of systems (Section 3.2). We then explain why MSO-definable languages of transition systems are recognised by rankwise-finite algebras (Section 3.3).

3.1 Algebras

The following definitions follow the standard approach via monads.

Definition 9 (algebras).

An >algebra $\mathcal{A}$ is a ranked set $A$ together with a map of ranked sets $\mathrm{\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{% rgb}{.1,0,.75}eval}}$ (called the >evaluation) from $A$ -systems to $A$ , such that

$\blacksquare$

$\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}({\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathrm{unit}}(a))=a$ , for all $a\in A$ ,
$\blacksquare$

$\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\overline{eva}l}}(S))=\mathrm{{\color[rgb]{% 0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}({% \color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525% }\mathrm{flatten}}(S))$ for all ( $A$ -system)-systems $S$ .

A >morphism from an algebra $\mathcal{A}=(A,\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}eval}})$ to an algebra $\mathcal{A}^{\prime}=(A^{\prime},\mathrm{{\color[rgb]{0.07,0,0.525}% \definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}^{\prime})$ is a map $\rho$ from $A$ to $A^{\prime}$ such that $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}^{\prime}({\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\overline{\rho}}(S))=\rho(\mathrm{{\color[% rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}% (S))$ for all $A$ -systems $S$ .

An algebra is >unfold-invariant if $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(S)=\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}(S^{\prime})$ for all unfold-equivalent $A$ -systems $S,S^{\prime}$ . Unfold-invariant algebras are called >regular-tree algebras. The ranked set of ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems equipped with ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ as evaluation is an algebra called the >free algebra generated by ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ , or the ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -free algebra.

Example 10 (regular-tree algebra).

Consider $\mathcal{A}=(A,\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}eval}})$ defined by $A_{n}=\mathcal{P}({\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor% }{rgb}{0.07,0,0.525}[}n{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}]})\uplus\{\bot\}$ , and $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}$ to be mapping an $A$ -system $T$ of rank $n$ to $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(T)=\bot$ if there is a “ $\bot$ ” element reachable from the initial vertex. Otherwise $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(T)$ is the set of indices $i\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}[}n{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}]}$ such that the variable $x_{i}$ is reachable in $T$ from the initial vertex. We leave to the reader to check that $\mathcal{A}$ satisfies the identities of algebras. This algebra is unfold-invariant, since the symbol $\bot$ (resp. the variable $x_{i}$ ) is reachable in $T$ from the initial vertex if and only if this is also the case in any unfolding of $T$ .

3.2 Languages and their recognition by algebras

Call >language of ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems a set of closed ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems. A >language of ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -regular-trees is a language of ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems $L$ which is invariant under unfold-equivalence, i.e. such that if $T,T^{\prime}$ are unfold-equivalent and $T\in L$ , then $T^{\prime}\in L$ , for all ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems $T,T^{\prime}$ .

Let $\rho$ be a morphism (called the >recognising morphism) from the ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -free algebra to an algebra $\mathcal{A}$ , and $P\subseteq A_{0}$ (called the >accepting set), the >language recognised by $(\mathcal{A},\rho,P)$ is the set of closed ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems defined as

\displaystyle{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}\mathrm{Rec}}(\mathcal{A},\rho,P)

\displaystyle:=\{T\mid\text{$T$ {\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}closed} {\color[rgb]{0.07,0,0.525}% \definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}${\color[rgb]{% 0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}\Sigma}$-% system} such that\leavevmode\nobreak\ $\rho(T)\in P$}\}.

We say that $L$ is recognised by $\mathcal{A}$ if it is recognised by $(\mathcal{A},\rho,P)$ for some $\rho$ and $P$ .

This is the standard definition of recognition, as for word languages. The only subtlety is that we focus on closed systems, which is reflected in the fact that $P\subseteq A_{0}$ . Quite naturally, we shall be interested in languages that are recognisable by rankwise-finite algebras, i.e. algebras $\mathcal{A}=(A,\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}eval}})$ such that $A_{n}$ is finite for all $n\in\mathbb{N}$ .

Note, as expected, that the language of systems recognised by regular-tree algebras are languages of regular-trees.

Example 11 (reachable symbol).

Consider some ranked alphabet ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ , and a set of letters $R\subseteq{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ . Let also $\mathcal{A}$ be the algebra from Example 10, and define $\rho$ as the unique algebra morphism from the ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -free algebra to $\mathcal{A}$ such that for all symbols $a\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ of rank $k$ :

\displaystyle\rho(a(x_{1},\dots,x_{k}))=\begin{cases}\bot&\text{if}\leavevmode% \nobreak\ a\in R\\ \{1,\dots,k\}&\text{otherwise.}\end{cases}

This morphism sends all ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems $S$ of rank $n$ to $\bot$ if there is a symbol from $R$ reachable from the initial vertex, and otherwise the set of numbers of the variables reachable from the initial vertex otherwise. The language recognised by $(\mathcal{A},\rho,\{\bot\})$ is the set of closed ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems that contain a symbol of $R$ reachable from the initial vertex. Since this language is preserved under unfold-equivalence, it is a language of regular-trees. Similarly, the language recognised by $(\mathcal{A},\rho,\{\emptyset\})$ is the set of closed ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems that contain no symbol of $R$ reachable from the initial vertex.

3.3 Monadic second-order logic, and the composition method

We have seen in Section 2.5 how to see transition systems as ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathbf{Tr}}$ -systems. Let us explain now how logic interacts with this view. We assume the reader familiar with >monadic second-order logic (>MSO for short). The signature here is the one of transition systems (one binary relation, unary relations for each predicate, and an initial constant). We shall say that an MSO-sentence $\psi$ is >bisimulation-invariant if for all bisimilar transition systems $S,S^{\prime}$ , we have $S\models\psi$ if and only if $S^{\prime}\models\psi$ .

Using the “composition method” approach, we get the translation from logic to algebra:

Lemma 12.

Given an MSO-sentence $\varphi$ , the set $L:=\{S\text{ {\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb% }{0.07,0,0.525}${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}\mathbf{Tr}}$-system}}\mid{\color[rgb]{0.07,0,0.525}% \definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}\mathrm{decode}}(S)% \models\varphi\}$ is recognisable by a rankwise-finite algebra.

4 Yield Algebras

We aim to introduce a richer notion of algebras, called yield-algebras. For this, we need to first give some form of semantics to set-systems. This is done in Section 4.1 with the notion of yields and yield-equivalence. The notions of yield-algebras and recognition by them follows (Section 4.3). In Section 4.2, we explain a difficulty in working with this definition, and hint at a tool for circumventing it. The section culminates with Lemma 20 explaining why we can restrict our attention to rankwise-finite yield-algebras (Section 4.4).

4.1 Yields of set-systems and yield-equivalence

We shall now see how set-systems can be understood as “finite-non-deterministic machines that would non-deterministically produce systems”. This is the notion of yields, that come in two variants: init and root-yields.

We introduce the notion through an example. A set-system is pictured below, as well as one of its init-yields and one of its root-yields.

Informally, an >init-yield of a set-system $S$ is a system $T$ that unfolds to a system that can be obtained starting from some initial vertex of $S$ by resolving all the non-deterministic choices, potentially partially unravelling the set-system at the same time. A >root-yield is similar, starting from a root vertex instead of an initial vertex. Note that the definition is such that the set of init-yields (resp. root-yields) of a set-system are closed under unfold-equivalence. The real definition makes use of morphisms of set-systems. We set:

{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{.1,0,.75}% \mathrm{InitYields}}(S)=\{T\mid\text{$T$ {\color[rgb]{0.07,0,0.525}% \definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}init-yield} of% \leavevmode\nobreak\ $S$}\}\quad\text{and}\quad{\color[rgb]{.1,0,.75}% \definecolor[named]{pgfstrokecolor}{rgb}{.1,0,.75}\mathrm{RootYields}}(S)=\{T% \mid\text{$T$ {\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}root-yield} of }S\}.

Two set-systems $S,S^{\prime}$ of same rank are >yield-equivalent if ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{InitYields}}(S)={\color[rgb]{0.07,0,0.525}\definecolor[% named]{pgfstrokecolor}{rgb}{0.07,0,0.525}\mathrm{InitYields}}(S^{\prime})$ and ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{RootYields}}(S)={\color[rgb]{0.07,0,0.525}\definecolor[% named]{pgfstrokecolor}{rgb}{0.07,0,0.525}\mathrm{RootYields}}(S^{\prime})$ .

Note that if $S$ is a system, then an init-yield of $S$ is nothing but an unfold-equivalent system, and there is no root-yield. As a consequence yield-equivalence coincides with unfold-equivalence over systems.

From now, for simplification of the presentation, we shall do as if only init-yields existed.

4.2 Resolutions

This section illustrates a difficulty arising from working with yields, and briefly describes the tool of resolution that we use to circumvent it in the complete version of this work.

At a high level, we would like, given a set-context $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ , to write the yields of $C[S]$ for some set-system $S$ of rank $n$ as a “composition” of the yields of $S$ . The following example shows that it does not work.

Example 13 (yields are not compositional).

Let us work with a ranked alphabet that has symbols $b$ and $c$ of rank $0$ , and $a_{2}$ of rank $2$ . Consider the set-context $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{1}]={\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\square}_{1}(b\mathbin{{\color[rgb]{% 0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}+}}c)$ and the system $S=a_{2}(x_{1},x_{1})$ :

Then:

$\blacksquare$

The set-context $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{1}]$ has two init-yields: $C_{1}[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{1}]={\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\square}_{1}(b)$ and $C_{2}[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{1}]={\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\square}_{1}(c)$ .
$\blacksquare$

The system $S$ has exactly one init-yield: $S$ itself.
$\blacksquare$

The set-system $C[S]$ has four init-yields up to unfold-equivalence: $a_{2}(b,b)$ , $a_{2}(b,c)$ , $a_{2}(c,b)$ , and $a_{2}(c,c)$ .

If we combine the init-yields of $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{1}]$ and $S$ in all possible ways, we only obtain $C_{1}[S]=a_{2}(b,b)$ and $C_{2}[S]=a_{2}(c,c)$ . These are indeed init-yields of $C[S]$ , but we still miss two of them, namely $a_{2}(b,c)$ and $a_{2}(c,b)$ . The issue is that the double occurrence of $x_{1}$ in $a_{2}(x_{1},x_{1})$ forbids to use it in a combination of systems for producing, eg, $a_{2}(a,b)$ .

This situation is in fact a well-identified mathematical difficulty. For instance, this is what prevents the existence of a distributive law with the powerset monad over the monads of non-linear trees [4].

To circumvent this problem, we introduce the notion of >resolution, which combines the yield with some variable renaming.

Definition 14.

Given a ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-system $S$ of rank $n$ , a >resolution $(T,\sigma)$ of $S$ consists of:

$\blacksquare$

a map $\sigma$ from ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}[}m{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}]}$ to ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}[}n{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}]}$ for some $m$ , and
$\blacksquare$

a ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -system $T$ of rank $m$ , such that
$\blacksquare$

${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\widehat{\sigma}}(T)$ is an init-yield of $S$ , where $\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{% pgfstrokecolor}{rgb}{.1,0,.75}\widehat{\sigma}}(T)$ is $T$ with all variables $x_{i}$ renamed into $x_{\sigma(i)}$ .

In Example 13, we see that $(a_{2}(x_{1},x_{2}),\sigma)$ with the constant map $\sigma\colon{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}% {0.07,0,0.525}[}2{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}% {rgb}{0.07,0,0.525}]}\to{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}[}1{\color[rgb]{0.07,0,0.525}\definecolor[% named]{pgfstrokecolor}{rgb}{0.07,0,0.525}]}$ is a resolution of $S$ , and that it conveys the information that could not be caught by yields.

Now, it makes sense to understand a yield of $C[S]$ as a combination of resolutions of $C$ and $S$ . This notion of resolution is one of the arguments used for showing that yield-equivalence is a congruence:

Lemma 15 (yield-equivalence is a congruence).

Let $S,S^{\prime}$ be ( ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-system)-set-systems of the same shape such that $T(t)$ is yield-equivalent to $T^{\prime}(t)$ for all vertices $t$ , then ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}(S)$ and ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}(S^{\prime})$ are yield-equivalent.

4.3 Yield-algebras and recognisability

The definition of a yield-algebra corresponds to the notion of algebras naturally arising from set-systems, quotiented by yield-equivalence. Since set-systems are equipped with a form of non-determinism, yield-algebras are naturally endowed with an order $\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqsubseteq}}$ that provides an inf-semi-lattice structure (note that the non-deterministic sum will be noted $\mathbin{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqcap}}$ , i.e. as an infimum rather than a supremum; this is a choice of presentation that corresponds to the fact that we see in this work this non-determinism as “adversarial”, i.e. controlled by an opponent).

Definition 16.

A >yield-algebra $\mathcal{\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{% rgb}{.1,0,.75}Y}}$ is a ranked set $Y$ together with a map of ranked sets $\mathrm{\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{% rgb}{.1,0,.75}eval}}$ from $Y$ -set-systems to $Y$ , such that

$\blacksquare$

$\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}({\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\mathrm{unit}}(a))=a$ , for all $a\in Y$ ,
$\blacksquare$

$\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\overline{eva}l}}(S))=\mathrm{{\color[rgb]{% 0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}({% \color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525% }\mathrm{flatten}}(S))$ for all ( $Y$ -set-system)-set-systems $S$ , and
$\blacksquare$

for all unfold-equivalent⁵⁵5This is a generalised version of unfold-equivalence for set-systems, which relies on the natural notion of locally surjective morphisms. $Y$ -set-systems $S,S^{\prime}$ of same rank $n$ , $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(S)=\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}(S^{\prime})$ .
$\blacksquare$

for all $Y$ -set-systems $S$ of rank $n$ and all $f\in Y_{n}$ ,

$f\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqsubseteq}}\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named% ]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}(T)$ for all yields $T$ of $S$ implies $f\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqsubseteq}}\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named% ]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}(S)$ .

where for $a,b\in Y_{n}$ , we note $a\mathbin{\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[% named]{pgfstrokecolor}{rgb}{.1,0,.75}\sqcap}}b:=\mathrm{{\color[rgb]{% 0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}(a% \mathbin{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}+}}b)$ , and $a\mathrel{{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}\sqsubseteq}}b$ holds if $a\mathbin{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqcap}}b=a$ .

For $a\in Y_{n}$ , let $a\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{% pgfstrokecolor}{rgb}{.1,0,.75}\Uparrow}$ be $\{b\in Y_{n}\mid a\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\sqsubseteq}}b\}$ .
A >morphism from the yield-algebra $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}=(Y,\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}eval}})$ to the yield-algebra $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}^{\prime}=(Y^{\prime},\mathrm{{\color[rgb]{0.07,0,0.525}% \definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}^{\prime})$ is a map of ranked sets $\rho$ from $Y$ to $Y^{\prime}$ such that $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}^{\prime}({\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\overline{\rho}}(S))=\rho(\mathrm{{\color[% rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}% (S))$ for all $Y$ -set-systems $S$ .

Lemma 17.

The operation $\mathbin{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqcap}}$ is associative, commutative, and idempotent. The relation $\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqsubseteq}}$ is an order, and $\mathbin{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqcap}}$ computes the infimum with respect to $\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqsubseteq}}$ .

Note that since systems are particular cases of set-systems, yield-algebras are in particular algebras. Furthermore, since yield-equivalence coincides with unfold-equivalence over systems, yield-algebras seen as algebras are unfold-invariant, and hence regular-tree algebras. This downgrading of yield-algebras to regular-tree algebras is made implicitly in the rest of this work. This means that we can use yield-algebras for recognising languages of regular-trees:

Definition 18.

A language of regular-trees is >recognisable by a yield-algebra $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ if it is recognisable by $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ seen as an algebra using an accepting set of the form $f{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Uparrow}$ for some $f\in Y_{0}$ .

The following lemma shows the intention behind the definition.

Lemma 19.

Let $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ be a yield-algebra recognising a language $L$ of ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems, then, for all closed ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -set-systems $S$ , $f\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqsubseteq}}\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named% ]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}(S)$ if and only if ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{InitYields}}(S)\subseteq L$ .

4.4 rankwise-finite yield-algebras

One can now state the key lemma of this section (which involves new arguments compared to the literature).

Lemma 20.

If a language of regular-trees $L$ is recognised by a rankwise-finite algebra, then it is recognised by a rankwise-finite yield-algebra.

For other kind of algebras, such as monoids, deterministic automata over words or trees, forest algebras, …, a similar result is classically obtained by applying a form of powerset construction. Let us explain by an example why this standard approach fails here.

Example 21 (standard approach fails).

Given a rankwise-finite algebra $\mathcal{A}$ , the “standard approach” would be to add new elements to $\mathcal{A}$ in order to build a yield-algebra $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ for the same language. With this approach $\mathcal{A}$ is a sub-algebra of $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ ⁶⁶6For instance, if we apply the powerset operation to a monoid $\mathcal{M}$ , then by distributive law, we obtain a new monoid $\mathcal{P}(\mathcal{M})$ , and $\mathcal{M}$ is isomorphic to the submonoid of $\mathcal{P}(\mathcal{M})$ restricted to its singletons.. This is not possible here.

Indeed, consider the language of regular-trees $L$ over the alphabet ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ with $b,c\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}_{0}$ , $a_{1}\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}_{1}$ and $a_{2}\in{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}_{2}$ that contains the closed ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -systems that have all their leaves carrying the same letter (only $b$ ’s or only $c$ ’s). Let $\mathcal{A}$ be its syntactic-algebra⁷⁷7It always exists by generic algebraic arguments., and $P$ be the accepting set. For simplicity, we identify the letters with their image in the algebra and use $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}_{\mathcal{A}}$ as recognising morphism. We have $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}_{\mathcal{A}}(a_{1}(x_{1}))=\mathrm{{\color[rgb]{% 0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}_{% \mathcal{A}}(a_{2}(x_{1},x_{1}))$ since for all closed ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Sigma}$ -contexts $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{1}]$ , $C[a_{1}(x_{1})]\in L$ if and only if $C[a_{2}(x_{1},x_{1})]\in L$ . We also have $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}_{\mathcal{A}}(a_{1}(b))\in P$ , $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}_{\mathcal{A}}(a_{1}(c))\in P$ , but $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}_{\mathcal{A}}(a_{2}(b,c))\not\in P$ .

Assume now that we have built a yield-algebra $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ that has $\mathcal{A}$ as a sub-algebra, and $f{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Uparrow}$ is the accepting part. This implies that $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}_{\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}Y}}}(a_{1}(x_{1}))=\mathrm{{\color[rgb]{% 0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}_{% \mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}}(a_{2}(x_{1},x_{1}))$ . However, when composed with $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{1}]={\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\square}_{1}(b+c)$ (as in Example 13), we would have by Lemma 19 $f\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqsubseteq}}\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named% ]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}_{\mathcal{{\color[rgb]{0.07,0,0.525% }\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}Y}}}(C[a_{1}(x_{1})])$ since ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{InitYields}}(C[a_{1}(x_{1})])\subseteq L$ but $f\not\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}\sqsubseteq}}\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[% named]{pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}_{\mathcal{{\color[rgb]{% 0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}Y}}}(C[a_{2% }(x_{1},x_{1})])$ since ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{InitYields}}(C[a_{2}(x_{1},x_{1})])\ni a_{2}(b,c)\not\in L$ . A contradiction.

Our argument for avoiding the problem and proving Lemma 20 involves two ideas. Let $\mathcal{A}$ be a rankwise-finite regular-tree algebra recognising a language of regular-trees $L$ .

Idea 1.

Define the map $\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{% pgfstrokecolor}{rgb}{.1,0,.75}\mathrm{profile}}$ which maps each $A$ -set-systems $S$ to the set $\{(\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(T),\sigma)\mid\text{$(T,\sigma)$ is a {\color[rgb]{% 0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}resolution}% of\leavevmode\nobreak\ $S$}\}$ . The arguments of Section 4.2 can be used to show that >profile-equivalence (i.e. the equivalence relation over $A$ -set-systems “having the same ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{profile}}$ ”) is a congruence for ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ over $A$ -set-systems, i.e. for all $A$ -set-contexts $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ and $A$ -set-systems $S_{1},S_{2}$ of rank $n$ that are profile-equivalent, then $C[S_{1}]$ and $C[S_{2}]$ are profile-equivalent. It follows that $A$ -set-systems quotiented by profile-equivalence is a yield-algebra that recognises $L$ .

The problem is that it is not rankwise-finite: profile-equivalence is too fine.

Idea 2.

This is where the second argument is put in action: we consider small resolutions. A resolution $(T,\sigma)$ is >small if the map $\sigma$ satisfies the condition that for each $i$ in its codomain, $|\sigma^{-1}(i)|\leqslant|A_{1}|$ . We define $\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}\mathrm{small{-}profile}}(S)$ as ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{profile}}$ but considering only small resolutions. Two set-systems are >small–profile-equivalent if they have the same small profile. Note that over each rank, small-profile-equivalence is of finite index, is coarser than profile-equivalence, and over rank $0$ profile-equivalence and small-profile-equivalence coincide.

At this point, a key technical result is used: a “context smallification lemma”, that has as consequence that for closed $A$ -set-contexts $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ and $A$ -set-systems $S_{1},S_{2}$ of rank $n$ that are small-profile-equivalent, we obtain that $C[S_{1}]$ and $C[S_{2}]$ are profile-equivalent. Note the only difference in this statement compared to the analogue one for profile-equivalence, which is that here $C$ is assumed to be closed.

The consequence of this is that the >syntactic yield-algebra for $L$ (defined in a natural way) is a yield-algebra that recognises $L$ , and is coarser than small-profile-equivalence, and hence rankwise-finite. Lemma 20 has been established.

5 The Automaton Property

In this section, we identify a combinatorial property that allows us to translate languages recognisable by yield-algebras to automata or $\mu$ -calculus sentences.

Consider some rankwise-finite yield-algebra $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ , its >deterministic elements are the ones obtainable by composing only elements of the two lowest ranks, i.e. from $\immediate{\color[rgb]{.1,0,.75}\definecolor[named]{pgfstrokecolor}{rgb}{% .1,0,.75}Y_{\leqslant 1}}:=Y_{0}\uplus Y_{1}$ :

\mathrm{\immediate\immediate\immediate{\color[rgb]{.1,0,.75}\definecolor[named% ]{pgfstrokecolor}{rgb}{.1,0,.75}Det}}(\mathcal{{\color[rgb]{0.07,0,0.525}% \definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}Y}})_{k}:=\{\,\mathrm{{% \color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525% }eval}}(S)\mid S\text{ is a }\text{{\color[rgb]{0.07,0,0.525}\definecolor[% named]{pgfstrokecolor}{rgb}{0.07,0,0.525}${\color[rgb]{0.07,0,0.525}% \definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}Y_{\leqslant 1}}$-set-% system} of rank }k\,\}

Note that $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Det}}(\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}Y}})$ is in fact a sub-yield-algebra of $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ . The name of deterministic element comes from the following application (we assume knowledge of automata over infinite trees: our automata have finite states, use a parity condition with finitely many priorities, but there is no limit on the cardinality of the size of the input alphabet):

Lemma 22.

Let $L$ be a language of regular-trees recognised by $(\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}},\rho,f{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}\Uparrow})$ where $\rho$ maps symbols to deterministic elements. There exists a deterministic top-down parity automaton over infinite trees that accepts the full unfolding of a system $T$ if and only if $T\in L$ .

Consider now that $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ is used to recognise some language of regular-trees $L$ , using the accepting set $P=f{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\Uparrow}$ with $f\in Y_{0}$ . Let $\rho$ be the recognising morphism. The automaton property intuitively states that deterministic elements are sufficient for describing the behaviour of all elements when put in a closed context.

Definition 23.

$(\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}},P)$ have the >automaton property if for all closed $Y$ -set–contexts $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ and every $a\in Y_{n}$ with $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(C[a])\in P$ , there is a deterministic element $\delta\in\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}% {rgb}{0.07,0,0.525}Det}}(\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named% ]{pgfstrokecolor}{rgb}{0.07,0,0.525}Y}})_{n}$ such that $\delta\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}\sqsubseteq}}a$ and $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(C[\delta])\in P$ .

It is easy to verify that, if $(\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}},P)$ has the automaton property, setting $\Delta_{a}:=\{\delta\in\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}Det}}(\mathcal{{\color[rgb]{0.07,0,0.525}% \definecolor[named]{pgfstrokecolor}{rgb}{0.07,0,0.525}Y}})_{n}\mid\delta% \mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqsubseteq}}a\}$ for all $a\in Y_{n}$ , we get that for all $Y$ -set-contexts $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ :

\displaystyle\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}(C[a])\in P\quad\text{if and only if}% \quad\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb% }{0.07,0,0.525}eval}}(C[\delta])\in P\text{ for some }\delta\in\Delta_{a}\ .\immediate\immediate

(1)

Let us very informally attempt to explain how this property allows us to build a finite state automaton over infinite trees that coincides with $L$ over regular trees. This automaton is defined as follows: it guesses a labelling of the tree by deterministic elements such that for all nodes $v$ labelled by some $a$ it associates some $\delta\in\Delta_{a}$ , and then checks that the resulting infinite tree is accepted by the automaton from Lemma 22.

Let us explain now why this automaton accepts the full unfolding of a system $T$ if and only if $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(T)\in P$ . For this, we first establish, using inductively Equation 1, that:

$\blacksquare$

$\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(T)\in P$ , if and only if
$\blacksquare$

there exists a $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Det}}(\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}Y}})$ -system $R$ of the same shape as $T$ such that $R(v)\in\Delta_{T(v)}$ for all vertices $v$ , and moreover $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(R)\in P$ . Let us call such $R$ a >direct run over $T$ .

The downward implication indicates that if $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(T)\in P$ , then the automaton accepts the full unfolding of $T$ using the full unfolding of the direct run $R$ as a witness of acceptance. Conversely, if the automaton accepts the full unfolding of $T$ , this means, by Rabin’s lemma⁸⁸8Rabin’s lemma states that a non-empty regular language of infinite trees contains a regular tree., that there is a regular witness of acceptance over the full unfolding of $T$ . This can be rephrased as the existence of a $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Det}}(\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}Y}})$ -system $R$ such that $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(R)\in P$ and $R$ is a direct run over some unfolding $T^{\prime}$ of $T$ . Hence $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(T)=\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}eval}}(T^{\prime})\in P$ .

If we combine these arguments with the ones of Janin and Walukiewicz, we get:

Lemma 24.

If $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ is a rankwise-finite yield-algebra such that $(\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}},f{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor% }{rgb}{0.07,0,0.525}\Uparrow})$ has the automaton property and recognises a bisimulation-invariant language of ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathbf{Tr}}$ -regular trees $L$ , then $L$ is definable by a $\mu$ -calculus sentence.

We obtain this by showing that if we assume that the original language $L$ is bisimulation-invariant, then the automaton built via our procedure going through deterministic elements can be turned into an automaton of the shape requested by [13, Theorem 7]. Thus, we obtain a $\mu$ -calculus sentence that defines $L$ , using directly [13].

The last missing argument, which is also a key contribution of this work is to show:

Lemma 25.

Let $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ be the syntactic yield-algebra for a language of regular-trees $L$ , and $P\subseteq\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{% pgfstrokecolor}{rgb}{0.07,0,0.525}Y}}$ be the accepting set, if $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ is rankwise-finite, then $(\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}},P)$ has the automaton property.

We recall the general intuition behind syntactic algebra. In general, the syntactic algebra for a language is the minimal one that recognises it, which is reflected by the fact that any two distinct elements in the algebra can be distinguished by the language in some context.

The proof of Lemma 25 is the second place where new arguments are used. Let $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ be a $Y$ -set-context, and $a\in Y_{n}$ , such that $C[a]\in P$ . Our goal is to find a deterministic element $\delta$ such that $\delta\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}\sqsubseteq}}a$ and $C[\delta]\in P$ . Notice that here we will informally write $C[\delta]\in P$ instead of $\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}eval}}(C[\delta])\in P$ to lighten notations.

We proceed in two major steps. First, we transform $C[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ into a “maximally difficult context” $M[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ such that $M[a]\in P$ , and $M[b]\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb% }{0.07,0,0.525}\sqsubseteq}}C[b]$ for all $b\in Y_{n}$ . This is possible since $\mathcal{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y}}$ is rankwise-finite and ordered by $\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqsubseteq}}$ . This also requires decomposition arguments for set-contexts. In particular we show that they can always be refactored into a normal form that is easy to manipulate, as a context of rank $n$ in normal form is completely given by a tuple of $n+1$ elements of rank $1$ .

Second, we combine the element $a$ with the structure of the context $M[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ in normal form, and crucially make use of roots, to define a ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}Y_{\leqslant 1}}$ -set-system $\Delta$ of rank $n$ , i.e. whose evaluation $\delta:=\mathrm{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}eval}}(\Delta)$ is deterministic, and that satisfies $M[\delta]\in P$ . The maximality assumption in the construction of $M[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ then results in that for all $Y$ -set-context $D[{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\square}_{n}]$ , $D[\delta]\in P$ implies that $D$ is “less or as difficult” as $M$ , which in turns implies $D[a]\in P$ . By definition of the order $\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\sqsubseteq}}$ , this shows that $\delta\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{% rgb}{0.07,0,0.525}\sqsubseteq}}a$ as desired. Since we also have $M[\delta]\in P$ by construction, and $M[\delta]\mathrel{{\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor% }{rgb}{0.07,0,0.525}\sqsubseteq}}C[\delta]$ , we obtain $C[\delta]\in P$ since $P$ is upward-closed. The automaton property is proved.

6 Conclusion

Our notion of systems is equivalent to notions of tree algebras developed elsewhere in the literature. The notion of set-systems is a kind of extended powerset construction on it, expanded with root vertices, something that is new to the best of our knowledge, though close to more classical powerset constructions. One originality here is mainly that all operations are performed while keeping the structures folded, and only finitely unfolding it when necessary. More crucial is the argument that transforms algebras to yield-algebras, which circumvents the impossibility to have the powerset distribute over the monad of an algebraic theory that has non-linear identities. We are studying further this categorical aspect. Another interesting direction is to see to which extent an algebraic approach can be used to bypass automata for deciding the MSO-theory of infinite trees, i.e. a purely algebraic proof or Rabin’s theorem.

References

[1] Peter Aczel, Jiří Adámek, and Jiří Velebil. A coalgebraic view of infinite trees and iteration. Electronic Notes in Theoretical Computer Science, 44(1):1–26, 2001. CMCS 2001, Coalgebraic Methods in Computer Science (a Satellite Event of ETAPS 2001). doi:10.1016/S1571-0661(04)80900-9.
[2] Achim Blumensath. Recognisability for algebras of infinite trees. Theor. Comput. Sci., 412(29):3463–3486, 2011. doi:10.1016/J.TCS.2011.02.037.
[3] Achim Blumensath. An algebraic proof of rabin’s tree theorem. Theor. Comput. Sci., 478:1–21, 2013. doi:10.1016/J.TCS.2013.01.026.
[4] Achim Blumensath. The Power-Set Construction for Tree Algebras. Logical Methods in Computer Science, Volume 19, Issue 4, November 2023. doi:10.46298/lmcs-19(4:9)2023.
[5] Achim Blumensath and Felix Wolf. Bisimulation invariant monadic-second order logic in the finite. Theoretical Computer Science, 823:26–43, 2020. doi:10.1016/j.tcs.2020.03.001.
[6] Mikołaj Bojańczyk and Tomasz Idziaszek. Algebra for infinite forests with an application to the temporal logic EF. In Mario Bravetti and Gianluigi Zavattaro, editors, CONCUR 2009 - Concurrency Theory, 20th International Conference, CONCUR 2009, Bologna, Italy, September 1-4, 2009. Proceedings, volume 5710 of Lecture Notes in Computer Science, pages 131–145. Springer, 2009. doi:10.1007/978-3-642-04081-8_10.
[7] Mikołaj Bojańczyk and Bartek Klin. A non-regular language of infinite trees that is recognizable by a sort-wise finite algebra. Logical Methods in Computer Science, Volume 15, Issue 4, November 2019. doi:10.23638/LMCS-15(4:11)2019.
[8] Facundo Carreiro, Alessandro Facchini, Yde Venema, and Fabio Zanasi. Weak MSO: automata and expressiveness modulo bisimilarity. In Thomas A. Henzinger and Dale Miller, editors, Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), CSL-LICS ’14, Vienna, Austria, July 14 - 18, 2014, pages 27:1–27:27. ACM, 2014. doi:10.1145/2603088.2603101.
[9] Anuj Dawar and Martin Otto. Modal characterisation theorems over special classes of frames. Annals of Pure and Applied Logic, 161(1):1–42, 2009. doi:10.1016/J.APAL.2009.04.002.
[10] S. Feferman and R. Vaught. The first order properties of products of algebraic systems. Fundamenta Mathematicae, 47(1):57–103, 1959. URL: http://eudml.org/doc/213526.
[11] Helle Hvid Hansen, Clemens Kupke, and Eric Pacuit. Neighbourhood structures: Bisimilarity and basic model theory. Logical Methods in Computer Science, Volume 5, Issue 2, April 2009. doi:10.2168/LMCS-5(2:2)2009.
[12] Matthew Hennessy and Robin Milner. On observing nondeterminism and concurrency. In J. W. de Bakker and Jan van Leeuwen, editors, Automata, Languages and Programming, 7th Colloquium, Noordweijkerhout, The Netherlands, July 14-18, 1980, Proceedings, volume 85 of Lecture Notes in Computer Science, pages 299–309. Springer, 1980. doi:10.1007/3-540-10003-2_79.
[13] David Janin and Igor Walukiewicz. On the expressive completeness of the propositional mu-calculus with respect to monadic second order logic. In Ugo Montanari and Vladimiro Sassone, editors, CONCUR ’96, Concurrency Theory, 7th International Conference, Pisa, Italy, August 26-29, 1996, Proceedings, volume 1119 of Lecture Notes in Computer Science, pages 263–277. Springer, 1996. doi:10.1007/3-540-61604-7_60.
[14] Faron Moller and Alexander Moshe Rabinovich. On the expressive power of CTL*. In 14th Annual IEEE Symposium on Logic in Computer Science, Trento, Italy, July 2-5, 1999, pages 360–368. IEEE Computer Society, 1999. doi:10.1109/LICS.1999.782631.
[15] Damian Niwiński. Fixed point characterization of infinite behavior of finite-state systems. Theoretical Computer Science, 189(1):1–69, 1997. doi:10.1016/S0304-3975(97)00039-X.
[16] Maximilian Pflueger, Johannes Marti, and Egor V. Kostylev. A characterisation theorem for two-way bisimulation-invariant monadic least fixpoint logic over finite structures. In Proceedings of the 39th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2024, Tallinn, Estonia, July 8-11, 2024, pages 63:1–63:14. ACM, 2024. doi:10.1145/3661814.3662107.
[17] Alexander Rabinovich. On compositionality and its limitations. ACM Trans. Comput. Log., 8(1):4, 2007. doi:10.1145/1182613.1182617.
[18] Eric Rosen. Modal logic over finite structures. J. Log. Lang. Inf., 6(4):427–439, 1997. doi:10.1023/A:1008275906015.
[19] Saharon Shelah. The monadic theory of order. Annals of Mathematics, 102(3):379–419, 1975.
[20] Johan Van Benthem. Correspondence theory. In D. Gabbay and F. Guenthner, editors, Handbook of Philosophical Logic: Volume II: Extensions of Classical Logic, pages 167–247. Springer Netherlands, Dordrecht, 1984. doi:10.1007/978-94-009-6259-0_4.
[21] Thomas Wilke. Algebras for classifying regular tree languages and an application to frontier testability. In Andrzej Lingas, Rolf Karlsson, and Svante Carlsson, editors, Automata, Languages and Programming, pages 347–358, Berlin, Heidelberg, 1993. Springer Berlin Heidelberg. doi:10.1007/3-540-56939-1_85.

[bib.bib1] [1] Peter Aczel, Jiří Adámek, and Jiří Velebil. A coalgebraic view of infinite trees and iteration. Electronic Notes in Theoretical Computer Science, 44(1):1–26, 2001. CMCS 2001, Coalgebraic Methods in Computer Science (a Satellite Event of ETAPS 2001). doi:10.1016/S1571-0661(04)80900-9.

[bib.bib2] [2] Achim Blumensath. Recognisability for algebras of infinite trees. Theor. Comput. Sci., 412(29):3463–3486, 2011. doi:10.1016/J.TCS.2011.02.037.

[bib.bib3] [3] Achim Blumensath. An algebraic proof of rabin’s tree theorem. Theor. Comput. Sci., 478:1–21, 2013. doi:10.1016/J.TCS.2013.01.026.

[bib.bib4] [4] Achim Blumensath. The Power-Set Construction for Tree Algebras. Logical Methods in Computer Science, Volume 19, Issue 4, November 2023. doi:10.46298/lmcs-19(4:9)2023.

[bib.bib5] [5] Achim Blumensath and Felix Wolf. Bisimulation invariant monadic-second order logic in the finite. Theoretical Computer Science, 823:26–43, 2020. doi:10.1016/j.tcs.2020.03.001.

[bib.bib6] [6] Mikołaj Bojańczyk and Tomasz Idziaszek. Algebra for infinite forests with an application to the temporal logic EF. In Mario Bravetti and Gianluigi Zavattaro, editors, CONCUR 2009 - Concurrency Theory, 20th International Conference, CONCUR 2009, Bologna, Italy, September 1-4, 2009. Proceedings, volume 5710 of Lecture Notes in Computer Science, pages 131–145. Springer, 2009. doi:10.1007/978-3-642-04081-8_10.

[bib.bib7] [7] Mikołaj Bojańczyk and Bartek Klin. A non-regular language of infinite trees that is recognizable by a sort-wise finite algebra. Logical Methods in Computer Science, Volume 15, Issue 4, November 2019. doi:10.23638/LMCS-15(4:11)2019.

[bib.bib8] [8] Facundo Carreiro, Alessandro Facchini, Yde Venema, and Fabio Zanasi. Weak MSO: automata and expressiveness modulo bisimilarity. In Thomas A. Henzinger and Dale Miller, editors, Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), CSL-LICS ’14, Vienna, Austria, July 14 - 18, 2014, pages 27:1–27:27. ACM, 2014. doi:10.1145/2603088.2603101.

[bib.bib9] [9] Anuj Dawar and Martin Otto. Modal characterisation theorems over special classes of frames. Annals of Pure and Applied Logic, 161(1):1–42, 2009. doi:10.1016/J.APAL.2009.04.002.

[bib.bib10] [10] S. Feferman and R. Vaught. The first order properties of products of algebraic systems. Fundamenta Mathematicae, 47(1):57–103, 1959. URL: http://eudml.org/doc/213526.

[bib.bib11] [11] Helle Hvid Hansen, Clemens Kupke, and Eric Pacuit. Neighbourhood structures: Bisimilarity and basic model theory. Logical Methods in Computer Science, Volume 5, Issue 2, April 2009. doi:10.2168/LMCS-5(2:2)2009.

[bib.bib12] [12] Matthew Hennessy and Robin Milner. On observing nondeterminism and concurrency. In J. W. de Bakker and Jan van Leeuwen, editors, Automata, Languages and Programming, 7th Colloquium, Noordweijkerhout, The Netherlands, July 14-18, 1980, Proceedings, volume 85 of Lecture Notes in Computer Science, pages 299–309. Springer, 1980. doi:10.1007/3-540-10003-2_79.

[bib.bib13] [13] David Janin and Igor Walukiewicz. On the expressive completeness of the propositional mu-calculus with respect to monadic second order logic. In Ugo Montanari and Vladimiro Sassone, editors, CONCUR ’96, Concurrency Theory, 7th International Conference, Pisa, Italy, August 26-29, 1996, Proceedings, volume 1119 of Lecture Notes in Computer Science, pages 263–277. Springer, 1996. doi:10.1007/3-540-61604-7_60.

[bib.bib14] [14] Faron Moller and Alexander Moshe Rabinovich. On the expressive power of CTL*. In 14th Annual IEEE Symposium on Logic in Computer Science, Trento, Italy, July 2-5, 1999, pages 360–368. IEEE Computer Society, 1999. doi:10.1109/LICS.1999.782631.

[bib.bib15] [15] Damian Niwiński. Fixed point characterization of infinite behavior of finite-state systems. Theoretical Computer Science, 189(1):1–69, 1997. doi:10.1016/S0304-3975(97)00039-X.

[bib.bib16] [16] Maximilian Pflueger, Johannes Marti, and Egor V. Kostylev. A characterisation theorem for two-way bisimulation-invariant monadic least fixpoint logic over finite structures. In Proceedings of the 39th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2024, Tallinn, Estonia, July 8-11, 2024, pages 63:1–63:14. ACM, 2024. doi:10.1145/3661814.3662107.

[bib.bib17] [17] Alexander Rabinovich. On compositionality and its limitations. ACM Trans. Comput. Log., 8(1):4, 2007. doi:10.1145/1182613.1182617.

[bib.bib18] [18] Eric Rosen. Modal logic over finite structures. J. Log. Lang. Inf., 6(4):427–439, 1997. doi:10.1023/A:1008275906015.

[bib.bib19] [19] Saharon Shelah. The monadic theory of order. Annals of Mathematics, 102(3):379–419, 1975.

[bib.bib20] [20] Johan Van Benthem. Correspondence theory. In D. Gabbay and F. Guenthner, editors, Handbook of Philosophical Logic: Volume II: Extensions of Classical Logic, pages 167–247. Springer Netherlands, Dordrecht, 1984. doi:10.1007/978-94-009-6259-0_4.

[bib.bib21] [21] Thomas Wilke. Algebras for classifying regular tree languages and an application to frontier testability. In Andrzej Lingas, Rolf Karlsson, and Svante Carlsson, editors, Automata, Languages and Programming, pages 347–358, Berlin, Heidelberg, 1993. Springer Berlin Heidelberg. doi:10.1007/3-540-56939-1_85.

Tree Algebras and Bisimulation-Invariant MSO on Finite Graphs

Abstract

Keywords and phrases:

Category:

Funding:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

Acknowledgements:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

Theorem 1.

Theorem 2.

Overview of the arguments and key contributions

PART 1: From MSO to algebras.

PART 2: From algebras to yield-algebras.

PART 3: From yield-algebras to 𝝁-calculus.

Related work

Bisimulation-invariant logics.

Composition method and tree algebra.

2 Set-systems and Systems

2.1 Set-Systems and Systems

Definition 3 (set-systems).

Definition 4 (systems).

2.2 Composing systems and set-systems: the flatten operation

2.3 Set-contexts, and contexts

2.4 Morphisms, unfoldings, and regular-trees

Definition 5 (unfolding and regular-trees).

▶ Remark 6.

Lemma 7 (unfold-equivalence is a congruence).

2.5 Transition systems as systems

Lemma 8.

3 Algebras

3.1 Algebras

Definition 9 (algebras).

Example 10 (regular-tree algebra).

3.2 Languages and their recognition by algebras

Example 11 (reachable symbol).

3.3 Monadic second-order logic, and the composition method

Lemma 12.

4 Yield Algebras

4.1 Yields of set-systems and yield-equivalence

4.2 Resolutions

Example 13 (yields are not compositional).

Definition 14.

Lemma 15 (yield-equivalence is a congruence).

4.3 Yield-algebras and recognisability

Definition 16.

Lemma 17.

Definition 18.

Lemma 19.

4.4 rankwise-finite yield-algebras

Lemma 20.

Example 21 (standard approach fails).

Idea 1.

Idea 2.

5 The Automaton Property

Lemma 22.

Definition 23.

Lemma 24.

Lemma 25.

6 Conclusion

References

PART 3: From yield-algebras to $\mu$ -calculus.

2.2 Composing systems and set-systems: the ${\color[rgb]{0.07,0,0.525}\definecolor[named]{pgfstrokecolor}{rgb}{% 0.07,0,0.525}\mathrm{flatten}}$ operation

$\blacktriangleright$ Remark 6.