Labelled Well Quasi Ordered Classes of Bounded Linear Clique-Width
Abstract
We construct an algorithm that inputs an -interpretation from finite words to graphs, and decides if there exists a such that the class of graphs induced by the interpretation is not well-quasi-ordered by the induced subgraph relation when vertices are freely labelled using . In case no such exists, we also prove that the class of graphs is not well-quasi-ordered by the induced subgraph relation when vertices are freely labelled using any well-quasi-ordered set of labels. As a byproduct of our analysis, we prove that for classes of bounded linear clique-width, a weak version of a conjecture by Pouzet holds.
Keywords and phrases:
well-quasi-ordering, linear clique-width, MSO transduction, automata theoryFunding:
Aliaume Lopez: Aliaume Lopez was supported by the Polish National Science Centre (NCN) grant “Polynomial finite state computation” (2022/46/A/ST6/00072).2012 ACM Subject Classification:
Theory of computation Models of computation ; Theory of computation Automata extensionsEditors:
Paweł Gawrychowski, Filip Mazowiecki, and Michał SkrzypczakSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
A well-quasi-ordering (wqo) is a quasi-order that contains no infinite antichains (i.e., sets of pairwise incomparable elements) nor infinite decreasing sequences. A cornerstone result of structural graph theory is the Graph Minor Theorem of Robertson and Seymour [25], which states that the class of all graphs is well-quasi-ordered by the graph minor relation. This result has profound implications in graph theory, and algorithmic consequences such as the polynomial-time solvability of whether a graph can be embedded into a given surface.
The class of all (finite, undirected) graphs is not well-quasi-ordered by the induced subgraph relation, as witnessed by the antichain composed of finite (chordless) cycles, and there has been considerable interest in characterizing classes of finite graphs that are well-quasi-ordered by the induced subgraph relation. It was proven by Ding that all classes of bounded tree-depth are well-quasi-ordered for the induced subgraph relation [12, Theorem 2.2], and it is also the case for letter graphs [23]. In general, these theorems can be extended to all classes admitting “well-behaved” tree decompositions, such as classes of bounded shrub-depth [14, Corollary 3.9], or -partite cographs [15, Theorem 5.5].
All the theorems mentioned above actually prove a stronger statement: the classes at hand are well-quasi-ordered in the presence of any well-quasi-ordered labelling of the nodes. Given a quasi-ordered set and a class of graphs, we call the class of graphs obtained by freely labelling vertices using elements of , that is, graphs equipped with a labelling . The notion of induced subgraph is naturally extended to the one of induced labelled subgraph, and we defer its formal definition to Section 2. For instance, it is shown in [12, Theorem 2.1] that a class of graphs of bounded tree-depth is well-quasi-ordered by the induced labelled subgraph relation, assuming that the labels are themselves well-quasi-ordered. We say that a class of graphs is -well-quasi-ordered (-wqo) if is well-quasi-ordered for all finite sets of size at most , and that it is labelled-well-quasi-ordered (labelled-wqo) if it is well-quasi-ordered for all finite sets of labels. Finally, let us say that a class is wqo-well-quasi-ordered (wqo-wqo) if it is well-quasi-ordered for all well-quasi-ordered sets of labels. Note that wqo-wqo implies labelled-wqo, which implies -wqo for all , and that -wqo implies -wqo for all .
For hereditary classes of graphs, the notion of -well-quasi-ordered is of particular interest, as it implies the existence111The existence is non-constructive, and building a constructive algorithm requires non-trivial work, see for instance [1] in the case of letter graphs. of a fixed-parameter tractable algorithm to decide the membership of a graph in the class , such as described in [15, Corollary 5.8], and more generally that the class of graphs is described by finitely many forbidden induced subgraphs as shown by [9, Proposition 3] and [24]. In 2010, Daligault, Rao and Thomassé studied classes of graphs described by relabel functions, a generalization of -partite cographs, and characterized those that are -well-quasi-ordered [9, Theorem 3]. In this paper, they also provided a positive answer to the Pouzet conjecture [24], which states that -wqo, labelled-wqo and wqo-wqo are equivalent for classes described by relabel functions [9, p. 4].222The Pouzet conjecture states the same collapse for all classes of graphs. The classes of graphs studied in [9] are all of bounded clique-width, and it was conjectured that all -well-quasi-ordered classes of graphs are of bounded clique-width [9, Conjecture 5]. This started a series of works on the relationship between clique-width and well-quasi-orderings: it was shown that -wqo does not imply bounded clique-width [21, 8], that there are classes of graphs that are -wqo and described by finitely many forbidden (induced) subgraphs but are not -wqo [3], and that there exists (minimal) classes of unbounded clique width described by finitely many forbidden induced subgraphs [2]. We summarize the various properties of classes of graphs in Figure 1.
Contributions.
Our first contribution is a decision procedure, that allows us to recognize classes of graphs that are labelled-wqo, provided that they are described using an -interpretation from (finite) words to graphs, the definition of which we defer to Section 2.
Theorem 1.
Let be an -interpretation from words to graphs, and let be its image. There exists a computable such that the following are equivalent:
-
1.
is -well-quasi-ordered,
-
2.
is labelled-well-quasi-ordered,
-
3.
is wqo-well-quasi-ordered.
Furthermore, these properties are decidable.
We also prove that all hereditary classes of graphs that are -wqo and of bounded linear clique-width can be described by such an -interpretation, which justifies their use as an input to our decision procedure. As a corollary, we answer positively to part of Pouzet’s conjecture in the case of classes of graphs having bounded linear clique-width, a subset of classes of bounded clique-width, that is incomparable with the classes studied in [9].
Corollary 2 (Collapse for Bounded Linear Clique-Width).
For all classes of graphs of bounded linear clique-width, the following are equivalent:
-
1.
The class is labelled-well-quasi-ordered,
-
2.
The class is wqo-well-quasi-ordered.
The novelty of our approach is to leverage classical tools from automata theory (namely, the factorization forest theorem of Simon [26]), and avoid the use of complex minimal bad sequence arguments [22] that were used in [9]. We strongly believe that these results generalize to the case of classes of bounded clique-width. To justify this claim, we derive a self-contained proof of [9, Theorem 3] for classes having bounded linear clique-width. Our algebraic approach provides a new perspective on the statements of [9] by connecting it to the class of totally ordered monoids, that were already investigated in the context of regular languages for completely different reasons [18, 20].
Outline.
We provide in Section 2 basic definitions and explain how to deduce Corollary 2 from Theorem 1. In Section 3, we prove our main Theorem 1. Then, in Section 4, we discuss the relationship between our characterization and the previous work of [9], proving in Theorem 24 that their characterization can be seen as a coarser version of ours in the case of bounded linear clique-width.
2 Preliminaries
Graphs.
In this paper, graphs are all finite, simple and undirected unless otherwise explicitly stated. We denote by the set of vertices of a graph , and by the set of edges. A map between two graphs and is an embedding if it is injective and satisfies if and only if , for all . We write if there exists an embedding , which means that is an induced subgraph of , or equivalently that is an induced extension of . A class of graph is hereditary when it is closed under taking induced subgraphs. We denote by the hereditary closure of a class , obtained by considering all induced subgraphs of graphs in . The notion of induced subgraph is extended to labelled graphs as follows: a function is an embedding of into whenever is injective, if and only if , and for all .
Example 3.
The class of all finite paths is well-quasi-ordered (because a path of length is an induced subgraph of all paths of length ) but not -well-quasi-ordered (which can be seen by coloring the endpoints of the paths). The class of all finite cycles is not well-quasi-ordered (since a cycle of length is never an induced subgraph of a cycle of length ). The class of all cliques is wqo-wqo, which can be seen by representing colored cliques as finite multisets of colors, and leveraging the fact that these finite multisets are well-quasi-ordered when the colors themselves are well-quasi-ordered (see for instance [10]).
Automata Theory.
We assume that the reader is familiar with the basics of automata theory, in particular the notions of monoid morphisms, idempotents in monoids, monadic second-order () logic and first-order () logic over finite words (see e.g. [27]). We use symbols , to denote finite non-empty alphabets, and letters , , for finite words. The symbol is used to denote the empty word, and given a word , we write for the letter at position , and (resp. ) for the prefix of of length (resp. the suffix of starting at position ), note that these can be empty words. Similarly, we write for the factor of between positions and , which is empty if .
Let us recall that a monoid is a set endowed with a binary operation that is associative, has an identity element . Inside a monoid , an element is called idempotent if . A morphism of monoids is a map that preserves the multiplication and the identity element, i.e., and for all . The finite words is a monoid under concatenation, with the empty word as neutral element.
MSO Interpretations.
A (simple) -interpretation from finite words () to finite graphs is given by an formula that defines the edge relation of a graph, a domain formula that defines the domain of the output graph, and a selection formula without free variables, used to restrict the domain of the interpretation.
Let be an -interpretation from finite words to graphs. The image of a word is the graph with vertices , and edges for each pair such that and satisfies . The image of is the collection of where ranges over the words in that satisfy , we write this image . We provide examples of interpretations in Examples 4 and 5, the first example is a class that is wqo-wqo, while the second one is not even -wqo.
Example 4.
The class of all cliques can be obtained as the image of the following -interpretation: , , and .
Example 5.
The class of all finite paths can be obtained as the image of the following -interpretation: , , and .
A class of graphs has bounded linear clique-width if there exists a finite alphabet , and an -interpretation such that [7, 6]. The notion of bounded clique-width is defined similarly, using trees instead of words as input of the interpretation. We remark that there are uncountably many classes of graphs that have bounded linear clique-width, and only countably many -interpretations. This discrepancy vanishes when considering hereditary classes of graphs that are -well-quasi-ordered (Lemma 6), and we can use this to bridge the gap between the algorithm of Theorem 1 that considers -interpretations and our Corollary 2, by leveraging the interaction between hereditary closure and labelling of a class (Lemma 7).
Lemma 6 (Sandwich Lemma).
Let be a class of graphs that has bounded linear clique-width and is -well-quasi-ordered. Then, there exists an -interpretation such that . In particular, when is hereditary.
Lemma 7 (Hereditary Closure Lemma).
Let , and be a class of graphs that is -well-quasi-ordered. Then the hereditary closure of is -well-quasi-ordered.
Corollary 2 (Collapse for Bounded Linear Clique-Width). [Restated, see original statement.]
For all classes of graphs of bounded linear clique-width, the following are equivalent:
-
1.
The class is labelled-well-quasi-ordered,
-
2.
The class is wqo-well-quasi-ordered.
Proof.
Let be a class of graphs that has bounded linear clique-width,
and is labelled-well-quasi-ordered.
By Lemma 6, there
exists an -interpretation such that . Since is
labelled-well-quasi-ordered, so is by
Lemma 7.
As a consequence, is labelled-wqo (as a subset), and using
Theorem 1, we conclude that is wqo-wqo.
Because , it means that is wqo-wqo too.
††margin:
Back to Corollary˜2
on page 2
3 MSO-Interpretations and Monoids
In this section, we will develop an automata theoretic approach to the problem of characterizing images of -interpretations that are labelled-well-quasi-ordered by the induced subgraph relation. The key ingredient is to use the notion of factorization forests [26] to provide an alternative syntax of such classes of graphs where words (linear trees of unbounded depth) are replaced by forests (branching trees of bounded depth). Let us first justify that in the upcoming analysis, the domain formula and the universe formula can be safely ignored (i.e., set to be constantly true). Beware that, contrary to Lemma 6, the following construction is effective which is essential for the algorithm of Theorem 1.
Lemma 8 (Simple Interpretations).
Let be an -interpretation. There exists an -interpretation that does not use and , and such that for all sets , is well-quasi-ordered if and only if is.
Furthermore, is effectively computable from .
Monoid Interpretations.
The equivalence between regular languages, finite automata, languages recognized by finite monoids and formulas ([13]) provides the existence (and computability), for every formula , of a finite monoid , together with a surjective morphism , and a subset , such that, for all words , for all couples of positions in , we have:
As a consequence, given a finite alphabet and an edge formula defining an -interpretation, one can construct a monoid interpretation: it is described by a finite monoid , a morphism , and a subset that we call an edge selector. To a word , it associates a graph with vertices defined as the set of positions of , and creates an edge if . We provide hereafter two examples of monoid interpretations that will be used to illustrate our constructions.
Example 9.
Let , where , , and . Then, the image of the monoid interpretation is the class of finite paths.
Example 10.
Let , , , and . Then, the image of the monoid interpretation is included in the class of bipartite cliques.
Although Examples 9 and 10 share the same alphabet, similar morphisms and similar edge selectors, their behavior is drastically different. The image of is -wqo but not -wqo, while the image of is wqo-wqo.
3.1 Factorisation Forests
Let us now introduce one of the key ingredients of our proof, which is the notion of factorization forest of a finite word over a finite monoid . Initially introduced by Simon in [26], they provide a way to efficiently compute the product of factors of a word in , by grouping computations using a tree of bounded depth, as restated in Theorem 11. The main idea of this paper is to leverage the factorization forest theorem of Simon to provide a (bounded depth) tree-decomposition of the graphs in the image of a monoid interpretation. We will equip these tree-decompositions with a well-quasi-ordering (Lemma 12), which will induce a (new) well-quasi-ordered relation on the image of our monoid interpretation. At the level of graphs, this new relation will be a restriction of the induced subgraph relation, obtained by only considering embeddings that are compatible with the tree-decompositions of the graphs.
Theorem 11 ([26]).
Let be a finite alphabet, and be a morphism to a finite monoid. There exists333It can be proven that this depth is at most [5], but we will not use this fact. a computable depth such that every word can be factorized as a tree of depth at most built using:
-
Leaves: , that evaluate to ,
-
Binary nodes: , that evaluate to if evaluates to and evaluates to ,
-
Idempotent nodes: , if and are subtrees that evaluate to the same idempotent element . The idempotent nodes evaluate to their corresponding idempotent .
We call such a tree a factorization forest of . We write for the set of factorization forests of depth at most for the monoid , leaving implicit.
As the name suggests, the factorization theorem provides a factorization of a word into a tree, and its subtrees correspond to factors of . Let us illustrate in Figure 2 a factorization of a word for the monoid used in Example 9. For the interpretation of Example 10, a similar picture is obtained by replacing all ’s from Figure 2 by ’s.
Ordering Factorisation Forests.
By definition, factorization forests are nested words over the alphabet . It has been proven by Higman that if a set is well-quasi-ordered, then the set of finite words is also well-quasi-ordered [17], where if there exists an increasing map such that for all . When we say that is a (scattered) subword of .
Given a finite alphabet , a finite monoid and a surjective morphism , we define by induction on a well-quasi-ordering on the set of factorization forests of depth at most . For , the forests are all leaves, and we equip them with the equality quasi-order. For the inductive case, we let if one of the following holds:
-
(hence, both are of depth at most ),
-
, , and ,
-
and and for the subword embedding relation over .
Let us briefly remark that this definition of ordering using “nested subword embeddings” was used to verify priority channel systems in [16], and is a specialization of the notion of the gap-embedding relation on trees of [11]. Furthermore, the definition of the well-quasi-ordering can be extended to the case where nodes of the forests are labelled by elements from a quasi-ordered set as follows: whenever we compare two nodes and , we also check if their labels are comparable. The following lemma states that is a well-quasi-order for every , and follows immediately from Higman’s lemma [17].
Lemma 12 (Factorization Forests are Well-Quasi-Ordered).
Let be a finite monoid, be a finite alphabet, be a morphism, be a well-quasi-ordered set, and . The set of -labelled factorization forests of depth at most is a well-quasi-order for .
Let us fix for the rest of this section a monoid interpretation , and a constant such that every word in has a factorization forest of depth at most (using Theorem 11). Because factorization forests are an additional structure over a finite word , the interpretation can be extended to a map from factorization forests to graphs (ignoring the factorization of the underlying word). If this map was order-preserving, i.e., if implies for every , then we could immediately conclude that is well-quasi-ordered: the image of a well-quasi-ordered set by an order-preserving map is well-quasi-ordered [10]. Let us argue that it is almost the case.
By construction, if , then can be obtained from by inserting new children to idempotent nodes. In particular, it induces a map from nodes of to nodes of that respects: the least common ancestor relation, the evaluation of subtrees, the type of nodes (binary node, leaf, idempotent node). Now, if and are factorization forests of words and respectively, then can be seen as a map from positions of to positions of , hence a map from vertices of to vertices of , which is almost an embedding.
Lemma 13.
Let be a word, be a factorization of , be an idempotent element of , be an idempotent node that evaluates to in , and be a factorization of a word that evaluates to . Let be obtained by inserting as a new child of the node in , and be the corresponding map at the level of words. Then, for every pair of positions that are not in the factor of described by , we have: , , and .
As a consequence, for such pairs, if and only if .
Proof.
Let us consider two positions of the word . Assume that the insertion of is done between and (the other cases are similar). Then, one can write , where is the factor of corresponding to the idempotent node , split where the insertion of the word happens. By construction, . Because and , we immediately obtain the first and third desired equalities.
Which concludes the proof.
Let us illustrate why considering positions that are “too close” to the inserted forest is problematic in Figure 3. This is not surprising as the class of graphs described by the interpretation of Figure 3 is not -wqo, hence it is impossible for the interpretation to transport embeddings between forests to embeddings between their images.
3.2 Characterizing Labelled Well-Quasi-Ordered Classes
Generalizing the example of Figure 3, we can now define what are the formal obstructions inside the factorization forests for the map to be order-preserving. We will call them bad forest paths. To that end, let us first generalize the notion of evaluation of a monoid interpretation from factorization forests to account for a context : is the graph obtained by evaluating the factorization forest with the morphism and the edge selector , where . This will allow us to reason about the image of a subtree of a factorization forest , which corresponds to an induced subgraph of the generated graph , where and are the monoid elements obtained by evaluating the word to the left and to the right of the selected subtree. Let us also define the type of a leaf in a factorization tree of a word , as the triple , that we denote as .
Definition 14.
Let be a sequence of elements in , all evaluating to the same idempotent element and . The sequence is a good forest path if for all , there exists another sequence of elements of evaluating to , a tree , and an embedding where and , such that:
-
1.
for every element , originating from a leaf of some , that is sent to a vertex originating from via , we have ,
-
2.
the vertices originating from are mapped to vertices originating from ,
-
3.
the vertices originating from are mapped to vertices originating from ,
-
4.
there exists such that the image of does not intersect the vertices originating from .
We say that a good forest path is split by the sequence and embedding in the context . A bad forest path is a sequence of factorization forests , that is not a good forest path.
By definition, good forest paths can be split into smaller independent parts. Beware that the split can reorder the nodes in some arbitrary fashion, and may not respect the order of the leaves in the original forest. Our first result is to prove that the existence of arbitrarily long bad forest paths is sufficient to conclude that the class is not labelled-wqo. In fact, the number of colors used will be related to the size of the monoid .
Lemma 15.
Assume that bad forest paths of arbitrarily large length in . Then, is not -wqo, where .
Proof.
Assume that there exists arbitrarily long bad forest paths in . Then, there exists a sequence of forests such that for all , , where is a bad forest path, and is increasing with respect to . Since these are bad forest paths, there exists such that cannot be split in the context .
Because the monoid is finite, we can assume that all the forests evaluate to the same idempotent element , and that there exists such that and for all . Extracting a subsequence also allows us to assume that for all , is greater than the number of leaves in .
Let us now consider the sequence of graphs , that we color as follows: every vertex is colored by its leaf type in its original factorization forest , . We furthermore add three new colors, distinguishing vertices originating from the first and last factorization forest of the bad forest path and from the rest of the vertices. The final coloring corresponds to labels. We now prove that the sequence is an infinite antichain with the provided coloring, which concludes the proof.
Assume towards a contradiction that there exists and a labelled embedding . Because of the coloring chosen on and , the map respects the types of the leaves (Item 1), sends the vertices originating from to the vertices originating from (Item 2), and the vertices originating from to the vertices originating from (Item 3). Furthermore, the image of must not intersect the vertices originating from some , (Item 4), because is greater than the number of leaves in , hence the number of vertices in . We conclude that is a good forest path, which can be split in the context , contradicting our initial assumption.
Let us now prove that the absence of arbitrarily long bad forest paths is sufficient to conclude that the class is wqo-well-quasi-ordered. The key ingredient of the proof is to split the good forest paths, which has the effect of adding dummy idempotent nodes separating the leaves of our factorization forests. Adding these nodes will prevent the issues encountered in generalizing Lemma 13 to nodes that are “too close” to the inserted forest, ensuring that the case of Figure 3 never happens. A first observation is that to split a good forest path, one only needs to repeat it three times.
Lemma 16.
Let be a good forest path. Then, for all contexts , there exists an embedding satisfying Items 1, 2, 3, and 4 of Definition 14, where , and . Furthermore, the embedding can be chosen such that for all , is either in the first copy of the sequence, or in the third copy of the sequence, i.e., is uniquely determined by a map from the vertices of to .
Proof.
Let be a sequence of factorization forests that splits the good forest path in the context using an embedding , where . Let us write for the smallest index such that the image of does not intersect the vertices originating from . Let us also write the map that assigns to every vertex of the index of the factorization forest from which originates in .
Recall that every vertex of exists in three copies in (one for each repetition of the sequence of forests). We define the map as follows, given a vertex :
-
If originates from , then it is mapped to its first copy in ,
-
If originates from , then it is mapped to its last copy in ,
-
If originates from , , and , then it is mapped to its first copy in ,
-
If originates from , , and , then it is mapped to its last copy in .
It is clear that satisfies Items 1, 2, 3, and 4. What remains to be shown is that is an embedding of into , which will follow from the fact that was itself an embedding. Given two vertices four cases can be considered:
-
If and both originate from . Then, if and only if , if and only if , because for every idempotent nodes. Therefore, if and only if .
-
If and both originate from , the same argument applies but with the last copy of .
-
If and originate from a tree , , and are sent to the same copy of by . Then a similar argument applies too.
-
If and originate from a tree , , and are sent to different copies of by . Then, one idempotent element is inserted when computing the edge relation between and , which may change the result (as in Figure 3). However, if and only if because is an embedding. It follows from the same proof scheme as for Lemma 13 that if and only if , as they share the same leaf types, and are both separated by the same idempotent .
We have proven that is an embedding of into .
We are now ready to upgrade the forests in , assuming the existence of a bound on the length of bad forest paths. Let us overload notations for the binary nodes so that , , and . Note that the depth of is bounded by , where is the maximal depth of the trees .
Let us illustrate our construction on an example. Let , where are factorization forests evaluating to some idempotent element , and assuming that . The first step is to group the children of into buckets of size (that is, ): and . Now, because and are of length , they are good forest paths, and can be split by repeating their sequence three times (Lemma 16). We obtain new buckets and . We label the leaves of the trees in the sequences and as if they are in the image of the split, and otherwise. The resulting forest is obtained as follows:
| (1) |
Where the padding elements have all their leaves labelled with . The induced subgraph of obtained by removing all the vertices labelled by is isomorphic to . Furthermore, every pair of leaves of that are labelled by have the following property: either and belong to the same child of (for instance, , or ), or they are separated by a child of whose leaves are all labelled by . In particular, for the tree , inserting new children to the idempotent node does not modify the induced subgraph of the leaves labelled by .
The following technical lemma states that this construction can be applied inductively to obtain a function , for some bounded that depends on and .
Lemma 17.
Assume that there exists a bound on the length of bad forest paths in . Then, there exists a and a function that maps a factorization forest to a forest that is labelled using , and such that:
-
Every graph is obtained as the restriction of a graph to its vertices for some ,
-
For every , for every function witnessing that (as labelled forests), the restriction of to the induced subgraph of vertices of is an embedding into the induced subgraph of vertices of .
From Lemma 17, we immediately conclude that the class is wqo-well-quasi-ordered. To simplify the proof, let us recall an equivalent characterization of well-quasi-ordered sets . A sequence is good if there exists such that . A set is well-quasi-ordered if and only if every infinite sequence of elements of is good.
Corollary 18.
Assume that there exists a bound on the length of bad forest paths in . Then, is wqo-well-quasi-ordered.
Proof.
Consider an infinite sequence of graphs , with labellings , where is a well-quasi-ordered set. Let us equip the set with the equality relation, and remark that is also well-quasi-ordered when endowed with the product ordering.
By definition, every graph is the image of a word through . Because we chose the value of according to Simon’s factorization theorem (Theorem 11), every word has a factorization forest . We can construct build the sequence , where is the function provided by Lemma 17.
Because the set of -labelled factorization forests of depth at most is well-quasi-ordered for (Lemma 12), there exists an increasing pair of indices such that . This defines a function that is an embedding when restricted to the induced subgraph of vertices, and respects the labelling . Because the induced subgraph of vertices of is precisely , and the same holds for , this provides a function that is a labelled embedding of into .
The sequence is therefore a good sequence. and we have proven that is wqo-well-quasi-ordered.
We have proven that the existence of a bound on the length of bad forest paths charcterizes whether is wqo-well-quasi-ordered. To finalize the proof of our Theorem 1, we only need to show that the existence of such a bound is decidable. This is because the language of bad forest paths of depth at most is a regular language, and one can decide whether a regular language is unbounded.
Lemma 19.
Let . It is decidable whether contains bad forest paths of unbounded length.
Proof of Theorem 1 page 1.
Let be an -interpretation from to graphs. Using Lemma 8, we can compute an -interpretation that only uses the formula , such that is -wqo if and only if is -wqo for all . From , we can compute a monoid interpretation by standard techniques. Applying Theorem 11, we compute a number such that every word has a factorization forest . We then use Lemma 19 to decide whether has bad forest paths of unbounded length. If it does, then is not -well-quasi-ordered, where (Lemma 15), hence is not -wqo. If it does not, then is wqo-well-quasi-ordered (Corollary 18), and therefore so is .
In particular, we have computed a number such that is -well-quasi-ordered if and only if is wqo-well-quasi-ordered. ††margin: Back to Theorem˜1 on page 1
4 A Semigroup Approach to Well-Quasi-Orders of Relabel Functions
Classes that are labelled-well-quasi-ordered and of bounded clique-width had already been studied by [9], which leveraged the notion of -node labelled controlled graphs, a notion introduced by [28]. The class of -node labelled controlled graphs with relabel functions from to is defined inductively on -labelled graphs as follows: one can create a single vertex graph with label , one can relabel a -labelled graph with a function (applying to every vertex label of ), and one can combine two -labelled graphs and using a set by creating the disjoint union of and , and adding edges between vertices and in and respectively if .
It is a folklore result that every class of graphs that has bounded linear clique-width is a subset of some for some and some , and [9] completely characterized the pairs such that is -well-quasi-ordered or labelled-well-quasi-ordered, showing the equivalence of these two properties [9, Theorem 3]. The theorem relies on the notion of totally ordered sets of relabel functions: given two functions , they define a relation as , and study families of functions that are closed under composition, contain the identity function, and totally ordered for this relation [9, Section 2].
Theorem 20 ([9, Theorem 3, Corollary 1, Corollary 2]).
For all , for all monoids of endofunctions of , the following are equivalent and decidable properties:
-
1.
is totally ordered by the relation ,
-
2.
The class of graphs is -well-quasi-ordered.
-
3.
The class of graphs is labelled-well-quasi-ordered.
Let us mention that Theorem 20 does not allow us to decide whether a class of bounded linear clique-width is -well-quasi-ordered or not, nor does it solve the Pouzet Conjecture in that case. While it is true that every class of graphs of bounded linear clique-width is a subset of for some and some [7], no analogues of Lemma 6 exist for this representation of classes of graphs: it may be that the class of graphs is -well-quasi-ordered while the class that contains it is not. This mismatch was conjectured to never happen [9, Conjecture 4], but it remains an open question. Informally, one can state that the difference between Theorem 20 and the results of the present paper is akin to the difference between analyzing properties of transition monoids and properties of regular languages: in our case, the set of accepting states (the edge selector) is fixed, which leads to more precise results, and in particular allows us to transfer our results from images of monoid interpretations to all classes of bounded linear clique-width. We will make this intuition precise in our Theorems 24 and 25, by showing that the condition of being ordered is equivalent to the fact that there are no long bad forest paths for every choice of edge selector.
4.1 Totally Ordered Monoids
In this section, we argue that the seemingly arbitrary notion of ordering between relabel functions can be naturally understood in the language of semigroup theory, namely that they are related to the classical Green Relations [5]. The connection can be made because the sets of functions studied by [9] are closed under compositions and contain the identity function, and therefore form a (finite) monoid .
Let be a finite monoid and be an element. We denote by the bilateral ideal of , i.e., the set of elements where ranges in . Similarly, we define the left ideal and the right ideal respectively as the sets of elements and where ranges in .
Lemma 21 (Green Formulations of Total Orderings).
Let be a finite monoid. The following are equivalent:
-
1.
is totally ordered by the relation defined by if and only if ,
-
2.
is totally ordered by the relation defined by if and only if ,
-
3.
is totally ordered by the relation defined by if and only if ,
-
4.
The bilateral ideals of are totally ordered for inclusion, and for all , .
Whenever one of these conditions is satisfied, all the defined preorders coincide. Furthermore, if is a monoid of relabel functions , then the above conditions are equivalent to the condition that is totally ordered by the relation .
Let us now introduce a definition for so-called “totally ordered monoids” that is based on Item 3. Examples of such totally ordered monoids are: all groups, and all band monoids, i.e., the monoids such that for all , there exists such that .
Definition 22.
A monoid is totally ordered if for all , either or . We write if , and if .
Let us mention that totally ordered monoids are exactly those that only recognize languages having the finite power property, i.e., such that languages such that there exists such that [18, 20]. Furthermore, totally ordered monoids enjoy the following cancellation property, that was noticed by [9], and powers their combinatorial results. We will leverage this property to prove the upcoming Lemma 25, that these cancellations prevent the existence of long bad forest paths.
Lemma 23 (Cancellation Property).
Let be a totally ordered finite monoid. Then for all , if and then . Similarly, if then .
4.2 Bad Forest Paths in Totally Ordered Monoids
Let us now relate our characterization of classes of bounded linear clique-width that are labelled-well-quasi-ordered to the one introduced by [9]. This is done by showing that totally ordered monoids (i.e., the families studied in [9]), are precisely those for which the edge selector of our monoid interpretations (as defined in Section 3) does not matter: every choice leads to a wqo-well-quasi-ordered class of graphs.
Theorem 24.
Let be a finite alphabet, be a finite monoid, and be a surjective morphism. Let be the size of , and be the set of functions from to itself obtained by considering the action of on itself. Then, the following are equivalent:
-
1.
The monoid is totally ordered,
-
2.
The union is -well-quasi-ordered, where .
-
3.
The union is wqo-well-quasi-ordered, where .
-
4.
The class is wqo-well-quasi-ordered.
-
5.
The class is -well-quasi-ordered.
Notice that Theorem 24 also proves that not considering the edge selector from our interpretations (by taking the union over all possible choices) smooths the combinatorial properties of Theorem 1: from a computable such that the class is -well-quasi-ordered if and only if it is wqo-well-quasi-ordered, we drop to the constant , answering positively to the Pouzet Conjecture in this case. The proof of Theorem 24 follows straightforwardly from the following lemma that characterizes the absence of bad forest paths in the case of totally ordered monoids, regardless of the choice of the edge selector. The lemma itself uses the cancellation properties of totally ordered monoids (Lemma 23).
Lemma 25 (Bad Forest Paths in Totally Ordered Monoids).
Let be a finite alphabet, let be a totally ordered finite monoid, be an edge selector, be a surjective morphism, and be a monoid interpretation. Then, there are no bad forest paths in , for all .
Proof.
Let be factorization forest of a word . Assume that evaluates to an idempotent element . Because , and is totally ordered, there must exist a factorization such that , , and and . This is proven by induction on the size of , using the fact that or .
Let us now construct a function from the leaves of into the leaves of . To that end, let us remark that the leaves of are the positions of the word , and let us define the function as follows:
-
For every position that belongs to , we , i.e., its corresponding position in the first copy of in .
-
For every position that belongs to , we , i.e., its corresponding position in the third copy of in .
This function satisfies all requirements of the definition of a split (Definition 14), and we only have to prove that it is an embedding of into . To that end, it suffices to prove that for every decomposition , we have , , and .444This works regardless of the choice of , that can only use the values of these factors to determine the presence of an edge. Let us only deal with the case of , as the other cases are similar.
If the factor is contained in or in , then the result is immediate. Let us now assume that the range intersects both and . We can write , such that , , and . One can then compute . Our goal is to prove that . To that end, let us remark that . Let us also remark that , and , and that one can therefore apply the cancellation property of Lemma 23 to obtain the following equalities:
| left cancellation | ||||
| right cancellation | ||||
We have proven that there exists no bad forest path in .
Proof of Theorem 24 page 24.
First, the equivalence between Item 1 and Items 4 and 5 follows from Theorem 20. Furthermore, it is clear that Item 3 implies Item 2.
To prove that Item 1 implies Item 3, let us assume that is totally ordered. Then by Lemma 25, we know that for all , there are no bad forest paths in for all , and using Corollary 18 we conclude that is wqo-well-quasi-ordered. As a consequence, the finite union of these classes is also wqo-well-quasi-ordered.
To prove that Item 2 implies Item 1, let us assume that that the monoid is not totally ordered, then there exists such that and . Because the morphism is surjective, there exists words such that and . In particular, both and must be non-empty words. Let us define the edge selector , and consider the sequence of words , for some letter and for all . If there is an edge between the -th and -th letter of , then . As a consequence, the distance between the first and last positions of in the graph is at least , which tends to as increases. Furthermore, there exists a path from the first to the last position of , because for every factor of , (and similarly for ). Hence, the distance between the vertices originating from the first and last positions in is at most . By coloring the vertices of the graphs to distinguish the first and last positions of , and extracting a subsequence of such that the distance between the first and last positions in is greater than , we obtain an infinite antichain of -labelled graphs. Therefore, is not -well-quasi-ordered, which completes the proof. ††margin: Back to Theorem˜24 on page 24
5 Outlook
The automata based approach.
We believe that the results presented in this paper strongly advocate for an automata based approach to the characterization of graph classes that are labelled-well-quasi-ordered. In particular, there are two main directions that naturally emerge from our work. The first one is to investigate the case of classes of bounded clique-width, that is, moving from words to trees as input of our -interpretations. The second direction is to investigate whether the bound of our main Theorem 1 can be improved to , as it is the case in Theorem 24.
We strongly believe that our proof technique can be adapted to trees, as it is fundamentally based on two key theorems for words that are already known to be adaptable to trees: the fact that finite words are equipped with a well-quasi-ordering by Higman’s lemma [17], for which the generalization to trees is known as Kruskal’s tree embedding theorem [19]; and the fact that finite words over a finite monoids can be factored into bounded height trees, i.e. Simon’s factorization theorem [26], which has been adapted to trees by Colcombet [4].
As to the bound in Theorem 1, we conjecture that it can be improved to , based on a finer analysis of bad forest paths (Lemma 15), which would bridge the gap between our analysis and the previous results from [9], and prove that the Pouzet conjecture holds for classes of graphs of bounded linear clique width.
New decompositions.
Let us also remark that the proof of Theorem 1 actually provides a tree-decomposition of graphs that is, in some sense, successor-free: adding new nodes between two nodes in the tree does not change the semantics of the tree. We believe it is worth investigating whether these decompositions could be exploited in other contexts than labelled-well-quasi-orderings.
Interpreting paths.
Finally, we conjecture that paths are the only obstruction to being labelled-well-quasi-ordered, as all (currently known by the authors) proofs that some classes are not labelled-wqo end up extracting finite paths from the graph classes we consider (see for instance Lemmas 15 and 24, [9, Theorem 3], [12]). This leads us to formulate the following Conjecture 26.
Conjecture 26.
Let be a class of graphs. Then, is not labelled-well-quasi-ordered if and only if there exists a finite set of labels, and an order-preserving -interpretation from to the class of all finite paths.
References
- [1] Bogdan Alecu, Mamadou Moustapha Kanté, Vadim Lozin, and Viktor Zamaraev. Lettericity of graphs: an fpt algorithm and a bound on the size of obstructions, 2024. doi:10.48550/arXiv.2402.12559.
- [2] Aistis Atminas, Robert Brignall, Vadim V. Lozin, and Juraj Stacho. Minimal classes of graphs of unbounded clique-width defined by finitely many forbidden induced subgraphs. Discrete Applied Mathematics, 295:57–69, May 2021. doi:10.1016/j.dam.2021.02.007.
- [3] Robert Brignall, Michael Engen, and Vincent Vatter. A counterexample regarding labelled well-quasi-ordering. Graphs and Combinatorics, pages 1395–1409, 2018. doi:10.1007/s00373-018-1962-0.
- [4] Thomas Colcombet. A combinatorial theorem for trees. In Lars Arge, Christian Cachin, Tomasz Jurdziński, and Andrzej Tarlecki, editors, Automata, Languages and Programming, pages 901–912, Berlin, Heidelberg, 2007. Springer Berlin Heidelberg. doi:10.1007/978-3-540-73420-8_77.
- [5] Thomas Colcombet. Green’s relations and their use in automata theory. In Adrian-Horia Dediu, Shunsuke Inenaga, and Carlos Martín-Vide, editors, Language and Automata Theory and Applications, pages 1–21, Berlin, Heidelberg, 2011. Springer Berlin Heidelberg. doi:10.1007/978-3-642-21254-3_1.
- [6] Bruno Courcelle. Monadic second-order definable graph transductions: a survey. Theoretical Computer Science, 126(1):53–75, 1994. doi:10.1016/0304-3975(94)90268-2.
- [7] Bruno Courcelle, Joost Engelfriet, and Grzegorz Rozenberg. Handle-rewriting hypergraph grammars. Journal of Computer and System Sciences, 46(2):218–270, 1993. doi:10.1016/0022-0000(93)90004-G.
- [8] Konrad K. Dabrowski, Vadim V. Lozin, and Daniël Paulusma. Well-quasi-ordering versus clique-width: New results on bigenic classes. Journal of Combinatorial Theory, Series B, 130:1–18, 2018. doi:10.1016/j.jctb.2017.09.012.
- [9] Jean Daligault, Michael Rao, and Stéphan Thomassé. Well-quasi-order of relabel functions. Order, 27(3):301–315, September 2010. doi:10.1007/s11083-010-9174-0.
- [10] Stéphane Demeri, Alain Finkel, Jean Goubault-Larrecq, Sylvain Schmitz, and Philippe Schnoebelen. Algorithmic aspects of wqo theory. Course notes for the MPRI Master’s program, Paris, 2012. URL: https://cel.archives-ouvertes.fr/cel-00727025.
- [11] Nachum Derhowitz and Iddo Tzameret. Gap embedding for well-quasi-orderings1. Electronic Notes in Theoretical Computer Science, 84:80–90, 2003. WoLLIC’2003, 10th Workshop on Logic, Language, Information and Computation. doi:10.1016/S1571-0661(04)80846-6.
- [12] Guoli Ding. Subgraphs and well-quasi-ordering. Journal of Graph Theory, 16(5):489–502, November 1992. doi:10.1002/jgt.3190160509.
- [13] Samuel Eilenberg. Automata, Languages, and Machines, volume A of Automata, Languages, and Machines. Academic Press, 1974. doi:10.5555/540244.
- [14] Robert Ganian, Petr Hliněný, Jaroslav Nešetřil, Jan Obdržálek, and Patrice Ossona de Mendez. Shrub-depth: Capturing height of dense graphs. Logical Methods in Computer Science, Volume 15, Issue 1, January 2019. doi:10.23638/LMCS-15(1:7)2019.
- [15] Robert Ganian, Petr Hliněný, Jaroslav Nešetřil, Jan Obdržálek, Patrice Ossona de Mendez, and Reshma Ramadurai. When Trees Grow Low: Shrubs and Fast MSO1, pages 419–430. Springer Berlin Heidelberg, 2012. doi:10.1007/978-3-642-32589-2_38.
- [16] Christoph Haase, Sylvain Schmitz, and Philippe Schnoebelen. The power of priority channel systems. In Pedro R. D’Argenio and Hernán Melgratti, editors, CONCUR 2013 – Concurrency Theory, pages 319–333, Berlin, Heidelberg, 2013. Springer Berlin Heidelberg. doi:10.1007/978-3-642-40184-8_23.
- [17] Graham Higman. Ordering by divisibility in abstract algebras. Proceedings of the London Mathematical Society, 3:326–336, 1952. doi:10.1112/plms/s3-2.1.326.
- [18] Daniel Kirsten. The finite power problem revisited. Information Processing Letters, 84(6):291–294, 2002. doi:10.1016/S0020-0190(02)00319-8.
- [19] Joseph B. Kruskal. The theory of well-quasi-ordering: A frequently discovered concept. Journal of Combinatorial Theory, Series A, 13:297–305, 1972. doi:10.1016/0097-3165(72)90063-5.
- [20] Michal Kunc. Regular solutions of language inequalities and well quasi-orders. Theoretical Computer Science, 348(2):277–293, 2005. Automata, Languages and Programming: Algorithms and Complexity (ICALP-A 2004). doi:10.1016/j.tcs.2005.09.018.
- [21] Vadim V. Lozin, Igor Razgon, and Viktor Zamaraev. Well-quasi-ordering does not imply bounded clique-width. In Graph-Theoretic Concepts in Computer Science, pages 351–359, Berlin, Heidelberg, 2016. Springer Berlin Heidelberg. doi:10.1007/978-3-662-53174-7_25.
- [22] Crispin St. John Alvah Nash-Williams. On well-quasi-ordering transfinite sequences. Mathematical Proceedings of the Cambridge Philosophical Society, 61:33–39, 1965. doi:10.1017/S0305004100038603.
- [23] Marko Petkovšek. Letter graphs and well-quasi-order by induced subgraphs. Discrete Mathematics, 244(1–3):375–388, February 2002. doi:10.1016/s0012-365x(01)00094-2.
- [24] Maurice Pouzet. Un bel ordre d’abritement et ses rapports avec les bornes d’une multirelation. CR Acad. Sci. Paris Sér. AB, 274:A1677–A1680, 1972.
- [25] Neil Robertson and Paul D. Seymour. Graph minors. xx. wagner’s conjecture. Journal of Combinatorial Theory, Series B, 92(2):325–357, 2004. Special Issue Dedicated to Professor W.T. Tutte. doi:10.1016/j.jctb.2004.08.001.
- [26] Imre Simon. Factorization forests of finite height. Theoretical Computer Science, 72(1):65–94, 1990. doi:10.1016/0304-3975(90)90047-L.
- [27] Wolfgang Thomas. Languages, automata, and logic. In Grzegorz Rozenberg and Arto Salomaa, editors, Handbook of formal languages, pages 389–455. Springer, 1997. doi:10.1007/978-3-642-59136-5.
- [28] Egon Wanke. k-nlc graphs and polynomial algorithms. Discrete Applied Mathematics, 54(2):251–266, 1994. doi:10.1016/0166-218X(94)90026-4.
