Abstract 1 Introduction 2 Preliminaries 3 MSO-Interpretations and Monoids 4 A Semigroup Approach to Well-Quasi-Orders of Relabel Functions 5 Outlook References

Labelled Well Quasi Ordered Classes of Bounded Linear Clique-Width

Aliaume Lopez ORCID University of Warsaw, Poland
Abstract

We construct an algorithm that inputs an 𝖬𝖲𝖮-interpretation from finite words to graphs, and decides if there exists a k such that the class of graphs induced by the interpretation is not well-quasi-ordered by the induced subgraph relation when vertices are freely labelled using {1,,k}. In case no such k exists, we also prove that the class of graphs is not well-quasi-ordered by the induced subgraph relation when vertices are freely labelled using any well-quasi-ordered set of labels. As a byproduct of our analysis, we prove that for classes of bounded linear clique-width, a weak version of a conjecture by Pouzet holds.

Keywords and phrases:
well-quasi-ordering, linear clique-width, MSO transduction, automata theory
Funding:
Aliaume Lopez: Aliaume Lopez was supported by the Polish National Science Centre (NCN) grant “Polynomial finite state computation” (2022/46/A/ST6/00072).
Copyright and License:
[Uncaptioned image] © Aliaume Lopez; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Models of computation
; Theory of computation Automata extensions
Related Version:
Full Version: https://arxiv.org/abs/2405.10894
Editors:
Paweł Gawrychowski, Filip Mazowiecki, and Michał Skrzypczak

1 Introduction

A well-quasi-ordering (wqo) (X,) is a quasi-order that contains no infinite antichains (i.e., sets of pairwise incomparable elements) nor infinite decreasing sequences. A cornerstone result of structural graph theory is the Graph Minor Theorem of Robertson and Seymour [25], which states that the class of all graphs is well-quasi-ordered by the graph minor relation. This result has profound implications in graph theory, and algorithmic consequences such as the polynomial-time solvability of whether a graph can be embedded into a given surface.

The class of all (finite, undirected) graphs is not well-quasi-ordered by the induced subgraph relation, as witnessed by the antichain composed of finite (chordless) cycles, and there has been considerable interest in characterizing classes 𝒞 of finite graphs that are well-quasi-ordered by the induced subgraph relation. It was proven by Ding that all classes of bounded tree-depth are well-quasi-ordered for the induced subgraph relation [12, Theorem 2.2], and it is also the case for letter graphs [23]. In general, these theorems can be extended to all classes admitting “well-behaved” tree decompositions, such as classes of bounded shrub-depth [14, Corollary 3.9], or m-partite cographs [15, Theorem 5.5].

All the theorems mentioned above actually prove a stronger statement: the classes at hand are well-quasi-ordered in the presence of any well-quasi-ordered labelling of the nodes. Given a quasi-ordered set (X,) and a class 𝒞 of graphs, we call 𝖫𝖺𝖻𝖾𝗅X(𝒞) the class of graphs obtained by freely labelling vertices using elements of X, that is, graphs G𝒞 equipped with a labelling 𝗅𝖻𝗅G:𝖵(G)X. The notion of induced subgraph is naturally extended to the one of induced labelled subgraph, and we defer its formal definition to Section 2. For instance, it is shown in [12, Theorem 2.1] that a class of graphs of bounded tree-depth is well-quasi-ordered by the induced labelled subgraph relation, assuming that the labels are themselves well-quasi-ordered. We say that a class 𝒞 of graphs is k-well-quasi-ordered (k-wqo) if 𝖫𝖺𝖻𝖾𝗅X(𝒞) is well-quasi-ordered for all finite sets X of size at most k, and that it is labelled-well-quasi-ordered (labelled-wqo) if it is well-quasi-ordered for all finite sets X of labels. Finally, let us say that a class 𝒞 is wqo-well-quasi-ordered (wqo-wqo) if it is well-quasi-ordered for all well-quasi-ordered sets X of labels. Note that wqo-wqo implies labelled-wqo, which implies k-wqo for all k, and that k-wqo implies -wqo for all k.

For hereditary classes of graphs, the notion of 2-well-quasi-ordered is of particular interest, as it implies the existence111The existence is non-constructive, and building a constructive algorithm requires non-trivial work, see for instance [1] in the case of letter graphs. of a fixed-parameter tractable algorithm to decide the membership of a graph in the class 𝒞, such as described in [15, Corollary 5.8], and more generally that the class of graphs is described by finitely many forbidden induced subgraphs as shown by [9, Proposition 3] and [24]. In 2010, Daligault, Rao and Thomassé studied classes of graphs described by relabel functions, a generalization of m-partite cographs, and characterized those that are 2-well-quasi-ordered [9, Theorem 3]. In this paper, they also provided a positive answer to the Pouzet conjecture [24], which states that 2-wqo, labelled-wqo and wqo-wqo are equivalent for classes described by relabel functions [9, p. 4].222The Pouzet conjecture states the same collapse for all classes of graphs. The classes of graphs studied in [9] are all of bounded clique-width, and it was conjectured that all 2-well-quasi-ordered classes of graphs are of bounded clique-width [9, Conjecture 5]. This started a series of works on the relationship between clique-width and well-quasi-orderings: it was shown that 1-wqo does not imply bounded clique-width [21, 8], that there are classes of graphs that are 1-wqo and described by finitely many forbidden (induced) subgraphs but are not 2-wqo [3], and that there exists (minimal) classes of unbounded clique width described by finitely many forbidden induced subgraphs [2]. We summarize the various properties of classes of graphs in Figure 1.

Figure 1: Ellipses represent classes of graphs that are well-quasi-ordered, and the two cones represent respectively the classes of graphs of bounded clique-width and bounded linear clique-width. We chose to not represent the intermediate classes that are k-wqo for all 2<k<+ for clarity. The dots represent specific classes of graphs witnessing the non-emptiness of some intersections. Pouzet’s conjecture states that all ellipses below 2-wqo collapse for hereditary classes of graphs, and [9, Conjecture 5] states that the region obtained by removing the cone of bounded clique-width from the 2-wqo ellipse is empty. The dashed region represents the classes of graphs that we study in this paper, for which we prove the collapse to wqo-wqo in our Corollary 2.

Contributions.

Our first contribution is a decision procedure, that allows us to recognize classes of graphs that are labelled-wqo, provided that they are described using an 𝖬𝖲𝖮-interpretation from (finite) words to graphs, the definition of which we defer to Section 2.

Theorem 1.

Let I be an 𝖬𝖲𝖮-interpretation from words to graphs, and let 𝒞 be its image. There exists a computable k1 such that the following are equivalent:

  1. 1.

    𝒞 is k-well-quasi-ordered,

  2. 2.

    𝒞 is labelled-well-quasi-ordered,

  3. 3.

    𝒞 is wqo-well-quasi-ordered.

Furthermore, these properties are decidable.

We also prove that all hereditary classes of graphs that are 2-wqo and of bounded linear clique-width can be described by such an 𝖬𝖲𝖮-interpretation, which justifies their use as an input to our decision procedure. As a corollary, we answer positively to part of Pouzet’s conjecture in the case of classes of graphs having bounded linear clique-width, a subset of classes of bounded clique-width, that is incomparable with the classes studied in [9].

Corollary 2 (Collapse for Bounded Linear Clique-Width).

For all classes of graphs of bounded linear clique-width, the following are equivalent:

  1. 1.

    The class is labelled-well-quasi-ordered,

  2. 2.

    The class is wqo-well-quasi-ordered.

The novelty of our approach is to leverage classical tools from automata theory (namely, the factorization forest theorem of Simon [26]), and avoid the use of complex minimal bad sequence arguments [22] that were used in [9]. We strongly believe that these results generalize to the case of classes of bounded clique-width. To justify this claim, we derive a self-contained proof of [9, Theorem 3] for classes having bounded linear clique-width. Our algebraic approach provides a new perspective on the statements of [9] by connecting it to the class of totally ordered monoids, that were already investigated in the context of regular languages for completely different reasons [18, 20].

Outline.

We provide in Section 2 basic definitions and explain how to deduce Corollary 2 from Theorem 1. In Section 3, we prove our main Theorem 1. Then, in Section 4, we discuss the relationship between our characterization and the previous work of [9], proving in Theorem 24 that their characterization can be seen as a coarser version of ours in the case of bounded linear clique-width.

2 Preliminaries

Graphs.

In this paper, graphs are all finite, simple and undirected unless otherwise explicitly stated. We denote by 𝖵(G) the set of vertices of a graph G, and by 𝖤(G) the set of edges. A map h:GH between two graphs G and H is an embedding if it is injective and satisfies {h(u),h(v)}𝖤(H) if and only if {u,v}𝖤(G), for all (u,v)𝖵(G). We write GiH if there exists an embedding h:GH, which means that G is an induced subgraph of H, or equivalently that H is an induced extension of G. A class of graph is hereditary when it is closed under taking induced subgraphs. We denote by i𝒞 the hereditary closure of a class 𝒞, obtained by considering all induced subgraphs of graphs in 𝒞. The notion of induced subgraph is extended to labelled graphs as follows: a function f:𝖵(G)𝖵(H) is an embedding of G into H whenever f is injective, (u,v)𝖤(G) if and only if (f(u),f(v))𝖤(H), and 𝗅𝖻𝗅G(u)𝗅𝖻𝗅H(f(u)) for all (u,v)𝖵(G)2.

Example 3.

The class of all finite paths is well-quasi-ordered (because a path of length n is an induced subgraph of all paths of length mn) but not 2-well-quasi-ordered (which can be seen by coloring the endpoints of the paths). The class of all finite cycles is not well-quasi-ordered (since a cycle of length n is never an induced subgraph of a cycle of length mn). The class of all cliques is wqo-wqo, which can be seen by representing colored cliques as finite multisets of colors, and leveraging the fact that these finite multisets are well-quasi-ordered when the colors themselves are well-quasi-ordered (see for instance [10]).

Automata Theory.

We assume that the reader is familiar with the basics of automata theory, in particular the notions of monoid morphisms, idempotents in monoids, monadic second-order (𝖬𝖲𝖮) logic and first-order (𝖥𝖮) logic over finite words (see e.g. [27]). We use symbols Σ, Γ to denote finite non-empty alphabets, and letters u, v, w for finite words. The symbol ε is used to denote the empty word, and given a word wΣ, we write wi for the letter at position 1i|w|, and wi (resp. wi) for the prefix of w of length i (resp. the suffix of w starting at position i), note that these can be empty words. Similarly, we write w(i,j) for the factor of w between positions i+1 and j1, which is empty if ji+1.

Let us recall that a monoid M is a set endowed with a binary operation that is associative, has an identity element 1M. Inside a monoid M, an element eM is called idempotent if e2=e. A morphism of monoids μ:MN is a map that preserves the multiplication and the identity element, i.e., μ(1M)=1N and μ(ab)=μ(a)μ(b) for all a,bM. The finite words Σ is a monoid under concatenation, with the empty word ε as neutral element.

MSO Interpretations.

A (simple) 𝖬𝖲𝖮-interpretation from finite words (Σ) to finite graphs is given by an 𝖬𝖲𝖮 formula φedge(x,y) that defines the edge relation of a graph, a domain formula φdom(x) that defines the domain of the output graph, and a selection formula φΔ without free variables, used to restrict the domain of the interpretation.

Let I:=(φedge,φdom,φΔ) be an 𝖬𝖲𝖮-interpretation from finite words to graphs. The image of a word wΣ is the graph G with vertices {1i|w|wφdom(i)}, and edges {i,j} for each pair (i,j) such that i<j and w satisfies φedge(i,j). The image of I is the collection of I(w) where w ranges over the words in Σ that satisfy φΔ, we write this image 𝖨𝗆(I). We provide examples of interpretations in Examples 4 and 5, the first example is a class that is wqo-wqo, while the second one is not even 2-wqo.

Example 4.

The class of all cliques can be obtained as the image of the following 𝖬𝖲𝖮-interpretation: φdom(x):=, φedge(x,y):=xy, and φΔ:=.

Example 5.

The class of all finite paths can be obtained as the image of the following 𝖬𝖲𝖮-interpretation: φdom(x):=, φedge(x,y):=(xy)z.(xzy)z=xz=y, and φΔ:=.

A class of graphs has bounded linear clique-width if there exists a finite alphabet Σ, and an 𝖬𝖲𝖮-interpretation I such that 𝒞𝖨𝗆(I) [7, 6]. The notion of bounded clique-width is defined similarly, using trees instead of words as input of the interpretation. We remark that there are uncountably many classes of graphs that have bounded linear clique-width, and only countably many 𝖬𝖲𝖮-interpretations. This discrepancy vanishes when considering hereditary classes of graphs that are 2-well-quasi-ordered (Lemma 6), and we can use this to bridge the gap between the algorithm of Theorem 1 that considers 𝖬𝖲𝖮-interpretations and our Corollary 2, by leveraging the interaction between hereditary closure and labelling of a class (Lemma 7).

Lemma 6 (Sandwich Lemma).

Let 𝒞 be a class of graphs that has bounded linear clique-width and is 2-well-quasi-ordered. Then, there exists an 𝖬𝖲𝖮-interpretation I such that 𝒞𝖨𝗆(I)i𝒞. In particular, 𝒞=𝖨𝗆(I) when 𝒞 is hereditary.

Lemma 7 (Hereditary Closure Lemma).

Let k1, and 𝒞 be a class of graphs that is (k+1)-well-quasi-ordered. Then the hereditary closure of 𝒞 is k-well-quasi-ordered.

Corollary 2 (Collapse for Bounded Linear Clique-Width). [Restated, see original statement.]

For all classes of graphs of bounded linear clique-width, the following are equivalent:

  1. 1.

    The class is labelled-well-quasi-ordered,

  2. 2.

    The class is wqo-well-quasi-ordered.

Proof.

Let 𝒞 be a class of graphs that has bounded linear clique-width, and is labelled-well-quasi-ordered. By Lemma 6, there exists an 𝖬𝖲𝖮-interpretation I such that 𝒞𝖨𝗆(I)i𝒞. Since 𝒞 is labelled-well-quasi-ordered, so is i𝒞 by Lemma 7. As a consequence, 𝖨𝗆(I) is labelled-wqo (as a subset), and using Theorem 1, we conclude that 𝖨𝗆(I) is wqo-wqo. Because 𝒞𝖨𝗆(I), it means that 𝒞 is wqo-wqo too. margin: Back to Corollary˜2
on page 2

3 MSO-Interpretations and Monoids

In this section, we will develop an automata theoretic approach to the problem of characterizing images of 𝖬𝖲𝖮-interpretations that are labelled-well-quasi-ordered by the induced subgraph relation. The key ingredient is to use the notion of factorization forests [26] to provide an alternative syntax of such classes of graphs where words (linear trees of unbounded depth) are replaced by forests (branching trees of bounded depth). Let us first justify that in the upcoming analysis, the domain formula φdom and the universe formula φΔ can be safely ignored (i.e., set to be constantly true). Beware that, contrary to Lemma 6, the following construction is effective which is essential for the algorithm of Theorem 1.

Lemma 8 (Simple Interpretations).

Let I be an 𝖬𝖲𝖮-interpretation. There exists an 𝖬𝖲𝖮-interpretation I that does not use φΔ and φdom, and such that for all sets (Q,), 𝖫𝖺𝖻𝖾𝗅Q(𝖨𝗆(I)) is well-quasi-ordered if and only if 𝖫𝖺𝖻𝖾𝗅Q(𝖨𝗆(I)) is.

Furthermore, I is effectively computable from I.

Monoid Interpretations.

The equivalence between regular languages, finite automata, languages recognized by finite monoids and 𝖬𝖲𝖮 formulas ([13]) provides the existence (and computability), for every formula φ(x,y)𝖬𝖲𝖮, of a finite monoid M, together with a surjective morphism μ:ΣM, and a subset PM3, such that, for all words wΣ, for all couples i<j of positions in w, we have:

wφ(i,j)(μ(wi),μ(w(i,j)),μ(wj))P.

As a consequence, given a finite alphabet Σ and an edge formula φedge(x,y) defining an 𝖬𝖲𝖮-interpretation, one can construct a monoid interpretation: it is described by a finite monoid M, a morphism μ:ΣM, and a subset PM3 that we call an edge selector. To a word wΣ, it associates a graph G with vertices 𝖵(G) defined as the set of positions of w, and creates an edge (i,j)𝖤(G) if (μ(wi),μ(w(i,j)),μ(wj))P. We provide hereafter two examples of monoid interpretations that will be used to illustrate our constructions.

Example 9.

Let Σ:={a}, M:=({0,1,2},+2) where a+2b:=min(2,a+b), μ(a):=1, and P:={(x,y,z)M3y=0}. Then, the image of the monoid interpretation Ipaths is the class of finite paths.

Example 10.

Let Σ:={a}, M:=(/2,+), μ(a):=1, and P:={(x,y,z)M3y=0}. Then, the image of the monoid interpretation Ibp is included in the class of bipartite cliques.

Although Examples 9 and 10 share the same alphabet, similar morphisms and similar edge selectors, their behavior is drastically different. The image of Ipaths is 1-wqo but not 2-wqo, while the image of Ibp is wqo-wqo.

3.1 Factorisation Forests

Let us now introduce one of the key ingredients of our proof, which is the notion of factorization forest of a finite word over a finite monoid M. Initially introduced by Simon in [26], they provide a way to efficiently compute the product of factors of a word in M, by grouping computations using a tree of bounded depth, as restated in Theorem 11. The main idea of this paper is to leverage the factorization forest theorem of Simon to provide a (bounded depth) tree-decomposition of the graphs in the image of a monoid interpretation. We will equip these tree-decompositions with a well-quasi-ordering (Lemma 12), which will induce a (new) well-quasi-ordered relation on the image of our monoid interpretation. At the level of graphs, this new relation will be a restriction of the induced subgraph relation, obtained by only considering embeddings that are compatible with the tree-decompositions of the graphs.

Theorem 11 ([26]).

Let Σ be a finite alphabet, and μ:ΣM be a morphism to a finite monoid. There exists333It can be proven that this depth is at most 3|M| [5], but we will not use this fact. a computable depth d such that every word wΣ can be factorized as a tree of depth at most d built using:

  • Leaves: aΣ, that evaluate to μ(a),

  • Binary nodes: t1t2, that evaluate to ab if t1 evaluates to a and t2 evaluates to b,

  • Idempotent nodes: (t1,,tn), if n2 and t1,,tn are subtrees that evaluate to the same idempotent element eM. The idempotent nodes evaluate to their corresponding idempotent e.

We call such a tree a factorization forest of w. We write d(M) for the set of factorization forests of depth at most d for the monoid M, leaving Σ implicit.

As the name suggests, the factorization theorem provides a factorization of a word w into a tree, and its subtrees correspond to factors of w. Let us illustrate in Figure 2 a factorization of a word for the monoid used in Example 9. For the interpretation Ibp of Example 10, a similar picture is obtained by replacing all 2’s from Figure 2 by 0’s.

Figure 2: We represent a factorization forest for the word w=a9 using the monoid of the interpretation Ipaths of Example 9. Gray nodes are leaves, blue nodes are binary (or unary), and yellow nodes are idempotent nodes. For every node, we used as a label the evaluation of the subtree rooted at this node. We depicted the factors wi, w(i,j), and wj, where i=2 and j=7.

Ordering Factorisation Forests.

By definition, factorization forests are nested words over the alphabet Σ. It has been proven by Higman that if a set (X,) is well-quasi-ordered, then the set of finite words (X,) is also well-quasi-ordered [17], where uv if there exists an increasing map f:{1,,|u|}{1,,|v|} such that uivf(i) for all i. When uv we say that u is a (scattered) subword of v.

Given a finite alphabet Σ, a finite monoid M and a surjective morphism μ:ΣM, we define by induction on d a well-quasi-ordering d on the set of factorization forests of depth at most d. For d=0, the forests are all leaves, and we equip them with the equality quasi-order. For the inductive case, we let td+1t if one of the following holds:

  • tdt (hence, both are of depth at most d),

  • t=t1t2, t=t1t2, t1dt1 and t2dt2,

  • t=(t1,,tn) and t=(t1,,tn) and t1tndt1tn for the subword embedding relation over (d(M)).

Let us briefly remark that this definition of ordering using “nested subword embeddings” was used to verify priority channel systems in [16], and is a specialization of the notion of the gap-embedding relation on trees of [11]. Furthermore, the definition of the well-quasi-ordering d can be extended to the case where nodes of the forests are labelled by elements from a quasi-ordered set (Q,) as follows: whenever we compare two nodes t and t, we also check if their labels are comparable. The following lemma states that (d(M),d) is a well-quasi-order for every d, and follows immediately from Higman’s lemma [17].

Lemma 12 (Factorization Forests are Well-Quasi-Ordered).

Let M be a finite monoid, Σ be a finite alphabet, μ:ΣM be a morphism, (Q,) be a well-quasi-ordered set, and d. The set of Q-labelled factorization forests of depth at most d is a well-quasi-order for d.

Let us fix for the rest of this section a monoid interpretation I=(Σ,M,μ,P), and a constant d such that every word in wΣ has a factorization forest of depth at most d (using Theorem 11). Because factorization forests are an additional structure over a finite word w, the interpretation I can be extended to a map from factorization forests to graphs (ignoring the factorization of the underlying word). If this map I:d(M)I(Σ) was order-preserving, i.e., if tdt implies I(t)iI(t) for every t,td(M), then we could immediately conclude that I(Σ) is well-quasi-ordered: the image of a well-quasi-ordered set by an order-preserving map is well-quasi-ordered [10]. Let us argue that it is almost the case.

By construction, if tdt, then t can be obtained from t by inserting new children to idempotent nodes. In particular, it induces a map f from nodes of t to nodes of t that respects: the least common ancestor relation, the evaluation of subtrees, the type of nodes (binary node, leaf, idempotent node). Now, if t and t are factorization forests of words u and v respectively, then f can be seen as a map from positions of u to positions of v, hence a map from vertices of I(u) to vertices of I(v), which is almost an embedding.

Lemma 13.

Let uΣ be a word, tud(M) be a factorization of u, e be an idempotent element of M, x be an idempotent node that evaluates to e in tu, and tw be a factorization of a word w that evaluates to e. Let tv be obtained by inserting tw as a new child of the node x in u, and f:{1,,|u|}{1,,|v|} be the corresponding map at the level of words. Then, for every pair of positions i<j that are not in the factor of u described by x, we have: μ(ui)=μ(vf(i)), μ(uj)=μ(vf(j)), and μ(u(i,j))=μ(v(f(i),f(j))).

As a consequence, for such pairs, (i,j)𝖤(I(u)) if and only if (f(i),f(j))𝖤(I(v)).

Proof.

Let us consider two positions i<j of the word u. Assume that the insertion of tw is done between i and j (the other cases are similar). Then, one can write u=uiu(i,j)𝗅𝖾𝖿𝗍x𝗅𝖾𝖿𝗍x𝗋𝗂𝗀𝗁𝗍u(i,j)𝗋𝗂𝗀𝗁𝗍uj, where x𝗅𝖾𝖿𝗍x𝗋𝗂𝗀𝗁𝗍 is the factor of u corresponding to the idempotent node x, split where the insertion of the word w happens. By construction, v=uiu(i,j)𝗅𝖾𝖿𝗍x𝗅𝖾𝖿𝗍wx𝗋𝗂𝗀𝗁𝗍u(i,j)𝗋𝗂𝗀𝗁𝗍uj. Because ui=vf(i) and uj=vf(j), we immediately obtain the first and third desired equalities.

μ(v(f(i),f(j))) =μ(u(i,j)𝗅𝖾𝖿𝗍)μ(x𝗅𝖾𝖿𝗍)μ(w)μ(x𝗋𝗂𝗀𝗁𝗍)μ(u(i,j)𝗋𝗂𝗀𝗁𝗍)
=μ(u(i,j)𝗅𝖾𝖿𝗍)e2μ(u(i,j)𝗋𝗂𝗀𝗁𝗍) x is an idempotent node
=μ(u(i,j)𝗅𝖾𝖿𝗍)eμ(u(i,j)𝗋𝗂𝗀𝗁𝗍) e is idempotent
=μ(u(i,j)𝗅𝖾𝖿𝗍)μ(x𝗅𝖾𝖿𝗍x𝗋𝗂𝗀𝗁𝗍)μ(u(i,j)𝗋𝗂𝗀𝗁𝗍) x is an idempotent node
=μ(u(i,j))

Which concludes the proof.

Let us illustrate why considering positions that are “too close” to the inserted forest is problematic in Figure 3. This is not surprising as the class of graphs described by the interpretation Ipaths of Figure 3 is not 2-wqo, hence it is impossible for the interpretation Ipaths to transport embeddings between forests to embeddings between their images.

Figure 3: We represent factorization forest for the words u=a7 and v=a9, using the monoid of the interpretation Ipaths of Example 9, following the same coloring conventions as in Figure 2. The forest for v is obtained by inserting the purple subtree in the forest for u. The map f:{1,,7}{1,,9} derived from this embedding is such that f(x)=x if x2 and f(x)=x+2 otherwise. In this drawing, k=f(i)=2, j=3, and =f(j)=5. The pair (i,j) belongs to 𝖤(I(u)) because μ(u(i,j))=μ(ε)=0, but the pair (f(i),f(j))=(k,) does not belong to 𝖤(I(v)) because μ(v(k,))=μ(a2)=2.

3.2 Characterizing Labelled Well-Quasi-Ordered Classes

Generalizing the example of Figure 3, we can now define what are the formal obstructions inside the factorization forests for the map I to be order-preserving. We will call them bad forest paths. To that end, let us first generalize the notion of evaluation of a monoid interpretation from factorization forests to account for a context (ml,mr)M2: I(mltmr) is the graph obtained by evaluating the factorization forest t with the morphism μ and the edge selector Pml,mr, where (x,y,z)Pml,mr(mlx,y,zmr)P. This will allow us to reason about the image of a subtree t of a factorization forest t, which corresponds to an induced subgraph I(mltmr) of the generated graph I(t), where ml and mr are the monoid elements obtained by evaluating the word to the left and to the right of the selected subtree. Let us also define the type of a leaf i in a factorization tree td(M) of a word uΣ, as the triple (μ(u<i),μ(ui),μ(u>i)M3, that we denote as 𝗍𝗉t(i).

Definition 14.

Let (ti)1ik be a sequence of elements in d(M), all evaluating to the same idempotent element eM and t=(ti)1ik. The sequence is a good forest path if for all (ml,mr)M2, there exists another sequence (hi)1i of elements of d(M) evaluating to e, a tree h=(hi)1i, and an embedding f:𝖵(T)𝖵(H) where T=I(mltmr) and H=I(mlhmr), such that:

  1. 1.

    for every element x𝖵(T), originating from a leaf of some ti, that is sent to a vertex originating from hj via f, we have 𝗍𝗉ti(x)=𝗍𝗉hj(f(x)),

  2. 2.

    the vertices originating from t1 are mapped to vertices originating from h1,

  3. 3.

    the vertices originating from tk are mapped to vertices originating from h,

  4. 4.

    there exists 1i such that the image of f does not intersect the vertices originating from hi.

We say that a good forest path is split by the sequence (hi)1i and embedding f in the context (ml,mr). A bad forest path is a sequence of factorization forests (ti)1ik, that is not a good forest path.

By definition, good forest paths can be split into smaller independent parts. Beware that the split can reorder the nodes in some arbitrary fashion, and may not respect the order of the leaves in the original forest. Our first result is to prove that the existence of arbitrarily long bad forest paths is sufficient to conclude that the class I(Σ) is not labelled-wqo. In fact, the number of colors used will be related to the size of the monoid M.

Lemma 15.

Assume that bad forest paths of arbitrarily large length in d(M). Then, I(Σ) is not k-wqo, where k=|M|2×3.

Proof.

Assume that there exists arbitrarily long bad forest paths in d(M). Then, there exists a sequence of forests (ti)i such that for all i, ti=(hi,k)1jni, where (hi,k)1kni is a bad forest path, and ni is increasing with respect to i. Since these are bad forest paths, there exists (ai,bi)M2 such that ti cannot be split in the context (ai,bi).

Because the monoid M is finite, we can assume that all the forests ti evaluate to the same idempotent element eM, and that there exists (a,b)M2 such that ai=a and bi=b for all i. Extracting a subsequence also allows us to assume that for all i, ni+1 is greater than the number of leaves in ti.

Let us now consider the sequence of graphs Gi:=I(atib), that we color as follows: every vertex is colored by its leaf type in its original factorization forest hi,k, 1kni. We furthermore add three new colors, distinguishing vertices originating from the first and last factorization forest of the bad forest path hi,1 and hi,ni from the rest of the vertices. The final coloring corresponds to |M|3×3 labels. We now prove that the sequence (Gi)i is an infinite antichain with the provided coloring, which concludes the proof.

Assume towards a contradiction that there exists i<j and a labelled embedding f:𝖵(Gi)𝖵(Gj). Because of the coloring chosen on Gi and Gj, the map f respects the types of the leaves (Item 1), sends the vertices originating from hi,1 to the vertices originating from hj,1 (Item 2), and the vertices originating from hi,ni to the vertices originating from hj,nj (Item 3). Furthermore, the image of f must not intersect the vertices originating from some hj,k, 1knj (Item 4), because nj is greater than the number of leaves in ti, hence the number of vertices in Gi. We conclude that (hi,k)1kni is a good forest path, which can be split in the context (a,b), contradicting our initial assumption.

Let us now prove that the absence of arbitrarily long bad forest paths is sufficient to conclude that the class I(Σ) is wqo-well-quasi-ordered. The key ingredient of the proof is to split the good forest paths, which has the effect of adding dummy idempotent nodes separating the leaves of our factorization forests. Adding these nodes will prevent the issues encountered in generalizing Lemma 13 to nodes that are “too close” to the inserted forest, ensuring that the case of Figure 3 never happens. A first observation is that to split a good forest path, one only needs to repeat it three times.

Lemma 16.

Let (ti)1in be a good forest path. Then, for all contexts (a,b)M2, there exists an embedding f:𝖵(T)𝖵(T3) satisfying Items 1, 2, 3, and 4 of Definition 14, where T=I(a(t1,,tn)b), and T3=I(a(t1,,tn,t1,,tn,t1,,tn)b). Furthermore, the embedding f can be chosen such that for all x, f(x) is either x in the first copy of the sequence, or x in the third copy of the sequence, i.e., f is uniquely determined by a map from the vertices of T to {𝖿𝗂𝗋𝗌𝗍,𝗍𝗁𝗂𝗋𝖽}.

Proof.

Let (hi)1i be a sequence of factorization forests that splits the good forest path in the context (a,b) using an embedding g:𝖵(T)𝖵(H), where H=I(a(h1,,h)b). Let us write 1<N0< for the smallest index such that the image of g does not intersect the vertices originating from hN0. Let us also write ρ:𝖵(T){1,,} the map that assigns to every vertex x of T the index of the factorization forest from which g(x) originates in H.

Recall that every vertex of T exists in three copies in T3 (one for each repetition of the sequence of forests). We define the map f:𝖵(T)𝖵(T3) as follows, given a vertex i𝖵(T):

  • If i originates from t1, then it is mapped to its first copy in T3,

  • If i originates from tn, then it is mapped to its last copy in T3,

  • If i originates from tj, 1<j<n, and ρ(i)<N0, then it is mapped to its first copy in T3,

  • If i originates from tj, 1<j<n, and ρ(i)>N0, then it is mapped to its last copy in T3.

It is clear that f satisfies Items 1, 2, 3, and 4. What remains to be shown is that f is an embedding of T into T3, which will follow from the fact that g was itself an embedding. Given two vertices x,y𝖵(T) four cases can be considered:

  • If x and y both originate from t1. Then, (x,y)𝖤(T) if and only if (x,y)𝖤(I(a(t1,,tn)b)), if and only if (x,y)𝖤(I(at1eb)), because n2 for every idempotent nodes. Therefore, (x,y)𝖤(T) if and only if (f(x),f(y))𝖤(T3).

  • If x and y both originate from tn, the same argument applies but with the last copy of t.

  • If x and y originate from a tree tj, 1<j<n, and are sent to the same copy of t by f. Then a similar argument applies too.

  • If x and y originate from a tree tj, 1<j<n, and are sent to different copies of t by f. Then, one idempotent element e is inserted when computing the edge relation between f(x) and f(y), which may change the result (as in Figure 3). However, (x,y)𝖤(T) if and only if (g(x),g(y))𝖤(H) because g is an embedding. It follows from the same proof scheme as for Lemma 13 that (g(x),g(y))𝖤(H) if and only if (f(x),f(y))𝖤(T3), as they share the same leaf types, and are both separated by the same idempotent e.

We have proven that f is an embedding of T into T3.

We are now ready to upgrade the forests in d(M), assuming the existence of a bound N0 on the length of bad forest paths. Let us overload notations for the binary nodes so that (t1,,tk):=(t1,(t2,,tk)), (t1,t2):=t1t2, and (t1):=t1. Note that the depth of (t1,,tk) is bounded by k+dmax, where dmax is the maximal depth of the trees ti.

Let us illustrate our construction on an example. Let t=(t1,t2,t3,t4), where ti are factorization forests evaluating to some idempotent element eM, and assuming that N0=1. The first step is to group the children of t into buckets of size N0+1 (that is, 2): b1:=(t1,t2) and b2:=(t3,t4). Now, because b1 and b2 are of length N0+1, they are good forest paths, and can be split by repeating their sequence three times (Lemma 16). We obtain new buckets b1:=(t1,t2,t1,t2,t1,t2) and b2:=(t3,t4,t3,t4,t3,t4). We label the leaves of the trees in the sequences b1 and b2 as 𝗏𝖺𝗅𝗂𝖽 if they are in the image of the split, and 𝖽𝗎𝗆𝗆𝗒 otherwise. The resulting forest is obtained as follows:

t:=((t1,t2))Leftmost((t1,t2padding,(t1,t2,t3,t4)Group,t3,t4padding))((t3,t4))Rightmost. (1)

Where the padding elements have all their leaves labelled with 𝖽𝗎𝗆𝗆𝗒. The induced subgraph of I(t) obtained by removing all the vertices labelled by 𝖽𝗎𝗆𝗆𝗒 is isomorphic to I(t). Furthermore, every pair of leaves (x,y) of I(t) that are labelled by 𝗏𝖺𝗅𝗂𝖽 have the following property: either x and y belong to the same child of t (for instance, (t1,t2), or (t1,t2,t3,t4)), or they are separated by a child of t whose leaves are all labelled by 𝖽𝗎𝗆𝗆𝗒. In particular, for the tree t, inserting new children to the idempotent node does not modify the induced subgraph of the leaves labelled by 𝗏𝖺𝗅𝗂𝖽.

The following technical lemma states that this construction can be applied inductively to obtain a function τ:d(M)d(M), for some bounded d that depends on N0 and d.

Lemma 17.

Assume that there exists a bound N0 on the length of bad forest paths in d(M). Then, there exists a dd and a function τ:d(M)d(M) that maps a factorization forest t to a forest τ(t) that is labelled using {𝗏𝖺𝗅𝗂𝖽,𝖽𝗎𝗆𝗆𝗒}, and such that:

  • Every graph GI(Σ) is obtained as the restriction of a graph I(t) to its 𝗏𝖺𝗅𝗂𝖽 vertices for some tF,

  • For every t,td(M), for every function f witnessing that τ(t)dτ(t) (as labelled forests), the restriction of f to the induced subgraph of 𝗏𝖺𝗅𝗂𝖽 vertices of I(t) is an embedding into the induced subgraph of 𝗏𝖺𝗅𝗂𝖽 vertices of I(t).

From Lemma 17, we immediately conclude that the class I(Σ) is wqo-well-quasi-ordered. To simplify the proof, let us recall an equivalent characterization of well-quasi-ordered sets (X,). A sequence (xi)i is good if there exists i<j such that xixj. A set (X,) is well-quasi-ordered if and only if every infinite sequence of elements of X is good.

Corollary 18.

Assume that there exists a bound N0 on the length of bad forest paths in d(M). Then, I(Σ) is wqo-well-quasi-ordered.

Proof.

Consider an infinite sequence of graphs GiI(Σ), with labellings 𝗅𝖻𝗅Gi:𝖵(Gi)Q, where (Q,) is a well-quasi-ordered set. Let us equip the set {𝗏𝖺𝗅𝗂𝖽,𝖽𝗎𝗆𝗆𝗒} with the equality relation, and remark that Q×{𝗏𝖺𝗅𝗂𝖽,𝖽𝗎𝗆𝗆𝗒} is also well-quasi-ordered when endowed with the product ordering.

By definition, every graph Gi is the image of a word uiΣ through I. Because we chose the value of d according to Simon’s factorization theorem (Theorem 11), every word ui has a factorization forest tid(M). We can construct build the sequence hi:=τ(ti), where τ is the function provided by Lemma 17.

Because the set of (Q×{𝗏𝖺𝗅𝗂𝖽,𝖽𝗎𝗆𝗆𝗒})-labelled factorization forests of depth at most d is well-quasi-ordered for d (Lemma 12), there exists an increasing pair i<j of indices such that hidhj. This defines a function f:𝖵(I(hi))𝖵(I(hj)) that is an embedding when restricted to the induced subgraph of 𝗏𝖺𝗅𝗂𝖽 vertices, and respects the labelling Q×{𝗏𝖺𝗅𝗂𝖽,𝖽𝗎𝗆𝗆𝗒}. Because the induced subgraph of 𝗏𝖺𝗅𝗂𝖽 vertices of I(hi) is precisely I(ti)=I(ui)=Gi, and the same holds for I(hj)=Gj, this provides a function f:𝖵(Gi)𝖵(Gj) that is a labelled embedding of Gi into Gj.

The sequence (Gi)i is therefore a good sequence. and we have proven that I(Σ) is wqo-well-quasi-ordered.

We have proven that the existence of a bound on the length of bad forest paths charcterizes whether I(Σ) is wqo-well-quasi-ordered. To finalize the proof of our Theorem 1, we only need to show that the existence of such a bound is decidable. This is because the language of bad forest paths of depth at most d is a regular language, and one can decide whether a regular language is unbounded.

Lemma 19.

Let d. It is decidable whether d(M) contains bad forest paths of unbounded length.

Proof of Theorem 1 page 1.

Let I be an 𝖬𝖲𝖮-interpretation from Σ to graphs. Using Lemma 8, we can compute an 𝖬𝖲𝖮-interpretation I that only uses the formula φedge, such that I(Σ) is k-wqo if and only if I(Σ) is k-wqo for all k. From I, we can compute a monoid interpretation I′′=(Σ,M,μ,P) by standard techniques. Applying Theorem 11, we compute a number d such that every word uΣ has a factorization forest td(M). We then use Lemma 19 to decide whether d(M) has bad forest paths of unbounded length. If it does, then I′′(Σ) is not k-well-quasi-ordered, where k=|M|3×3 (Lemma 15), hence I(Σ) is not k-wqo. If it does not, then I′′(Σ) is wqo-well-quasi-ordered (Corollary 18), and therefore so is I(Σ).

In particular, we have computed a number k=|M|3×3 such that I(Σ) is k-well-quasi-ordered if and only if I(Σ) is wqo-well-quasi-ordered. margin: Back to Theorem˜1 on page 1

4 A Semigroup Approach to Well-Quasi-Orders of Relabel Functions

Classes that are labelled-well-quasi-ordered and of bounded clique-width had already been studied by [9], which leveraged the notion of k-node labelled controlled graphs, a notion introduced by [28]. The class 𝖭𝖫𝖢k of k-node labelled controlled graphs with relabel functions from {1,,k} to {1,,k} is defined inductively on k-labelled graphs as follows: one can create a single vertex graph with label i{1,,k}, one can relabel a k-labelled graph G with a function f (applying f to every vertex label of G), and one can combine two k-labelled graphs G and H using a set S{1,,k}2 by creating the disjoint union of G and H, and adding edges between vertices x and y in G and H respectively if (𝗅𝖻𝗅G(x),𝗅𝖻𝗅H(y))S.

It is a folklore result that every class of graphs 𝒞 that has bounded linear clique-width is a subset of some 𝖭𝖫𝖢k for some k and some , and [9] completely characterized the pairs (k,) such that 𝖭𝖫𝖢k is 2-well-quasi-ordered or labelled-well-quasi-ordered, showing the equivalence of these two properties [9, Theorem 3]. The theorem relies on the notion of totally ordered sets of relabel functions: given two functions f,g:{1,,k}{1,k}, they define a relation fg as Img(f)Img(fg), and study families of functions that are closed under composition, contain the identity function, and totally ordered for this relation [9, Section 2].

Theorem 20 ([9, Theorem 3, Corollary 1, Corollary 2]).

For all k, for all monoids of endofunctions of {1,,k}, the following are equivalent and decidable properties:

  1. 1.

    is totally ordered by the relation ,

  2. 2.

    The class of graphs 𝖭𝖫𝖢k is 2-well-quasi-ordered.

  3. 3.

    The class of graphs 𝖭𝖫𝖢k is labelled-well-quasi-ordered.

Let us mention that Theorem 20 does not allow us to decide whether a class of bounded linear clique-width is 2-well-quasi-ordered or not, nor does it solve the Pouzet Conjecture in that case. While it is true that every class of graphs of bounded linear clique-width is a subset of 𝖭𝖫𝖢k for some k and some [7], no analogues of Lemma 6 exist for this representation of classes of graphs: it may be that the class of graphs is 2-well-quasi-ordered while the class 𝖭𝖫𝖢k that contains it is not. This mismatch was conjectured to never happen [9, Conjecture 4], but it remains an open question. Informally, one can state that the difference between Theorem 20 and the results of the present paper is akin to the difference between analyzing properties of transition monoids and properties of regular languages: in our case, the set of accepting states (the edge selector) is fixed, which leads to more precise results, and in particular allows us to transfer our results from images of monoid interpretations to all classes of bounded linear clique-width. We will make this intuition precise in our Theorems 24 and 25, by showing that the condition of being ordered is equivalent to the fact that there are no long bad forest paths for every choice of edge selector.

4.1 Totally Ordered Monoids

In this section, we argue that the seemingly arbitrary notion of ordering between relabel functions can be naturally understood in the language of semigroup theory, namely that they are related to the classical Green Relations [5]. The connection can be made because the sets of functions studied by [9] are closed under compositions and contain the identity function, and therefore form a (finite) monoid M.

Let M be a finite monoid and mM be an element. We denote by (m)𝔍 the bilateral ideal of m, i.e., the set of elements xmy where x,y ranges in M. Similarly, we define the left ideal (m)𝔏 and the right ideal (m) respectively as the sets of elements xm and my where x,y ranges in M.

Lemma 21 (Green Formulations of Total Orderings).

Let M be a finite monoid. The following are equivalent:

  1. 1.

    M is totally ordered by the relation defined by xy if and only if (x)(xy),

  2. 2.

    M is totally ordered by the relation defined by xy if and only if (x)𝔏(yx)𝔏,

  3. 3.

    M is totally ordered by the relation defined by xy if and only if (x)𝔍=(xy)𝔍=(yx)𝔍,

  4. 4.

    The bilateral ideals of M are totally ordered for inclusion, and for all x,yM, (xy)𝔍=(x)𝔍(y)𝔍.

Whenever one of these conditions is satisfied, all the defined preorders coincide. Furthermore, if M is a monoid of relabel functions , then the above conditions are equivalent to the condition that is totally ordered by the relation .

Let us now introduce a definition for so-called “totally ordered monoids” that is based on Item 3. Examples of such totally ordered monoids are: all groups, and all band monoids, i.e., the monoids M such that for all a,bM, there exists cM such that a=cbc.

Definition 22.

A monoid M is totally ordered if for all a,bM, either (ab)𝔍=(a)𝔍 or (ab)𝔍=(b)𝔍. We write a𝔍b if (a)𝔍(b)𝔍, and a𝔍b if (a)𝔍=(b)𝔍.

Let us mention that totally ordered monoids are exactly those that only recognize languages L having the finite power property, i.e., such that languages such that there exists n such that L=inLi [18, 20]. Furthermore, totally ordered monoids enjoy the following cancellation property, that was noticed by [9], and powers their combinatorial results. We will leverage this property to prove the upcoming Lemma 25, that these cancellations prevent the existence of long bad forest paths.

Lemma 23 (Cancellation Property).

Let M be a totally ordered finite monoid. Then for all a,b,cM, if b𝔍a and abc=ab then bc=b. Similarly, if cba=ba then cb=b.

4.2 Bad Forest Paths in Totally Ordered Monoids

Let us now relate our characterization of classes of bounded linear clique-width that are labelled-well-quasi-ordered to the one introduced by [9]. This is done by showing that totally ordered monoids (i.e., the families studied in [9]), are precisely those for which the edge selector of our monoid interpretations (as defined in Section 3) does not matter: every choice leads to a wqo-well-quasi-ordered class of graphs.

Theorem 24.

Let Σ be a finite alphabet, M be a finite monoid, and μ:ΣM be a surjective morphism. Let k:=|M| be the size of M, and be the set of functions from {1,,k} to itself obtained by considering the action of M on itself. Then, the following are equivalent:

  1. 1.

    The monoid M is totally ordered,

  2. 2.

    The union PM3𝖨𝗆(IP) is 2-well-quasi-ordered, where IP:=(Σ,M,μ,P).

  3. 3.

    The union PM3𝖨𝗆(IP) is wqo-well-quasi-ordered, where IP:=(Σ,M,μ,P).

  4. 4.

    The class 𝖭𝖫𝖢k is wqo-well-quasi-ordered.

  5. 5.

    The class 𝖭𝖫𝖢k is 2-well-quasi-ordered.

Notice that Theorem 24 also proves that not considering the edge selector from our interpretations (by taking the union over all possible choices) smooths the combinatorial properties of Theorem 1: from a computable k such that the class is k-well-quasi-ordered if and only if it is wqo-well-quasi-ordered, we drop to the constant 2, answering positively to the Pouzet Conjecture in this case. The proof of Theorem 24 follows straightforwardly from the following lemma that characterizes the absence of bad forest paths in the case of totally ordered monoids, regardless of the choice of the edge selector. The lemma itself uses the cancellation properties of totally ordered monoids (Lemma 23).

Lemma 25 (Bad Forest Paths in Totally Ordered Monoids).

Let Σ be a finite alphabet, let M be a totally ordered finite monoid, PM3 be an edge selector, μ:ΣM be a surjective morphism, and I=(Σ,M,μ,P) be a monoid interpretation. Then, there are no bad forest paths in d(M), for all d.

Proof.

Let td(M) be factorization forest of a word uΣ. Assume that t evaluates to an idempotent element eM. Because μ(u)=e, and M is totally ordered, there must exist a factorization u=vaw such that μ(v)=m1, μ(a)=m2, and μ(w)=m3 and m2𝔍e. This is proven by induction on the size of u, using the fact that (xy)𝔍=(x)𝔍 or (xy)𝔍=(y)𝔍.

Let us now construct a function from the leaves of t into the leaves of t3:=(t,t,t). To that end, let us remark that the leaves of t3 are the positions of the word u3, and let us define the function f as follows:

  • For every position i that belongs to va, we f(i)=i, i.e., its corresponding position in the first copy of u in u3.

  • For every position i that belongs to w, we f(i)=i+2|u|, i.e., its corresponding position in the third copy of u in u3.

This function satisfies all requirements of the definition of a split (Definition 14), and we only have to prove that it is an embedding of I(t) into I(t3). To that end, it suffices to prove that for every decomposition u=uiu(i,j)uj, we have μ(u(i,j))=μ((uuu)(f(i),f(j))), μ(ui)=μ((uuu)f(i)), and μ(uj)=μ((uuu)f(j)).444This works regardless of the choice of P, that can only use the values of these factors to determine the presence of an edge. Let us only deal with the case of u(i,j), as the other cases are similar.

If the factor u(i,j) is contained in va or in w, then the result is immediate. Let us now assume that the range (i,j) intersects both va and w. We can write u=v1v2aw1w2, such that u(i,j)=v2aw1, v=v1v2, and w=w1w2. One can then compute (uuu)(f(i),f(j))=(v2aw)(vaw)(vaw1). Our goal is to prove that μ(v2aw1)=μ((v2aw)(vaw)(vaw1)). To that end, let us remark that μ(u)=e=e3=μ(uuu). Let us also remark that e𝔍μ(v2aw1)𝔍μ(v1), and e𝔍μ(v2aw1)𝔍μ(w2), and that one can therefore apply the cancellation property of Lemma 23 to obtain the following equalities:

μ(u) =μ(uuu)
μ(v1)μ(v2aw1)μ(w2) =μ(v1)μ((v2aw)(vaw)(vaw1))μ(w2)
μ(v2aw1)μ(w2) =μ((v2aw)(vaw)(vaw1))μ(w2) left cancellation
μ(v2aw1) =μ((v2aw)(vaw)(vaw1)) right cancellation

We have proven that there exists no bad forest path in d(M).

Proof of Theorem 24 page 24.

First, the equivalence between Item 1 and Items 4 and 5 follows from Theorem 20. Furthermore, it is clear that Item 3 implies Item 2.

To prove that Item 1 implies Item 3, let us assume that M is totally ordered. Then by Lemma 25, we know that for all PM3, there are no bad forest paths in d(M) for all d, and using Corollary 18 we conclude that 𝖨𝗆(IP) is wqo-well-quasi-ordered. As a consequence, the finite union of these classes is also wqo-well-quasi-ordered.

To prove that Item 2 implies Item 1, let us assume that that the monoid M is not totally ordered, then there exists m1,m2M such that (m1m2)𝔍(m1)𝔍 and (m1m2)𝔍(m2)𝔍. Because the morphism μ:ΣM is surjective, there exists words u,vΣ such that μ(u)=m1 and μ(v)=m2. In particular, both u and v must be non-empty words. Let us define the edge selector P:=M×(M(m1m2)𝔍)×M, and consider the sequence of words wn:=a(uv)na, for some letter aΣ and for all n. If there is an edge between the i-th and j-th letter of wn, then |ij|2(|u|+|v|). As a consequence, the distance between the first and last positions of wn in the graph IP(wn) is at least n/2(|u|+|v|), which tends to + as n increases. Furthermore, there exists a path from the first to the last position of wn, because for every factor u of u, μ(u)(m1m2)𝔍 (and similarly for v). Hence, the distance between the vertices originating from the first and last positions in wn is at most |wn|. By coloring the vertices of the graphs IP(wn) to distinguish the first and last positions of wn, and extracting a subsequence of wn such that the distance between the first and last positions in wn+1 is greater than |wn|, we obtain an infinite antichain of 2-labelled graphs. Therefore, 𝖨𝗆(IP) is not 2-well-quasi-ordered, which completes the proof. margin: Back to Theorem˜24 on page 24

5 Outlook

The automata based approach.

We believe that the results presented in this paper strongly advocate for an automata based approach to the characterization of graph classes that are labelled-well-quasi-ordered. In particular, there are two main directions that naturally emerge from our work. The first one is to investigate the case of classes of bounded clique-width, that is, moving from words to trees as input of our 𝖬𝖲𝖮-interpretations. The second direction is to investigate whether the bound k of our main Theorem 1 can be improved to 2, as it is the case in Theorem 24.

We strongly believe that our proof technique can be adapted to trees, as it is fundamentally based on two key theorems for words that are already known to be adaptable to trees: the fact that finite words are equipped with a well-quasi-ordering by Higman’s lemma [17], for which the generalization to trees is known as Kruskal’s tree embedding theorem [19]; and the fact that finite words over a finite monoids can be factored into bounded height trees, i.e. Simon’s factorization theorem [26], which has been adapted to trees by Colcombet [4].

As to the bound k in Theorem 1, we conjecture that it can be improved to 2, based on a finer analysis of bad forest paths (Lemma 15), which would bridge the gap between our analysis and the previous results from [9], and prove that the Pouzet conjecture holds for classes of graphs of bounded linear clique width.

New decompositions.

Let us also remark that the proof of Theorem 1 actually provides a tree-decomposition of graphs that is, in some sense, successor-free: adding new nodes between two nodes in the tree does not change the semantics of the tree. We believe it is worth investigating whether these decompositions could be exploited in other contexts than labelled-well-quasi-orderings.

Interpreting paths.

Finally, we conjecture that paths are the only obstruction to being labelled-well-quasi-ordered, as all (currently known by the authors) proofs that some classes are not labelled-wqo end up extracting finite paths from the graph classes we consider (see for instance Lemmas 15 and 24, [9, Theorem 3], [12]). This leads us to formulate the following Conjecture 26.

Conjecture 26.

Let 𝒞 be a class of graphs. Then, 𝒞 is not labelled-well-quasi-ordered if and only if there exists a finite set Q of labels, and an order-preserving 𝖬𝖲𝖮-interpretation from 𝖫𝖺𝖻𝖾𝗅Q(𝒞) to the class of all finite paths.

References

  • [1] Bogdan Alecu, Mamadou Moustapha Kanté, Vadim Lozin, and Viktor Zamaraev. Lettericity of graphs: an fpt algorithm and a bound on the size of obstructions, 2024. doi:10.48550/arXiv.2402.12559.
  • [2] Aistis Atminas, Robert Brignall, Vadim V. Lozin, and Juraj Stacho. Minimal classes of graphs of unbounded clique-width defined by finitely many forbidden induced subgraphs. Discrete Applied Mathematics, 295:57–69, May 2021. doi:10.1016/j.dam.2021.02.007.
  • [3] Robert Brignall, Michael Engen, and Vincent Vatter. A counterexample regarding labelled well-quasi-ordering. Graphs and Combinatorics, pages 1395–1409, 2018. doi:10.1007/s00373-018-1962-0.
  • [4] Thomas Colcombet. A combinatorial theorem for trees. In Lars Arge, Christian Cachin, Tomasz Jurdziński, and Andrzej Tarlecki, editors, Automata, Languages and Programming, pages 901–912, Berlin, Heidelberg, 2007. Springer Berlin Heidelberg. doi:10.1007/978-3-540-73420-8_77.
  • [5] Thomas Colcombet. Green’s relations and their use in automata theory. In Adrian-Horia Dediu, Shunsuke Inenaga, and Carlos Martín-Vide, editors, Language and Automata Theory and Applications, pages 1–21, Berlin, Heidelberg, 2011. Springer Berlin Heidelberg. doi:10.1007/978-3-642-21254-3_1.
  • [6] Bruno Courcelle. Monadic second-order definable graph transductions: a survey. Theoretical Computer Science, 126(1):53–75, 1994. doi:10.1016/0304-3975(94)90268-2.
  • [7] Bruno Courcelle, Joost Engelfriet, and Grzegorz Rozenberg. Handle-rewriting hypergraph grammars. Journal of Computer and System Sciences, 46(2):218–270, 1993. doi:10.1016/0022-0000(93)90004-G.
  • [8] Konrad K. Dabrowski, Vadim V. Lozin, and Daniël Paulusma. Well-quasi-ordering versus clique-width: New results on bigenic classes. Journal of Combinatorial Theory, Series B, 130:1–18, 2018. doi:10.1016/j.jctb.2017.09.012.
  • [9] Jean Daligault, Michael Rao, and Stéphan Thomassé. Well-quasi-order of relabel functions. Order, 27(3):301–315, September 2010. doi:10.1007/s11083-010-9174-0.
  • [10] Stéphane Demeri, Alain Finkel, Jean Goubault-Larrecq, Sylvain Schmitz, and Philippe Schnoebelen. Algorithmic aspects of wqo theory. Course notes for the MPRI Master’s program, Paris, 2012. URL: https://cel.archives-ouvertes.fr/cel-00727025.
  • [11] Nachum Derhowitz and Iddo Tzameret. Gap embedding for well-quasi-orderings1. Electronic Notes in Theoretical Computer Science, 84:80–90, 2003. WoLLIC’2003, 10th Workshop on Logic, Language, Information and Computation. doi:10.1016/S1571-0661(04)80846-6.
  • [12] Guoli Ding. Subgraphs and well-quasi-ordering. Journal of Graph Theory, 16(5):489–502, November 1992. doi:10.1002/jgt.3190160509.
  • [13] Samuel Eilenberg. Automata, Languages, and Machines, volume A of Automata, Languages, and Machines. Academic Press, 1974. doi:10.5555/540244.
  • [14] Robert Ganian, Petr Hliněný, Jaroslav Nešetřil, Jan Obdržálek, and Patrice Ossona de Mendez. Shrub-depth: Capturing height of dense graphs. Logical Methods in Computer Science, Volume 15, Issue 1, January 2019. doi:10.23638/LMCS-15(1:7)2019.
  • [15] Robert Ganian, Petr Hliněný, Jaroslav Nešetřil, Jan Obdržálek, Patrice Ossona de Mendez, and Reshma Ramadurai. When Trees Grow Low: Shrubs and Fast MSO1, pages 419–430. Springer Berlin Heidelberg, 2012. doi:10.1007/978-3-642-32589-2_38.
  • [16] Christoph Haase, Sylvain Schmitz, and Philippe Schnoebelen. The power of priority channel systems. In Pedro R. D’Argenio and Hernán Melgratti, editors, CONCUR 2013 – Concurrency Theory, pages 319–333, Berlin, Heidelberg, 2013. Springer Berlin Heidelberg. doi:10.1007/978-3-642-40184-8_23.
  • [17] Graham Higman. Ordering by divisibility in abstract algebras. Proceedings of the London Mathematical Society, 3:326–336, 1952. doi:10.1112/plms/s3-2.1.326.
  • [18] Daniel Kirsten. The finite power problem revisited. Information Processing Letters, 84(6):291–294, 2002. doi:10.1016/S0020-0190(02)00319-8.
  • [19] Joseph B. Kruskal. The theory of well-quasi-ordering: A frequently discovered concept. Journal of Combinatorial Theory, Series A, 13:297–305, 1972. doi:10.1016/0097-3165(72)90063-5.
  • [20] Michal Kunc. Regular solutions of language inequalities and well quasi-orders. Theoretical Computer Science, 348(2):277–293, 2005. Automata, Languages and Programming: Algorithms and Complexity (ICALP-A 2004). doi:10.1016/j.tcs.2005.09.018.
  • [21] Vadim V. Lozin, Igor Razgon, and Viktor Zamaraev. Well-quasi-ordering does not imply bounded clique-width. In Graph-Theoretic Concepts in Computer Science, pages 351–359, Berlin, Heidelberg, 2016. Springer Berlin Heidelberg. doi:10.1007/978-3-662-53174-7_25.
  • [22] Crispin St. John Alvah Nash-Williams. On well-quasi-ordering transfinite sequences. Mathematical Proceedings of the Cambridge Philosophical Society, 61:33–39, 1965. doi:10.1017/S0305004100038603.
  • [23] Marko Petkovšek. Letter graphs and well-quasi-order by induced subgraphs. Discrete Mathematics, 244(1–3):375–388, February 2002. doi:10.1016/s0012-365x(01)00094-2.
  • [24] Maurice Pouzet. Un bel ordre d’abritement et ses rapports avec les bornes d’une multirelation. CR Acad. Sci. Paris Sér. AB, 274:A1677–A1680, 1972.
  • [25] Neil Robertson and Paul D. Seymour. Graph minors. xx. wagner’s conjecture. Journal of Combinatorial Theory, Series B, 92(2):325–357, 2004. Special Issue Dedicated to Professor W.T. Tutte. doi:10.1016/j.jctb.2004.08.001.
  • [26] Imre Simon. Factorization forests of finite height. Theoretical Computer Science, 72(1):65–94, 1990. doi:10.1016/0304-3975(90)90047-L.
  • [27] Wolfgang Thomas. Languages, automata, and logic. In Grzegorz Rozenberg and Arto Salomaa, editors, Handbook of formal languages, pages 389–455. Springer, 1997. doi:10.1007/978-3-642-59136-5.
  • [28] Egon Wanke. k-nlc graphs and polynomial algorithms. Discrete Applied Mathematics, 54(2):251–266, 1994. doi:10.1016/0166-218X(94)90026-4.