Abstract 1 Introduction 2 Comparing the three proofs of the finitary HPT 3 Preliminaries 4 Characterization of + definability via the cops-and-robber game 5 Generalized Cai-Fürer-Immerman construction 6 Proof of the equi-rank finitary HPT 7 Open questions References

Equi-Rank Homomorphism Preservation Theorem on Finite Structures

Benjamin Rossman ORCID Duke University, Durham, NC, USA
Abstract

The Homomorphism Preservation Theorem (HPT) of classical model theory states that a first-order sentence is preserved under homomorphisms if, and only if, it is equivalent to an existential-positive sentence. This theorem remains valid when restricted to finite structures, as demonstrated by the author in [33, 34] via distinct model-theoretic and circuit-complexity based proofs. In this paper, we present a third (and significantly simpler) proof of the finitary HPT based on a generalized Cai-Fürer-Immerman construction. This method establishes a tight correspondence between syntactic parameters of a homomorphism-preserved sentence (quantifier rank, variable width, alternation height) and structural parameters of its minimal models (tree-width, tree-depth, decomposition height). Consequently, we prove a conjectured “equi-rank” version of the finitary HPT. In contrast, previous versions of the finitary HPT possess additional properties, but incur blow-ups in the quantifier rank of the equivalent existential-positive sentence.

Keywords and phrases:
finite model theory, preservation theorems, quantifier rank
Copyright and License:
[Uncaptioned image] © Benjamin Rossman; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Finite Model Theory
Acknowledgements:
I thank Deepanshu Kush, William He and Ken-ichi Kawarabayashi for stimulating conversions that prompted me to revisit the topic of this paper. Anuj Dawar provided helpful comments on a preliminary draft of this paper. I am grateful to the anonymous referees for numerous suggestions that improved the clarity of this paper. This paper was partially written during a visit to the National Institute of Informations in Tokyo, Japan.
Editors:
Jörg Endrullis and Sylvain Schmitz

1 Introduction

A first-order sentence φ is said to be preserved under homomorphisms if, for any model 𝔄φ and any structure 𝔅 such that there exists a homomorphism 𝔄𝔅, it holds that 𝔅φ. One class of first-order sentences that are always preserved under homomorphisms are the existential-positive sentences, which are built from atomic formulas (of the form x1=x2 and R(x1,,xr) where R is an r-ary relation symbol) via conjunction φ1φ2, disjunction φ1φ2, and existential quantification xφ (that is, without negation ¬φ or universal quantification xφ).111A sentence is a formula with no free variables. Although the definition of preserved under homomorphisms extends to formulas with free variables, we speak of sentences for simplicity sake. We further restrict attention to relational languages (without functions or constant symbols), even though most definitions and results in this paper extend to general first-order languages.

The Homomorphism Preservation Theorem (HPT) of classical model theory, attributed to Łoś, Lyndon and Tarski [30, 39, 31], states that existential-positive sentences are – up to logical equivalence – the only first-order sentences that are preserved under homomorphisms.222Here the semantic notions of logical equivalence and preserved under homomorphisms are with respect to all (finite or infinite) structures.

Theorem 1.1 (HPT).

A first-order sentence is preserved under homomorphisms if, and only if, it is equivalent to an existential-positive sentence.

The closely related Łoś-Tarski and Lyndon Preservation Theorems state that a first-order sentence that is preserved under embeddings (respectively: surjective homomorphisms) if, and only if, it is equivalent to an existential sentence (respectively: positive sentence). In each of these classical preservation theorems, the “if” direction follows directly from the semantics of first-order logic, while the “only if” direction was originally proved non-constructively using the Compactness Theorem.

A line of work in finite model theory initiated by Gurevich [20] studies the question of which theorems of classical model theory remain valid, and which become false, when restricted to finite structures. For example, the Compactness Theorem is easily seen to be false on finite structures. Counterexamples in [38, 20] witness the failure on finite structure of the Łoś-Tarski and Lyndon Theorems (on preservation under embeddings and surjective homomorphisms). In contrast, previous work of the author [33, Theorem 1.7] and [34, Theorem 6] showed that the classical HPT remains valid when restricted to finite structures.

Theorem 1.2 (HPT on finite structures).

Every first-order sentence that is preserved under homomorphisms on finite structures is equivalent on finite structures to an existential-positive sentence.

Articles [33, 34] provide two entirely different proofs of Theorem 1.2, which we discuss in §2.1 and §2.2. Unfortunately, both proofs incur large blow-ups from the quantifier rank r of a homomorphism-preserved first-order sentence φ to the quantifier rank r+ (r) of the equivalent existential-positive sentence φ+. The blow-up in [33] is non-elementary: r+ is a tower-of-exponentials of height r. (With respect to the length of φ and φ+, a non-elementary blow-up is necessary [33, Theorem 6.1].) The method of [34] improved the quantifier rank blow-up to merely polynomial: r+=O(r3logr).

At the same time, an additional result of [33, Theorem 1.6] showed the classical HPT (Theorem 1.1) requires no blow-up at all in quantifier rank:

Theorem 1.3 (Equi-rank HPT).

Every first-order sentence that is preserved under homomorphisms (on all structures) is equivalent to an existential-positive sentence with the same quantifier rank.

The proof of Theorem 1.3 avoids the non-constructive Compactness Theorem, using instead a method of +-saturated co-retracts (see the discussion in §2.1). It was left as an open question whether Theorem 1.3 remains valid on finite structures. The main result of the present paper answers this question in the affirmative, finally unifying the finitary and equi-rank versions of the HPT.

Theorem 1.4 (Equi-rank HPT on finite structures).

Every first-order sentence that is preserved under homomorphisms on finite structures is equivalent on finite structures to an existential-positive sentence with the same quantifier rank, variable width, and alternation height.

Theorem 1.4 is simultaneously tight with respect to three different parameters: quantifier rank, variable width, and alternation height (see §3.2 for definitions). The surprisingly simple proof utilizes a generalized Cai-Fürer-Immerman construction [6] on finite relational structures (see §2.3). In particular, we consider two CFI structures even and odd over a finite core (i.e., structure such that every homomorphism is an isomorphism). We show that

even and odd↛odd

where denotes the existence of a homomorphism (Lemma 5.5). Theorem 1.4 then follows from a characterization of existential-positive definability in terms of the minimal cores of a homomorphism-closed class of finite structures (Lemma 4.6).

Related work

Our proof of Theorem 1.4 combines a few standard techniques that appear in many prior works. Variants of the Cai-Fürer-Immerman construction have found numerous applications similar in nature to our main result. In particular, Fürer [16] and Neuen [32] considered very similar generalization of CFI structures as given in Definition 5.4, only with different applications in mind.

The correspondence between quantifier rank/variable width, tree-width/tree-depth, and the # of moves/# of cops parameters of the cops-and-robber game, has been developed in several works including [1, 3, 12, 15, 19]. Variants of Lemmas 4.6 can be found in many of these papers.

Anuj Dawar (personal communication) independently identified the “core” homomorphism property of CFI structures (Lemma 5.5) in the context of a different problem. It is perhaps surprising that the application of the CFI construction to prove the Homomorphism Preservation Theorem remained unrecognized for such an extended period.

Versions of the HPT relativized to various classes of structures have been investigated in [4, 5, 10, 21] (see also [11], which corrects some claims in articles [5, 10]). Versions of the HPT for different fragments, as well as extensions, of first-order logic have been studied in [2, 7, 14].

2 Comparing the three proofs of the finitary HPT

In this section, we discuss both previous proofs of the Homomorphism Preservation Theorem on finite structures (Theorem 1.2) from articles [33, 34]. We then give a brief overview of our new proof using the Cai-Fürer-Immerman construction.

2.1 First proof via +-indistinguishable co-retracts

The original proof of the finitary HPT actually establishes the following stronger result.

Theorem 2.1 ([33, Theorem 5.15]).

For every finite relational signature σ and integer r0, there exists an integer r+ (r) and an operation

𝔄𝔄^:{ σ-structures}{ σ-structures}

with the following properties:

  1. (I)

    𝔄^ is a co-retract of 𝔄 (i.e., 𝔄 is a substructure of 𝔄^ and there is a homomorphism 𝔄^𝔄 that fixes each element of 𝔄).

  2. (II)

    Whenever 𝔄 is finite, so is 𝔄^.

  3. (III)

    Whenever 𝔄 and 𝔅 satisfy the same existential-positive sentences of quantifier rank r+, their co-retracts 𝔄^ and 𝔅^ satisfy the same first-order sentences of quantifier rank r.

The finitary HPT follows straightforwardly from Theorem 2.1, although inheriting the same blow-up in quantifier rank from r to r+ (a tower-of-exponentials of height r). It is unknown whether every operation 𝔄𝔄^ satisfying properties (I), (II) and (III) requires a large blow-up from r to r+; the method of the present paper sheds no light on this question.

An additional result in [33, Theorem 4.11] shows that the optimal value r+=r can be achieved by sacrificing property (II), that is, allowing 𝔄^ to be infinite even when 𝔄 is finite. This yields the equi-rank version of the classical HPT (Theorem 1.3). In this version of the hat operation, 𝔄^ is an (infinite) +-saturated co-retract of 𝔄. The manner of “finitizing” this operation in [33] is responsible for the tower-of-exponentials blow-up in Theorem 2.1.

 Remark 2.2.

The method of +-saturated co-retracts was recently generalized by Abramsky and Reggio [2], though the lens of game comonads and arboreal categories. They give general conditions leading to equi-resource homomorphism preservation theorems, both with respect to fragments of first-order logic and relativized to classes of structures satisfying certain axioms. Our proof of Theorem 1.4 does not follow this template, but it would be interesting to know if a +-saturation proof is possible in the finite setting.

2.2 Second proof via 𝗔𝗖𝟬 lower bounds

Subsequent work of the author [34] provides an entirely different proof of the finitary HPT, with a merely polynomial blow-up in quantifier rank, based on lower bounds in circuit complexity.

Theorem 2.3 (Main result of [34]).

Let 𝒞 be a homomorphism-closed class of finite structures.

  1. (1)

    If membership in 𝒞 is decidable on structures of size n by non-uniform 𝖠𝖢𝟢 formulas of size O(nr), then 𝒞 is definable on finite structures (of all sizes) by a single existential-positive sentence of quantifier rank O(r3logr).

  2. (2)

    If membership in 𝒞 is decidable on structures of size n by non-uniform 𝖠𝖢𝟢 circuits of size O(ns), then 𝒞 is definable on finite structures (of all sizes) by a single existential-positive sentence of variable width O(slogs).

The proof of Theorem 2.3 relies on three different lower bounds in circuit complexity [27, 28, 35], in addition to a result in graph minor theory on the excluded-minor approximation of tree-depth [9, 25].333Part (1) of Theorem 2.3 is stated in [34] with a weaker polynomial bound O(r5logr). The improvement to O(r3logr) relies on a subsequent result of Czerwinski, Nadara, and Pilipczuk [9]. Part (2) of Theorem 2.3 is not explicitly stated in [34], but follows by a similar argument to part (1) using the 𝖠𝖢𝟢 circuit lower bound of Li, Razborov and the author [28]. We remark that Theorem 2.3 has an equivalent descriptive complexity formulation via the well-known correspondence [13, 24] between the non-uniform complexity class 𝖠𝖢𝟢 and the logic 𝖥𝖮[𝖠𝗋𝖻] (first-order logic with arbitrary background predicates).

Corollary 2.4.

Let 𝒞 be a homomorphism-closed class of finite structures.

  1. (1)

    If 𝒞 is definable on finite structures by an 𝖥𝖮[𝖠𝗋𝖻] sentence of quantifier rank r, then 𝒞 is definable on finite structures by an existential-positive sentence of quantifier rank O(r3logr).

  2. (2)

    If 𝒞 is definable on finite structures by an 𝖥𝖮[𝖠𝗋𝖻] sentence of quantifier width s, then 𝒞 is definable on finite structures by an existential-positive sentence of variable width O(slogs).

Corollary 2.4 strengthens Theorem 1.2 by expanding the hypothesis from first-order sentences to the more expressive class of 𝖥𝖮[𝖠𝗋𝖻] sentences, while the equivalent existential-positive sentences remain first-order (without background predicates). The results of the present paper do not directly improve Corollary 2.4, but do show that improvements would follow from strong enough lower bounds on the 𝖠𝖢𝟢 complexity of distinguishing “even” and “odd” CFI structures over any base graph. Recent size-depth tradeoffs for 𝖠𝖢𝟢-Frege refutations of Tseitin formulas [22, 17] might be relevant to this question.

2.3 New proof via the Cai-Fürer-Immerman construction

An influential article of Cai, Fürer and Immerman [6] introduced a construction of non-isomorphic simple graphs of order n that are indistinguishable by o(n)-variable counting logic (equivalently, by the o(n)-dimensional Weisfeiler-Leman algorithm). The general construction associates any 3-regular graph G with a pair of non-isomorphic graphs Geven and Godd, which are hard to distinguish when the “base graph” G is an expander. The CFI construction is closely related to 3-XOR-CNF formulas associated with G, studied by Tseitin [40] in the setting of proof complexity. The Tseitin formulas are a system of linear equations modulo 2, with a variable Xe for each edge and a constraint for each vertex v (either XeXfXg=0 or XeXfXg=1, where e,f,g are the edges incident to v). This system is satisfiable if, and only if, the number of inhomogeneous constraints is even. The CFI graphs Geven and Godd encode the two different (satisfiable and unsatisfiable) Tseitin formulas over G, corresponding to the 0-homomology classes of G over 2.

The CFI construction generalizes to arbitrary (non-3-regular, non-connected) simple graphs G, as well as to abelian coordinate groups other than 2. Generalizations of Tseitin formulas and the CFI construction have found numerous applications in finite model theory and proof complexity. In this paper we consider a natural version of the CFI construction on finite structures with a fixed relational signature (Definition 5.4). For any finite “base” structure 𝔄, this construction produces a pair of non-isomorphic finite structures 𝔄even and 𝔄odd. Like the original CFI graphs, these structures are indistinguishable by first-order sentences whose quantifier rank / number of variables is less than the tree-depth / tree-width of 𝔄 (Lemma 5.6).

Unlike some versions of the CFI construction (such as the original CFI graphs [6]), structures 𝔄even and 𝔄odd project homomorphically to the base structure 𝔄. In the key special case of a finite core (where every homomorphism is an isomorphism), we show that admits a homomorphism to even but not to odd (Lemma 5.5). Our main result, the equi-rank finitary HPT (Theorem 1.4), then follows by essentially well-known arguments.

3 Preliminaries

This section includes all relevant definitions pertaining to structures, homomorphisms, and first-order logic. See [29, 24] for additional background on finite model theory.

3.1 Structures and homomorphisms

Definition 3.1.

A (relational) signature is a set σ of relation symbols, each with an associated positive integer “arity”. A σ-structure 𝔄 consists a set A (called the universe of 𝔄) together with an interpretation R𝔄At for each t-ary relation symbol R in σ. A structure is finite if its signature and universe are both finite.

Definition 3.2 (Homomorphism and isomorphism).
  • For structures 𝔄,𝔅 with the same signature, a homomorphism h:𝔄𝔅 is a function from the universe of 𝔄 to the universe of 𝔅, which maps each tuple in each relation of 𝔄 to a tuple in the corresponding relation of 𝔅. An isomorphism is a homomorphism h which is a bijection and whose inverse h1 is a homomorphism.

  • Notation 𝔄𝔅 expresses that there exists a homomorphism from 𝔄 to 𝔅. We say that 𝔄 and 𝔅 are homomorphically equivalent, denoted 𝔄𝔅, if 𝔄𝔅 and 𝔅𝔄.

  • Notation 𝔄 denotes the class of all (finite or infinite) structures 𝔅 such that 𝔄𝔅.

Definition 3.3 (The core of a finite structure).
  • A structure is a core if every homomorphism h: is an isomorphism.

  • Every finite structure 𝔄 is homomorphically equivalent to a unique core (up to isomorphism), which we call “the” core of 𝔄 and denote by 𝖢𝗈𝗋𝖾(𝔄). 𝖢𝗈𝗋𝖾(𝔄) is isomorphic to an induced substructure of 𝔄, namely any minimal retract of 𝔄 [23].

Definition 3.4 (Gaifman graph).

The Gaifman graph of a structure 𝔄, denoted 𝖦𝖺𝗂𝖿(𝔄), is the simple graph with vertex set V(𝖦𝖺𝗂𝖿(𝔄))=A (the universe of 𝔄) and undirected edge set

E(𝖦𝖺𝗂𝖿(𝔄))={{v,w}(A2):v,w occur together in any tuple of any relation of 𝔄}.

3.2 First-order logic

Definitions in this subsection are with respect to an arbitrary fixed relational signature (i.e., a finite set of relation symbols, each with an associated positive integer “arity”).

Definition 3.5 (First-order formulas).

Formulas of first-order logic (denoted by φ,ψ,θ) are constructed from

  • atomic formulas x=y and Rx1xt (for a t-ary relation symbol R) via

  • Boolean connectives φψ and φψ and ¬φ and

  • quantifiers xφ and xφ.

(Here x,y,x1,,xt are arbitrary variable symbols.)

A formula is said to be:

  • a sentence if it contains no free variables (i.e., if every occurrence of every variable symbol is bounded by a quantifier),

  • positive if it contains no negations (¬),

  • existential-positive if it contains no negations (¬) or universal quantifiers (),

  • primitive-positive if it contains no negations (¬), universal quantifiers () or disjunction ().

In other words, primitive-positive formulas are constructed from atomic formulas via existential quantification () and conjunction () only; existential-positive formulas additionally allow disjunction ().

Definition 3.6 (Quantifier rank and variable width).

Two important parameters of first-order formulas are:

  • quantifier rank, defined as the maximum nesting depth of quantifiers, and

  • variable width, defined as the maximum number of free variables in any subformula.

Definition 3.7 (Alternation height).

A first-order formula φ has alternation height 0 iff it is quantifier-free (i.e., a Boolean combination of atomic formulas). For d1, φ has alternation height at most d iff it is a Boolean combination of finitely many first-order formulas ψ1,,ψm, each of the form x1xkθ or x1xkθ some k0 and θ with alternation height at most d1.

Example 3.8.

To illustrate various tradeoffs in parameters, we present four different primitive-positive sentences, all which define the class P9 where P9 is the directed path graph of order 9.

  1.  (a)

    quantifier rank 5, variable width 3, alternation height 3

    x0x8x4(x2(x1(Ex0x1Ex1x2)x3(Ex2x3Ex3x4))x6(x5(Ex4x5Ex5x6)x7(Ex6x7Ex7x8)))
  2.  (b)

    quantifier rank 4 (the minimum possible), variable width 3, alternation height 4

    x4(x2(x1(x0Ex0x1Ex1x2)x3(Ex2x3Ex3x4))x6(x5(Ex4x5Ex5x6)x7(Ex6x7x8Ex7x8)))
  3.  (c)

    quantifier rank 9, variable width 2 (the minimum possible), alternation height 8

    x0x1(Ex0x1x2(Ex1x2x3(Ex2x3x7(Ex6x7x8Ex7x8))))
  4.  (d)

    quantifier rank 9, variable width 9, alternation height 1 (the minimum possible)

    x0x1x2x7x8(Ex0x1(Ex1x2(Ex2x3(Ex6x7Ex7x8))))

4 Characterization of + definability via the cops-and-robber game

The main result of this section (Lemma 4.6) provides a useful characterization of the classes of structures that are definable by existential-positive sentences with a given quantifier rank r, variable width s, and alternation height d. This characterization uses the well-known cops-and-robber game, which is also a means of defining the graph parameters tree-width and tree-depth.

4.1 Cops-and-robber game

Let G=(V,E), E(V2), be a finite simple graph. Tree-width and tree-depth, denoted by 𝐭𝐰(G) and 𝐭𝐝(G), are well-studied graphs parameters that are usually defined in terms of tree-like decompositions of G. Below, we present an alternative definition of these parameters in terms of a two-player pursuit-evasion game. This “cops-and-robbers” characterization of 𝐭𝐰(G) and 𝐭𝐝(G) is better suited to our purposes in this paper.

Definition 4.1 (Cops-and-robber game).

For any d0, the height-d cops-and-robber game on a graph G is a pursuit-evasion between two players: a team of cops and a sole robber (each with complete knowledge of the other’s moves). The game is played in a sequence of d rounds as follows:

  • Initially, the robber positions himself on any vertex of his choice.

  • In round 1 of the game, any number of cops take up positions on their choice of vertices. The robber then moves to any vertex in the same connected component (bypassing any newly positioned cops that stand in the way). The game ends immediately only if the robber moves to a position guarded by a cop.

  • In round 2 of the game, any subset of the assigned cops remain on their stationed vertices, and any number of (reassigned or additional) cops take up new positions on their choice of vertices. The robber then moves to any vertex in that reachable via a path that avoids the stationary cops (but may bypass any newly (re)positioned cops).

  • The game proceeds in this manner for up to d rounds, ending immediately if the robber ever occupies the same vertex as a cop at the end of a round. This situation is a win for the cop team; otherwise the robber wins if not caught after d rounds.

The team of cops clearly have a winning strategy for any d1: in the first round, simply occupy all vertices in the connected component of the robber. The two questions that concern us are:

  • How many distinct cops are required to catch the robber?

  • How many distinct cop “moves” (i.e., instances of positioning a cop on a vertex) are required to catch the robber?

When the “height” of the game (i.e., number of rounds) is unbounded, the answers to these questions respectively characterize the tree-width and tree-depth of G [37, 18].

Taking into account the height d, we get two hierarchies of parameters

𝐭𝐰1(G)𝐭𝐰2(G) and 𝐭𝐝1(G)𝐭𝐝2(G)

defined as follows:

  • 𝐭𝐰d(G) is the maximum s0 such that the robber has a winning strategy in the height-d -move s-cops-and-robber game on G.

  • 𝐭𝐝d(G) is the minimum r1 such that the cops have a winning strategy in height-d r-move -cops-and-robber game.

Observe that

𝐭𝐰1(G)+1=𝐭𝐝1(G)=maximum # of vertices in a connected component of G.

Also note that 𝐭𝐰d(G)+1𝐭𝐝d(G) for all d. Finally, note that 𝐭𝐰|V(G)|(G)=𝐭𝐰(G) and 𝐭𝐝|V(G)|(G)=𝐭𝐝(G).

Example 4.2.

With respect to the path graph Pk of order k, it is well-known that 𝐭𝐰(Pk)=2 and 𝐭𝐝(Pk)=log2(k)+O(1). Moreover it is not hard to show that

𝐭𝐰d(Pk)=k1/d+O(1) and 𝐭𝐝d(Pk)=(d+o(d))k1/d

for all dlog2(k).

In the remainder of this paper, we are not interested in 𝐭𝐰d(G) and 𝐭𝐝d(G) per se, but rather in tradeoffs among all three parameters in the cops-and-robbers game: the numbers of cops, cop moves, and height. That is, for any triple of parameters r,s,d, which side has a winning strategy in the height-d r-move s-cops-and-robber game on G?

 Remark 4.3.

Tradeoffs between r and s (when d is unbounded) were recently studied in [15], and monotonicity of an optimal cops strategy in the r-move s-cops-and-robber game was established in [3]. In both of those articles, the r-move s-cops-and-robber game is called the “r-round s-cops-and-robber game”. Optimizing s with respect to fixed r characterizes a parameter called the depth-r tree-width of G.

In the present article, we use terminology “height-d” instead of “d-round” to avoid confusion with the previous terminology. We propose names height-d tree-width and height-d tree-depth for parameters 𝐭𝐰d(G) and 𝐭𝐝d(G).

4.2 + definability of homomorphism-closed classes

We shall now review a well-known characterization of the parameters (quantifier rank and variable width) required to define the class of structures by a primitive-positive sentence, for finite core . In fact, we slightly extend this characterization by including alternation height as a third parameter.

Lemma 4.4.

For any finite core and integer d,r,s0, the following statements are equivalent:

  1. (i)

    The class is definable by a primitive-positive sentence with quantifier rank r, variable width s, and alternation height d.

  2. (ii)

    The cops have a winning strategy in the height-d r-move s-cops-and-robber game on 𝖦𝖺𝗂𝖿().

Results very similar to Lemma 4.4, which consider only one or two of the parameters d,r,s, have appeared before in the literature [1, 3, 12, 15, 18, 19, 37]. Lemma 4.4 may be used to characterize the parameters of existential-positive sentences that define a given homomorphism-closed class of finite structures.

Definition 4.5 (Minimal cores of a homomorphism-closed class of finite structures).
  • A class of finite structures 𝒞 is homomorphism-closed if

    (𝔄𝒞 and 𝔄𝔅)𝔅𝒞

    for all finite structures 𝔄 and 𝔅.

  • A minimal core in 𝒞 is a core 𝒞 such that

    (𝔄𝒞 and 𝔄)𝔄

    for all finite structures 𝔄.

Note that 𝒞 is determined by its set of minimal cores: a finite structures 𝔄 belongs to 𝒞 if, and only if, there is a homomorphism to 𝔄 from at least one minimal core in 𝒞.

Lemma 4.6.

Let 𝒞 be a homomorphism-closed class of finite structures. For any d,r,s0, the following statements are equivalent:

  1. (i)

    𝒞 is definable on finite structures by an existential-positive sentence with quantifier rank r, variable width s, and alternation height d.

  2. (ii)

    The cops have a winning strategy in the height-d r-move s-cops-and-robber game on 𝖦𝖺𝗂𝖿(), for every minimal core in 𝒞.

Lemma 4.6 follows from Lemma 4.4 by standard arguments (see [34, Proposition 2.16]). The only minor subtlety in the proof is a reliance on the fact that, for any given finite relational signature and r0, there are only a finite number of non-isomorphic cores with tree-depth r [23]. This is required so that the disjunction of primitive-positive sentences defining , over the non-isomorphic minimal cores in 𝒞, constitutes a well-defined (finite length) existential-positive sentence.

5 Generalized Cai-Fürer-Immerman construction

In this section we present the generalized Cai-Fürer-Immerman construction discussed in §2.3 and establish its key properties given by Lemmas 5.5 and 5.6.

Notation 5.1.

Let 2 denote the group {0,1} with addition modulo 2.

We will make use of the following basic lemma of graph homology, stated here over the coefficient group 2.

Lemma 5.2.

For any graph G=(V,E), E(V2), and function ξ:V2, the following statements are equivalent:

  1. (i)

    uUξ(u)=0 for every connected component UV.

  2. (ii)

    ξ is a “1-boundary”, that is, there exists a function ε:E2 such that for every vV, we have eE:veε(e)=ξ(v).

Notation 5.3.

Let 𝔄 be a structure with Gaifman graph G=(A,E). For an element vA, let Ev, Nv and Nv respectively denote the incident-edge set, neighbor set and 1-neighborhood of v in G. That is,

Ev :={{v,w}:wA such that {v,w} is an edge in G},
Nv :={wA:{v,w} is an edge in G},
Nv :=Nv{v}.
Definition 5.4 (Generalized Cai-Fürer-Immerman structures 𝔄ξ).

For any relational structure 𝔄 with universe A and any function ξ:A2, we define a structure 𝔄ξ (with the same signature) as follows:

  • 𝔄ξ has universe

    Aξ :={v,α:vA and α:Nv2 such that α(v)=0 and wNvα(w)=ξ(v)}.

    (Here for readability sake we use a distinctive notation v,α for the ordered pair (v,α).)

  • For each t-ary relation RAt in 𝔄, the corresponding relation RξAξt in 𝔄ξ is defined by

    Rξ :={(v1,α1,,vt,αt)Aξt:(v1,,vt)R and αi(vj)=αj(vi) for all i,j{1,,t}}.

Note that the projection v,αv is a homomorphism 𝔄ξ𝔄. Also note that this homomorphism need not be surjective (for instance, if vA is an isolated vertex in 𝖦𝖺𝗂𝖿(𝔄) and ξ(v)=1).

Our first key lemma gives a necessary and sufficient condition for the existence of a homomorphism in the opposite direction in the special case that 𝔄 is a core.

Lemma 5.5.

Let be a finite core with Gaifman graph G=(C,E), and let ξ:C2. There exists a homomorphism ξ if, and only if, uUξ(u)=0 for each connected component U of G.

Proof of Lemma 5.5.

We first prove the “if” direction. Assume that uUξ(u)=0 for each connected component U of . By Lemma 5.2, there exists a function ε:E2 such that for all vC, we have

eEvε(e)=ξ(v).

For each vC, define αv:Nv2 by αv(v):=0 and αv(w):=ε({v,w}) for all wNv{v}. Note that v,αvCξ. The function h:vv,αv is the desired homomorphism ξ.

We now prove the “only if” direction. Assume that h is an arbitrary homomorphism ξ. Consider the projection homomorphism v,αv:ξ that maps v,α to v, and let f be the composition

f=(v,αv)h:.

Since is a core, f is an isomorphism. In particular, note that f restricts to a bijection from Nv to Nf(v) for each vC.

For each vC, we have h(v)=f(v),αv for some αv:Nf(v)2. Since h(v)Cξ, we have

αv(f(v))=0 and xNf(v)αv(x)=ξ(f(v)).

We now define α~v:Nv2 by

α~v(w):=αv(f(w)).

Using the fact that f maps Nv bijectively to Nf(v), we have

α~v(v)=αv(f(v))=0 and wNvα~v(w)=wNvαv(f(w))=xNf(v)αv(x)=ξ(f(v)).

Therefore, we have v,α~vCξ~ where ξ~:C2 is the function ξ~(v):=ξ(f(v)).

Let us next consider the function h~:CCξ~ defined by

h~(v):=v,α~v.

We claim that h~ is a homomorphism ξ~. (We prove this claim only in order to show that α~v(w)=α~w(v) for all {v,w}E.) To see why, consider any tuple (v1,,vt)R in any relation of . Since h is a homomorphism ξ, we have

(h(v1),,h(vr))=(f(v1),αv1,,f(vt),αvt)Rξ.

By definition of Rξ, for all i,j{1,,t}, we have αvi(f(vj))=αvj(f(vi)) and hence

α~vi(vj)=αvi(f(vj))=αvj(f(vi))=α~vj(vi).

So we see that (by definition of Rξ~)

(h~(v1),,h~(vt))=(v1,α~v1,,vt,α~vt)Rξ~.

This argument shows that h~ is a homomorphism ξ~ as claimed.

We now define a function ε:E2 by

ε({v,w}):=α~v(w).

This is well-defined, since (as established in previous paragraph) we have α~v(w)=α~w(v) for all {v,w}E (i.e., for all distinct v,wC that appear together in any tuple of any relation of ).

For all vC, we have

eEvε(e)=wNvε({v,w})=wNvα~v(w)=ξ(f(v))=ξ~(v).

By Lemma 5.2, it follows that uUξ~(u)=0 for each connected component U of C. Since is a core, f: restricts to a bijection on each connected component. We conclude that

uUξ(u)=uUξ(f(u))=uUξ~(u) =0

as required.

The second key lemma concerns the parameters of first-order sentences that distinguish any two structures in the class {𝔄ξ:ξ is a function from A to 2}.

Lemma 5.6.

Let 𝔄 be a finite structure with Gaifman graph G=(A,E). Assume that the robber has a winning strategy starting on vertex uA in the height-d r-move s-ccops-and-robber game on G. Further assume that ξ,ζ:A2 and ε:E2 are functions satisfying

ξ(v)+ζ(v)+eEvε(e) =𝟙[v=u] for all vA.

Then structures 𝔄ξ and 𝔄ζ are indistinguishable by first-order sentences with quantifier rank r, variable width s, and alternation height d.

We obtain Lemma 5.6 as the k=0 case of the following more general lemma, whose statement is suited for proof by induction on the alternation height d.

Lemma 5.7.

Let 𝔄 be a finite structure with Gaifman graph G=(A,E). Assume that the robber has a winning strategy in the height-d r-move s-ccops-and-robber game on G with k (s) cops initially positioned at vertices v1,,vkA and the robber initially positioned at vertex uA{v1,,vk}. Further assume that ξ,ζ:A2 and ε:E2 and αi,βi:Nvi2 are functions satisfying

v1,α1,,vk,αk Aξ,
v1,β1,,vk,βk Aζ,
αi(w)+βi(w)+ε({vi,w}) =0 for all i{1,,k} and wNvi,
ξ(v)+ζ(v)+eEvε(e) =𝟙[v=u] for all vA.

Then for every first-order formula φ(x1,,xk) with quantifier rank r, variable width s, and alternation height d, we have

𝔄ξφ(v1,α1,,vk,αk)𝔄ζφ(v1,β1,,vk,βk).

Proof.

We argue by induction on d. The base case d=0 is equivalent to showing that 𝔄ξ and 𝔄ζ satisfy the same quantifier-free formulas. Here it suffices to consider only the atomic formulas. That is, we must show

  • vi,αi=vj,αjvi,βi=vj,βj for all indices i,j{1,,k}, and

  • (vi1,αi1,,vit,αit)Rξ(vi1,βi1,,vit,βit)Rζ for every t-ary relation symbol R and indices i1,,it{1,,k}.

Both equivalences follow from our assumptions on ξ,ζ,αi,βi,ε. In particular, the second equivalence follows from the definition of relations Rξ,Rζ and the observation that

αi(vj)=αj(vi)αi(vj)+ε({vi,vj})=αj(vi)+ε({vi,vj})βi(vj)=βj(vi)

for all i,j{1,,k}.

For the induction step, assume that d1. By definition of having alternation height d, φ(x1,,xk) is a Boolean combination of finitely many first-order formulas ψ(xi1,,xij), each of the form

y1yθ(xi1,,xij,y1,,y) or y1yθ(xi1,,xij,y1,,y)

for some j,0 and 1i1<<ijk and first-order formula θ with quantifier rank (at most) r, variable width (at most) s, and alternation depth (at most) d1. Consider any such formula ψ(x1,,xj), without loss of generality of the form y1yθ(x1,,xj,y1,,y) where (i1,,ij)=(1,,j). It suffices to show that

𝔄ξψ(v1,α1,,vj,αj) 𝔄ζψ(v1,β1,,vj,βj).

We will prove the implication ; the reverse implication follows by a symmetric argument.

Assume that 𝔄ξψ(v1,α1,,vj,αj) and fix a choice of v^1,α^1,,v^,α^Aξ such that

𝔄ξθ(v1,α1,,vj,αj,v^1,α^1,,v^,α^).

In the remainder of this proof, we will show that there exist functions β^i:Nv^2 with v^i,β^iAζ (i{1,,}) such that

𝔄ζθ(v1,α1,,vj,αj,v^1,β^1,,v^,β^).

It then follows that 𝔄ζψ(v1,β1,,vj,βj), which establishes the required implication

𝔄ξψ(v1,α1,,vj,αj)𝔄ζψ(v1,β1,,vj,βj).

In order to define suitable functions β^i, we invoke the robber’s winning strategy in the height-d r-move s-ccops-and-robber game on G with cops starting at v1,,vk and the robber starting at u. Suppose that in the round 1 of the game, the first j cops remain at v1,,vj while the next cops redeploy to v^1,,v^. There exists u^V{v1,,vj,v^1,,v^} and a path u=p0,p1,,pm=u^ in G (with m0 and {pi1,pi}E for all 1im) such that {v1,,vj}{p0,,pm}= and the robber has a winning strategy in the d1-round r-move s-ccops-and-robber game on G with cops starting at v1,,vj,v^1,,v^ and the robber starting at u^.

Define ε^:E2 and β^i:Nv^i2 (i{1,,}) by

ε^(e) :=ε(e)+i=1m𝟙[e={pi1,pi}],
β^i(w) :={0if w=v^i,α^i(w)+ε^({v^i,w})if wNv^i.

We will next show that ξ,ζ,α1,,αj,α^1,,α^,β1,,βj,β^1,,β^ and e^ satisfy the conditions of the lemma with respect to v1,,vj,v^1,,v^,u^ and the first-order formula θ.

First, note that

v1,α1,,vj,αj,v^1,α^1,,v^,α^ Aξ.

Second, to establish that

v1,β1,,vj,βj,v^1,β^1,,v^,β^ Aζ,

we observe that for each i{1,,},

wNv^iβ^i(w) =wNv^i(α^i(w)+ε^({v^i,w}))
=wNv^i(α^i(w)+ε({v^i,w})+i=1m𝟙[{v^i,w}={pi1,pi}])
=ξ(v^i)+eEv^iε(e)+wNv^ii=1m𝟙[{v^i,w}={pi1,pi}]
=ζ(v^i)+𝟙[v^i=u]+𝟙[v^i=p0]+𝟙[v^i=u^]
=ζ(v^i)(since p0=u and u^{v1,,vj,v^1,,v^}).

Third, by definition of β^i, we have

α^i(w)+β^i(w)+ε^({v^i,w}) =0 for all i{1,,} and wNv^i.

Fourth and finally, for all vA, we have

ξ(v)+ζ(v)+eEvε^(e) =ξ(v)+ζ(v)+eEvε(e)+wNvi=1m𝟙[{v,w}={pi1,pi}]
=𝟙[v=u]+𝟙[v=p0]+𝟙[v=pm]
=𝟙[v=u~].

By the induction hypothesis applied to θ, we conclude that

𝔄ζθ(v1,α1,,vj,αj,v^1,β^1,,v^,β^),

finishing the proof.

6 Proof of the equi-rank finitary HPT

Proof of Theorem 1.4.

Let φ be a first-order sentence that is preserved under homomorphisms on finite structures. Let r, s and d be the quantifier rank, variable width and alternation height of φ.

Assume that φ has at least one finite model, since otherwise the theorem is trivial (allowing as a special primitive-positive sentence with no models). Consider any minimal core (with universe C) in the class of finite models of φ. By Lemma 4.6, it suffices to show that the cops have a winning strategy in the height-d r-move s-cops-and-robber game on 𝖦𝖺𝗂𝖿(), starting from an (adversarial) choice of initial position uC for the robber.

Let 0 be the CFI structure where 0 stands for the all-zero function C2, and let 1u be the CFI structure where 1u stands for the function C2 with value 1 at u and 0 elsewhere. Additionally, let ε be the all-zero function E2. Note that

0(v)+1u(v)+eEvε(e) =0+𝟙[v=u]+eEv0=𝟙[v=u] for all vA.

By Lemma 5.5, we have 0. Since φ is closed under homomorphisms on finite structures, it follows that 0φ. Lemma 5.5 also implies ↛1u. Since 1u, our assumption that is a minimal core in the class of finite models of φ implies that 1u⊧̸φ.

We have established that φ is a first-order sentence with quantifier rank r, variable width s and alternation height d, which distinguishes the pair of structures 0 and 1u. Therefore, by (the contrapositive of) Lemma 5.6, the cops have a winning strategy with the robber starting on uC in the height-d r-move s-cops-and-robber game on 𝖦𝖺𝗂𝖿().

By Lemma 4.6, we conclude that φ is equivalent on finite structures to an existential-positive sentence with quantifier rank r, variable width s, and alternation height d, as required.

7 Open questions

As discussed in §2, it remains an open question whether the quantifier-rank blow-up can be eliminated or significantly reduced in either Theorem 2.1 or 2.3 (the main results of [33, 34]).

Another interesting question is to investigate tradeoffs in Theorems 2.1 or 2.3 involving alternation depth d. There is a natural correspondence between primitive-positive sentences and monotone 𝖲𝖠𝖢𝟢 circuits (with unbounded gates and fan-in 2 gates). For any finite graph G, this correspondence gives the following upper bounds on the colored G-subgraph isomorphism problem (equivalent to the G-homomorphism problem when G is a core).

Proposition 7.1.

For any finite graph G, the colored G-subgraph isomorphism problem, as a sequence of monotone Boolean functions {0,1}|E(G)|n2{0,1}, is computable for all d1 by both

  • monotone 𝖲𝖠𝖢𝟢 formulas with -depth d and size n𝐭𝐝d(G)+O(1), and

  • monotone 𝖲𝖠𝖢𝟢 circuits with -depth d and size n𝐭𝐰d(G)+O(1).

In the arithmetic setting, the corresponding set-multilinear polynomials are computable by monotone arithmetic 𝖲𝖠𝖢𝟢 formulas and circuits with -depth d and size n𝐭𝐝d(G)+O(1) and n𝐭𝐰d(G)+O(1), respectively.

It would be interesting to establish lower bounds in circuit complexity that nearly match the size-depth tradeoffs of Proposition 7.1. Recent work of the author [36] takes a step in this direction by establishing nΩ(𝐭𝐝d(G)) and nΩ(𝐭𝐰d(G)) lower bounds in the case of path graphs G=Pk. With respect to monotone arithmetic circuits and formulas, even tighter n𝐭𝐝d(G)O(1) and n𝐭𝐰d(G)O(1) lower bounds for general graphs G might be possible using the technique of Komarath, Pandey and Rahul [26].

References

  • [1] Samson Abramsky, Anuj Dawar, and Pengming Wang. The pebbling comonad in finite model theory. In 2017 32nd Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), pages 1–12. IEEE, 2017. doi:10.1109/LICS.2017.8005129.
  • [2] Samson Abramsky and Luca Reggio. Arboreal categories and equi-resource homomorphism preservation theorems. Annals of Pure and Applied Logic, 175(6):103423, 2024. doi:10.1016/J.APAL.2024.103423.
  • [3] Isolde Adler and Eva Fluck. Monotonicity of the Cops and Robber Game for Bounded Depth Treewidth. In 49th International Symposium on Mathematical Foundations of Computer Science (MFCS 2024), volume 306, pages 6:1–6:18, 2024. doi:10.4230/LIPICS.MFCS.2024.6.
  • [4] Albert Atserias. On digraph coloring problems and treewidth duality. European Journal of Combinatorics, 29(4):796–820, 2008. doi:10.1016/J.EJC.2007.11.004.
  • [5] Albert Atserias, Anuj Dawar, and Phokion G. Kolaitis. On preservation under homomorphisms and unions of conjunctive queries. J. ACM, 53(2):208–237, 2006. doi:10.1145/1131342.1131344.
  • [6] Jin-Yi Cai, Martin Fürer, and Neil Immerman. An optimal lower bound on the number of variables for graph identification. Combinatorica, 12(4):389–410, 1992. doi:10.1007/BF01305232.
  • [7] James Carr. Homomorphism preservation theorems for many-valued structures. arXiv preprint arXiv:2403.00217, 2024.
  • [8] Hubie Chen. On the complexity of existential positive queries. ACM Transactions on Computational Logic (TOCL), 15(1):1–20, 2014. doi:10.1145/2559946.
  • [9] Wojciech Czerwinski, Wojciech Nadara, and Marcin Pilipczuk. Improved bounds for the excluded-minor approximation of treedepth. In 27th Annual European Symposium on Algorithms (ESA 2019). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019.
  • [10] Anuj Dawar. Homomorphism preservation on quasi-wide classes. Journal of Computer and System Sciences, 76(5):324–332, 2010. doi:10.1016/J.JCSS.2009.10.005.
  • [11] Anuj Dawar and Ioannis Eleftheriadis. Preservation theorems on sparse classes revisited. In Proceedings of the 49th International Symposium on Mathematical Foundations of Computer Science, 2024.
  • [12] Anuj Dawar, Tomáš Jakl, and Luca Reggio. Lovász-type theorems and game comonads. In 2021 36th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), pages 1–13. IEEE, 2021.
  • [13] Larry Denenberg, Yuri Gurevich, and Saharon Shelah. Definability by constant-depth polynomial-size circuits. Information and control, 70(2-3):216–240, 1986. doi:10.1016/S0019-9958(86)80006-7.
  • [14] Tomás Feder and Moshe Y. Vardi. Homomorphism closed vs. existential positive. In Proceedings of the 18th IEEE Symposium on Logic in Computer Science, pages 310–320, 2003. doi:10.1109/LICS.2003.1210071.
  • [15] Eva Fluck, Tim Seppelt, and Gian Luca Spitzer. Going deep and going wide: Counting logic and homomorphism indistinguishability over graphs of bounded treedepth and treewidth. In 32nd EACSL Annual Conference on Computer Science Logic (CSL 2024). Schloss-Dagstuhl-Leibniz Zentrum für Informatik, 2024.
  • [16] Martin Fürer. On the combinatorial power of the Weisfeiler-Lehman algorithm. In International Conference on Algorithms and Complexity, pages 260–271. Springer, 2017. doi:10.1007/978-3-319-57586-5_22.
  • [17] Nicola Galesi, Dmitry Itsykson, Artur Riazanov, and Anastasia Sofronova. Bounded-depth Frege complexity of Tseitin formulas for all graphs. Annals of Pure and Applied Logic, 174(1):103166, 2023. doi:10.1016/J.APAL.2022.103166.
  • [18] Archontia C Giannopoulou, Paul Hunter, and Dimitrios M Thilikos. LIFO-search: A min–max theorem and a searching game for cycle-rank and tree-depth. Discrete Applied Mathematics, 160(15):2089–2097, 2012. doi:10.1016/J.DAM.2012.03.015.
  • [19] Martin Grohe. The complexity of homomorphism and constraint satisfaction problems seen from the other side. Journal of the ACM, 54(1):1–24, 2007. doi:10.1145/1206035.1206036.
  • [20] Yuri Gurevich. Toward logic tailored for computational complexity. In M. M. Richter et al., editor, Computation and Proof Theory, pages 175–216. Springer Lecture Notes in Mathematics, 1984.
  • [21] Lucy Ham. Relativised homomorphism preservation at the finite level. Studia Logica, 105(4):
    761–786, 2017.
    doi:10.1007/S11225-017-9710-7.
  • [22] Johan Håstad. On small-depth Frege proofs for Tseitin for grids. Journal of the ACM (JACM), 68(1):1–31, 2020. doi:10.1145/3425606.
  • [23] Pavol Hell and Jaroslav Nešetřil. The core of a graph. Discrete Math., 109:117–126, 1992. doi:10.1016/0012-365X(92)90282-K.
  • [24] Neil Immerman. Descriptive Complexity Theory. Graduate Texts in Computer Science. Springer, New York, 1999.
  • [25] Ken-ichi Kawarabayashi and Benjamin Rossman. A polynomial excluded-minor characterization of treedepth. In 29th ACM-SIAM Symposium on Discrete Algorithms, pages 234–246, 2018.
  • [26] Balagopal Komarath, Anurag Pandey, and Chengot Sankaramenon Rahul. Monotone arithmetic complexity of graph homomorphism polynomials. Algorithmica, 85(9):2554–2579, 2023. doi:10.1007/S00453-023-01108-0.
  • [27] Deepanshu Kush and Benjamin Rossman. Tree-depth and the formula complexity of subgraph isomorphism. SIAM Journal on Computing, 52(1):273–325, 2023. doi:10.1137/20M1372925.
  • [28] Yuan Li, Alexander Razborov, and Benjamin Rossman. On the AC0 complexity of subgraph isomorphism. SIAM Journal on Computing, 46(3):936–971, 2017.
  • [29] Leonid Libkin. Elements of Finite Model Theory. Springer-Verlag, 2004.
  • [30] Jerzy Łoś. On the extending of models (I). Fundamenta Mathematicae, 42:38–54, 1955.
  • [31] Roger C. Lyndon. Properties preserved under homomorphism. Pacific J. Math., 9:129–142, 1959.
  • [32] Daniel Neuen. Homomorphism-distinguishing closedness for graphs of bounded tree-width. In 41st International Symposium on Theoretical Aspects of Computer Science (STACS 2024). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024.
  • [33] Benjamin Rossman. Homomorphism preservation theorems. Journal of the ACM, 55(3):15, 2008.
  • [34] Benjamin Rossman. An improved homomorphism preservation theorem from lower bounds in circuit complexity. In 8th Innovations in Theoretical Computer Science, volume 67 of LIPIcs, pages 27:1–17, 2017. doi:10.4230/LIPICS.ITCS.2017.27.
  • [35] Benjamin Rossman. Formulas versus circuits for small distance connectivity. SIAM Journal on Computing, 47(5):1986–2028, 2018. doi:10.1137/15M1027310.
  • [36] Benjamin Rossman. Formula size-depth tradeoffs for iterated sub-permutation matrix multiplication. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 1386–1395, 2024. doi:10.1145/3618260.3649628.
  • [37] Paul D Seymour and Robin Thomas. Graph searching and a min-max theorem for tree-width. Journal of Combinatorial Theory, Series B, 58(1):22–33, 1993. doi:10.1006/JCTB.1993.1027.
  • [38] William W. Tait. A counterexample to a conjecture of Scott and Suppes. J. Symbolic Logic, 24:15–16, 1959. doi:10.2307/2964569.
  • [39] Alfred Tarski. Contributions to the theory of models. I. In Indagationes Mathematicae (Proceedings), volume 57, pages 572–581. Elsevier BV, 1954.
  • [40] Grigori S Tseitin. On the complexity of derivation in propositional calculus. Automation of reasoning: 2: Classical papers on computational logic 1967–1970, pages 466–483, 1983.