Word Structures and Their Automatic Presentations
Abstract
We study automatic presentations of the structures , , , and their expansions by a unary predicate . Here is the successor function, is the undirected version of , and is the natural order. We call these structures word structures. Our goal is three-fold. First, we study the isomorphism problem for automatic word structures by focusing on the following three problems. The first problem asks to design an algorithm that, given an automatic structure , decides if is isomorphic to . The second asks to design an algorithm that, given two automatic presentations of and , where and are unary predicates, decides if these structures are isomorphic. The third problem investigates if there is an algorithm that, given two automatic presentations of and , decides whether . We show that these problems are undecidable. Next, we study intrinsic regularity of the function in the structure . We build an automatic presentation of in which is not regular. This implies that is not intrinsically regular in . For , let be the function that computes the distances between the consecutive elements of . We build automatic presentations of where can realise logarithmic, radical, intermediate, and exponential functions.
Keywords and phrases:
Automatic structures, the isomorphism problem, decidability, undecidability, regular relationsCopyright and License:
2012 ACM Subject Classification:
Theory of computation Automata over infinite objects ; Theory of computation LogicFunding:
The authors acknowledge the support of the NSFC grant 62172077, 62350710215 and Sichuan S&T Department grant 2025HJPJ0006.Editors:
Paweł Gawrychowski, Filip Mazowiecki, and Michał SkrzypczakSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
We investigate automatic presentations of the structures , , , , and , where , is the natural order, the function is the successor function, is the undirected version of (that is iff ), and is a unary predicate on . The first structure is the ordinal , the second is the successor structure, and the third structure is the infinite path graph . For the last three structures, without loss of generality, we postulate that and is infinite. Each of the last three structures with the predicate determines the infinite binary word , where if and otherwise. These binary words are full isomorphism invariants of , , and , i.e., if and only if . Thus, here is a definition:
Definition 1.
Call , , ordered words, successor words, and graph words, respectively. We also call any of these structures word structures.
Definition 2.
An automatic presentation of the ordered word structure is a triple , where:
-
and are both FA recognizable languages,
-
is a binary relation on recognized by two heads synchronous finite automata, and
-
The structure is isomorphic to .
Automatic presentations of and are defined similarly.
1.1 Automatic structures
A relational structure is automatic if the domain and the atomic relations (of arity ) are all finite automata recognizable. We assume the reader is familiar with the basics of finite automata, e.g., runs, acceptance, and recognizability[33]. For the reader, nevertheless, we explain finite automata recognizability of binary relations on the set of strings over the alphabet . So, let and be two strings over . Then we code the pair as the string so that is defined as follows: if , if , and if , where is a new symbol. The operation is called the convolution operation on strings. The string is a string over the finite alphabet .
Let be the language over obtained by applying the convolution operation a to pair of strings in . Call finite automata recognizable if is recognized by a finite automaton. One can view finite automata over as two-head synchronous automata over . The reader can consult standard papers on automatic structures for details and examples, e.g., [1, 28, 7, 32, 49, 31], although Definition 2 suffices for this paper. Finally, in this paper, we use the standard terminology and notations from the theory of finite automata.
The connection between automata, logic and structures traces back to the 1960s when Büchi proved the decidability of -the monadic second-order theory of -using automata [8]. M. Rabin extended this result, through the use of tree automata, by proving that the monadic second-order theory of the full infinite binary tree is decidable [47]. In computational group theory context, M. Thurston et al.[12] realized that geodesic words within Cannon’s group of hyperbolic isometries are finite automata recognizable and defined the concept of automatic groups, which was extended to Cayley automatic groups in [27] while retaining its tractable essences. These and related papers constitute a foundation of modern research on automata, logic, their interactions, and applications. Also, automatic structures have attracted much attention in the last few decades, see surveys by Grädel [16], Bárány, Rubin, and Grädel [2], F. Stephan [52], and Rubin [50, 36]. Several key factors motivate this study. One is that automatic structures exhibit nice algorithmic and model-theoretic properties. The theory of every automatic structure is decidable. Automatic structures are closed under interpretations in first-order logic [7]. The other aspects involve connections to algebra (e.g., Gromov’s theorem that finitely generated groups of polynomial growth are virtually nilpotent [17]) and additive combinatorics (e.g., Freiman’s theorem that sets with small sums are contained in generalized arithmetic progressions [13]). These connections were revealed in the characterization theorem of finitely generated finite automata presentable groups [46] [45] and the proof that the group has no automatic presentation [53].
The presentations use finite automata. There are also other presentations of structures: tree automatic presentations [22, 2, 21], computably enumerable presentations [29, 9, 15], co-computably enumerable presentations [39], semi-automatic presentations [24, 23], -automatic presentations [19, 26] and unary automatic presentations [30, 35], etc.
1.2 Background and contributions
We describe three, among many, research directions directly related to this work.
1.
One direction focuses on first-order (FO) theories and interpretations of automatic structures in FO-logic and its extensions. For instance, Hodgson [20], Khoussainov and Nerode [28], Blumensath and Grädel [7] proved that the first-order theory of any automatic structure is decidable. This is known as the decidability theorem for automatic structures.
This result was extended to FO-logic with “there exists many modulo ” quantifiers [38] and “there exist infinitely many” quantifier [7]. S. Rubin [50] studied extensions with Ramsey’s quantifiers. Blumensath and Grädel characterized automatic structures in terms of interpretations [6]. They proved the interpretability theorem that a structure is automatic iff it has an interpretation in the infinite binary tree with two successor relations, the prefix order and the equal length predicates [6]. Finding extensions of the FO-logic that preserve the decidability and interpretability theorems is one of the key topics in the area.
2.
The second direction focuses on the classical isomorphism problem. The problem asks to design an algorithm that, given two automatic structures, decides if the structures are isomorphic. The isomorphism problem pertains to describing the isomorphism invariants and extracting structure theorems. For instance, the Cantor-Bendixson ranks of automatic linear orders turned out to be a useful invariant. The study of Cantor-Bendixson ranks implied that an ordinal is automatic iff it is less than [10]. Using this, Khoussainov, Stephan, and Rubin proved that the isomorphism problem for automatic ordinals is decidable. Thomas and Oliver fully described finitely generated automatic groups [46]. Khoussainov, Nies, Rubin and Stephan characterized and solved the isomorphism problem for automatic Boolean algebras [34]. Grädel and Blumensath were the first to prove that the isomorphism problem for automatic structures is undecidable [7]. Khoussainov, Nies, Rubin, and Stephan proved that the isomorphism problem for automatic structures is -complete[34]. D. Kuske, M. Lohrey, and J. Liu proved that the isomorphism problems for automatic linearly ordered sets and automatic trees are undecidable, and -complete for automatic equivalence relations [40]. Moreover, B. Khoussainov and M. Ganardi show that the isomorphism problem for automatic equivalence relations on regular domains with polynomial growth is still undecidable[14].
3.
The third direction concerns the intrinsic regularity of relations. A relation in a structure is intrinsically regular if from any automatic presentation of we can extract a finite automaton that recognizes the relation in [38]. For instance, all first-order definable relations (with and without parameters) are intrinsically regular. For any first-order logic formula , automatic structure , the set of all such that is intrinsically regular [38]. In an automatic partially ordered set of bounded width, the set of all that belong to an infinite path is also intrinsically regular [37].
Let be an intrinsically regular relation in . Given a formula we can consider the expression (binding ) with the following semantics: So, intrinsically regular relations allow us to define logic that extends the first order logic with the generalised quantifier . Such extensions preserve the decidability and interpretability theorems for automatic structures. Thus, intrinsic regularity is intimately connected with the questions of extending the decidability and interpretability theorems.
To refute that is intrinsically regular in , we need to build an automatic presentation of in which is not regular. For instance, consider the -ary representation of integers, . These presentations make the structure automatic. In these presentations addition relation, the relations and are regular. Clearly, the relation is not intrinsically regular in because in all unary automatic presentations of the addition is not regular. However, proving that and are not intrinsically regular requires non-trivial arguments. Indeed, there are automatic presentations of in which the relations are not regular [38]. Given that the relations are intrinsically regular in [38], we conclude that the relation in is also not intrinsically regular. Moreover, there are presentations of in which is not regular but all are regular [38]. These presentations can be argued to be “exotic” as they show that regularity of the successor implies neither regularity of nor regularity of ; they also show that regularity of and does not imply regularity of . More examples of non-intrinsically regular relations are in [38] [43] [49]. These examples bring insights into the understanding of regular relations, their interactions within automatic presentations, and the exploration of extensions to decidability and interpretability theorems.
There are certainly other exciting directions of research in automatic structures. We mention the complexity-theoretic issues. For instance, it is important to find exact complexity-theoretic bounds on the first order theories of automatic structures. D. Kuske and M. Lohrey proved that the first order theory of automatic graphs of bounded degree is triple exponential [41]. Blumensath and Gradel proved that there are automatic structures whose first order theories are non-elementary [6]. Also, D. Berdinsky has investigated the existence of automatic presentations for various types of vertex transitive graphs [11] [4] .
Our contributions for this work are listed below.
1. The isomorphism problem for .
Our first isomorphism problem asks if we can decide that given an automatic structure is isomorphic to . Although not explicitly stated in the literature, the question is natural and has been known since 1996 [44]. Note that there is an algorithm that, given an automatic structure decides if satisfies the following two axioms: (a) is injective, and (b) there is exactly one element without -pre-image. the structure satisfies these two axioms. However, even if satisfies (a) and (b), this does not imply that is isomorphic to . We prove that the problem of deciding if an automatic structure is isomorphic to is undecidable. Our proof shows that this isomorphism problem is -complete. Our solution is based on coding the configuration graphs of total Turing machines (TMs) into automatic presentations of . Recall that a TM, whose inputs are strings over a given alphabet , is called total if for all the machine halts on by either accepting or rejecting . The set of all total Turing machines is a -complete set [25]. Our solution is based on a careful analysis and coding of the configuration graphs of reversible TMs. Lemma 6 provides the coding. Theorem 7 transforms the configuration graphs of total TMs into the structure . Both, the lemma and the theorem, involve non-trivial reasoning.
Now we explain why the isomorphism problem for is of particular importance. Let us compare the structure with automatic ordinal which is similar to . First, it is decidable if a given automatic structure is isomorphic to . Hence, the isomorphism problem for is decidable. Note that every automatic presentation of produces an automatic presentation of while the opposite is not true. Hence, it is natural to ask if one can extend the decidability result for to . There is model-theoretic and computability theoretic evidence towards a positive answer. Indeed, in the language, the structure is easier to describe than . Formally, the Scott rank of is smaller than the Scott rank of 111D. Scott describes every countable structure by an -sentence up to isomorphism [51]. The Scott rank measures the ordinal complexity of such sentences [18]. The higher the Scott rank of the structure , the harder it is to describe it in -language.. Third, the index set of is -complete222 The index set of structure is the class of all Turing machines computing the atomic diagram of . [48]. In contrast, the index set of is a -complete set. In summary, from a descriptional complexity viewpoint, is harder to describe than . Likewise, from the viewpoint of computability, the index set of is more complex than the one for . Yet, the isomorphism problem for automatic is decidable whereas the isomorphism problem for is not.
2. The isomorphism problem for automatic successor word structures.
Our second isomorphism problem asks if there exists an algorithm that, given two automatic successor word structures and , decides if and are isomorphic. F. Stephan poses the question in [52]. Just as our first question, this problem has remained unresolved since the start of the theory of automatic structures in 1996 .In Theorem 11, we prove that no algorithm exists that decides this problem.
Here are a few important comments about our solution. First, due to the previous result (that the isomorphism problem for is a -complete set), it is already undecidable if a given automatic structure is a successor word structure. Therefore, we want to know if there is an algorithm that, under the promise that given automatic structures and are successor words, decides if these structures are isomorphic. In case, when or is not a successor word structure, the algorithm can give any answer (not even terminate). In this sense, solving the problem is more complex than the one for because we need to consider all algorithms that (1) might not even halt on some inputs, but (2) halt on all inputs from the -complete set of automatic successor word structures. Second, we stress that the solution to the first problem does not, in any direct way, imply a solution to the second problem. We use two new techniques. The first technique, through Lemma 13, reduces the equality problem for TMs to the equality problem of the running times of TMs. The second technique, via Lemma 14, encodes the running times of pairs of TMs into automatic successor word structures . This transformation is a key to the proof. The transformation is not necessarily commutative, that is, the structures and could be non-isomorphic. These two techniques are new and non-trivial.
3. The agreement problem.
Let be an automatic ordered word structure. Let be the image of under the isomorphism from onto . Clearly, automatic ordered word structures and are isomorphic iff . It is unknown if the isomorphism problem for automatic ordered word structures is decidable. Here we attack the agreement problem, an approximation to the isomorphism problem. The automatic structures and agree on if and . If no exists on which the structures agree then we say that these structures disagree.
The agreement problem asks to design an algorithm that, given two automatic ordered word structures and , decides if these two structures agree on some . The agreement problem is equivalent to . We prove that the agreement problem is -complete. The proof utilises two techniques. The first technique non-trivially implements the transformation idea from Lemma 14 used in Theorem 11. Our second technique employs the methods from D. Kuske, J. Liu, and M. Lohrey who showed the undecidability of the isomorphism problem for automatic equivalence relations [40]. Note that automatic ordered word structures and are isomorphic if and only if (a) the structures and disagree and (b) the structures and disagree. Thus, the agreement problem can viewed as an approximation to the isomorphism problem for automatic ordered word structures.
4. Intrinsic regularity in .
One can prove that a unary relation in is intrinsically regular if and only if is finite or co-finite. The case for binary relations turns out to be more complex. The difficult case, for instance, is the successor relation . The difficulty of whether or not is intrinsically regular is attested by the following two facts. First, any automatic presentation of is an automatic presentation of . Second, all known automatic presentations of the structure preserve the regularity of . It is therefore natural to conjecture that the relation is an intrinsically regular relation in . Counter-intuitively, in Theorem 24 we build an automatic presentation of in which the successor relation is not regular. Our proof employs techniques from [38] and non-trivially adjusts them for building such an automatic presentation of . As discussed in the introduction, this result implies that we can not extend the decidability theorem for the using the generalised quantifier that corresponds to the relation .
5. Exotic automatic word structures.
Let be an automatic word structure with predicate . Let be the listing of all natural numbers in in increasing order, where . Define , where , the distance function realised by . The distance functions are full isomorphism invariants of word structures. Thus, the study of automatic word structures is equivalent to the study of the distance functions . From Theorem 7, one shows that for any total TM there is an automatic successor word structure whose distance function bounds from above the running time of . In contrast, the distance functions in automatic ordered words are bounded by exponential functions. See Corollaries 9 and 10. The interesting case, then, is the study of distance functions in automatic ordered word structures. In Section 6, we show that exponential, intermediate, polynomial, radical, and logarithmic functions can be realized as distance functions.
2 The isomorphism problem for
We prove that no algorithm exists that, given an automatic structure , decides if is isomorphic to . For the proof we recall standard terminology about Turing machines.
A Turing Machine (TM) is a tuple , where is the set of states, is the alphabet, is a set of transitions of the form , is the initial state and are the sets of accepting and rejecting states, respectively. Transitions will also be called instructions of . The transition is interpreted as when reads in state , moves to state , writes in the head position, and moves in the direction of in the tape. We assume that all our TMs are deterministic.
A configuration of is a string , where and . This indicates that the tape content is , is at state , and the head of is at the first symbol of the string . An initial configuration is , where . A configuration is halting if . For configurations and , we write if there is an instantaneous move of from to through one of the instructions. If occurs via instruction , we also denote this as . There are no transitions from halting configurations.
A run of the machine is a sequence of configurations , , , , such that for all we have . We say that is total if its run starting at any initial configuration halts (in a halting configuration). For a technical reason, we will assume that our TMs never stall, that is, for any non-halting configuration there is a such that . We will transform the TMs M into equivalent (reversible) Turing machines . The Turing machines , as opposed to , will be allowed to stall.
Definition 3.
A configuration is legal if it belongs to a run that starts at an initial configuration. Else, is illegal. A maximal run of a TM is called a chain.
Let be the directed graph consisting of all configurations of and the edge relation . This graph is automatic. We call this graph the configuration graph of . Thus, we can use graph-theoretic notations for configuration graphs. For every configuration , we have - the cardinality of all outgoing edges from . Similarly, - is the cardinality of all in-going edges into .
The non-stalling condition (mentioned above) guarantees that for all configurations , is a halting configuration if and only if -. Because our Turing machines are deterministic, for all configurations of we have -.
Definition 4.
A Turing Machine is called reversible if for all configurations we have -.
For technical reasons, we can extend the concepts above to multi-tape Turing machines . For instance, this can be done by coding the contents of the tapes of into strings, e.g., through the convolution operation (see Section 1.1). To explain this let us assume that is a two-tape TM. Let the pair represent the contents of the first and the second tapes, respectively, where is a state of . Note that both tapes have the same state of . We code this two-tape configuration as the string
We use this type of coding, based on the convolution operation, to represent each configuration in the configuration graphs of multi-tape Turing machines.
Turing Machines and are equivalent if for all we have accepts if and only if accepts , and rejects if and only if rejects . We briefly present the proof of the following known result [3] for the reader’s intuition, as it will be useful later.
Lemma 5.
There exists a transformation that, given a TM , outputs a reversible TM equivalent to .
Proof.
We outline the construction. Let be a Turing machine. We build the TM as follows. The configurations of are 2-tuples where is a configuration of and is a sequence of instructions in . The transitions of are defined as if and only if . Thus, simulates with instruction and records the instruction on the second tape.
The configuration graph satisfies the following important property:
-
Every chain is either finite or is isomorphic to . The graph contains no finite cycles.
Note that finite chains have the start and the last configurations. A configuration is the start (or the bottom) configuration in the chain if -. A configuration is the last (or the top) configuration if -.
Note that even if is total, the configuration graph of might contain infinite chains. We would like to strengthen the lemma above by modifying into a reversible equivalent TM such that totality of implies that every chain in is finite.
Lemma 6.
There is a transformation which, given a TM , outputs such that:
-
1.
is reversible.
-
2.
is equivalent to .
-
3.
Configuration graph of consists of finite chains if and only if is total.
Proof.
The configurations of TM are , where , , , and . We explain the roles of , , , and :
-
1.
The string is the instruction sequence executed so far (as in ) but the head of can be at any position.
-
2.
The string checks if is a consistent sequence of instructions. The head for is at the right end of .
-
3.
The symbol represents the direction of the head for . If , then the head moves to the right along , and to the left otherwise.
-
4.
If , then simulates for one step.
We represent configurations as where the underlying symbol indicates the head positions. Initial configurations of are of the form for some . Thus, says that the head position for the tape is at symbol . There is no need to specify the position of the head for because it is always at the right end of . Since the values of and are finite, these can be counted as states of . Therefore, is a -tape Turing Machine. Now we describe transitions of :
-
R1:
, .
Intuition: This mimics one computation step of . -
R2:
.
Intuition: Preparation to verify if is a valid sequence of instructions. -
R3:
, where we have .
Intuition: This rule attempts to validate the instruction sequence and records the validation history on tape . Note that copies a suffix of in reverse. -
R4:
, where and is initial configuration.
Intuition: Just like Rule 2, this is a preparation step to return the head for to the rightmost position. -
R5:
, where we have and
Intuition: This rule tries to reduce the size of and at the same time validates consistency between and .
Thus, R1 is the simulation rule, R2 and R4 are preparation rules, and R3 and R5 are validation rules. Furthermore, if on input halts in steps, then on input runs in time proportional to . One checks that the construction is correct.
Theorem 7.
The set of all automatic structures isomorphic to is -complete. Hence, the problem if an automatic structure is isomorphic to is undecidable.
Proof.
Let = Turing machine is total. This is a standard -complete set [25]. We reduce to the set is automatic and . Note that is a -set. Indeed if and only if is an injective function (this can be computably verified) and there exists a such that has no pre-image with respect to and for every there is an such that . This is a definition of .
Let be a Turing machine. Consider the equivalent Turing machine built in Lemma 6. Consider the configuration graph of all configurations of .
Let be chain in .
This chain has the first element, , such that every other configuration in is reachable from through a finite number of transitions . If is finite then it has its top element as well. The top element of is its final element (with no successor), where
and . We introduce the “reverse” copy of the chain, where is obtained from by renaming each character in by the new character . The following lemma utilizes two copies of to build an automatic structure which captures totality of any TM .
Lemma 8.
For any TM , there exists an automatic structure such that the structure is isomorphic to if and only if the configuration graph consists of only finite chains.
Let be a Turing machine. We construct the Turing machine as in Lemma 6. Based on we build the structure as follows:
-
1.
The domain is the set .
-
2.
The set is the union of the transitions and together with the following two sets of transitions:
-
(a)
Transitions such that is the top element of a chain in .
-
(b)
Transitions , where for some chain and is the length-lexicographic successor of among the set of all first elements of chains in .
-
(a)
One can depict the structure as follows. Let be the list of all chains of the graph ordered by the length-lexicographical order of their first elements . Amend this list as follows: . Starting from the first element of , The relation successively applies transition until the top element of is reached. Once the top element of is reached, the relation makes a transition to , the reverse of , and then reverses transitions by applying until the reverse of the first element of is reached. Then transits to the first element of , and repeats the process333One might suggest to simplify this description and propose that moves from the top of to the bottom of directly. This, indeed, removes the need for reverse configurations and the transitions . However, this simplification does not guarantee that the relation built in this way is regular..
It is clear that is an automatic graph. In this graph the in-degree of each element is at most . Also, the out-degree of each element is exactly . If the Turing machine is total, then every chain in the space is finite. Moreover, the configuration graph excludes cycles. Therefore, starting from the first element in reaches all elements in . If is not total, then there must exist an infinite chain in the directed graph . Let be the first such infinite chain. Hence, once is applied to the first element of , all the transitions in stay inside . Hence, no element in is reachable from the first element of . Therefore is isomorphic to if and only if is total. We proved the lemma, and hence the theorem.
2.1 Distance functions on automatic successor word structures
We apply the results of the previous section to build various types of distance functions for automatic successor word structures. Given an automatic word structure with predicate , let be the listing of all natural numbers in in increasing natural order, where . Recall that the function , where , is called the distance function realised by . Lemma 6 and Theorem 7 give us tools that code various distance functions in automatic successor word structures.
Let be a Turing machine with running time . Construct the Turing machine as in Lemma 6. Then for all we have , where is a fixed constant. Now consider the automatic successor word structure , where is constructed in Theorem 7, and . From the construction of and the unary predicate , it is not hard to see that , where is the length-lexicographic order of .
Corollary 9.
For any total Turing machine there exists an automatic successor word structure such that the distance function of bounds the running time .
Corollary 10.
There exist automatic successor word structures whose distance functions are non-elementary. (This corollary strengthens and generalizes the results in [5])
3 Automatic successor word structures
Our interest is in the set of all pairs such that both and are isomorphic automatic successor word structures. The question is this: Is there an algorithm that, given two automatic successor word structures, decides if these structures are isomorphic? This is a promise problem in the sense that we care about those input that are word automatic successor structures. If any of the inputs is not a word automatic structure, then we do not care about the output of the algorithm. See our contribution in the Introduction Section.
Now we prove that this isomorphism problem for automatic successor word structures is undecidable. Our proof uses the construction of the structures from Theorem 7.
Theorem 11.
The isomorphism problem for automatic successor word structures is undecidable.
Proof.
Our goal is to show that no algorithm exists that, given two automatic presentations of successor word structures, decides if these structures are isomorphic. In our proof, we use the operator defined in Lemma 6. By this lemma, we always assume that our Turing machines are reversible. Moreover, a Turing machine is total if and only if, in the configuration graph of , all chains are finite. Our proof also uses the following folklore fact from computability theory [25]:
Lemma 12.
No algorithm exists that given two total Turing machines and decides if is equivalent to .
We assume that the outputs of Turing machines are either accept or reject. Let be a total Turing machine (over an alphabet ). For each , let be the number of steps taken for to halt its computation on . Then the following lemma can be established upon Lemma 12:
Lemma 13.
No algorithm exists that given two total TMs and decides if .
Let and be Turing machines. For and , we build the structure using and as constructed in the proof of Theorem 7.
We assume that the domains and are disjoint. We build as follows.
-
1.
The domain is .
-
2.
The unary predicate is the set of all initial configurations of and .
-
3.
To build the binary relation we need a few notations. In the structures and , respectively, let us list all the chains ordered according to the length-lexicographic order on the first elements:
and .
In each of these lists consider the sub-sequences of chains whose first elements are legal configurations: and . For these sub-sequences, the set of first elements of the chains are regular. Denote these first elements as: and . Define the relation by:
-
[ Simulate on the same input as ]
-
[ Both and are simulated on same input.]
-
-
-
In other cases where no rule can be matched , the successor function will just follow , i.e.
-
Lemma 14.
is automatic. If and are total, then is a successor word structure.
Lemma 15.
Let and be total. Then iff .
If then and are isomorphic. Assume that . Let be the first element, in the length-lexicographic order, such that . Assume that is the initial configuration that corresponds to . Then in the structure , the elements and belong to the unary predicate. The distance between these two elements and equals . Similarly, the elements and belong to the unary predicate in . The distance between and in equals . Obviously, these two distances are not equal as . If there is an isomorphism from onto , then for the isomorphism we must have and . This is impossible because the distances between these pairs of elements are not equal.
4 The agreement problem
Recall that the ordered word structures are structures isomorphic to , where is the natural order and is a unary predicate. The isomorphism problem for automatic ordered word structures is a natural follow-up to the isomorphism problem for automatic successor word structures. V. Bárány was the first who asked and investigated the isomorphism problem for automatic ordered word structure [1][Question 5.9.3]. The techniques used in addressing the isomorphism problem for automatic successor word structures can not be applied in this case. One reason is that the distance functions in automatic ordered word structures are bounded by exponential functions while the function for automatic successor word structures can bound from above any given computable function (see Corollary 9). Another reason is that the set of all automatic ordered word structures is decidable as opposed to -completeness of the set of all automatic presentations of successor word structures.
Let be an automatic ordered word structure. Let be the image of in under the isomorphism from onto .
Proposition 16.
Two word structures and are isomorphic if and only if .
In this section we attack the following agreement problem.
Definition 17.
Automatic word structures and agree if the set is a non-empty set. Otherwise, we say that these structures disagree.
The agreement problem asks to design an algorithm that given automatic ordered word structures and decides if these two structures agree.
Theorem 18.
The agreement problem for automatic ordered word structures is -complete.
Proof.
Let . We set to be the set of all polynomials over in variables. We use the following theorem from [42] that solves Hilbert’s 10th problem.
Theorem 19.
For any -complete , there exist two polynomials and from such that if and only if for some we have .
From now on we fix which witnesses this theorem. This theorem implies -completeness of the following set: .
We define a computable bijection , where :
-
The function encodes tuples as words in ;
-
The function maps each word to its position in the length-lexicographic ordering of .
Using , given two polynomials , we define the following function:
Let and be the ranges of and , respectively. As the sets and depend on and which we write as and , respectively. Here is a lemma that connects the set with the sets and .
Lemma 20.
For , we have the following. The set is non-empty if and only if the polynomials and agree on some , that is, .
Consider the following structures: and . These can be viewed as ordered counterparts of and from Theorem 11.
Lemma 21.
For any given polynomials , the two structures and have automatic presentations.
Proof.
We use the ideas from D. Kuske et al. [40]. Let be an NFA over alphabet . For , let be the set of all accepting runs of on . Let be all accepting runs of . The following is proved in [40].
Lemma 22.
For any polynomial , we can build a non-deterministic finite automaton over the alphabet such that and for all , we have .
Let . Let and be the NFA constructed according to the lemma above. So, and . We can assume that the sets of states of these two automata are disjoint. Hence, .
Now we build an automatic presentation of . An automatic presentation of the structure is built similarly.
-
1.
The domain is the set . The domain is a regular set.
-
2.
To define the linear order on we use the relation . Clearly, the relation is regular. If , then we write . Given two elements (accepting runs) and from , we define the order as follows:
-
(a)
When :
-
If and , then .
-
For runs and of the same automaton, whenever .
-
-
(b)
If , the ordering is defined as .
It is not hard to see that the relation is regular.
-
(a)
Thus, we have built an automatic structure . This structure is isomorphic to the linearly ordered set via the bijective function defined by
Let be the image of the unary relation in under the isomorphism . Our goal is to show that is a regular set. If so, then we will have built the automatic presentation of the structure .
The remaining part is to prove the regularity of . We claim that is identical to the following set .
Indeed, the set collects every which are greater than any other run with the same label . Moreover, no two distinct runs in can share the same label . Thus, it’s sufficient to prove that for any input .
Note that can be partitioned into two part:
-
, whose cardinality is the sum .
-
which equals to due to maximality of .
The following facts hold for any input :
-
1.
.
-
2.
.
-
3.
We finally conclude that . Therefore, holds for all . Thus, with Lemma 20 and Lemma 21 in our hand, we have proved the theorem.
Corollary 23.
Given two automatic ordered word structures and , it’s -complete to decide if .
5 Is intrinsically regular in ?
Any automatic presentation of is an automatic presentation of . We prove that the reverse does not hold. This is counter-intuitive. Indeed, in a computability-theoretic setting, the computability of implies the computability of . Contrasting this, our result shows that it is not possible to build finite automata for from finite automata recognizing the relation . Our proof non-trivially adapts ideas from [38, Theorem 4.1, page 447].
Theorem 24.
There is an automatic presentation of structure in which the successor relation is not regular.
Proof.
The domain of the desired automatic structure is with . In our construction, we will use the following three auxiliary functions from [38]:
-
1.
Every binary string is uniquely decomposed into:
-
: bits of at even positions (indexing starts at ).
-
: bits of at odd positions
Thus, and its pair are equivalent representations.
-
-
2.
Let and . Thus, or and . Both and are interpreted as natural numbers written in the least-significant-bit-first binary form.
-
3.
Define as: .
Let be a binary string. Call the start-point if . Our fourth auxiliary function is the following:
-
(a)
If is such that then is the length-lexicographic successor of .
-
(b)
If is not a start-point, then .
-
(c)
If is a start-point and then , where and
-
(d)
If is a startpoint and then we have .
We informally explain . Think of as a parameter of input viewed as . Starting at , consecutively adds to . Once, runs through all elements in and comes back to , the value is incremented by .
For string , one can inductively check the fact that there is a unique such that . Based on this, we define function which, given the string , outputs the solution to the equation modulo . So, is the distance between and the start-point .
Set , which has the following property in our construction:
Lemma 25 ([38]).
The unary predicate is not regular.
Using Lemma 25, we now define the desired function :
-
1.
If , then is the successor of with respect to length-lexicographic ordering.
-
2.
If , then .
-
3.
If , then .
-
4.
If , then .
-
5.
If , then .
-
6.
If , then .
The function is a successor on the domain . Call the rules (2) and (4) above non-regular rules. Consider now the graph of : . The non-regular rules, by Lemma 25, guarantee that the successor function is not a regular relation. Thus, we have a non-automatic presentation of the successor structure.
Consider the edge relation defined by . We show that is regular. For this, consider the rules (3), (5), and (6). Call them regular rules as they are finite automata recognizable. The edge relation can be defined as follows. Consider a pair .
-
1.
If , then holds when .
-
2.
If , then holds when .
-
3.
If , then holds when .
-
4.
If , then holds when .
-
5.
If , then holds when .
The regularity of these rules depends on the regularity of the unary relation given by the regular rules (3), (5) and (6) in the definition of . More formally, this unary relation can be recognised as follows:
-
1.
if .
-
2.
if .
-
3.
if is a startpoint.
-
4.
if .
So, the relation is regular. The theorem is proved.
6 Distance functions on ordered words
The distance functions on automatic successor word structures can bound, by Corollary 9, any computable function. In contrast, the distance functions on automatic ordered word structures are bounded by an exponential function. Hence, one would like to study distance functions realised by automatic ordered word structures.
A radical function is where . A function is intermediate if asymptotically is strictly between and any polynomial with non-negative coefficients. An example is , where , .
Theorem 26.
Polynomials, exponents, radical, logarithmic, and intermediate functions can all be realised on automatic word ordered structures as distance functions.
7 versus
Let , where . In known automatic presentations of , the relation is regular. The unary predicate is definable in the extension of the FO-logic based on and with “there exists many mod ” quantifier . Hence, if is regular, then so is the set [38]. Known automatic presentations of (over alphabet ) in which is not regular[38] have the following two properties:
-
Sparseness Property: The function is polynomial.
-
Unboundedness Property: The length difference between the pairs with and is unbounded.
The Sparseness Property is used to show that , , are all regular. The Unboundedness property is the key to show that is not regular. Two questions arise. The first question asks if “sparseness” is necessary to show that the sets are regular. The second asks if the unboundedness property is necessary to show that is not regular in automatic presentations of the successor structure. The following theorem answers the first question.
Theorem 27.
There is an automatic presentation of such that
-
1.
The transitive closure of is not regular.
-
2.
For all integers the sets are regular.
To answer the second question, we have the next definition:
Definition 28.
A pair in an automatic presentation of is non-standard if and , where is the transitive closure of .
The following proposition is easy to prove.
Proposition 29.
For any presentation of , if the length difference between the components of non-standard pairs is unbounded, then is not automatic.
Now we answer the second question.
Theorem 30.
There is an automatic presentation of of with no non-standard pairs such that is not automatic but is automatic.
References
- [1] Vince Bárány. Automatic presentations of infinite structures. Phd thesis, RWTH Aachen University, Germany, 2007. URL: http://darwin.bth.rwth-aachen.de/opus3/volltexte/2007/2019/.
- [2] Vince Bárány, Erich Grädel, and Sasha Rubin. Automata-based presentations of infinite structures. In Javier Esparza, Christian Michaux, and Charles Steinhorn, editors, Finite and Algorithmic Model Theory, volume 379 of London Mathematical Society Lecture Note Series, pages 1–76. Cambridge University Press, 2011.
- [3] C. H. Bennett. Logical reversibility of computation. IBM Journal of Research and Development, 17(6):525–532, 1973. doi:10.1147/rd.176.0525.
- [4] Dmitry Berdinsky and Khoussainov Bakh. Cayley automatic representations of wreath products. International Journal of Foundations of Computer Science, 27(02):147–159, 2016. doi:10.1142/S0129054116400049.
- [5] Dmitry Berdinsky, Sanjay Jain, Bakh Khoussainov, and Frank Stephan. String compression in fa-presentable structures. Theoretical Computer Science, 947:113705, 2023. doi:10.1016/j.tcs.2023.113705.
- [6] Achim Blumensath and Erich Grädel. Automatic structures. In 15th Annual IEEE Symposium on Logic in Computer Science, Santa Barbara, California, USA, June 26-29, 2000, pages 51–62. IEEE Computer Society, 2000. doi:10.1109/LICS.2000.855755.
- [7] Achim Blumensath and Erich Grädel. Finite presentations of infinite structures: Automata and interpretations. Theory Comput. Syst., 37(6):641–674, 2004. doi:10.1007/s00224-004-1133-y.
- [8] J. Richard Büchi. Weak second-order arithmetic and finite automata. Mathematical Logic Quarterly, 6(1-6):66–92, 1960. doi:10.1002/malq.19600060105.
- [9] Cristian Calude, Peter Hertling, Bakh Khoussainov, and Yongge Wang. Recursively enumerable reals and chaitin omega numbers. Theor. Comput. Sci., 255(1-2):125–149, 2001. doi:10.1016/S0304-3975(99)00159-0.
- [10] Christian Delhommé. Automaticité des ordinaux et des graphes homogènes. Comptes Rendus Mathematique, 339(1):5–10, 2004. doi:10.1016/j.crma.2004.03.035.
- [11] Berdinsky Dmitry and Bakh Khoussainov. On automatic transitive graphs. In Arseny M. Shur and Mikhail V. Volkov, editors, Developments in Language Theory, volume 8633 of Lecture Notes in Computer Science, pages 1–12, Cham, 2014. Springer International Publishing. doi:10.1007/978-3-319-09698-8_1.
- [12] David B. A. Epstein, M. S. Paterson, J. W. Cannon, D. F. Holt, S. V. Levy, and W. P. Thurston. Word Processing in Groups. A. K. Peters, Ltd., USA, 1992.
- [13] G. A. Freĭman. Addition of finite sets. Sov. Math., Dokl., 5:1366–1370, 1964.
- [14] Moses Ganardi and Bakh Khoussainov. Automatic equivalence structures of polynomial growth. In Maribel Fernández and Anca Muscholl, editors, 28th EACSL Annual Conference on Computer Science Logic, CSL 2020, January 13-16, 2020, Barcelona, Spain, volume 152 of LIPIcs, pages 21:1–21:16. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPIcs.CSL.2020.21.
- [15] Alex Gavryushkin, Bakh Khoussainov, and Frank Stephan. Reducibilities among equivalence relations induced by recursively enumerable structures. Theor. Comput. Sci., 612:137–152, 2016. doi:10.1016/j.tcs.2015.11.042.
- [16] Erich Grädel. Automatic structures: Twenty years later. In Holger Hermanns, Lijun Zhang, Naoki Kobayashi, and Dale Miller, editors, LICS ’20: 35th Annual ACM/IEEE Symposium on Logic in Computer Science, Saarbrücken, Germany, July 8-11, 2020, pages 21–34. ACM, 2020. doi:10.1145/3373718.3394734.
- [17] Michael Gromov. Groups of polynomial growth and expanding maps (with an appendix by Jacques Tits). Publications Mathématiques de l’IHÉS, 53:53–78, 1981. doi:10.1007/BF02698687.
- [18] Matthew Harrison-Trainor. An introduction to the scott complexity of countable structures and a survey of recent results. Bull. Symb. Log., 28(1):71–103, November 2022. doi:10.1017/bsl.2021.62.
- [19] Greg Hjorth, Bakh Khoussainov, Antonio Montalbán, and André Nies. From automatic structures to borel structures. In Proceedings of the Twenty-Third Annual IEEE Symposium on Logic in Computer Science, LICS 2008, 24-27 June 2008, Pittsburgh, PA, USA, pages 431–441. IEEE, IEEE Computer Society, 2008. doi:10.1109/LICS.2008.28.
- [20] Bernard R. Hodgson. Théories décidables par automate fini. PhD thesis, Université de Montréal, 1976.
- [21] Sanjay Jain, Bakh Khoussainov, Philipp Schlicht, and Frank Stephan. Tree-automatic scattered linear orders. Theoretical Computer Science, 626:83–96, 2016. doi:10.1016/j.tcs.2016.02.008.
- [22] Sanjay Jain, Bakh Khoussainov, Philipp Schlicht, and Frank Stephan. The isomorphism problem for tree-automatic ordinals with addition. Information Processing Letters, 149:19–24, 2019. doi:10.1016/j.ipl.2019.05.004.
- [23] Sanjay Jain, Bakh Khoussainov, and Frank Stephan. Finitely generated semiautomatic groups. Comput., 7(2-3):273–287, 2018. doi:10.3233/COM-180089.
- [24] Sanjay Jain, Bakh Khoussainov, Frank Stephan, Dan Teng, and Siyuan Zou. Semiautomatic structures. Theory Comput. Syst., 61(4):1254–1287, 2017. doi:10.1007/s00224-017-9792-7.
- [25] Hartley Rogers Jr. Theory of recursive functions and effective computability (Reprint from 1967). MIT Press, Cambridge, MA, USA, 1987. URL: http://mitpress.mit.edu/catalog/item/default.asp?ttype=2&tid=3182.
- [26] Lukasz Kaiser, Sasha Rubin, and Vince Bárány. Cardinality and counting quantifiers on omega-automatic structures. In Susanne Albers and Pascal Weil, editors, STACS 2008, 25th Annual Symposium on Theoretical Aspects of Computer Science, Bordeaux, France, February 21-23, 2008, Proceedings, volume 1 of LIPIcs, pages 385–396. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, Germany, 2008. doi:10.4230/LIPIcs.STACS.2008.1360.
- [27] Olga Kharlampovich, Bakh Khoussainov, and Alexei Miasnikov. From automatic structures to automatic groups. Groups, Geometry, and Dynamics, 8(1):157–198, 2014. doi:10.4171/GGD/221.
- [28] Bakh Khoussainov and Nerode Anil. Automatic presentations of structures. In Daniel Leivant, editor, Logic and Computational Complexity, pages 367–392, Berlin, Heidelberg, 1995. Springer Berlin Heidelberg. doi:10.1007/3-540-60178-3_93.
- [29] Bakh Khoussainov, Steffen Lempp, and Theodore A. Slaman. Computably enumerable algebras, their expansions, and isomorphisms. Int. J. Algebra Comput., 15(3):437–454, 2005. doi:10.1142/S0218196705002281.
- [30] Bakh Khoussainov, Jiamou Liu, and Mia Minnes. Unary automatic graphs: An algorithmic perspective. In Manindra Agrawal, Ding-Zhu Du, Zhenhua Duan, and Angsheng Li, editors, Theory and Applications of Models of Computation, 5th International Conference, TAMC 2008, Xi’an, China, April 25-29, 2008. Proceedings, volume 4978 of Lecture Notes in Computer Science, pages 542–553, Berlin, Heidelberg, 2008. Springer. doi:10.1007/978-3-540-79228-4_47.
- [31] Bakh Khoussainov and Mia Minnes. Three lectures on automatic structures. In Proceedings of Logic Colloquium, 2007.
- [32] Bakh Khoussainov and Mia Minnes. Model-theoretic complexity of automatic structures. Annals of Pure and Applied Logic, 161(3):416–426, 2009. Papers presented at the Symposium on Logical Foundations of Computer Science 2007. doi:10.1016/j.apal.2009.07.012.
- [33] Bakh Khoussainov and Anil Nerode. Automata Theory and its Applications, volume 21. Springer Science & Business Media, 2001. doi:10.1007/978-1-4612-0171-7.
- [34] Bakh Khoussainov, André Nies, Sasha Rubin, and Frank Stephan. Automatic structures: Richness and limitations. Log. Methods Comput. Sci., 3(2), 2007. doi:10.2168/LMCS-3(2:2)2007.
- [35] Bakh Khoussainov and Sasha Rubin. Graphs with automatic presentations over a unary alphabet. J. Autom. Lang. Comb., 6(4):467–480, April 2001. doi:10.25596/jalc-2001-467.
- [36] Bakh Khoussainov and Sasha Rubin. Automatic structures: overview and future directions. J. Autom. Lang. Comb., 8(2):287–301, April 2003. doi:10.25596/jalc-2003-287.
- [37] Bakh Khoussainov, Sasha Rubin, and Frank Stephan. On automatic partial orders. In Proceedings of the Eighteenth Annual IEEE Symposium on Logic in Computer Science (LICS 2003), pages 168–177. IEEE Computer Society Press, June 2003. doi:10.1109/LICS.2003.1210056.
- [38] Bakh Khoussainov, Sasha Rubin, and Frank Stephan. Definability and regularity in automatic structures. In Volker Diekert and Michel Habib, editors, STACS 2004, 21st Annual Symposium on Theoretical Aspects of Computer Science, Montpellier, France, March 25-27, 2004, Proceedings, volume 2996 of Lecture Notes in Computer Science, pages 440–451. Springer, 2004. doi:10.1007/978-3-540-24749-4_39.
- [39] Bakh Khoussainov, Theodore A. Slaman, and Pavel Semukhin. P-presentations of algebras. Arch. Math. Log., 45(6):769–781, 2006. doi:10.1007/s00153-006-0013-3.
- [40] Dietrich Kuske, Jiamou Liu, and Markus Lohrey. The isomorphism problem on classes of automatic structures with transitive relations. Transactions of the American Mathematical Society, 365(10):5103–5151, October 2013. doi:10.1090/S0002-9947-2013-05766-2.
- [41] Dietrich Kuske and Markus Lohrey. First-order and counting theories of omega-automatic structures. In Luca Aceto and Anna Ingólfsdóttir, editors, Foundations of Software Science and Computation Structures, 9th International Conference, FOSSACS 2006, Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2006, Vienna, Austria, March 25-31, 2006, Proceedings, volume 3921 of Lecture Notes in Computer Science, pages 322–336, Berlin, Heidelberg, 2006. Springer. doi:10.1007/11690634_22.
- [42] Yuri Vladimirovich Matiyasevich. Hilbert’s tenth problem. MIT press, 1993.
- [43] André Nies and Pavel Semukhin. Finite automata presentable abelian groups. In Sergei N. Artemov and Anil Nerode, editors, Logical Foundations of Computer Science, pages 422–436, Berlin, Heidelberg, 2007. Springer Berlin Heidelberg. doi:10.1007/978-3-540-72734-7_29.
- [44] André Nies, Frank Stephan, Markus Lohrey, Pavel Semukhin, Dimitry Berdinsky, and et al. Personal communications (over many years).
- [45] André Nies and Richard M. Thomas. Fa-presentable groups and rings. Journal of Algebra, 320(2):569–585, 2008. Computational Algebra. doi:10.1016/j.jalgebra.2007.04.015.
- [46] Graham P. Oliver and Richard M. Thomas. Automatic presentations for finitely generated groups. In Volker Diekert and Bruno Durand, editors, STACS 2005, 22nd Annual Symposium on Theoretical Aspects of Computer Science, Stuttgart, Germany, February 24-26, 2005, Proceedings, volume 3404 of Lecture Notes in Computer Science, pages 693–704. Springer, 2005. doi:10.1007/978-3-540-31856-9_57.
- [47] Michael O Rabin. Decidability of second-order theories and automata on infinite trees. Transactions of the american Mathematical Society, 141:1–35, 1969.
- [48] Downey Rodney and Melnikov Alexander. Computable Structure Theory: A Unified Approach. Springer, 2024.
- [49] Sasha Rubin. Automatic structures. PhD thesis, University of Auckland New Zealand, 2004.
- [50] Sasha Rubin. Automata presenting structures: A survey of the finite string case. The Bulletin of Symbolic Logic, 14(2):169–209, 2008. doi:10.2178/bsl/1208442827.
- [51] Dana Scott. Logic with denumerably long formulas and finite strings of quantifiers. Journal of Symbolic Logic, 36(1):1104–329, 1965. doi:10.2307/2271545.
- [52] Frank Stephan. Automatic structures — recent results and open questions. Journal of Physics: Conference Series, 622(1):012013, May 2015. doi:10.1088/1742-6596/622/1/012013.
- [53] Todor Tsankov. The additive group of the rationals does not have an automatic presentation. Journal of Symbolic Logic, 76(4):1341–1351, 2011. doi:10.2178/jsl/1318338853.
