Abstract 1 Introduction 2 Proof of the upper bounds 3 Conclusion References

A Proof of Shur’s Conjecture on the Growth of Power-Free Languages over Large Alphabets

Vuong Bui UET, Vietnam National University, Hanoi, 144 Xuan Thuy Street, Hanoi, 100000, Vietnam Matthieu Rosenfeld LIRMM, Université de Montpellier, CNRS, 161 Rue Ada, 34095, Montpellier, France
Abstract

We settle a conjecture of Shur on an estimation of the exponential growth rates of the languages of (nn1)-free words and (nn1)+-free words over large alphabets of size k with a correction of order O(1k2).

Keywords and phrases:
power-free languages, large alphabets, Shur’s conjecture, Dejean’s conjecture
Copyright and License:
[Uncaptioned image] © Vuong Bui and Matthieu Rosenfeld; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Mathematics of computing Combinatorics on words
Editors:
Paweł Gawrychowski, Filip Mazowiecki, and Michał Skrzypczak

1 Introduction

A square is a word of the form uu where u is a nonempty word. A square-free word is a word that does not contain a square as a factor. For instance, hotshots is a square, and minimize is a square-free word. In the seminal work [16], which is widely regarded as the starting point of combinatorics on words, Thue showed that there are infinite square-free words over the ternary alphabet as well as infinite cube-free words over the binary alphabet. This notion of powers and its generalizations have received a lot of attention.

One such generalization is the notion of fractional power. A word of the form w=xxxy where x is nonempty and y is a prefix of x is a power of exponent |w||x| and of period |x| (we also say that w is a (|w||x|)-power). A square is a 2-power, and a cube is a 3-power. For any real β>1, we say that the word w is β-free (resp. β+-free) if it contains no α-power as a factor with αβ (resp. α>β). These notions were introduced by Dejean who conjectured in 1972 that for any k5, there exists a (kk1)+-free infinite word over the k-ary alphabet, and proved that there exists no (kk1)-free infinite word over the same alphabet [5]. For k=3, she proved that there exists a (74)+ free infinite word over the ternary alphabet [5], and also conjectured that there exists a (75)+ free infinite word over the 4-ary alphabet, which was later proven by Pansiot [10]. Different authors solved Dejean’s conjecture for the cases k11 [9], k14 [7], k33 [1], k30 [2], until the remaining cases were finally proven 40 years after the initial conjecture [3, 11]. This led to another conjecture about the number of (kk1)+-free words over the k-ary alphabet. It is conjectured that for all k3, this language grows exponentially [8], and that as k goes to infinity, the growth rate converges to a constant whose first digits are 1.242 [3, 15]. For all but six cases, it has been proven that these languages grow exponentially [6, 17, 4]. The second part of the conjecture remains completely open.

Let (k,p) denote the languages of p-free words over an alphabet of size k. The growth rate α(k,p)=limn|n(k,p)|1/n, where n(k,p) is the set of words with length n in (k,p), is usually hard to estimate and requires a large amount of computation. The previously mentioned conjecture asserts that limkα(k,(kk1)+)1.24. Shur suggests an interesting problem: study the asymptotic order of α(k,p) when k for fixed p>1. For an overview of progress on the problem, we refer the reader to the intensive survey of Shur [14]. In particular, Shur [13] settled the problem when the degree of the power is at least 2 in Theorem 1 below.

We extend the strict total order < over the reals to the numbers of the form x+ in such a way that for all x<y, we have x<x+<y. That is, x+ is right after x in this order.

Theorem 1 (Shur, 2010).

If p2 is an integer and β is in [p+,p+1], then

α(k,β)={k1kp1+1kp1k2p2+O(1k2p1)if β[p+,p+12];k1kp1+1kp+O(1k2p1)if β[(p+12)+,p+1].

However, the problem for powers with degree less than 2 is still open and Shur [13] suggested the following conjecture.

Conjecture 2 (Shur, 2010).

For every integer n2, we have

α(k,(nn1)+) =k+2nn1k+O(1k2),
α(k,nn1) =k+1nn1k+O(1k2).

Note that for all k and for all x,y{x+:x}, if xy then (k,x)(k,y). This implies that α(k,x) is a non-decreasing function of x. Moreover, if Shur’s conjecture holds then for every integer n and k, we have

α(k,nn1)α(k,(n+1n)+) =1k+O(1k2), (1)
α(k,(nn1)+)α(k,nn1) =1+O(1k2). (2)

Shur’s conjecture implies that most of the jump between α(k,nn1) and α(k,n+1n) is located between α(k,nn1) and α(k,(nn1)+). It provides precise bounds on the asymptotic behavior of α(k,β) tight up to 1k for every β<2. In particular, if p{x+:x} is such that n+1n<p<nn1, then α(k,(n+1n)+)α(k,p)α(k,nn1), which implies

|α(k,p)(k+1nn1/2k)|12k+O(1k2). (3)

That is, this conjecture provides for all p a good estimate of the asymptotic behavior of α(k,p) as k goes to infinity. This conjecture implies other similar empirical facts that also hold for β>2 and illustrate the particular behavior of α(k,β) ((1) and (2) are respectively called small variation and big jump in [13]).

Using a counting argument, the second author has established the lower bound in [12].

Theorem 3 (Rosenfeld, 2021).

Let n2 be an integer, then the following holds

α(k,(nn1)+) k+2nn1k+O(1k2),
α(k,nn1) k+1nn1k+O(1k2).

Shur has actually confirmed the other direction of the inequality for n9 by a computer-assisted proof which settles the conjecture for these cases [13]. In this article, we settle Shur’s conjecture, by proving the upper bound for all n2.

Theorem 4.

For every integer n2, we have

α(k,(nn1)+) k+2nn1k+O(1k2), (4)
α(k,nn1) k+1nn1k+O(1k2). (5)

Shur’s proof for n9 considers a regular language that contains (k,nn1) (resp. (k,(nn1)+)) and uses standard tools from automata theory to upper bound the growth of this language. His proof technique allows him to work with all k, for one fixed n, by considering the automaton where states are taken up to isomorphism (that is, up to renaming of the letters). However, this technique needs to explicitly construct the automaton for each specific value n, which requires the use of computers. Proving the result for all n asks for a different approach. Moreover, Shur’s approach is to explicitly construct the automata that avoid powers up to a given length. The states of these automata correspond to suffixes of certain lengths, hence the size of the automata grows exponentially. The main idea behind our proof is also to construct a sequence of automata for all n that approximate the true automata. However, they must be simple enough to allow estimating the growth rates manually, and include all the words in the languages (to ensure that we obtain a proper upper bound), but do not include relatively too many forbidden words (to ensure that the upper bound on the growth rate is sharp).

2 Proof of the upper bounds

In this section, we settle Inequality (5) for (nn1)-free words. Inequality (4) for (nn1)+-free words can be proved similarly.

For any two integers ab, we write [a..b] for the set of integers {a,,b}, and we write ab for the word a(a+1)(a+2)(b1)b.

Fix n,k2. We denote by the language of words over k letters that avoid p-powers isomorphic to 1m1 and 12m12 for any p(nn1) and any m. Since contains all (nn1)-free words, the following theorem directly implies Inequality (5) of Shur’s conjecture. This section is devoted to the proof of this theorem.

Theorem 5.

The growth rate of the language is at most

k+1nn1k+O(1k2).

The following fact simply illustrates that two occurrences of the same letter cannot be too close, as it would cause the existence of a p-power isomorphic to 1m1 with pnn1.

Observation 6.

Any factor from of length n contains n different letters.

It directly implies that the growth of is at most k+1n. The term n1k requires considering the powers isomorphic to 12m12.

The idea is to construct an automaton whose language is a superset of . Shur noted in [13] that the size of the automaton recognizing is exponential in n and is really difficult to analyze. But by considering a slightly larger language, we can construct an automaton with a much smaller number of states that we will be able to analyze, but whose growth rate is really close to the original language (up to the O(1k2) term).

For this, we define the following states for any long enough word w (we require |w|2n1):

  • m for m[n..2n2]: if a suffix of w is isomorphic to m1m.

  • m^ for m[n..2n2]: if a suffix of w is isomorphic to x1m with x satisfying

    {x<mif m<2n2,xmif m=2n2.

We provide some intuition about this definition. The number m in the states m^,m stands for the length of the longest suffix containing only distinct letters, hence the letter in front of that suffix should appear in the suffix (the only exception111It might be more natural to write this state as 2n2^, but for the sake of notation it is more convenient to keep it as 2n2^. being 2n2^ as it actually covers all the lengths from 2n2 up to k). The states m and m^ differ based on whether this reoccurring letter is the last letter of the word. The states m are distinguished from the other state, since when m<2n2 the word ends with m1m and we know that the next letter cannot be 1 (since this would create a power of the form m1m1), that is, we know that at least one more letter is forbidden in the next position. The length of this power actually suggests the choice of the seemingly-arbitrary number 2n2.

 Remark 7.

By definition, the words in state n^ contain a forbidden factor. It is fine to consider them anyway because we only want an upper bound on the number of words, and introducing this “artificial” state creates some symmetries that simplify the statements and the proofs.

From here on, we assume

k2n2.
Observation 8.

Every word u with |u|2n1 is in exactly one of the previously defined states.

We now describe how appending a letter at the end of a word from can alter the state. In particular, we need T(s1,s2) such that for every states s1,s2 and for any w in state s1,

T(s1,s2)|{α𝒜:wα and is in state s2}|.

That is, there are at most T(s1,s2) letters α[1..k] such that wα is in and is in state s2.

In the following lemmas, we provide such a T. For convenience, we use the following notation:

succ(m^)=m+1^ if m<2n2,
succ(2n2^)=2n2^.
Lemma 9.

For m[n..2n2], we can set

  • T(m^,s)=1, for each state s{n,,m},

  • T(m^,succ(m^))=km,

  • for the other states s, T(m^,s)=0.

Proof.

If w is in state m^, then up to renaming of the letters, x1m is a suffix of w for some x<m. By Proposition 6, the next letter needs to be different from the last n1 letters, so it belongs to [1..k][mn+2..m]. If the next letter is in [1..mn+1], then the next state is m,m1,,n, respectively. If the next letter is in [m+1..k], then the next state is always succ(m^). The following lemma and its proof are almost identical to the ones above. The difference is due to the missing loop around the state m.

Lemma 10.

For m[n..2n2], we can set

  • T(m,s)=1, for each state s{n,,m1},

  • T(m,succ(m^))=km,

  • for the other states s, T(m,s)=0.

Proof.

If w is in state m, then up to renaming of the letters, m1m is a suffix of w. By Proposition 6, if wα is in , then α is different from the last n1 letters of w, so α[1..k][mn+2..m]. This time, we also have α1 since m1m1 is forbidden in . In particular, if the next letter is 2,,mn+1, then the next state is m1,,n, respectively. If the next letter is in [m+1..k], then the next state is succ(m^).

In the remainder of this section, we treat T as a square matrix indexed over the states. We abuse the notation and write Ts1,s2 for T(s1,s2). By the definition of T, for every w in state s1 we have

(Tm)s1,s2{u𝒜m:wu and is in state s2}|.

Since every long enough word from is in one of the states, we have the following upper bound on the growth of .

Proposition 11.

The growth rate of the language is at most the spectral radius of the matrix T.

Theorem 4 can then be reduced to the following theorem.

Theorem 12.

For any n, there exists a constant C such that for all large enough k, the spectral radius of T is at most

λ=k(n1)n1k+C/k2.

Proof.

For the proof we fix n, and we let λ=k(n1)n1k+C/k2, with C large enough as a function of n. We denote the coordinates of a vector v by vn^,,v2n2^, vn,,v2n2. We consider the vector x with

xm^ =1m(4nm3)2k2, (6)
xm =xm^1k. (7)

We first verify that x is positive when k is large enough. Indeed, for all values of n,k, we can prove that for every m,

xm^=1m(4nm3)2k21(2n2)(2n1)2(2n2)2=4n42n+14n4=142n3n114

when n2. We also need xm>0 for any m, which by (7) holds for k large enough.

We will also use the following direct consequences of (6) and (7):

xsucc(m^) =xm^2nm2k2,
11k xm.

The nonnegativity of T and Perron-Frobenius Theorem imply that an eigenvector v associated to the dominant eigenvalue ρ(T) is nonnegative. Since x is positive, it is non-orthogonal with v, and we have x=cv+r with c a positive real and r a nonnegative vector. This and the nonnegativity of all the entries implies that TnxTn(cv)Θ(ρn). It follows that ρ(T)=limnTnx1/n. Therefore, in order to deduce ρ(T)λ, it suffices to show that

Txλx,

where the inequality is coordinate-wise.

We now verify the inequality Txλx for each coordinate as follows.

For every m[n..2n2],

(Tx)m^= (km)xsucc(m^)+xn++xm
(km)(xm^2nm2k2)+(m+1n)(11k)
= (km)xm^+(m+1n)2nm2km+1nk+m(2nm2)k2
= (km)xm^+(m+1n)n1k+m(2nm2)k2
= (k+1nn1k+Ck2)xm^+(1xm^)(m+1nn1k)
xm^Ck2+m(2nm2)k2
(k+1nn1k+Ck2)xm^+m(4nm3)2k2(m+1n)
14Ck2+m(2nm2)k2
λxm^

for some large enough C that only depends on n.

We now take care of the coordinates m. By Lemma 9 and Lemma 10, we have for all m[n..2n2],

(Tx)m=(Tx)m^xm.

We have already proven that (Tx)m^λxm^, for all m[n..2n2], and the rest of the computation follows:

(Tx)m λxm^xm
= λ(xm+1k)(11km(4nm3)2k2)
= λxm+(kn+1n1k+Ck2)1k(11k)+m(4nm3)2k2
= λxmn2k+Ck3+m(4nm3)2n+22k2
λxm

where the last inequality holds for large enough k. This concludes our proof.

 Remark 13.

Although we do not really care about how large C and k should be in Theorem 4, one easily verifies that C=20n3 and k=5(n+6) work.

3 Conclusion

As mentioned in the introduction (see Equation (3)), our result provides for all p a good estimate for α(k,p) up to an error term 1/k+O(1/k2). When we only care about powers isomorphic to 11 and 1212, the difference between p-powers for pnn1 and for p>nn1 lies only in the range of . By varying the length , we might allow ourselves to address p-powers for a fraction p that may be of more complicated nature than nn1. So applying our approach for all p instead of only p of the form nn1 or (nn1)+ might be enough to replace this error term by O(1/k2).

On the other hand, as conjectured by Shur, and later by the second author, the behavior of the growth rate up to the O(1/k2) term is controlled by powers whose tail is of length 1 or 2. It seems natural to wonder whether increasing the precision up to the O(1/k3) term requires precisely the considerations of powers with tails of length 3. Shall the degree of the order of precision grow linearly with respect to the length of the tails?

References

  • [1] Arturo Carpi. On Dejean’s conjecture over large alphabets. Theoret. Comput. Sci., 385(1):137–151, October 2007. doi:10.1016/j.tcs.2007.06.001.
  • [2] James Currie and Narad Rampersad. Dejean’s conjecture holds for n30. Theoret. Comput. Sci., 410(30):2885–2888, August 2009. doi:10.1016/j.tcs.2009.01.026.
  • [3] James Currie and Narad Rampersad. A proof of Dejean’s conjecture. Math. Comput., 80(274):1063–1070, April 2011. doi:10.1090/S0025-5718-2010-02407-X.
  • [4] James D. Currie, Lucas Mol, and Narad Rampersad. The number of threshold words on n letters grows exponentially for every n27. Journal of Integer Sequences, 23, 2020. URL: https://cs.uwaterloo.ca/journals/JIS/VOL23/Mol/mol2.html.
  • [5] Françoise Dejean. Sur un théorème de Thue. J. Combin. Theory Ser. A, 13(1):90–99, 1972.
  • [6] Roman Kolpakov and Michael Rao. On the number of Dejean words over alphabets of 5, 6, 7, 8, 9 and 10 letters. Theoret. Comput. Sci., 412(46), 2011. doi:10.1016/J.TCS.2011.08.006.
  • [7] M. Mohammad-Noori and James D. Currie. Dejean’s conjecture and Sturmian words. Eur. J. Combin., 28(3):876–890, April 2007. doi:10.1016/j.ejc.2005.11.005.
  • [8] Pascal Ochem. A generator of morphisms for infinite words. RAIRO-Theor. Inf. Appl., 40(3):427–441, July 2006. doi:10.1051/ita:2006020.
  • [9] Jean Moulin Ollagnier. Proof of Dejean’s conjecture for alphabets with 5, 6, 7, 8, 9, 10 and 11 letters. Theoret. Comput. Sci., 95(2):187–205, March 1992. doi:10.1016/0304-3975(92)90264-G.
  • [10] Jean-Jacques Pansiot. A propos d’une conjecture de F. Dejean sur les répétitions dans les mots. Discrete Appl. Math., 7(3):297–311, March 1984. doi:10.1016/0166-218X(84)90006-4.
  • [11] Michael Rao. Last cases of Dejean’s conjecture. Theoret. Comput. Sci., 412(27):3010–3018, 2011. doi:10.1016/J.TCS.2010.06.020.
  • [12] Matthieu Rosenfeld. Lower-bounds on the growth of power-free languages over large alphabets. Theory of Computing Systems, 65:1110–1116, 2021. doi:10.1007/S00224-021-10040-1.
  • [13] Arseny M. Shur. Growth of power-free languages over large alphabets. In International Computer Science Symposium in Russia, pages 350–361. Springer, 2010. doi:10.1007/978-3-642-13182-0_35.
  • [14] Arseny M. Shur. Growth properties of power-free languages. Computer Science Review, 6(5-6):187–208, 2012. doi:10.1016/J.COSREV.2012.09.001.
  • [15] Arseny M. Shur and Irina A. Gorbunova. On the growth rates of complexity of threshold languages. RAIRO-Theor. Inf. Appl., 44(1):175–192, January 2010. doi:10.1051/ita/2010012.
  • [16] Axel Thue. Über unendliche Zeichenreihen. ’Norske Vid. Selsk. Skr. I. Mat. Nat. Kl. Christiania, 7:1–22, 1906.
  • [17] Igor N. Tunev and Arseny M. Shur. On Two Stronger Versions of Dejean’s Conjecture. In Mathematical Foundations of Computer Science 2012, pages 800–812. Springer, Berlin, Germany, 2012. doi:10.1007/978-3-642-32589-2_69.