A Proof of Shur’s Conjecture on the Growth of Power-Free Languages over Large Alphabets

Bui, Vuong; Rosenfeld, Matthieu

doi:10.4230/LIPIcs.MFCS.2025.32

A Proof of Shur’s Conjecture on the Growth of Power-Free Languages over Large Alphabets

Vuong Bui UET, Vietnam National University, Hanoi, 144 Xuan Thuy Street, Hanoi, 100000, Vietnam Matthieu Rosenfeld LIRMM, Université de Montpellier, CNRS, 161 Rue Ada, 34095, Montpellier, France

Abstract

We settle a conjecture of Shur on an estimation of the exponential growth rates of the languages of $\left(\frac{n}{n-1}\right)$ -free words and $\left(\frac{n}{n-1}\right)^{+}$ -free words over large alphabets of size $k$ with a correction of order $O\left(\frac{1}{k^{2}}\right)$ .

Keywords and phrases:

power-free languages, large alphabets, Shur’s conjecture, Dejean’s conjecture

Copyright and License:

2012 ACM Subject Classification:

Mathematics of computing

\rightarrow

Combinatorics on words

DOI:

10.4230/LIPIcs.MFCS.2025.32

Event:

50th International Symposium on Mathematical Foundations of Computer Science (MFCS 2025)

Editors:

Paweł Gawrychowski, Filip Mazowiecki, and Michał Skrzypczak

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

A square is a word of the form $u u$ where $u$ is a nonempty word. A square-free word is a word that does not contain a square as a factor. For instance, hotshots is a square, and minimize is a square-free word. In the seminal work [16], which is widely regarded as the starting point of combinatorics on words, Thue showed that there are infinite square-free words over the ternary alphabet as well as infinite cube-free words over the binary alphabet. This notion of powers and its generalizations have received a lot of attention.

One such generalization is the notion of fractional power. A word of the form $w=xx\ldots xy$ where $x$ is nonempty and $y$ is a prefix of $x$ is a power of exponent $\frac{|w|}{|x|}$ and of period $|x|$ (we also say that $w$ is a $\left(\frac{|w|}{|x|}\right)$ -power). A square is a $2$ -power, and a cube is a $3$ -power. For any real $\beta>1$ , we say that the word $w$ is $\beta$ -free (resp. $\beta^{+}$ -free) if it contains no $\alpha$ -power as a factor with $\alpha\geq\beta$ (resp. $\alpha>\beta$ ). These notions were introduced by Dejean who conjectured in 1972 that for any $k\geq 5$ , there exists a $\left(\frac{k}{k-1}\right)^{+}$ -free infinite word over the $k$ -ary alphabet, and proved that there exists no $\left(\frac{k}{k-1}\right)$ -free infinite word over the same alphabet [5]. For $k=3$ , she proved that there exists a $\left(\frac{7}{4}\right)^{+}$ free infinite word over the ternary alphabet [5], and also conjectured that there exists a $\left(\frac{7}{5}\right)^{+}$ free infinite word over the $4$ -ary alphabet, which was later proven by Pansiot [10]. Different authors solved Dejean’s conjecture for the cases $k\leq 11$ [9], $k\leq 14$ [7], $k\geq 33$ [1], $k\geq 30$ [2], until the remaining cases were finally proven 40 years after the initial conjecture [3, 11]. This led to another conjecture about the number of $\left(\frac{k}{k-1}\right)^{+}$ -free words over the $k$ -ary alphabet. It is conjectured that for all $k\geq 3$ , this language grows exponentially [8], and that as $k$ goes to infinity, the growth rate converges to a constant whose first digits are $1.242$ [3, 15]. For all but six cases, it has been proven that these languages grow exponentially [6, 17, 4]. The second part of the conjecture remains completely open.

Let $\mathcal{L}(k,p)$ denote the languages of $p$ -free words over an alphabet of size $k$ . The growth rate $\alpha(k,p)=\lim_{n\to\infty}{|\mathcal{L}_{n}(k,p)|^{1/n}}$ , where $\mathcal{L}_{n}(k,p)$ is the set of words with length $n$ in $\mathcal{L}(k,p)$ , is usually hard to estimate and requires a large amount of computation. The previously mentioned conjecture asserts that $\lim\limits_{k\to\infty}\alpha\left(k,\left(\frac{k}{k-1}\right)^{+}\right)% \approx 1.24$ . Shur suggests an interesting problem: study the asymptotic order of $\alpha(k,p)$ when $k\to\infty$ for fixed $p>1$ . For an overview of progress on the problem, we refer the reader to the intensive survey of Shur [14]. In particular, Shur [13] settled the problem when the degree of the power is at least $2$ in Theorem 1 below.

We extend the strict total order $<$ over the reals to the numbers of the form $x^{+}$ in such a way that for all $x<y$ , we have $x<x^{+}<y$ . That is, $x^{+}$ is right after $x$ in this order.

Theorem 1 (Shur, 2010).

If $p\geq 2$ is an integer and $\beta$ is in $[p^{+},p+1]$ , then

\alpha(k,\beta)=\begin{cases}k-\frac{1}{k^{p-1}}+\frac{1}{k^{p}}-\frac{1}{k^{2% p-2}}+O(\frac{1}{k^{2p-1}})&\text{if $\beta\in\left[p^{+},p+\frac{1}{2}\right]% $};\\ k-\frac{1}{k^{p-1}}+\frac{1}{k^{p}}+O(\frac{1}{k^{2p-1}})&\text{if $\beta\in% \left[\left(p+\frac{1}{2}\right)^{+},p+1\right]$}.\end{cases}

However, the problem for powers with degree less than $2$ is still open and Shur [13] suggested the following conjecture.

Conjecture 2 (Shur, 2010).

For every integer $n\geq 2$ , we have

	$\displaystyle\alpha\left(k,\left(\frac{n}{n-1}\right)^{+}\right)$	$\displaystyle=k+2-n-\frac{n-1}{k}+O\left(\frac{1}{k^{2}}\right),$
	$\displaystyle\alpha\left(k,\frac{n}{n-1}\right)$	$\displaystyle=k+1-n-\frac{n-1}{k}+O\left(\frac{1}{k^{2}}\right).$

Note that for all $k$ and for all $x,y\in\mathbb{R}\cup\{x^{+}:x\in\mathbb{R}\}$ , if $x\leq y$ then $\mathcal{L}(k,x)\subseteq\mathcal{L}(k,y)$ . This implies that $\alpha\left(k,x\right)$ is a non-decreasing function of $x$ . Moreover, if Shur’s conjecture holds then for every integer $n$ and $k$ , we have

	$\displaystyle\alpha\left(k,\frac{n}{n-1}\right)-\alpha\left(k,\left(\frac{n+1}% {n}\right)^{+}\right)$	$\displaystyle=\frac{1}{k}+O\left(\frac{1}{k^{2}}\right),$		(1)
	$\displaystyle\alpha\left(k,\left(\frac{n}{n-1}\right)^{+}\right)-\alpha\left(k% ,\frac{n}{n-1}\right)$	$\displaystyle=1+O\left(\frac{1}{k^{2}}\right).$		(2)

Shur’s conjecture implies that most of the jump between $\alpha\left(k,\frac{n}{n-1}\right)$ and $\alpha\left(k,\frac{n+1}{n}\right)$ is located between $\alpha\left(k,\frac{n}{n-1}\right)$ and $\alpha\left(k,\left(\frac{n}{n-1}\right)^{+}\right)$ . It provides precise bounds on the asymptotic behavior of $\alpha\left(k,\beta\right)$ tight up to $\frac{1}{k}$ for every $\beta<2$ . In particular, if $p\in\mathbb{R}\cup\{x^{+}:x\in\mathbb{R}\}$ is such that $\frac{n+1}{n}<p<\frac{n}{n-1}$ , then $\alpha\left(k,\left(\frac{n+1}{n}\right)^{+}\right)\leq\alpha\left(k,p\right)% \leq\alpha\left(k,\frac{n}{n-1}\right)$ , which implies

\left|\alpha\left(k,p\right)-\left(k+1-n-\frac{n-1/2}{k}\right)\right|\leq% \frac{1}{2k}+O\left(\frac{1}{k^{2}}\right).

(3)

That is, this conjecture provides for all $p$ a good estimate of the asymptotic behavior of $\alpha\left(k,p\right)$ as $k$ goes to infinity. This conjecture implies other similar empirical facts that also hold for $\beta>2$ and illustrate the particular behavior of $\alpha(k,\beta)$ ((1) and (2) are respectively called small variation and big jump in [13]).

Using a counting argument, the second author has established the lower bound in [12].

Theorem 3 (Rosenfeld, 2021).

Let $n\geq 2$ be an integer, then the following holds

	$\displaystyle\alpha\left(k,\left(\frac{n}{n-1}\right)^{+}\right)$	$\displaystyle\geq k+2-n-\frac{n-1}{k}+O\left(\frac{1}{k^{2}}\right),$
	$\displaystyle\alpha\left(k,\frac{n}{n-1}\right)$	$\displaystyle\geq k+1-n-\frac{n-1}{k}+O\left(\frac{1}{k^{2}}\right).$

Shur has actually confirmed the other direction of the inequality for $n\leq 9$ by a computer-assisted proof which settles the conjecture for these cases [13]. In this article, we settle Shur’s conjecture, by proving the upper bound for all $n\geq 2$ .

Theorem 4.

For every integer $n\geq 2$ , we have

	$\displaystyle\alpha\left(k,\left(\frac{n}{n-1}\right)^{+}\right)$	$\displaystyle\leq k+2-n-\frac{n-1}{k}+O\left(\frac{1}{k^{2}}\right),$		(4)
	$\displaystyle\alpha\left(k,\frac{n}{n-1}\right)$	$\displaystyle\leq k+1-n-\frac{n-1}{k}+O\left(\frac{1}{k^{2}}\right).$		(5)

Shur’s proof for $n\leq 9$ considers a regular language that contains $\mathcal{L}\left(k,\frac{n}{n-1}\right)$ (resp. $\mathcal{L}\left(k,\left(\frac{n}{n-1}\right)^{+}\right)$ ) and uses standard tools from automata theory to upper bound the growth of this language. His proof technique allows him to work with all $k$ , for one fixed $n$ , by considering the automaton where states are taken up to isomorphism (that is, up to renaming of the letters). However, this technique needs to explicitly construct the automaton for each specific value $n$ , which requires the use of computers. Proving the result for all $n$ asks for a different approach. Moreover, Shur’s approach is to explicitly construct the automata that avoid powers up to a given length. The states of these automata correspond to suffixes of certain lengths, hence the size of the automata grows exponentially. The main idea behind our proof is also to construct a sequence of automata for all $n$ that approximate the true automata. However, they must be simple enough to allow estimating the growth rates manually, and include all the words in the languages (to ensure that we obtain a proper upper bound), but do not include relatively too many forbidden words (to ensure that the upper bound on the growth rate is sharp).

2 Proof of the upper bounds

In this section, we settle Inequality (5) for $\left(\frac{n}{n-1}\right)$ -free words. Inequality (4) for $\left(\frac{n}{n-1}\right)^{+}$ -free words can be proved similarly.

For any two integers $a\leq b$ , we write $[a\mathrel{{.}\,{.}}\nobreak b]$ for the set of integers $\{a,\ldots,b\}$ , and we write $a\dots b$ for the word $a\cdot(a+1)\cdot(a+2)\dots(b-1)\cdot b$ .

Fix $n,k\geq 2$ . We denote by $\mathcal{L}$ the language of words over $k$ letters that avoid $p$ -powers isomorphic to $1\dots m1$ and $12\dots m12$ for any $p\geq\left(\frac{n}{n-1}\right)$ and any $m$ . Since $\mathcal{L}$ contains all $\left(\frac{n}{n-1}\right)$ -free words, the following theorem directly implies Inequality (5) of Shur’s conjecture. This section is devoted to the proof of this theorem.

Theorem 5.

The growth rate of the language $\mathcal{L}$ is at most

k+1-n-\frac{n-1}{k}+O\left(\frac{1}{k^{2}}\right)\,.

The following fact simply illustrates that two occurrences of the same letter cannot be too close, as it would cause the existence of a $p$ -power isomorphic to $1\dots m1$ with $p\geq\frac{n}{n-1}$ .

Observation 6.

Any factor from $\mathcal{L}$ of length $n$ contains $n$ different letters.

It directly implies that the growth of $\mathcal{L}$ is at most $k+1-n$ . The term $-\frac{n-1}{k}$ requires considering the powers isomorphic to $12\dots m12$ .

The idea is to construct an automaton whose language is a superset of $\mathcal{L}$ . Shur noted in [13] that the size of the automaton recognizing $\mathcal{L}$ is exponential in $n$ and is really difficult to analyze. But by considering a slightly larger language, we can construct an automaton with a much smaller number of states that we will be able to analyze, but whose growth rate is really close to the original language (up to the $O\left(\frac{1}{k^{2}}\right)$ term).

For this, we define the following states for any long enough word $w$ (we require $|w|\geq 2n-1$ ):

$\blacksquare$

$\boxed{{\scriptstyle m}}$ for $m\in[n\mathrel{{.}\,{.}}\nobreak 2n-2]$ : if a suffix of $w$ is isomorphic to $m1\dots m$ .
$\blacksquare$

$\widehat{m}$ for $m\in[n\mathrel{{.}\,{.}}\nobreak 2n-2]$ : if a suffix of $w$ is isomorphic to $x1\dots m$ with $x$ satisfying

$\begin{cases}x<m&\text{if $m<2n-2$},\\ x\neq m&\text{if $m=2n-2$}.\end{cases}$

We provide some intuition about this definition. The number $m$ in the states $\widehat{m},\boxed{{\scriptstyle m}}$ stands for the length of the longest suffix containing only distinct letters, hence the letter in front of that suffix should appear in the suffix (the only exception¹¹1It might be more natural to write this state as $\widehat{\geq 2n-2}$ , but for the sake of notation it is more convenient to keep it as $\widehat{2n-2}$ . being $\widehat{2n-2}$ as it actually covers all the lengths from $2n-2$ up to $k$ ). The states $\boxed{{\scriptstyle m}}$ and $\widehat{m}$ differ based on whether this reoccurring letter is the last letter of the word. The states $\boxed{{\scriptstyle m}}$ are distinguished from the other state, since when $m<2n-2$ the word ends with $m1\dots m$ and we know that the next letter cannot be $1$ (since this would create a power of the form $m1\ldots m1$ ), that is, we know that at least one more letter is forbidden in the next position. The length of this power actually suggests the choice of the seemingly-arbitrary number $2n-2$ .

$\blacktriangleright$ Remark 7.

By definition, the words in state $\hat{n}$ contain a forbidden factor. It is fine to consider them anyway because we only want an upper bound on the number of words, and introducing this “artificial” state creates some symmetries that simplify the statements and the proofs.

From here on, we assume

k\geq 2n-2.

Observation 8.

Every word $u\in\mathcal{L}$ with $|u|\geq 2n-1$ is in exactly one of the previously defined states.

We now describe how appending a letter at the end of a word from $\mathcal{L}$ can alter the state. In particular, we need $T(s_{1},s_{2})$ such that for every states $s_{1},s_{2}$ and for any $w\in\mathcal{L}$ in state $s_{1}$ ,

T(s_{1},s_{2})\geq|\left\{\alpha\in\mathcal{A}:w\alpha\in\mathcal{L}\text{ and% is in state }s_{2}\right\}|\,.

That is, there are at most $T(s_{1},s_{2})$ letters $\alpha\in[1\mathrel{{.}\,{.}}\nobreak k]$ such that $w\alpha$ is in $\mathcal{L}$ and is in state $s_{2}$ .

In the following lemmas, we provide such a $T$ . For convenience, we use the following notation:

	$\displaystyle\operatorname{succ}(\widehat{m})=\widehat{m+1}\qquad$	$\displaystyle\text{if $m<2n-2$},$
	$\displaystyle\operatorname{succ}(\widehat{2n-2})=\widehat{2n-2}.$

Lemma 9.

For $m\in[n\mathrel{{.}\,{.}}\nobreak 2n-2]$ , we can set

$\blacksquare$

$T\left(\widehat{m},s\right)=1$ , for each state $s\in\{\boxed{{\scriptstyle n}},\dots,\boxed{{\scriptstyle m}}\}$ ,
$\blacksquare$

$T\left(\widehat{m},\operatorname{succ}(\widehat{m})\right)=k-m$ ,
$\blacksquare$

for the other states $s$ , $T\left(\widehat{m},s\right)=0$ .

Proof.

If $w\in\mathcal{L}$ is in state $\widehat{m}$ , then up to renaming of the letters, $x1\dots m$ is a suffix of $w$ for some $x<m$ . By Proposition 6, the next letter needs to be different from the last $n-1$ letters, so it belongs to $[1\mathrel{{.}\,{.}}\nobreak k]\setminus[m-n+2\mathrel{{.}\,{.}}\nobreak m]$ . If the next letter is in $[1\mathrel{{.}\,{.}}\nobreak m-n+1]$ , then the next state is $\boxed{{\scriptstyle m}},\boxed{{\scriptstyle m-1}},\dots,\boxed{{\scriptstyle n}}$ , respectively. If the next letter is in $[m+1\mathrel{{.}\,{.}}\nobreak k]$ , then the next state is always $\operatorname{succ}(\widehat{m})$ . $\hfill\blacktriangleleft$ The following lemma and its proof are almost identical to the ones above. The difference is due to the missing loop around the state $\boxed{{\scriptstyle m}}$ .

Lemma 10.

For $m\in[n\mathrel{{.}\,{.}}\nobreak 2n-2]$ , we can set

$\blacksquare$

$T\left(\boxed{{\scriptstyle m}},s\right)=1$ , for each state $s\in\{\boxed{{\scriptstyle n}},\dots,\boxed{{\scriptstyle m-1}}\}$ ,
$\blacksquare$

$T\left(\boxed{{\scriptstyle m}},\operatorname{succ}(\widehat{m})\right)=k-m$ ,
$\blacksquare$

for the other states $s$ , $T\left(\boxed{{\scriptstyle m}},s\right)=0$ .

Proof.

If $w\in\mathcal{L}$ is in state $\boxed{{\scriptstyle m}}$ , then up to renaming of the letters, $m1\dots m$ is a suffix of $w$ . By Proposition 6, if $w\alpha$ is in $\mathcal{L}$ , then $\alpha$ is different from the last $n-1$ letters of $w$ , so $\alpha\in[1\mathrel{{.}\,{.}}\nobreak k]\setminus[m-n+2\mathrel{{.}\,{.}}% \nobreak m]$ . This time, we also have $\alpha\not=1$ since $m1\dots m1$ is forbidden in $\mathcal{L}$ . In particular, if the next letter is $2,\dots,m-n+1$ , then the next state is $\boxed{{\scriptstyle m-1}},\dots,\boxed{{\scriptstyle n}}$ , respectively. If the next letter is in $[m+1\mathrel{{.}\,{.}}\nobreak k]$ , then the next state is $\operatorname{succ}(\widehat{m})$ . $\hfill\blacktriangleleft$

In the remainder of this section, we treat $T$ as a square matrix indexed over the states. We abuse the notation and write $T_{s_{1},s_{2}}$ for $T(s_{1},s_{2})$ . By the definition of $T$ , for every $w\in\mathcal{L}$ in state $s_{1}$ we have

(T^{m})_{s_{1},s_{2}}\geq\left\{u\in\mathcal{A}^{m}:\text{$wu\in\mathcal{L}$ % and is in state $s_{2}$}\right\}|\,.

Since every long enough word from $\mathcal{L}$ is in one of the states, we have the following upper bound on the growth of $\mathcal{L}$ .

Proposition 11.

The growth rate of the language $\mathcal{L}$ is at most the spectral radius of the matrix $T$ .

Theorem 4 can then be reduced to the following theorem.

Theorem 12.

For any $n$ , there exists a constant $C$ such that for all large enough $k$ , the spectral radius of $T$ is at most

\lambda=k-(n-1)-\frac{n-1}{k}+C/k^{2}.

Proof.

For the proof we fix $n$ , and we let $\lambda=k-(n-1)-\frac{n-1}{k}+C/k^{2}$ , with $C$ large enough as a function of $n$ . We denote the coordinates of a vector $v$ by $v_{\widehat{n}},\dots,v_{\widehat{2n-2}}$ , $v_{\boxed{{\scriptstyle n}}},\dots,v_{\boxed{{\scriptstyle 2n-2}}}$ . We consider the vector $x$ with

	$\displaystyle x_{\widehat{m}}$	$\displaystyle=1-\frac{m(4n-m-3)}{2k^{2}},$		(6)
	$\displaystyle x_{\boxed{{\scriptstyle m}}}$	$\displaystyle=x_{\widehat{m}}-\frac{1}{k}.$		(7)

We first verify that $x$ is positive when $k$ is large enough. Indeed, for all values of $n, k$ , we can prove that for every $m$ ,

x_{\widehat{m}}=1-\frac{m(4n-m-3)}{2k^{2}}\geq 1-\frac{(2n-2)(2n-1)}{2(2n-2)^{% 2}}=\frac{4n-4-2n+1}{4n-4}=\frac{1}{4}\cdot\frac{2n-3}{n-1}\geq\frac{1}{4}

when $n\geq 2$ . We also need $x_{\boxed{{\scriptstyle m}}}>0$ for any $m$ , which by (7) holds for $k$ large enough.

We will also use the following direct consequences of (6) and (7):

	$\displaystyle x_{\operatorname{succ}(\widehat{m})}$	$\displaystyle=x_{\widehat{m}}-\frac{2n-m-2}{k^{2}},$
	$\displaystyle 1-\frac{1}{k}$	$\displaystyle\geq x_{\boxed{{\scriptstyle m}}}.$

The nonnegativity of $T$ and Perron-Frobenius Theorem imply that an eigenvector $v$ associated to the dominant eigenvalue $\rho(T)$ is nonnegative. Since $x$ is positive, it is non-orthogonal with $v$ , and we have $x=cv+r$ with $c$ a positive real and $r$ a nonnegative vector. This and the nonnegativity of all the entries implies that $\|T^{n}x\|\geq\|T^{n}(cv)\|\geq\Theta(\rho^{n})$ . It follows that $\rho(T)=\lim_{n\to\infty}\|T^{n}x\|^{1/n}$ . Therefore, in order to deduce $\rho(T)\leq\lambda$ , it suffices to show that

Tx\leq\lambda x\,,

where the inequality is coordinate-wise.

We now verify the inequality $Tx\leq\lambda x$ for each coordinate as follows.

For every $m\in[n\mathrel{{.}\,{.}}\nobreak 2n-2]$ ,

	$\displaystyle(Tx)_{\widehat{m}}=$	$\displaystyle(k-m)x_{\operatorname{succ}(\widehat{m})}+x_{\boxed{{\scriptstyle n% }}}+\dots+x_{\boxed{{\scriptstyle m}}}$
	$\displaystyle\leq$	$\displaystyle(k-m)\left(x_{\widehat{m}}-\frac{2n-m-2}{k^{2}}\right)+(m+1-n)% \left(1-\frac{1}{k}\right)$
	$\displaystyle=$	$\displaystyle(k-m)x_{\widehat{m}}+(m+1-n)-\frac{2n-m-2}{k}-\frac{m+1-n}{k}+% \frac{m(2n-m-2)}{k^{2}}$
	$\displaystyle=$	$\displaystyle(k-m)x_{\widehat{m}}+(m+1-n)-\frac{n-1}{k}+\frac{m(2n-m-2)}{k^{2}}$
	$\displaystyle=$	$\displaystyle\left(k+1-n-\frac{n-1}{k}+\frac{C}{k^{2}}\right)x_{\widehat{m}}+(% 1-x_{\widehat{m}})\left(m+1-n-\frac{n-1}{k}\right)$
		$\displaystyle\qquad-x_{\widehat{m}}\frac{C}{k^{2}}+\frac{m(2n-m-2)}{k^{2}}$
	$\displaystyle\leq$	$\displaystyle\left(k+1-n-\frac{n-1}{k}+\frac{C}{k^{2}}\right)x_{\widehat{m}}+% \frac{m(4n-m-3)}{2k^{2}}(m+1-n)$
		$\displaystyle\qquad-\frac{1}{4}\frac{C}{k^{2}}+\frac{m(2n-m-2)}{k^{2}}$
	$\displaystyle\leq$	$\displaystyle\lambda x_{\widehat{m}}$

for some large enough $C$ that only depends on $n$ .

We now take care of the coordinates $\boxed{{\scriptstyle m}}$ . By Lemma 9 and Lemma 10, we have for all $m\in[n\mathrel{{.}\,{.}}\nobreak 2n-2]$ ,

(Tx)_{\boxed{{\scriptstyle m}}}=(Tx)_{\widehat{m}}-x_{\boxed{{\scriptstyle m}}% }\,.

We have already proven that $(Tx)_{\widehat{m}}\leq\lambda x_{\widehat{m}}$ , for all $m\in[n\mathrel{{.}\,{.}}\nobreak 2n-2]$ , and the rest of the computation follows:

	$\displaystyle(Tx)_{\boxed{{\scriptstyle m}}}\leq$	$\displaystyle\lambda x_{\widehat{m}}-x_{\boxed{{\scriptstyle m}}}$
	$\displaystyle=$	$\displaystyle\lambda\left(x_{\boxed{{\scriptstyle m}}}+\frac{1}{k}\right)-% \left(1-\frac{1}{k}-\frac{m(4n-m-3)}{2k^{2}}\right)$
	$\displaystyle=$	$\displaystyle\lambda x_{\boxed{{\scriptstyle m}}}+\left(k-n+1-\frac{n-1}{k}+% \frac{C}{k^{2}}\right)\frac{1}{k}-\left(1-\frac{1}{k}\right)+\frac{m(4n-m-3)}{% 2k^{2}}$
	$\displaystyle=$	$\displaystyle\lambda x_{\boxed{{\scriptstyle m}}}-\frac{n-2}{k}+\frac{C}{k^{3}% }+\frac{m(4n-m-3)-2n+2}{2k^{2}}$
	$\displaystyle\leq$	$\displaystyle\lambda x_{\boxed{{\scriptstyle m}}}$

where the last inequality holds for large enough $k$ . This concludes our proof. $\hfill\blacktriangleleft$

$\blacktriangleright$ Remark 13.

Although we do not really care about how large $C$ and $k$ should be in Theorem 4, one easily verifies that $C=20n^{3}$ and $k=5(n+6)$ work.

3 Conclusion

As mentioned in the introduction (see Equation (3)), our result provides for all $p$ a good estimate for $\alpha(k,p)$ up to an error term $1/k+O(1/k^{2})$ . When we only care about powers isomorphic to $1\dots\ell 1$ and $12\dots\ell 12$ , the difference between $p$ -powers for $p\geq\frac{n}{n-1}$ and for $p>\frac{n}{n-1}$ lies only in the range of $\ell$ . By varying the length $\ell$ , we might allow ourselves to address $p$ -powers for a fraction $p$ that may be of more complicated nature than $\frac{n}{n-1}$ . So applying our approach for all $p$ instead of only $p$ of the form $\frac{n}{n-1}$ or $\left(\frac{n}{n-1}\right)^{+}$ might be enough to replace this error term by $O(1/k^{2})$ .

On the other hand, as conjectured by Shur, and later by the second author, the behavior of the growth rate up to the $O(1/k^{2})$ term is controlled by powers whose tail is of length $1$ or $2$ . It seems natural to wonder whether increasing the precision up to the $O(1/k^{3})$ term requires precisely the considerations of powers with tails of length $3$ . Shall the degree of the order of precision grow linearly with respect to the length of the tails?

References

[1] Arturo Carpi. On Dejean’s conjecture over large alphabets. Theoret. Comput. Sci., 385(1):137–151, October 2007. doi:10.1016/j.tcs.2007.06.001.
[2] James Currie and Narad Rampersad. Dejean’s conjecture holds for $n\geq 30$ . Theoret. Comput. Sci., 410(30):2885–2888, August 2009. doi:10.1016/j.tcs.2009.01.026.
[3] James Currie and Narad Rampersad. A proof of Dejean’s conjecture. Math. Comput., 80(274):1063–1070, April 2011. doi:10.1090/S0025-5718-2010-02407-X.
[4] James D. Currie, Lucas Mol, and Narad Rampersad. The number of threshold words on n letters grows exponentially for every $n\geq 27$ . Journal of Integer Sequences, 23, 2020. URL: https://cs.uwaterloo.ca/journals/JIS/VOL23/Mol/mol2.html.
[5] Françoise Dejean. Sur un théorème de Thue. J. Combin. Theory Ser. A, 13(1):90–99, 1972.
[6] Roman Kolpakov and Michael Rao. On the number of Dejean words over alphabets of 5, 6, 7, 8, 9 and 10 letters. Theoret. Comput. Sci., 412(46), 2011. doi:10.1016/J.TCS.2011.08.006.
[7] M. Mohammad-Noori and James D. Currie. Dejean’s conjecture and Sturmian words. Eur. J. Combin., 28(3):876–890, April 2007. doi:10.1016/j.ejc.2005.11.005.
[8] Pascal Ochem. A generator of morphisms for infinite words. RAIRO-Theor. Inf. Appl., 40(3):427–441, July 2006. doi:10.1051/ita:2006020.
[9] Jean Moulin Ollagnier. Proof of Dejean’s conjecture for alphabets with 5, 6, 7, 8, 9, 10 and 11 letters. Theoret. Comput. Sci., 95(2):187–205, March 1992. doi:10.1016/0304-3975(92)90264-G.
[10] Jean-Jacques Pansiot. A propos d’une conjecture de F. Dejean sur les répétitions dans les mots. Discrete Appl. Math., 7(3):297–311, March 1984. doi:10.1016/0166-218X(84)90006-4.
[11] Michael Rao. Last cases of Dejean’s conjecture. Theoret. Comput. Sci., 412(27):3010–3018, 2011. doi:10.1016/J.TCS.2010.06.020.
[12] Matthieu Rosenfeld. Lower-bounds on the growth of power-free languages over large alphabets. Theory of Computing Systems, 65:1110–1116, 2021. doi:10.1007/S00224-021-10040-1.
[13] Arseny M. Shur. Growth of power-free languages over large alphabets. In International Computer Science Symposium in Russia, pages 350–361. Springer, 2010. doi:10.1007/978-3-642-13182-0_35.
[14] Arseny M. Shur. Growth properties of power-free languages. Computer Science Review, 6(5-6):187–208, 2012. doi:10.1016/J.COSREV.2012.09.001.
[15] Arseny M. Shur and Irina A. Gorbunova. On the growth rates of complexity of threshold languages. RAIRO-Theor. Inf. Appl., 44(1):175–192, January 2010. doi:10.1051/ita/2010012.
[16] Axel Thue. Über unendliche Zeichenreihen. ’Norske Vid. Selsk. Skr. I. Mat. Nat. Kl. Christiania, 7:1–22, 1906.
[17] Igor N. Tunev and Arseny M. Shur. On Two Stronger Versions of Dejean’s Conjecture. In Mathematical Foundations of Computer Science 2012, pages 800–812. Springer, Berlin, Germany, 2012. doi:10.1007/978-3-642-32589-2_69.

[bib.bib1] [1] Arturo Carpi. On Dejean’s conjecture over large alphabets. Theoret. Comput. Sci., 385(1):137–151, October 2007. doi:10.1016/j.tcs.2007.06.001.

[bib.bib2] [2] James Currie and Narad Rampersad. Dejean’s conjecture holds for $n\geq 30$ . Theoret. Comput. Sci., 410(30):2885–2888, August 2009. doi:10.1016/j.tcs.2009.01.026.

[bib.bib3] [3] James Currie and Narad Rampersad. A proof of Dejean’s conjecture. Math. Comput., 80(274):1063–1070, April 2011. doi:10.1090/S0025-5718-2010-02407-X.

[bib.bib4] [4] James D. Currie, Lucas Mol, and Narad Rampersad. The number of threshold words on n letters grows exponentially for every $n\geq 27$ . Journal of Integer Sequences, 23, 2020. URL: https://cs.uwaterloo.ca/journals/JIS/VOL23/Mol/mol2.html.

[bib.bib5] [5] Françoise Dejean. Sur un théorème de Thue. J. Combin. Theory Ser. A, 13(1):90–99, 1972.

[bib.bib6] [6] Roman Kolpakov and Michael Rao. On the number of Dejean words over alphabets of 5, 6, 7, 8, 9 and 10 letters. Theoret. Comput. Sci., 412(46), 2011. doi:10.1016/J.TCS.2011.08.006.

[bib.bib7] [7] M. Mohammad-Noori and James D. Currie. Dejean’s conjecture and Sturmian words. Eur. J. Combin., 28(3):876–890, April 2007. doi:10.1016/j.ejc.2005.11.005.

[bib.bib8] [8] Pascal Ochem. A generator of morphisms for infinite words. RAIRO-Theor. Inf. Appl., 40(3):427–441, July 2006. doi:10.1051/ita:2006020.

[bib.bib9] [9] Jean Moulin Ollagnier. Proof of Dejean’s conjecture for alphabets with 5, 6, 7, 8, 9, 10 and 11 letters. Theoret. Comput. Sci., 95(2):187–205, March 1992. doi:10.1016/0304-3975(92)90264-G.

[bib.bib10] [10] Jean-Jacques Pansiot. A propos d’une conjecture de F. Dejean sur les répétitions dans les mots. Discrete Appl. Math., 7(3):297–311, March 1984. doi:10.1016/0166-218X(84)90006-4.

[bib.bib11] [11] Michael Rao. Last cases of Dejean’s conjecture. Theoret. Comput. Sci., 412(27):3010–3018, 2011. doi:10.1016/J.TCS.2010.06.020.

[bib.bib12] [12] Matthieu Rosenfeld. Lower-bounds on the growth of power-free languages over large alphabets. Theory of Computing Systems, 65:1110–1116, 2021. doi:10.1007/S00224-021-10040-1.

[bib.bib13] [13] Arseny M. Shur. Growth of power-free languages over large alphabets. In International Computer Science Symposium in Russia, pages 350–361. Springer, 2010. doi:10.1007/978-3-642-13182-0_35.

[bib.bib14] [14] Arseny M. Shur. Growth properties of power-free languages. Computer Science Review, 6(5-6):187–208, 2012. doi:10.1016/J.COSREV.2012.09.001.

[bib.bib15] [15] Arseny M. Shur and Irina A. Gorbunova. On the growth rates of complexity of threshold languages. RAIRO-Theor. Inf. Appl., 44(1):175–192, January 2010. doi:10.1051/ita/2010012.

[bib.bib16] [16] Axel Thue. Über unendliche Zeichenreihen. ’Norske Vid. Selsk. Skr. I. Mat. Nat. Kl. Christiania, 7:1–22, 1906.

[bib.bib17] [17] Igor N. Tunev and Arseny M. Shur. On Two Stronger Versions of Dejean’s Conjecture. In Mathematical Foundations of Computer Science 2012, pages 800–812. Springer, Berlin, Germany, 2012. doi:10.1007/978-3-642-32589-2_69.

A Proof of Shur’s Conjecture on the Growth of Power-Free Languages over Large Alphabets

Abstract

Keywords and phrases:

Copyright and License:

2012 ACM Subject Classification:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

Theorem 1 (Shur, 2010).

Conjecture 2 (Shur, 2010).

Theorem 3 (Rosenfeld, 2021).

Theorem 4.

2 Proof of the upper bounds

Theorem 5.

Observation 6.

▶ Remark 7.

Observation 8.

Lemma 9.

Proof.

Lemma 10.

Proof.

Proposition 11.

Theorem 12.

Proof.

▶ Remark 13.

3 Conclusion

References

$\blacktriangleright$ Remark 7.

$\blacktriangleright$ Remark 13.