On Large Zeros of Linear Recurrence Sequences

Luca, Florian; Ouaknine, Joël; Worrell, James

doi:10.4230/LIPIcs.MFCS.2025.71

On Large Zeros of Linear Recurrence Sequences

Florian Luca

Mathematics Division, Stellenbosch University, South Africa
Max Planck Institute for Software Systems, Saarbrücken, Germany Joël Ouaknine

Max Planck Institute for Software Systems, Saarbrücken, Germany James Worrell

Department of Computer Science, Oxford University, UK

Abstract

The Skolem Problem asks to determine whether a given integer linear recurrence sequence (LRS) has a zero term. This problem, whose decidability has been open for many decades, arises across a wide range of topics in computer science, including loop termination, formal languages, automata theory, and probabilistic model checking, amongst many others.

In the present paper, we introduce a notion of “large” zeros of (non-degenerate) linear recurrence sequences, i.e., zeros occurring at an index larger than a sixth-fold exponential of the size of the data defining the given LRS. We establish two main results. First, we show that large zeros are very sparse: the set of positive integers that can possibly arise as large zeros of some LRS has null density. This in turn immediately yields a Universal Skolem Set of density one, answering a question left open in the literature. Second, we define an infinite set of prime numbers, termed “good”, having density one amongst all prime numbers, with the following property: for any large zero of a given LRS, there is an interval around the large zero together with an upper bound on the number of good primes possibly present in that interval. The bound in question is much lower than one would expect if good primes were distributed similarly as ordinary prime numbers, as per the Cramér model in number theory. We therefore conjecture that large zeros do not exist, which would entail decidability of the Skolem Problem.

Keywords and phrases:

Skolem Problem, linear recurrence sequences, decidability, Cramér conjecture

Funding:

Florian Luca: Supported by ERC grant DynAMiCs (101167561).

Joël Ouaknine: Also affiliated with Keble College, Oxford as emmy.network Fellow, and supported by ERC grant DynAMiCs (101167561) and DFG grant 389792660 as part of TRR 248.

James Worrell: Supported by UKRI Frontier Research Grant EP/X033813/1.

Copyright and License:

2012 ACM Subject Classification:

Mathematics of computing

\rightarrow

Discrete mathematics

DOI:

10.4230/LIPIcs.MFCS.2025.71

Event:

50th International Symposium on Mathematical Foundations of Computer Science (MFCS 2025)

Editors:

Paweł Gawrychowski, Filip Mazowiecki, and Michał Skrzypczak

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

An (integer) linear recurrence sequence (LRS) $\langle u_{n}\rangle_{n=0}^{\infty}$ is a sequence of integers satisfying a recurrence of the form

u_{n+k}=a_{1}u_{n+k-1}+\cdots+a_{k}u_{n}

(1)

where the coefficients $a_{1},\ldots,a_{k}$ are integers. The celebrated theorem of Skolem, Mahler, and Lech [28, 20, 15] describes the set of zero terms of such a recurrence:

Theorem 1.

Given an integer linear recurrence sequence $\langle u_{n}\rangle_{n=0}^{\infty}$ , the set $\{n\in\mathbb{N}:u_{n}=0\}$ is a union of finitely many arithmetic progressions together with a finite set.

The statement of Thm. 1 can be refined by considering the notion of non-degeneracy of an LRS. An LRS is non-degenerate if in its minimal recurrence no quotient of two distinct roots of the characteristic polynomial is a root of unity.¹¹1For basic definitions, facts, and properties concerning linear recurrence sequences, we refer the reader to standard texts such as [9, Chaps. 1 and 2], [14, Chap. 4], or [29, Chap. 4]. A given LRS can be effectively decomposed as the interleaving of finitely many non-degenerate sequences, some of which may be identically zero. The core of the Skolem-Mahler-Lech theorem is the fact that a non-zero non-degenerate linear recurrence sequence has finitely many zero terms. Unfortunately, all known proofs of this last result are ineffective: it is not known how to compute the finite set of zeros of a given non-degenerate linear recurrence sequence. It is readily seen that existence of a procedure to do so is equivalent to the existence of a procedure to determine whether an arbitrary given LRS has a zero term; the latter is known as the Skolem Problem. We refer to [4, Chap. 6] and [30, Chap. X] for expository accounts of the Skolem-Mahler-Lech theorem and discussion of the ineffectiveness of known proofs.

In computer science, the Skolem Problem lies at the heart of key decision problems in formal power series [26, 3], stochastic model checking [25], control theory [6, 10], and loop termination [24]. The problem is also closely related to membership problems on commutative matrix groups and semigroups, as considered in [7, 13]. We note that in several of the above-mentioned citations, the Skolem Problem is used as a reference benchmark to establish hardness of other open decision problems.

Decidability of the Skolem Problem is known only for certain special cases, based on the relative order of the absolute values of the characteristic roots. Say that a characteristic root $\lambda$ is dominant if its absolute value is maximal among all the characteristic roots. Decidability is known in case there are at most $3$ dominant characteristic roots, and also for recurrences of order at most $4$ [21, 31]. However for LRS of order $5$ it is not currently known how to decide the Skolem Problem. For a (highly restricted) subclass of LRS, the paper [1] obtains nearly matching complexity lower and upper bounds for the problem.

Some recent lines of research have succeeded in establishing conditional decidability of the Skolem Problem for simple LRS (i.e., LRS none of whose characteristic roots are repeated), assuming certain classical number-theoretic conjectures [16, 5]. Nevertheless, to the best of our knowledge, no putative algorithm has to date been proposed to solve the Skolem Problem in full generality.

A different approach was initiated in [18, 19, 17] via the notion of Universal Skolem Sets. An infinite, recursive set $\mathcal{S}\subseteq\mathbb{N}$ is a Universal Skolem Set if there is some algorithm which, given any LRS, determines whether or not the LRS has a zero in $\mathcal{S}$ . Decidability of the Skolem Problem is then of course equivalent to the assertion that $\mathbb{N}$ is itself a Universal Skolem Set. The authors of [18] succeded in exhibiting a sparse Universal Skolem Set, i.e., a set having null density, and left open the question of whether Universal Skolem Sets of strictly positive density, or even density one, could be constructed (the interest in high-density Universal Skolem Sets being that they approximate $\mathbb{N}$ more closely). The question was partially answered in [19], which presented a positive-density Universal Skolem Set, albeit restricted to simple LRS, and in [17], which exhibited a Universal Skolem Set of strictly positive density, and even established density $1$ subject to the Bateman-Horn conjecture in number theory.

In this paper we propose an explicit bound for the largest zero of a non-degenerate LRS in terms of the data describing the LRS. We call zeros that exceed this bound large zeros of the LRS. Evidently, decidability of the Skolem Problem would follow from a proof that large zeros do not exist. Using known upper bounds on the cardinality of the set of zeros of non-degenerate LRS, it is relatively straightforward to show that the set of integers arising as large zeros of some non-degenerate LRS has null density, which in turn yields a Universal Skolem Set of unconditional density one; this is the first of our two main contributions.

While a proof that large zeros do not exist currently seems well out of reach, we give a heuristic argument as to why this should nevertheless be expected. This argument is based on an analogue of the well-known Cramér conjecture on gaps between consecutive primes. This conjecture, originally formulated by Cramér in 1936 [8] and subsequently refined by various number theorists into its present form, asserts that, for some constant $\kappa>1$ , for every prime $p$ the distance to the next largest prime is at most $\kappa(\log p)^{2}$ . The conjecture is based on the heuristic that the sequence of prime numbers behaves similarly to a Poisson-like random process in which the probability of a number $x$ being prime is $1/\log x$ . The largest observed prime gaps are of the order of $0.5(\log p)^{2}$ [23], however the best known upper bound on prime gaps is $O(p^{0.525})$ , due to Baker, Harman, and Pintz [2], which is far from Cramér’s conjectured bound. Cramér himself proved that, under the Riemann hypothesis, prime gaps are bounded above by $O(p^{0.5}\log p)$ [8]. On the other hand, the best known lower bound is $\Omega(\log p\log\log p)$ , which is some way from the conjectured upper bound. We refer to [12] for a discussion of Cramér’s conjecture and its refinements.

Here we define a subset of so-called good primes based on divisibility properties of LRS. We show that the set of good primes has density one in the set of all primes, or in other words that, asymptotically speaking, almost all primes are good primes. We further show that if the Cramér conjecture applies also to gaps between consecutive good primes, then large zeros of LRS cannot exist. The proof of the latter result (our second main contribution) proceeds by establishing an upper bound on the number of good primes in the neighbourhood of a large zero that violates the conjectured upper bound on gaps between good primes. In other words, if good primes are distributed according to Cramér’s heuristic then large zeros cannot exist and the Skolem Problem is decidable.

2 Background

We will need some basic notions concerning algebraic numbers. All material can be found in [11]. Recall that a number field $\mathbb{K}$ is a subfield of $\mathbb{C}$ that is finite dimensional as a vector space over $\mathbb{Q}$ . We assume that $\mathbb{K}$ is a Galois extension of $\mathbb{Q}$ , that is, it arises as the splitting field of a polynomial with integer coefficients. All elements of $\mathbb{K}$ are algebraic over $\mathbb{Q}$ , that is, they arise as roots of polynomials with integer coefficients. Those elements that arise more specifically as roots of monic polynomials with integer coefficients are called algebraic integers. The algebraic integers in $\mathbb{K}$ form a subring, denoted $\mathcal{O}_{\mathbb{K}}$ .

For a number field $\mathbb{K}$ , we denote by $\mathrm{Gal}(\mathbb{K}/\mathbb{Q})$ the group of field automorphisms of $\mathbb{K}$ . Given $\alpha\in\mathbb{K}$ , the norm of $\alpha$ is defined by

\displaystyle\mathcal{N}_{\mathbb{K}/\mathbb{Q}}(\alpha)=\!\!\!\!\!\prod_{% \sigma\in\mathrm{Gal}(\mathbb{K}/\mathbb{Q})}\!\!\!\!\!\!\sigma(\alpha)\,.

The norm $\mathcal{N}_{\mathbb{K}/\mathbb{Q}}(\alpha)$ is rational for all $\alpha\in\mathbb{K}$ ; moreover $\mathcal{N}_{\mathbb{K}/\mathbb{Q}}(\alpha)=0$ iff $\alpha=0$ , and $\mathcal{N}_{\mathbb{K}/\mathbb{Q}}(\alpha)$ is an integer if $\alpha\in\mathcal{O}_{\mathbb{K}}$ . Clearly we have $|\mathcal{N}_{\mathbb{K}/\mathbb{Q}}(\alpha)|\leq M^{d_{\mathbb{K}}}$ , where $d_{\mathbb{K}}$ is the degree of $\mathbb{K}$ and

\displaystyle M:=\!\!\!\max_{\sigma\in\mathrm{Gal}(\mathbb{K}/\mathbb{Q})}|% \sigma(\alpha)|

is the house of $\alpha$ .

We recall that every ideal in $\mathcal{O}_{\mathbb{K}}$ can be written uniquely up to the order of its factors as the product of prime ideals. Given a rational prime $P\in\mathbb{Z}$ , we say that a prime ideal $\mathfrak{p}$ lies above $P$ if $\mathfrak{p}$ is a factor of $P\mathcal{O}_{\mathbb{K}}$ . In this case we have that $P\mid\mathcal{N}_{\mathbb{K}/\mathbb{Q}}(\alpha)$ for all $\alpha\in\mathfrak{p}$ .

Let $\mathfrak{p}$ be a prime ideal of ${\mathcal{O}}_{\mathbb{K}}$ lying above $P\in\mathbb{Z}$ . Recall that the Frobenius automorphism $\sigma\in{\rm Gal}({\mathbb{K}}/{\mathbb{Q}})$ corresponding to $\mathfrak{p}$ is such that $\sigma(\alpha)\equiv\alpha^{P}\bmod\mathfrak{p}$ for all $\alpha\in\mathcal{O}_{\mathbb{K}}$ ; in fact it is the unique Galois automorphism with this property.

3 Large Zeros and a Universal Skolem Set of Density One

For an LRS $\mathbf{u}=\langle u_{n}\rangle_{n=0}^{\infty}$ as in (1), define its size²²2Note that we consider here the magnitude of the numbers defining a given LRS (rather than their bit size as is more common in complexity theory). An alternative definition in terms of bit size would of course be possible, only requiring altering (2) into a seventh-fold exponential. to be

C_{\mathbf{u}}:=\max\{k,|a_{1}|,\ldots,|a_{k}|,|u_{0}|,\ldots,|u_{k-1}|,12\}\,.

Given a (partial) function $f:\mathbb{R}\rightarrow\mathbb{R}$ and a positive integer $\ell$ , let $f_{\ell}(x)=f\circ f\circ\cdots\circ f(x)$ , where the iteration is $\ell$ -fold (thus $f_{1}=f$ ). We say that $n$ is a zero of $\mathbf{u}$ if $u_{n}=0$ , and we say that it is a large zero if the inequality

n<2\exp_{6}(C_{\mathbf{u}})

(2)

fails.

As we argue later on, there are good reasons to expect that (2) holds for all zeros of all non-degenerate LRS, which in turn would establish decidability of the Skolem Problem.³³3The sixth-fold exponential in (2) has of course been chosen in order for our mathematical argument to go through. In actual fact, it is plausible to expect that a single exponential would suffice: as far as we are aware, there is currently no known construction of a family of non-degenerate LRS having zeros at indices of magnitude even a single exponential in terms of the size of the LRS, as defined above. Unfortunately, we are unable to prove this assertion. Nevertheless, we now show that large zeros are very sparse, i.e., have null density amongst the positive integers. To this end, let

{\mathcal{L}}:=\{n\in\mathbb{N}:{\text{\rm there exists a non-degenerate LRS}}% \leavevmode\nobreak\ {\text{\bf u}}\leavevmode\nobreak\ {\text{\rm such that}}% \leavevmode\nobreak\ u_{n}=0\leavevmode\nobreak\ {\text{\rm and}}\leavevmode% \nobreak\ \eqref{eq:large}\leavevmode\nobreak\ {\text{\rm fails}}\}\,.

Thus ${\mathcal{L}}$ is the set of large zeros of some non-degenerate LRS.

Theorem 2.

The set ${\mathcal{L}}$ has null density. In fact, writing ${\mathcal{L}}(X)={\mathcal{L}}\cap[0,X]$ , the inequality

\#{\mathcal{L}}(X)=O\left(\frac{X}{(\log X)^{B}}\right)

holds with any constant $B>0$ .

Proof.

Let $X>2\exp_{7}(1)$ be an integer, and put $C:=\lceil\log_{6}(X/2)\rceil$ . We say that an LRS $\mathbf{u}$ is small at level $X$ if it has size $C_{\mathbf{u}}\leq\log_{6}(X/2)$ . We now wish to count the number of large zeros in the interval $[0,X]$ . By definition, any such zero originates from an LRS $\mathbf{u}$ which is small at level $X$ , i.e., having size $C_{\mathbf{u}}\leq C$ . Let us count the number of such LRS. Its coefficients $a_{1},\ldots,a_{k}$ and initial values $u_{0},\ldots,u_{k-1}$ are all in $[-C,C]$ , an interval containing at most $2C+1<3C$ integers. Altogether for fixed $k$ there are at most $(3C)^{2k}\leq(3C)^{2C}$ $2k$ -tuples, and summing up over $k$ we derive an upper bound of $C(3C)^{2C}<C^{3C}$ distinct possible LRS of size at most $C$ .

On the other hand, Schmidt [27] proved that any LRS of order $k$ or less has at most $\exp_{3}(3k\log k)<\exp_{4}(C)$ zeros, using the fact here that $k$ is taken to be at most $C$ . Hence the total number of zeros emanating from such LRS in the interval $[0,X]$ is at most $\exp_{4}(C)C^{3C}$ , whence the inequality

\#{\mathcal{L}}(X)=O\left(\frac{X}{(\log X)^{B}}\right)

easily follows for any $B>0$ . $\hfill\blacktriangleleft$

Corollary 3.

The set $\mathcal{S}:=\mathbb{N}\setminus\mathcal{L}$ is a Universal Skolem Set of density one.

Proof.

It is clear that the set $\mathcal{L}$ is recursive, and hence that $\mathcal{S}$ is recursive as well.

Density one follows from Thm. 2, and universality follows from the fact that $\mathcal{S}$ , by definition, doesn’t contain any large zeros. Thus given any non-degenerate LRS $\mathbf{u}$ of size $C_{\mathbf{u}}$ , its only possible zeros in $\mathcal{S}$ can only lie in the interval $[0,2\exp_{6}(C_{\mathbf{u}})]$ , which can readily be checked. $\hfill\blacktriangleleft$

4 Bad Primes and Good Primes

We first define what it means for a prime number to be bad. As before, let $X>2\exp_{7}(1)$ be an integer, and recall than an LRS $\mathbf{u}$ is small at level $X$ if it has size $C_{\mathbf{u}}\leq\log_{6}(X/2)$ . Let us write $C:=C_{\mathbf{u}}$ (that is, we omit the dependence on $\mathbf{u}$ ). We can express the general term $u_{t}$ of $\mathbf{u}$ as

u_{t}=\sum_{i=1}^{s}Q_{i}(t)\alpha_{i}^{t}\,,

(3)

where $s\leq C$ and $\alpha_{1},\ldots,\alpha_{s}$ are the roots of the characteristic polynomial

x^{k}-a_{1}x^{k-1}-\cdots-a_{k}

of $\mathbf{u}$ and $Q_{1},\ldots,Q_{s}$ are univariate polynomials. Note that all characteristic roots are algebraic integers since the characteristic polynomial is monic and comprises exclusively coefficients in $\mathbb{Z}$ . Recall that if $\alpha_{i}$ has multiplicity $\mu_{i}$ as a characteristic root then $Q_{i}$ has degree at most $\mu_{i}-1$ . Let ${\mathbb{K}}:={\mathbb{Q}}(\alpha_{1},\ldots,\alpha_{s})$ . The coefficients of each $Q_{i}$ are in $\mathbb{K}$ and can straightforwardly be computed from the initial values $u_{0},\ldots,u_{k-1}$ of the sequence by solving a system of $k$ linear equations, thanks to (3). By Cramer’s determinant rule,⁴⁴4This rule is named after the 18th-century Genevan mathematician Gabriel Cramer, who is presumably unrelated to the 20th-century Swedish mathematician Harald Cramér, whose work plays an important role in motivating the present article. each of the coefficients of $Q_{i}$ is the quotient of an algebraic integer by the determinant $\Delta:=\mathrm{det}(M)$ of the matrix⁵⁵5The matrix $M$ has $s$ blocks, one for each characteristic root. For $\ell\in\{1,\ldots,s\}$ the $\ell$ -th block has dimension $k\times\mu_{\ell}$ and has $(i,j)$ -th element $(i-1)^{(j-1)}\alpha_{\ell}^{(i-1)}$ for $i\in\{1,\ldots,k\}$ and $j\in\{1,\ldots,\mu_{\ell}\}$ .

M:=\begin{bmatrix}1&\ldots&0&1&\ldots&0&1&\cdots\\ \alpha_{1}&\ldots&\alpha_{1}&\alpha_{2}&\ldots&\alpha_{s-1}&\alpha_{s}&\ldots% \\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots\\ \alpha_{1}^{k-1}\;&\ldots\;&(k-1)^{\mu_{1}-1}\alpha_{1}^{k-1}\;&\alpha_{2}^{k-% 1}\;&\ldots\;&(k-1)^{\mu_{s}-1}\alpha_{s-1}^{k-1}\;&\alpha_{s}^{k-1}&\ldots% \end{bmatrix}\,.

By the Cauchy root bound we have $|\alpha_{i}|\leq 1+C$ for $i\in\{1,\ldots,s\}$ . It follows that the squared Euclidean norm of each column vector above is at most

\displaystyle{k(k-1)^{2(k-1)}(1+C)^{2k}}<k^{2k}(1+C)^{2k}\,.

Thus, by the Hadamard inequality,

|\Delta|^{2}<(k^{2k}(1+C)^{2k})^{k}=(k(1+C))^{2k^{2}}\,.

The determinant $\Delta$ is in general, of course, a complex number. Note however that any Galois automorphism $\sigma\in\mathrm{Gal}(\mathbb{K}/\mathbb{Q})$ will permute the characteristic roots, and thus when applied to $M$ will have the effect of permuting its columns. As a result, for any such $\sigma$ , $\sigma(\Delta)=\pm\Delta$ , and therefore the quantity $\Delta^{2}$ is stable under Galois automorphisms. We conclude that $\Delta^{2}$ must be a rational number, and since it is also by construction an algebraic integer,⁶⁶6Note that every entry of $M$ is an algebraic integer. we must have $\Delta^{2}\in\mathbb{Z}$ .

Let us now consider the LRS $\mathbf{v}:=\Delta^{2}\mathbf{u}$ , noting that $\mathbf{u}$ and $\mathbf{v}$ share the same zeros. Writing

v_{t}=\sum_{i=1}^{s}P_{i}(t)\alpha_{i}^{t}\,,

we observe that all the coefficients of each of the $P_{i}$ are algebraic integers. We therefore have, for each $1\leq i\leq s$ ,

P_{i}(t)=\Delta^{2}Q_{i}(t)=\sum_{j=0}^{\mu_{i}-1}c_{i,j}t^{j}\,.

We wish to estimate the size of each $c_{i,j}\in{\mathcal{O}}_{\mathbb{K}}$ . From our earlier calculation via Cramer’s determinant rule, noting that $|u_{0}|,\ldots,|u_{k-1}|$ are all bounded above by $C\leq 1+C$ , and invoking the Hadamard inequality once more, we conclude that the house of each $c_{i,j}$ is bounded above by

|\Delta|(k^{k}(1+C)^{k})^{k}<(k(1+C))^{2k^{2}}<(1+C)^{4C^{2}}<C^{C^{3}}\,.

Let $\sigma\in\Sigma_{s}$ be any permutation of the first $s$ integers and let

\beta_{i}:=\alpha_{\sigma(i)}\quad{\text{\rm for}}\quad i=1,\ldots,s\,.

For some nonnegative integer $m$ consider the algebraic integer

v_{m,\sigma}=\sum_{i=1}^{s}P_{i}(m)\beta_{i}\alpha_{i}^{m}\,.

(4)

Definition 4.

We say that $P\in[X,2X]$ is bad, if there exists an LRS $\mathbf{u}$ which is small at level $X$ , a permutation $\sigma\in\Sigma_{s}$ , and an integer $m\in[0,X^{1/4}]$ , such that

$\blacksquare$

The algebraic integer $v_{m,\sigma}$ defined in (4) above is non-zero, and
$\blacksquare$

$P$ is a prime factor of ${\mathcal{N}}_{{\mathbb{K}}/{\mathbb{Q}}}(v_{m,\sigma})$ .

Let ${\mathcal{P}}_{\text{\rm bad}}(X)$ be the set of bad primes in $[X,2X]$ .

Proposition 5.

We have

\#{\mathcal{P}}_{\text{\rm bad}}(X)<X^{2/3}

for all $X>X_{0}$ , where $X_{0}$ is some effective absolute constant.

Proof.

In order to estimate the size of ${\mathcal{P}}_{\text{\rm bad}}(X)$ , we first need to find out:

1.

How many such expressions (4) are there?
2.

How large are they?

For (1), recall from the proof of Thm. 2 that there are at most $C^{3C}$ distinct possible LRS of size at most $C$ . This in turn is an upper bound on the number of $s$ -tuples $((Q_{i},\alpha_{i}))_{i=1}^{s}$ . We must then multiply this quantity with the number of possible permutations of the characteristic roots, which is at most $C!<C^{C}$ . There are therefore at most $C^{4C}$ linear recurrence sequences $\mathbf{w}=\langle w_{m}\rangle_{m=0}^{\infty}$ whose $m$ -th term is given by

w_{m}=\sum_{i=1}^{s}P_{i}(m)\beta_{i}\alpha_{i}^{m}\quad{\text{\rm for% \leavevmode\nobreak\ all}}\quad m\geq 0\,.

This answers (1). As for (2), recall that the coefficients of $P_{i}$ are of size at most $C^{C^{3}}$ . There are at most $C$ terms, the largest monomial involved in $P_{i}(m)$ is at most $m^{C}<X^{C}$ and the largest root has magnitude at most $1+C<2C$ . Thus each individual term is of absolute value at most

	$\displaystyle C^{C^{3}+1}(2C)X^{C}(2C)^{X^{1/4}}$	$\displaystyle=\exp\left((C^{3}+1)\log C+\log(2C)+C\log X+X^{1/4}\log(2C)\right)$
		$\displaystyle<\exp(X^{0.26})$

for $X>X_{0}$ , since $C$ is tiny in comparison to $X$ . Hence the norm of the number shown in (4) is of size at most

\exp(C!X^{0.26})<\exp(X^{0.27})\quad{\text{\rm for}}\quad X>X_{0}\,,

since the degree of $\mathbb{K}$ is at most $C!$ (as $\mathbb{K}$ is the splitting field of a polynomial of degree at most $C$ ). Moreover, as noted earlier, there are at most $C^{4C}$ such expressions. Thus a bad prime $P$ divides an integer which is a product of such numbers and is of size at most

\exp(C^{4C}X^{0.27})<\exp(X^{0.28})\quad{\text{\rm for}}\quad X>X_{0}\,.

Therefore the number of possible choices for $P$ is at most $X^{0.28}$ . Since the number of choices for $m$ is at most $X^{0.25}$ , we conclude that, for $X>X_{0}$ , the cardinality of $\mathcal{P}_{\text{\rm bad}}(X)$ is at most

X^{0.25+0.28}<X^{2/3}\,,

as required. $\hfill\blacktriangleleft$

Finally, let us write $\mathcal{P}=\{p_{1},p_{2},\ldots\}$ to denote the set of prime numbers, enumerated in increasing order, and let $\mathcal{P}_{\mathrm{good}}:=\mathcal{P}\setminus\mathcal{P}_{\mathrm{bad}}=\{% g_{1},g_{2},\ldots\}$ denote the set of good primes, again enumerated in increasing order. Note that, by Prop. 5 along with the prime number theorem, the set of bad primes has null density amongst the prime numbers. This in turn entails that good primes have density one amongst all prime numbers.

5 The Cramér Argument

In this section we present a heuristic argument supporting the assertion that large zeros do not exist, or in other words that the set $\mathcal{L}$ is empty. The strategy is as follows. Assuming that good primes are distributed roughly similarly as ordinary primes, according to the Cramér model in number theory, one would expect that Cramér’s conjecture on gaps between primes applies also to good primes. More precisely, this conjecture postulates the existence of precise upper bounds on the largest possible gap between consecutive primes, and is predicated on the heuristic that the primes behave as a set of randomly distributed integers with asymptotic density conforming to the prime number theorem. However we show that around any large zero of an LRS there is an interval and an upper bound on the number of good primes in the interval that together contradict the above Cramér-type conjecture on gaps between good primes. We therefore surmise that large zeros do not exist.

Recall that $\mathcal{P}=\{p_{1},p_{2},\ldots\}$ denotes the set of prime numbers, enumerated in increasing order, and likewise let us write $\mathcal{P}_{\mathrm{good}}:=\{g_{1},g_{2},\ldots\}$ to denote an enumeration of the set of good primes in increasing order.

Conjecture 6 (Cramér-Granville).

For some $\kappa>1$ ,

\limsup_{j\rightarrow\infty}\frac{p_{j+1}-p_{j}}{\log^{2}p_{j}}=\kappa\,.

Cramér initially suggested that the constant $\kappa$ in Conjecture 6 might be 1 [8], but several decades later, building on substantial developments in the field, Granville produced evidence that $\kappa\geq 2e^{-\gamma}\approx 1.1229\ldots$ , where $\gamma$ is the Euler–Mascheroni constant [12]. There is in any event considerable computational evidence in support of the Cramér-Granville conjecture [23, 22].

As noted earlier, thanks to Prop. 5 and the prime number theorem, good primes have density one amongst all prime numbers:

\lim_{X\rightarrow\infty}\frac{\#\left(\mathcal{P}_{\mathrm{good}}\cap[0,X]% \right)}{\#\left(\mathcal{P}\cap[0,X]\right)}=1\,.

In other words, asymptotically speaking, almost all primes are good primes. Accordingly, it seems reasonable to suppose that good primes should behave similarly to ordinary primes, or at least should exhibit similar “statistical” properties. We therefore formulate:

Conjecture 7.

For some $\eta>1$ ,

\limsup_{j\rightarrow\infty}\frac{g_{j+1}-g_{j}}{\log^{2}g_{j}}=\eta\,.

We now have the following result.

Theorem 8.

Conjecture 7 implies that large zeros of LRS do not exist; or more precisely, that $\mathcal{L}$ is a finite set.

Proof.

Conjecture 7 can be reformulated as follows: there exist $\eta>1$ and $n_{0}\in\mathbb{N}$ such that, for all $n\geq n_{0}$ , the interval

[n-\eta(\log n)^{2},n]

always contains some good prime. In turn, this implies that the interval $[n-\eta(\log n)^{3},n]$ must contain at least $\log n$ distinct good primes for $n$ sufficiently large (say $n\geq n_{1}\geq\max\{n_{0},2\exp_{7}(1)\}$ ).

Thus let $n\geq n_{1}$ , put $C:=\log_{6}(n/2)$ , and suppose that there is some LRS $\mathbf{u}$ with $C_{\mathbf{u}}\leq C$ such that $u_{n}=0$ – in other words, $n$ is a large zero of $\mathbf{u}$ . Write $n=P+m$ , where $P\in[n-\eta(\log n)^{3},n]$ is a good prime and $0\leq m<\eta(\log n)^{3}<n^{1/4}$ . As in the previous section, let $\alpha_{1},\ldots,\alpha_{s}$ be the characteristic roots of $\mathbf{u}$ , put $\mathbb{K}:=\mathbb{Q}(\alpha_{1},\ldots,\alpha_{s})$ , and let $\Delta^{2}$ be the smallest positive integer such that, writing $\mathbf{v}:=\Delta^{2}\mathbf{u}$ , every term of $v_{t}$ of $\mathbf{v}$ has a representation as an exponential polynomial

v_{t}=\sum_{i=1}^{s}P_{i}(t)\alpha_{i}^{t}

in which all polynomials $P_{i}$ have algebraic-integer coefficients.

Since $u_{n}=v_{n}=0$ , we get

0=\sum_{i=1}^{s}P_{i}(P+m)\alpha_{i}^{P+m}\,.

We now reduce the above equation modulo $\mathfrak{p}$ , where $\mathfrak{p}$ is some prime ideal of ${\mathcal{O}}_{\mathbb{K}}$ dividing $P$ , from which we deduce that $P$ divides

{\mathcal{N}}_{{\mathbb{K}}/{\mathbb{Q}}}\left(\sum_{i=1}^{s}P_{i}(m)\beta_{i}% \alpha_{i}^{m}\right)\,,

(5)

where each $\beta_{i}=\sigma(\alpha_{i})$ is obtained from applying the Frobenius automorphism induced by $\mathfrak{p}$ in ${\mathbb{K}}$ to $\alpha_{i}$ . If the above expression (5) were non-zero, we would have to conclude that $P\in{\mathcal{P}}_{\text{\rm bad}}(n/2)$ , contradicting our choice of $P$ . Thus the expression (5) is zero.

Let us count how many expressions of the form (5) can vanish. More precisely, consider the (complex-valued) LRS $\mathbf{w}=\langle w_{j}\rangle_{j=0}^{\infty}$ whose $j$ -th term is given by

w_{j}=\sum_{i=1}^{s}P_{i}(j)\beta_{i}\alpha_{i}^{j}\quad{\text{\rm for% \leavevmode\nobreak\ all}}\quad j\geq 0\,,

and whose order is at most $C$ . Schmidt [27] proves that the number of distinct positive integers $m$ such that $w_{m}=0$ is at most

\exp_{3}(3C\log C)<\exp_{4}(C)\,.

Of course, given $\mathbf{u}$ , the $s$ -tuple $(\beta_{1},\ldots,\beta_{s})$ can be chosen in at most $s!<C^{C}$ ways. Thus the total number of possible zeros for expression (5) is at most $C^{C}\exp_{4}(C)<\exp_{5}(C)$ . Since distinct choices of $P$ give rise to distinct such zeros,⁷⁷7Recall that $n=P+m$ , and thus distinct choices of $P$ entail distinct values of $m$ . and (as noted earlier) there are at least $\log n$ possible choices for $P$ , we conclude that

\log n<\exp_{5}(C)\,,

or equivalently $n<\exp_{6}(C)=n/2$ , a contradiction. It thus follows, as claimed, that Conjecture 7 prohibits the existence of large zeros of LRS that are greater than the absolute constant $n_{1}$ . $\hfill\blacktriangleleft$

Thanks to Thm. 8, Conjecture 7 implies the existence of an algorithm to solve the Skolem Problem, as follows. Given an LRS $\mathbf{u}$ , first decompose $\mathbf{u}$ into finitely many non-degenerate LRS, and check that none of these is identically zero. Next, for each sub-LRS $\mathbf{v}$ of size $C_{\mathbf{v}}$ , simply search for a zero up to index $2\exp_{6}(C_{\mathbf{v}})$ .⁸⁸8Technically speaking, the algorithm should examine all terms up to index $\max\{2\exp_{6}(C_{\mathbf{v}}),n_{1}\}$ , where $n_{1}$ is the absolute constant appearing in the proof of Thm. 8. The existence of $n_{1}$ is implied by Conjecture 7, but its effectivity would depend on the effectivity of Conjecture 7. If at the end of this process no zero has been found for any of the LRS, return that $\mathbf{u}$ has no zeros.

References

[1] S. Akshay, N. Balaji, A. Murhekar, R. Varma, and N. Vyas. Near-optimal complexity bounds for fragments of the Skolem problem. In STACS, volume 154 of LIPIcs, pages 37:1–37:18. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPICS.STACS.2020.37.
[2] R. C. Baker, G. Harman, and J. Pintz. The difference between consecutive primes, ii. Proceedings of the London Mathematical Society, 83(3):532–562, 2001.
[3] D. Beauquier, A. M. Rabinovich, and A. Slissenko. A logic of probability with decidable model checking. J. Log. Comput., 16(4), 2006. doi:10.1093/LOGCOM/EXL004.
[4] J. Berstel and C. Reutenauer. Noncommutative Rational Series with Applications. Cambridge University Press, 2010.
[5] Y. Bilu, F. Luca, J. Nieuwveld, J. Ouaknine, D. Purser, and J. Worrell. Skolem meets Schanuel. In MFCS, volume 241 of LIPIcs, pages 20:1–20:15. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.MFCS.2022.20.
[6] V. Blondel and J. Tsitsiklis. A survey of computational complexity results in systems and control. Automatica, 36(9):1249–1274, 2000. doi:10.1016/S0005-1098(00)00050-9.
[7] J.-Y. Cai, R. J. Lipton, and Y. Zalcstein. The complexity of the A B C problem. SIAM J. Comput., 29(6), 2000. doi:10.1137/S0097539794276853.
[8] H. Cramér. On the order of magnitude of the difference between consecutive prime numbers. Acta arithmetica, 2:23–46, 1936.
[9] G. Everest, A. van der Poorten, I. Shparlinski, and T. Ward. Recurrence Sequences. American Mathematical Society, 2003.
[10] N. Fijalkow, J. Ouaknine, A. Pouly, J. Sousa Pinto, and J. Worrell. On the decidability of reachability in linear time-invariant systems. In HSCC, pages 77–86. ACM, 2019. doi:10.1145/3302504.3311796.
[11] A. Fröhlich and M. J. Taylor. Algebraic Number Theory, volume 27 of Cambridge Studies in Advanced Mathematics. Cambridge University Press, 1993.
[12] A. Granville. Harald Cramér and the distribution of prime numbers. Scandinavian Actuarial Journal, 1995(1):12–28, 1995.
[13] R. Kannan and R. J. Lipton. Polynomial-time algorithm for the orbit problem. JACM, 33(4), 1986. doi:10.1145/6490.6496.
[14] M. Kauers and P. Paule. The Concrete Tetrahedron — Symbolic Sums, Recurrence Equations, Generating Functions, Asymptotic Estimates. Texts & Monographs in Symbolic Computation. Springer, 2011.
[15] C. Lech. A note on recurring series. Ark. Mat., 2, 1953.
[16] R. Lipton, F. Luca, J. Nieuwveld, J. Ouaknine, D. Purser, and J. Worrell. On the Skolem problem and the Skolem conjecture. In LICS, pages 5:1–5:9. ACM, 2022. doi:10.1145/3531130.3533328.
[17] F. Luca, J. Maynard, A. Noubissie, J. Ouaknine, and J. Worrell. Skolem meets bateman-horn. CoRR, abs/2308.01152, 2023. doi:10.48550/arXiv.2308.01152.
[18] F. Luca, J. Ouaknine, and J. Worrell. Universal Skolem sets. In LICS, pages 1–6. IEEE, 2021. doi:10.1109/LICS52264.2021.9470513.
[19] F. Luca, J. Ouaknine, and J. Worrell. A universal Skolem set of positive lower density. In MFCS, volume 241 of LIPIcs, pages 73:1–73:12. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.MFCS.2022.73.
[20] K. Mahler. Eine arithmetische Eigenschaft der Taylor Koeffizienten rationaler Funktionen. Proc. Akad. Wet. Amsterdam, 38, 1935.
[21] M. Mignotte, T. Shorey, and R. Tijdeman. The distance between terms of an algebraic recurrence sequence. J. für die reine und angewandte Math., 349, 1984.
[22] T. R. Nicely. New maximal prime gaps and first occurrences. Math. Comput., 68(227):1311–1315, 1999. doi:10.1090/S0025-5718-99-01065-0.
[23] A. Odlyzko, M. Rubinstein, and M. Wolf. Jumping champions. Experimental Mathematics, 8(2):107–118, 1999. doi:10.1080/10586458.1999.10504393.
[24] J. Ouaknine and J. Worrell. On linear recurrence sequences and loop termination. ACM SIGLOG News, 2(2):4–13, 2015. doi:10.1145/2766189.2766191.
[25] J. Piribauer and C. Baier. On Skolem-hardness and saturation points in Markov decision processes. In ICALP, volume 168 of LIPIcs, pages 138:1–138:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPICS.ICALP.2020.138.
[26] G. Rozenberg and A. Salomaa. Cornerstones of Undecidability. Prentice Hall, 1994.
[27] W. M. Schmidt. The zero multiplicity of linear recurrence sequences. Acta Math., 182:243–282, 1999.
[28] T. Skolem. Ein Verfahren zur Behandlung gewisser exponentialer Gleichungen. In Comptes rendus du congrès des mathématiciens scandinaves, 1934.
[29] R. P. Stanley. Enumerative combinatorics. Cambridge studies in advanced mathematics, Volume 1, 2nd Edition, 2011.
[30] T. Tao. Structure and randomness: pages from year one of a mathematical blog. American Mathematical Society, 2008.
[31] N. K. Vereshchagin. The problem of appearance of a zero in a linear recurrence sequence (in Russian). Mat. Zametki, 38(2), 1985.

[bib.bib1] [1] S. Akshay, N. Balaji, A. Murhekar, R. Varma, and N. Vyas. Near-optimal complexity bounds for fragments of the Skolem problem. In STACS, volume 154 of LIPIcs, pages 37:1–37:18. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPICS.STACS.2020.37.

[bib.bib2] [2] R. C. Baker, G. Harman, and J. Pintz. The difference between consecutive primes, ii. Proceedings of the London Mathematical Society, 83(3):532–562, 2001.

[bib.bib3] [3] D. Beauquier, A. M. Rabinovich, and A. Slissenko. A logic of probability with decidable model checking. J. Log. Comput., 16(4), 2006. doi:10.1093/LOGCOM/EXL004.

[bib.bib4] [4] J. Berstel and C. Reutenauer. Noncommutative Rational Series with Applications. Cambridge University Press, 2010.

[bib.bib5] [5] Y. Bilu, F. Luca, J. Nieuwveld, J. Ouaknine, D. Purser, and J. Worrell. Skolem meets Schanuel. In MFCS, volume 241 of LIPIcs, pages 20:1–20:15. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.MFCS.2022.20.

[bib.bib6] [6] V. Blondel and J. Tsitsiklis. A survey of computational complexity results in systems and control. Automatica, 36(9):1249–1274, 2000. doi:10.1016/S0005-1098(00)00050-9.

[bib.bib7] [7] J.-Y. Cai, R. J. Lipton, and Y. Zalcstein. The complexity of the A B C problem. SIAM J. Comput., 29(6), 2000. doi:10.1137/S0097539794276853.

[bib.bib8] [8] H. Cramér. On the order of magnitude of the difference between consecutive prime numbers. Acta arithmetica, 2:23–46, 1936.

[bib.bib9] [9] G. Everest, A. van der Poorten, I. Shparlinski, and T. Ward. Recurrence Sequences. American Mathematical Society, 2003.

[bib.bib10] [10] N. Fijalkow, J. Ouaknine, A. Pouly, J. Sousa Pinto, and J. Worrell. On the decidability of reachability in linear time-invariant systems. In HSCC, pages 77–86. ACM, 2019. doi:10.1145/3302504.3311796.

[bib.bib11] [11] A. Fröhlich and M. J. Taylor. Algebraic Number Theory, volume 27 of Cambridge Studies in Advanced Mathematics. Cambridge University Press, 1993.

[bib.bib12] [12] A. Granville. Harald Cramér and the distribution of prime numbers. Scandinavian Actuarial Journal, 1995(1):12–28, 1995.

[bib.bib13] [13] R. Kannan and R. J. Lipton. Polynomial-time algorithm for the orbit problem. JACM, 33(4), 1986. doi:10.1145/6490.6496.

[bib.bib14] [14] M. Kauers and P. Paule. The Concrete Tetrahedron — Symbolic Sums, Recurrence Equations, Generating Functions, Asymptotic Estimates. Texts & Monographs in Symbolic Computation. Springer, 2011.

[bib.bib15] [15] C. Lech. A note on recurring series. Ark. Mat., 2, 1953.

[bib.bib16] [16] R. Lipton, F. Luca, J. Nieuwveld, J. Ouaknine, D. Purser, and J. Worrell. On the Skolem problem and the Skolem conjecture. In LICS, pages 5:1–5:9. ACM, 2022. doi:10.1145/3531130.3533328.

[bib.bib17] [17] F. Luca, J. Maynard, A. Noubissie, J. Ouaknine, and J. Worrell. Skolem meets bateman-horn. CoRR, abs/2308.01152, 2023. doi:10.48550/arXiv.2308.01152.

[bib.bib18] [18] F. Luca, J. Ouaknine, and J. Worrell. Universal Skolem sets. In LICS, pages 1–6. IEEE, 2021. doi:10.1109/LICS52264.2021.9470513.

[bib.bib19] [19] F. Luca, J. Ouaknine, and J. Worrell. A universal Skolem set of positive lower density. In MFCS, volume 241 of LIPIcs, pages 73:1–73:12. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.MFCS.2022.73.

[bib.bib20] [20] K. Mahler. Eine arithmetische Eigenschaft der Taylor Koeffizienten rationaler Funktionen. Proc. Akad. Wet. Amsterdam, 38, 1935.

[bib.bib21] [21] M. Mignotte, T. Shorey, and R. Tijdeman. The distance between terms of an algebraic recurrence sequence. J. für die reine und angewandte Math., 349, 1984.

[bib.bib22] [22] T. R. Nicely. New maximal prime gaps and first occurrences. Math. Comput., 68(227):1311–1315, 1999. doi:10.1090/S0025-5718-99-01065-0.

[bib.bib23] [23] A. Odlyzko, M. Rubinstein, and M. Wolf. Jumping champions. Experimental Mathematics, 8(2):107–118, 1999. doi:10.1080/10586458.1999.10504393.

[bib.bib24] [24] J. Ouaknine and J. Worrell. On linear recurrence sequences and loop termination. ACM SIGLOG News, 2(2):4–13, 2015. doi:10.1145/2766189.2766191.

[bib.bib25] [25] J. Piribauer and C. Baier. On Skolem-hardness and saturation points in Markov decision processes. In ICALP, volume 168 of LIPIcs, pages 138:1–138:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPICS.ICALP.2020.138.

[bib.bib26] [26] G. Rozenberg and A. Salomaa. Cornerstones of Undecidability. Prentice Hall, 1994.

[bib.bib27] [27] W. M. Schmidt. The zero multiplicity of linear recurrence sequences. Acta Math., 182:243–282, 1999.

[bib.bib28] [28] T. Skolem. Ein Verfahren zur Behandlung gewisser exponentialer Gleichungen. In Comptes rendus du congrès des mathématiciens scandinaves, 1934.

[bib.bib29] [29] R. P. Stanley. Enumerative combinatorics. Cambridge studies in advanced mathematics, Volume 1, 2nd Edition, 2011.

[bib.bib30] [30] T. Tao. Structure and randomness: pages from year one of a mathematical blog. American Mathematical Society, 2008.

[bib.bib31] [31] N. K. Vereshchagin. The problem of appearance of a zero in a linear recurrence sequence (in Russian). Mat. Zametki, 38(2), 1985.