Rough numbers between consecutive primes

What's new 2025-08-11

First things first: due to an abrupt suspension of NSF funding to my home university of UCLA, the Institute of Pure and Applied Mathematics (which had been preliminarily approved for a five-year NSF grant to run the institute) is currently fundraising to ensure continuity of operations during the suspension, with a goal of raising $500,000. Donations can be made at this page. As incoming Director of Special Projects at IPAM, I am grateful for the support (both moral and financial) that we have already received in the last few days, but we are still short of our fundraising goal.

Back to math. Ayla Gafni and I have just uploaded to the arXiv the paper “Rough numbers between consecutive primes“. In this paper we resolve a question of Erdös concerning rough numbers between consecutive gaps, and with the assistance of modern sieve theory calculations, we in fact obtain quite precise asymptotics for the problem. (As a side note, this research was supported by my personal NSF grant which is also currently suspended; I am grateful to recent donations to my own research fund which have helped me complete this research.)

Define a prime gap to be an interval {(p_n, p_{n+1})} between consecutive primes. We say that a prime gap contains a rough number if there is an integer {m \in (p_n,p_{n+1})} whose least prime factor is at least the length {p_{n+1}-p_n} of the gap. For instance, the prime gap {(3,5)} contains the rough number {4}, but the prime gap {(7,11)} does not (all integers between {7} and {11} have a prime factor less than {4}). The first few {n} for which the {n^\mathrm{th}} prime gap contains a rough number are

\displaystyle  2, 3, 5, 7, 10, 13, 15, 17, 20, \dots.

Numerically, the proportion of {n} for which the {n^\mathrm{th}} prime gap does not contain a rough number decays slowly as {n} increases:

Erdös initially thought that all but finitely many prime gaps should contain a rough number, but changed his mind, as per the following quote:

…I am now sure that this is not true and I “almost” have a counterexample. Pillai and Szekeres observed that for every {t \leq 16}, a set of {t} consecutive integers always contains one which is relatively prime to the others. This is false for {t = 17}, the smallest counterexample being {2184, 2185, \dots, 2200}. Consider now the two arithmetic progressions {2183 + d \cdot 2 \cdot 3 \cdot 5 \cdot 7 \cdot 11 \cdot 13} and {2201 + d \cdot 2 \cdot 3 \cdot 5 \cdot 7 \cdot 11 \cdot 13}. There certainly will be infinitely many values of {d} for which the progressions simultaneously represent primes; this follows at once from hypothesis H of Schinzel, but cannot at present be proved. These primes are consecutive and give the required counterexample. I expect that this situation is rather exceptional and that the integers {k} for which there is no {m} satisfying {p_k < m < p_{k+1}} and {p(m) > p_{k+1} - p_k} have density {0}.

In fact Erdös’s observation can be made simpler: any pair of cousin primes {p_{n+1}=p_n+4} for {p_n > 3} (of which {(7,11)} is the first example) will produce a prime gap that does not contain any rough numbers.

The latter question of Erdös is listed as problem #682 on Thomas Bloom’s Erdös problems website. In this paper we answer Erdös’s question, and in fact give a rather precise bound for the number of counterexamples:

Theorem 1 (Erdos #682) For {X>2}, let {N(X)} be the number of prime gaps {(p_n, p_{n+1})} with {p_n \in [X,2X]} that do not contain a rough number. Then

\displaystyle  N(X) \ll \frac{X}{\log^2 X}. \ \ \ \ \ (1)

Assuming the Dickson–Hardy–Littlewood prime tuples conjecture, we can improve this to

\displaystyle  N(X) \sim c \frac{X}{\log^2 X} \ \ \ \ \ (2)

for some (explicitly describable) constant {c>0}.

In fact we believe that {c \approx 2.8}, although the formula we have to compute {c} converges very slowly. This is (weakly) supported by numerical evidence:

While many questions about prime gaps remain open, the theory of rough numbers is much better understood, thanks to modern sieve theoretic tools such as the fundamental lemma of sieve theory. The main idea is to frame the problem in terms of counting the number of rough numbers in short intervals {[x,x+H]}, where {x} ranges in some dyadic interval {[X,2X]} and {H} is a much smaller quantity, such as {H = \log^\alpha X} for some {0 < \alpha < 1}. Here, one has to tweak the definition of “rough” to mean “no prime factors less than {z}” for some intermediate {z} (e.g., {z = \exp(\log^\beta X)} for some {0 < \beta < \alpha} turns out to be a reasonable choice). These problems are very analogous to the extremely well studied problem of counting primes in short intervals, but one can make more progress without needing powerful conjectures such as the Hardy–Littlewood prime tuples conjecture. In particular, because of the fundamental lemma of sieve theory, one can compute the mean and variance (i.e., the first two moments) of such counts to high accuracy, using in particular some calculations on the mean values of singular series that go back at least to the work of Montgomery from 1970. This second moment analysis turns out to be enough (after optimizing all the parameters) to answer Erdös’s problem with a weaker bound

\displaystyle  N(X) \ll \frac{X}{\log^{4/3-o(1)} X}.

To do better, we need to work with higher moments. The fundamental lemma also works in this setting; one now needs precise asymptotics for the mean value of singular series of {k}-tuples, but this was fortunately worked out (in more or less exactly the format we needed) by Montgomery and Soundararajan in 2004. Their focus was establishing a central limit theorem for the distribution of primes in short intervals (conditional on the prime tuples conjecture), but their analysis can be adapted to show (unconditionally) good concentration of measure results for rough numbers in short intervals. A direct application of their estimates improves the upper bound on {N(X)} to

\displaystyle  N(X) \ll \frac{X}{\log^{2-o(1)} X}

and some more careful tweaking of parameters allows one to remove the {o(1)} error. This latter analysis reveals that in fact the dominant contribution to {N(X)} will come with prime gaps of bounded length, of which our understanding is still relatively poor (it was only in 2014 that Yitang Zhang famously showed that infinitely many such gaps exist). At this point we finally have to resort to (a Dickson-type form of) the prime tuples conjecture to get the asymptotic (2).