Joni Teräväinen and myself have just uploaded to the arXiv our preprint “Quantitative bounds for Gowers uniformity of the Möbius and von Mangoldt functions“. This paper makes quantitative the Gowers uniformity estimates on the Möbius function
and the von Mangoldt function
.
To discuss the results we first discuss the situation of the Möbius function, which is technically simpler in some (though not all) ways. We assume familiarity with Gowers norms and standard notations around these norms, such as the averaging notation
and the exponential notation
. The prime number theorem in qualitative form asserts that
![\displaystyle \mathop{\bf E}_{n \in [N]} \mu(n) = o(1)](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+E%7D_%7Bn+%5Cin+%5BN%5D%7D+%5Cmu%28n%29+%3D+o%281%29&bg=ffffff&fg=000000&s=0&c=20201002)
as

. With Vinogradov-Korobov error term, the prime number theorem is strengthened to
![\displaystyle \mathop{\bf E}_{n \in [N]} \mu(n) \ll \exp( - c \log^{3/5} N (\log \log N)^{-1/5} );](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+E%7D_%7Bn+%5Cin+%5BN%5D%7D+%5Cmu%28n%29+%5Cll+%5Cexp%28+-+c+%5Clog%5E%7B3%2F5%7D+N+%28%5Clog+%5Clog+N%29%5E%7B-1%2F5%7D+%29%3B&bg=ffffff&fg=000000&s=0&c=20201002)
we refer to such decay bounds (With

type factors) as
pseudopolynomial decay. Equivalently, we obtain pseudopolynomial decay of Gowers

seminorm of

:
![\displaystyle \| \mu \|_{U^1([N])} \ll \exp( - c \log^{3/5} N (\log \log N)^{-1/5} ).](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5Cmu+%5C%7C_%7BU%5E1%28%5BN%5D%29%7D+%5Cll+%5Cexp%28+-+c+%5Clog%5E%7B3%2F5%7D+N+%28%5Clog+%5Clog+N%29%5E%7B-1%2F5%7D+%29.&bg=ffffff&fg=000000&s=0&c=20201002)
As is well known, the Riemann hypothesis would be equivalent to an upgrade of this estimate to polynomial decay of the form
![\displaystyle \| \mu \|_{U^1([N])} \ll_\varepsilon N^{-1/2+\varepsilon}](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5Cmu+%5C%7C_%7BU%5E1%28%5BN%5D%29%7D+%5Cll_%5Cvarepsilon+N%5E%7B-1%2F2%2B%5Cvarepsilon%7D&bg=ffffff&fg=000000&s=0&c=20201002)
for any

.
Once one restricts to arithmetic progressions, the situation gets worse: the Siegel-Walfisz theorem gives the bound
![\displaystyle \| \mu 1_{a \hbox{ mod } q}\|_{U^1([N])} \ll_A \log^{-A} N \ \ \ \ \ (1)](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5Cmu+1_%7Ba+%5Chbox%7B+mod+%7D+q%7D%5C%7C_%7BU%5E1%28%5BN%5D%29%7D+%5Cll_A+%5Clog%5E%7B-A%7D+N+%5C+%5C+%5C+%5C+%5C+%281%29&bg=ffffff&fg=000000&s=0&c=20201002)
for any residue class

and any

, but with the catch that the implied constant is
ineffective in

. This ineffectivity cannot be removed without further progress on the notorious Siegel zero problem.
In 1937, Davenport was able to show the discorrelation estimate
![\displaystyle \mathop{\bf E}_{n \in [N]} \mu(n) e(-\alpha n) \ll_A \log^{-A} N](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+E%7D_%7Bn+%5Cin+%5BN%5D%7D+%5Cmu%28n%29+e%28-%5Calpha+n%29+%5Cll_A+%5Clog%5E%7B-A%7D+N&bg=ffffff&fg=000000&s=0&c=20201002)
for any

uniformly in

, which leads (by standard Fourier arguments) to the Fourier uniformity estimate
![\displaystyle \| \mu \|_{U^2([N])} \ll_A \log^{-A} N.](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5Cmu+%5C%7C_%7BU%5E2%28%5BN%5D%29%7D+%5Cll_A+%5Clog%5E%7B-A%7D+N.&bg=ffffff&fg=000000&s=0&c=20201002)
Again, the implied constant is ineffective. If one insists on effective constants, the best bound currently available is
![\displaystyle \| \mu \|_{U^2([N])} \ll \log^{-c} N \ \ \ \ \ (2)](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5Cmu+%5C%7C_%7BU%5E2%28%5BN%5D%29%7D+%5Cll+%5Clog%5E%7B-c%7D+N+%5C+%5C+%5C+%5C+%5C+%282%29&bg=ffffff&fg=000000&s=0&c=20201002)
for some small effective constant

.
For the situation with the
norm the previously known results were much weaker. Ben Green and I showed that
![\displaystyle \mathop{\bf E}_{n \in [N]} \mu(n) \overline{F}(g(n) \Gamma) \ll_{A,F,G/\Gamma} \log^{-A} N \ \ \ \ \ (3)](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+E%7D_%7Bn+%5Cin+%5BN%5D%7D+%5Cmu%28n%29+%5Coverline%7BF%7D%28g%28n%29+%5CGamma%29+%5Cll_%7BA%2CF%2CG%2F%5CGamma%7D+%5Clog%5E%7B-A%7D+N+%5C+%5C+%5C+%5C+%5C+%283%29&bg=ffffff&fg=000000&s=0&c=20201002)
uniformly for any

, any degree two (filtered) nilmanifold

, any polynomial sequence

, and any Lipschitz function

; again, the implied constants are ineffective. On the other hand, in a
separate paper of Ben Green and myself, we established the following inverse theorem: if for instance we knew that
![\displaystyle \| \mu \|_{U^3([N])} \geq \delta](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5Cmu+%5C%7C_%7BU%5E3%28%5BN%5D%29%7D+%5Cgeq+%5Cdelta&bg=ffffff&fg=000000&s=0&c=20201002)
for some

, then there exists a degree two nilmanifold

of dimension

, complexity

, a polynomial sequence

, and Lipschitz function

of Lipschitz constant

such that
![\displaystyle \mathop{\bf E}_{n \in [N]} \mu(n) \overline{F}(g(n) \Gamma) \gg \exp(-\delta^{-O(1)}).](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+E%7D_%7Bn+%5Cin+%5BN%5D%7D+%5Cmu%28n%29+%5Coverline%7BF%7D%28g%28n%29+%5CGamma%29+%5Cgg+%5Cexp%28-%5Cdelta%5E%7B-O%281%29%7D%29.&bg=ffffff&fg=000000&s=0&c=20201002)
Putting the two assertions together and comparing all the dependencies on parameters, one can establish the qualitative decay bound
![\displaystyle \| \mu \|_{U^3([N])} = o(1).](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5Cmu+%5C%7C_%7BU%5E3%28%5BN%5D%29%7D+%3D+o%281%29.&bg=ffffff&fg=000000&s=0&c=20201002)
However the decay rate

produced by this argument is
completely ineffective: obtaining a bound on when this

quantity dips below a given threshold

depends on the implied constant in
(3) for some

whose dimension depends on

, and the dependence on

obtained in this fashion is ineffective in the face of a Siegel zero.
For higher norms
, the situation is even worse, because the quantitative inverse theory for these norms is poorer, and indeed it was only with the recent work of Manners that any such bound is available at all (at least for
). Basically, Manners establishes if
![\displaystyle \| \mu \|_{U^k([N])} \geq \delta](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5Cmu+%5C%7C_%7BU%5Ek%28%5BN%5D%29%7D+%5Cgeq+%5Cdelta&bg=ffffff&fg=000000&s=0&c=20201002)
then there exists a degree

nilmanifold

of dimension

, complexity

, a polynomial sequence

, and Lipschitz function

of Lipschitz constant

such that
![\displaystyle \mathop{\bf E}_{n \in [N]} \mu(n) \overline{F}(g(n) \Gamma) \gg \exp\exp(-\delta^{-O(1)}).](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+E%7D_%7Bn+%5Cin+%5BN%5D%7D+%5Cmu%28n%29+%5Coverline%7BF%7D%28g%28n%29+%5CGamma%29+%5Cgg+%5Cexp%5Cexp%28-%5Cdelta%5E%7B-O%281%29%7D%29.&bg=ffffff&fg=000000&s=0&c=20201002)
(We allow all implied constants to depend on

.) Meanwhile, the bound
(3) was extended to arbitrary nilmanifolds
by Ben and myself. Again, the two results when concatenated give the qualitative decay
![\displaystyle \| \mu \|_{U^k([N])} = o(1)](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5Cmu+%5C%7C_%7BU%5Ek%28%5BN%5D%29%7D+%3D+o%281%29&bg=ffffff&fg=000000&s=0&c=20201002)
but the decay rate is completely ineffective.
Our first result gives an effective decay bound:
Theorem 1 For any
, we have
for some
. The implied constants are effective.
This is off by a logarithm from the best effective bound (2) in the
case. In the
case there is some hope to remove this logarithm based on the improved quantitative inverse theory currently available in this case, but there is a technical obstruction to doing so which we will discuss later in this post. For
the above bound is the best one could hope to achieve purely using the quantitative inverse theory of Manners.
We have analogues of all the above results for the von Mangoldt function
. Here a complication arises that
does not have mean close to zero, and one has to subtract off some suitable approximant
to
before one would expect good Gowers norms bounds. For the prime number theorem one can just use the approximant
, giving
![\displaystyle \| \Lambda - 1 \|_{U^1([N])} \ll \exp( - c \log^{3/5} N (\log \log N)^{-1/5} )](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5CLambda+-+1+%5C%7C_%7BU%5E1%28%5BN%5D%29%7D+%5Cll+%5Cexp%28+-+c+%5Clog%5E%7B3%2F5%7D+N+%28%5Clog+%5Clog+N%29%5E%7B-1%2F5%7D+%29&bg=ffffff&fg=000000&s=0&c=20201002)
but even for the prime number theorem in arithmetic progressions one needs a more accurate approximant. In our paper it is convenient to use the “Cramér approximant”

where

and

is the quasipolynomial quantity

Then one can show from the Siegel-Walfisz theorem and standard bilinear sum methods that
![\displaystyle \mathop{\bf E}_{n \in [N]} (\Lambda - \Lambda_{\hbox{Cram\'er}}(n)) e(-\alpha n) \ll_A \log^{-A} N](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+E%7D_%7Bn+%5Cin+%5BN%5D%7D+%28%5CLambda+-+%5CLambda_%7B%5Chbox%7BCram%5C%27er%7D%7D%28n%29%29+e%28-%5Calpha+n%29+%5Cll_A+%5Clog%5E%7B-A%7D+N&bg=ffffff&fg=000000&s=0&c=20201002)
and
![\displaystyle \| \Lambda - \Lambda_{\hbox{Cram\'er}}\|_{U^2([N])} \ll_A \log^{-A} N](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5CLambda+-+%5CLambda_%7B%5Chbox%7BCram%5C%27er%7D%7D%5C%7C_%7BU%5E2%28%5BN%5D%29%7D+%5Cll_A+%5Clog%5E%7B-A%7D+N&bg=ffffff&fg=000000&s=0&c=20201002)
for all

and

(with an ineffective dependence on

), again regaining effectivity if

is replaced by a sufficiently small constant

. All the previously stated discorrelation and Gowers uniformity results for

then have analogues for

, and our main result is similarly analogous:
Theorem 2 For any
, we have
for some
. The implied constants are effective.
By standard methods, this result also gives quantitative asymptotics for counting solutions to various systems of linear equations in primes, with error terms that gain a factor of
with respect to the main term.
We now discuss the methods of proof, focusing first on the case of the Möbius function. Suppose first that there is no “Siegel zero”, by which we mean a quadratic character
of some conductor
with a zero
with
for some small absolute constant
. In this case the Siegel-Walfisz bound (1) improves to a quasipolynomial bound
![\displaystyle \| \mu 1_{a \hbox{ mod } q}\|_{U^1([N])} \ll \exp(-\log^c N). \ \ \ \ \ (5)](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%5Cmu+1_%7Ba+%5Chbox%7B+mod+%7D+q%7D%5C%7C_%7BU%5E1%28%5BN%5D%29%7D+%5Cll+%5Cexp%28-%5Clog%5Ec+N%29.+%5C+%5C+%5C+%5C+%5C+%285%29&bg=ffffff&fg=000000&s=0&c=20201002)
To establish Theorem
1 in this case, it suffices by Manners’ inverse theorem to establish the polylogarithmic bound
![\displaystyle \mathop{\bf E}_{n \in [N]} \mu(n) \overline{F}(g(n) \Gamma) \ll \exp(-\log^c N) \ \ \ \ \ (6)](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+E%7D_%7Bn+%5Cin+%5BN%5D%7D+%5Cmu%28n%29+%5Coverline%7BF%7D%28g%28n%29+%5CGamma%29+%5Cll+%5Cexp%28-%5Clog%5Ec+N%29+%5C+%5C+%5C+%5C+%5C+%286%29&bg=ffffff&fg=000000&s=0&c=20201002)
for all degree

nilmanifolds

of dimension

and complexity

, all polynomial sequences

, and all Lipschitz functions

of norm

. If the nilmanifold

had bounded dimension, then one could repeat the arguments
of Ben and myself more or less verbatim to establish this claim from
(5), which relied on the quantitative equidistribution theory on nilmanifolds developed in a
separate paper of Ben and myself. Unfortunately, in the latter paper the dependence of the quantitative bounds on the dimension

was not explicitly given. In an appendix to the current paper, we go through that paper to account for this dependence, showing that all exponents depend at most doubly exponentially in the dimension

, which is barely sufficient to handle the dimension of

that arises here.
Now suppose we have a Siegel zero
. In this case the bound (5) will not hold in general, and hence also (6) will not hold either. Here, the usual way out (while still maintaining effective estimates) is to approximate
not by
, but rather by a more complicated approximant
that takes the Siegel zero into account, and in particular is such that one has the (effective) pseudopolynomial bound
![\displaystyle \| (\mu - \mu_{\hbox{Siegel}}) 1_{a \hbox{ mod } q}\|_{U^1([N])} \ll \exp(-\log^c N) \ \ \ \ \ (7)](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%28%5Cmu+-+%5Cmu_%7B%5Chbox%7BSiegel%7D%7D%29+1_%7Ba+%5Chbox%7B+mod+%7D+q%7D%5C%7C_%7BU%5E1%28%5BN%5D%29%7D+%5Cll+%5Cexp%28-%5Clog%5Ec+N%29+%5C+%5C+%5C+%5C+%5C+%287%29&bg=ffffff&fg=000000&s=0&c=20201002)
for all residue classes

. The Siegel approximant to

is actually a little bit complicated, and to our knowledge the first appearance of this sort of approximant only appears as late as this 2010
paper of Germán and Katai. Our version of this approximant is defined as the multiplicative function such that

when

, and

when

is coprime to all primes

, and

is a normalising constant given by the formula

(this constant ends up being of size

and plays only a minor role in the analysis). This is a rather complicated formula, but it seems to be virtually the only choice of approximant that allows for bounds such as
(7) to hold. (This is the one aspect of the problem where the von Mangoldt theory is simpler than the Möbius theory, as in the former one only needs to work with very rough numbers for which one does not need to make any special accommodations for the behavior at small primes when introducing the Siegel correction term.) With this starting point it is then possible to repeat the analysis of my previous papers with Ben and obtain the pseudopolynomial discorrelation bound
![\displaystyle \mathop{\bf E}_{n \in [N]} (\mu - \mu_{\hbox{Siegel}})(n) \overline{F}(g(n) \Gamma) \ll \exp(-\log^c N)](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+E%7D_%7Bn+%5Cin+%5BN%5D%7D+%28%5Cmu+-+%5Cmu_%7B%5Chbox%7BSiegel%7D%7D%29%28n%29+%5Coverline%7BF%7D%28g%28n%29+%5CGamma%29+%5Cll+%5Cexp%28-%5Clog%5Ec+N%29+&bg=ffffff&fg=000000&s=0&c=20201002)
for

as before, which when combined with Manners’ inverse theorem gives the doubly logarithmic bound
![\displaystyle \| \mu - \mu_{\hbox{Siegel}} \|_{U^k([N])} \ll (\log\log N)^{-c_k}.](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%7C+%5Cmu+-+%5Cmu_%7B%5Chbox%7BSiegel%7D%7D+%5C%7C_%7BU%5Ek%28%5BN%5D%29%7D+%5Cll+%28%5Clog%5Clog+N%29%5E%7B-c_k%7D.&bg=ffffff&fg=000000&s=0&c=20201002)
Meanwhile, a direct sieve-theoretic computation ends up giving the singly logarithmic bound
![\displaystyle \| \mu_{\hbox{Siegel}} \|_{U^k([N])} \ll \log^{-c_k} N](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%7C+%5Cmu_%7B%5Chbox%7BSiegel%7D%7D+%5C%7C_%7BU%5Ek%28%5BN%5D%29%7D+%5Cll+%5Clog%5E%7B-c_k%7D+N&bg=ffffff&fg=000000&s=0&c=20201002)
(indeed, there is a good chance that one could improve the bounds even further, though it is not helpful for this current argument to do so). Theorem
1 then follows from the triangle inequality for the Gowers norm. It is interesting that the Siegel approximant

seems to play a rather essential component in the proof, even if it is absent in the final statement. We note that this approximant seems to be a useful tool to explore the “illusory world” of the Siegel zero further; see for instance the
recent paper of Chinis for some work in this direction.
For the analogous problem with the von Mangoldt function (assuming a Siegel zero for sake of discussion), the approximant
is simpler; we ended up using

which allows one to state the standard prime number theorem in arithmetic progressions with classical error term and Siegel zero term compactly as
![\displaystyle \| (\Lambda - \Lambda_{\hbox{Siegel}}) 1_{a \hbox{ mod } q}\|_{U^1([N])} \ll \exp(-\log^c N).](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%7C+%28%5CLambda+-+%5CLambda_%7B%5Chbox%7BSiegel%7D%7D%29+1_%7Ba+%5Chbox%7B+mod+%7D+q%7D%5C%7C_%7BU%5E1%28%5BN%5D%29%7D+%5Cll+%5Cexp%28-%5Clog%5Ec+N%29.&bg=ffffff&fg=000000&s=0&c=20201002)
Routine modifications of previous arguments also give
![\displaystyle \mathop{\bf E}_{n \in [N]} (\Lambda - \Lambda_{\hbox{Siegel}})(n) \overline{F}(g(n) \Gamma) \ll \exp(-\log^c N) \ \ \ \ \ (8)](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+E%7D_%7Bn+%5Cin+%5BN%5D%7D+%28%5CLambda+-+%5CLambda_%7B%5Chbox%7BSiegel%7D%7D%29%28n%29+%5Coverline%7BF%7D%28g%28n%29+%5CGamma%29+%5Cll+%5Cexp%28-%5Clog%5Ec+N%29+%5C+%5C+%5C+%5C+%5C+%288%29&bg=ffffff&fg=000000&s=0&c=20201002)
and
![\displaystyle \| \Lambda_{\hbox{Siegel}} \|_{U^k([N])} \ll \log^{-c_k} N.](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%7C+%5CLambda_%7B%5Chbox%7BSiegel%7D%7D+%5C%7C_%7BU%5Ek%28%5BN%5D%29%7D+%5Cll+%5Clog%5E%7B-c_k%7D+N.&bg=ffffff&fg=000000&s=0&c=20201002)
The one tricky new step is getting from the discorrelation estimate
(8) to the Gowers uniformity estimate
![\displaystyle \| \Lambda - \Lambda_{\hbox{Siegel}} \|_{U^k([N])} \ll (\log\log N)^{-c_k}.](https://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%7C+%5CLambda+-+%5CLambda_%7B%5Chbox%7BSiegel%7D%7D+%5C%7C_%7BU%5Ek%28%5BN%5D%29%7D+%5Cll+%28%5Clog%5Clog+N%29%5E%7B-c_k%7D.&bg=ffffff&fg=000000&s=0&c=20201002)
One cannot directly apply Manners’ inverse theorem here because

and

are unbounded. There is a standard tool for getting around this issue, now known as the
dense model theorem, which is the standard engine powering the
transference principle from theorems about bounded functions to theorems about certain types of unbounded functions. However the quantitative versions of the dense model theorem in the literature are expensive and would basically weaken the doubly logarithmic gain here to a triply logarithmic one. Instead, we bypass the dense model theorem and directly transfer the inverse theorem for bounded functions to an inverse theorem for unbounded functions by using the
densification approach to transference introduced
by Conlon, Fox, and Zhao. This technique turns out to be quantitatively quite efficient (the dependencies of the main parameters in the transference are polynomial in nature), and also has the technical advantage of avoiding the somewhat tricky “correlation condition” present in early transference results which are also not beneficial for quantitative bounds.