246B, Notes 4: The Riemann zeta function and the prime number theorem
What's new 2021-02-12
One of the great classical triumphs of complex analysis was in providing the first proof (by Hadamard and de Vallée Poussin in 1896) of arguably the most important theorem in analytic number theory, the prime number theorem:
Theorem 1 (Prime number theorem) Letdenote the number of primes less than a given real number
. Then
(or in asymptotic notation,
as
).
(Actually, it turns out to be slightly more natural to replace the approximation in the prime number theorem by the more precise approximation
of the logarithmic integral, but we will not stress this point here.)
The complex-analytic proof of this theorem hinges on the study of a key meromorphic function related to the prime numbers, the Riemann zeta function . Initially, it is only defined on the half-plane
:
Definition 2 (Riemann zeta function, preliminary definition) Letbe such that
. Then we define
Note that the series is locally uniformly convergent in the half-plane , so in particular
is holomorphic on this region. In previous notes we have already evaluated some special values of this function:
The Riemann zeta function has several remarkable properties, some of which we summarise here:
Theorem 3 (Basic properties of the Riemann zeta function)
- (i) (Euler product formula) For any
with
, we have
where the product is absolutely convergent and is over the prime numbers
.
- (ii) (Trivial zero-free region)
has no zeroes in the region
.
- (iii) (Meromorphic continuation)
has a unique meromorphic continuation to the complex plane (which by abuse of notation we also call
), with a simple pole at
and no other poles. Furthermore, the Riemann xi function
is an entire function of order
(after removing all singularities). The function
is an entire function of order one after removing the singularity at
.
- (iv) (Functional equation) After applying the meromorphic continuation from (iii), we have
for all
(excluding poles). Equivalently, we have
for all
. (The equivalence between the (5) and (6) is a routine consequence of theEuler reflection formula and the Legendre duplication formula, see Exercises 26 and 31 of Notes 1.)
Proof: We just prove (i) and (ii) for now, leaving (iii) and (iv) for later sections.
The claim (i) is an encoding of the fundamental theorem of arithmetic, which asserts that every natural number is uniquely representable as a product
over primes, where the
are natural numbers, all but finitely many of which are zero. Writing this representation as
, we see that
The claim (ii) is immediate from (i) since the Euler product is absolutely convergent and all terms are non-zero.
We remark that by sending to
in Theorem 3(i) we conclude that
The meromorphic continuation (iii) of the zeta function is initially surprising, but can be interpreted either as a manifestation of the extremely regular spacing of the natural numbers occurring in the sum (1), or as a consequence of various integral representations of
(or slight modifications thereof). We will focus in this set of notes on a particular representation of
as essentially the Mellin transform of the theta function
that briefly appeared in previous notes, and the functional equation (iv) can then be viewed as a consequence of the modularity of that theta function. This in turn was established using the Poisson summation formula, so one can view the functional equation as ultimately being a manifestation of Poisson summation. (For a direct proof of the functional equation via Poisson summation, see these notes.)
Henceforth we work with the meromorphic continuation of . The functional equation (iv), when combined with special values of
such as (2), gives some additional values of
outside of its initial domain
, most famously
From Theorem 3 and the non-vanishing nature of , we see that
has simple zeroes (known as trivial zeroes) at the negative even integers
, and all other zeroes (the non-trivial zeroes) inside the critical strip
. (The non-trivial zeroes are conjectured to all be simple, but this is hopelessly far from being proven at present.) As we shall see shortly, these latter zeroes turn out to be closely related to the distribution of the primes. The functional equation tells us that if
is a non-trivial zero then so is
; also, we have the identity
Conjecture 4 (Riemann hypothesis) All the non-trivial zeroes oflie on the critical line
.
This conjecture would have many implications in analytic number theory, particularly with regard to the distribution of the primes. Of course, it is far from proven at present, but the partial results we have towards this conjecture are still sufficient to establish results such as the prime number theorem.
Return now to the original region where . To take more advantage of the Euler product formula (3), we take complex logarithms to conclude that
The series and
that show up in the above formulae are examples of Dirichlet series, which are a convenient device to transform various sequences of arithmetic interest into holomorphic or meromorphic functions. Here are some more examples:
Exercise 5 (Standard Dirichlet series) Letbe a complex number with
.
- (i) Show that
.
- (ii) Show that
, where
is the divisor function of
(the number of divisors of
).
- (iii) Show that
, where
is the Möbius function, defined to equal
when
is the product of
distinct primes for some
, and
otherwise.
- (iv) Show that
, where
is the Liouville function, defined to equal
when
is the product of
(not necessarily distinct) primes for some
.
- (v) Show that
, where
is the holomorphic branch of the logarithm that is real for
, and with the convention that
vanishes for
.
- (vi) Use the fundamental theorem of arithmetic to show that the von Mangoldt function is the unique function
such that
for every positive integer
. Use this and (i) to provide an alternate proof of the identity (8). Thus we see that (8) is really just another encoding of the fundamental theorem of arithmetic.
Given the appearance of the von Mangoldt function , it is natural to reformulate the prime number theorem in terms of this function:
Theorem 6 (Prime number theorem, von Mangoldt form) One has(or in asymptotic notation,
as
).
Let us see how Theorem 6 implies Theorem 1. Firstly, for any , we can write
Exercise 7 Show that Theorem 1 conversely implies Theorem 6.
The alternate form (8) of the Euler product identity connects the primes (represented here via proxy by the von Mangoldt function) with the logarithmic derivative of the zeta function, and can be used as a starting point for describing further relationships between and the primes. Most famously, we shall see later in these notes that it leads to the remarkably precise Riemann-von Mangoldt explicit formula:
Theorem 8 (Riemann-von Mangoldt explicit formula) For any non-integer, we have
where
ranges over the non-trivial zeroes of
with imaginary part in
. Furthermore, the convergence of the limit is locally uniform in
.
Actually, it turns out that this formula is in some sense too precise; in applications it is often more convenient to work with smoothed variants of this formula in which the sum on the left-hand side is smoothed out, but the contribution of zeroes with large imaginary part is damped; see Exercise 22. Nevertheless, this formula clearly illustrates how the non-trivial zeroes of the zeta function influence the primes. Indeed, if one formally differentiates the above formula in
, one is led to the (quite nonrigorous) approximation
Comparing Theorem 8 with Theorem 6, it is natural to suspect that the key step in the proof of the latter is to establish the following slight but important extension of Theorem 3(ii), which can be viewed as a very small step towards the Riemann hypothesis:
Theorem 9 (Slight enlargement of zero-free region) There are no zeroes ofon the line
.
It is not quite immediate to see how Theorem 6 follows from Theorem 8 and Theorem 9, but we will demonstrate it below the fold.
Although Theorem 9 only seems like a slight improvement of Theorem 3(ii), proving it is surprisingly non-trivial. The basic idea is the following: if there was a zero at , then there would also be a different zero at
(note
cannot vanish due to the pole at
), and then the approximation (9) becomes
In fact, Theorem 9 is basically equivalent to the prime number theorem:
Exercise 10 For the purposes of this exercise, assume Theorem 6, but do not assume Theorem 9. For any non-zero real, show that
as
, where
denotes a quantity that goes to zero as
after being multiplied by
. Use this to derive Theorem 9.
This equivalence can help explain why the prime number theorem is remarkably non-trivial to prove, and why the Riemann zeta function has to be either explicitly or implicitly involved in the proof.
This post is only intended as the briefest of introduction to complex-analytic methods in analytic number theory; also, we have not chosen the shortest route to the prime number theorem, electing instead to travel in directions that particularly showcase the complex-analytic results introduced in this course. For some further discussion see this previous set of lecture notes, particularly Notes 2 and Supplement 3 (with much of the material in this post drawn from the latter).
— 1. Meromorphic continuation and functional equation —
We now focus on understanding the meromorphic continuation of , as well as the functional equation that that continuation satisfies. The arguments here date back to Riemann’s original paper on the zeta function. The general strategy is to relate the zeta function
for
to some sort of integral involving the parameter
, which is manipulated in such a way that the integral makes sense for values of
outside of the halfplane
, and can thus be used to define the zeta function meromorphically in such a region. Often the Gamma function
is involved in the relationship between the zeta function and integral. There are many such ways to connect
to an integral; we present some of the more classical ones here.
One way to motivate the meromorphic continuation is to look at the continuous analogue
Exercise 11 Using the trapezoid rule, show that for anyin the region
with
, there exists a unique complex number
for which one has the asymptotic
for any natural number
, where
. Use this to extend the Riemann zeta function meromorphically to the region
. Conclude in particular that
and
.
Exercise 12 Obtain the refinement
to the trapezoid rule when
are integers and
is continuously three times differentiable. Then show that for any
in the region
with
, there exists a unique complex number
for which one has the asymptotic
for any natural number
, where
. Use this to extend the Riemann zeta function meromorphically to the region
. Conclude in particular that
.
One can keep going in this fashion using the Euler-Maclaurin formula (see this previous blog post) to extend the range of meromorphic continuation to the rest of the complex plane. However, we will now proceed in a different fashion, using the theta function
that made an appearance in previous notes, and try to transform this function into the zeta function. We will only need this function for imaginary valuesWe will attempt to apply the Mellin transform (Exercise 11 from Notes 2) to this function; formally, we have
Now we exploit the modular identity (12) to improve the convergence of this formula. The convergence of is much better near
than near
, so we use (13) to split
It remains to establish that is of order
. From (11) we have
so from the triangle inequality
Exercise 13 (Alternate derivation of meromorphic continuation and functional equation)
- (i) Establish the identity
whenever
.
- (ii) Establish the identity
whenever
,
is not an integer,
,
where
is the branch of the logarithm with real part in
, and
is the contour consisting of the line segment
, the semicircle
, and the line segment
.
- (iii) Use (ii) to meromorphically continue
to the entire complex plane
.
- (iv) By shifting the contour
to the contour
for a large natural number
and applying the residue theorem, show that
again using the branch
of the logarithm to define
.
- (v) Establish the functional equation (5).
Exercise 14 Use the formulafrom Exercise 12, together with the functional equation, to give yet another proof of the identity
.
Exercise 15 (Relation between zeta function and Bernoulli numbers)
- (i) For any complex number
with
, use the Poisson summation formula (Proposition 3(v) from Notes 2) to establish the identity
- (ii) For
as above and sufficiently small, show that
Conclude that
for any natural number
, where the Bernoulli numbers
are defined through the Taylor expansion
Thus for instance
,
, and so forth.
- (iii) Show that
for any odd natural number
. (This identity can also be deduced from the Euler-Maclaurin formula, which generalises the approach in Exercise 12; see this previous post.)
- (iv) Use (14) and the residue theorem (now working inside the contour
, rather than outside) to give an alternate proof of (15).
Exercise 16 (Convexity bounds)It is possible to improve the bounds (iii) in the region
- (i) Establish the bounds
for any
and
with
.
- (ii) Establish the bounds
for any
and
with
. (Hint: use the functional equation.)
- (iii) Establish the bounds
for any
and
with
. (Hint: use the Phragmén-Lindelöf principle, Exercise 19 from Notes 2, after dealing somehow with the pole at
.)
; such improvements are known as subconvexity estimates. For instance, it is currently known that
for any
and
, a result of Bourgain; the Lindelöf hypothesis asserts that this bound in fact holds for all
, although this remains unproven (it is however a consequence of the Riemann hypothesis).
Exercise 17 (Riemann-von Mangoldt formula) Show that for any, the number of zeroes of
in the rectangle
is equal to
. (Hint: apply the argument principle to
evaluated at a rectangle
for some
that is chosen so that the horizontal edges of the rectangle do not come too close to any of the zeroes (cf. the selection of the radii
in the proof of the Hadamard factorisation theorem in Notes 1), and use the functional equation and Stirling’s formula to control the asymptotics for the horizontal edges.)
We remark that the error term
, due to von Mangoldt in 1905, has not been significantly improved despite over a century of effort. Even assuming the Riemann hypothesis, the error has only been reduced very slightly to
(a result of Littlewood from 1924).
Remark 18 Thanks to the functional equation and Rouche’s theorem, it is possible to numerically verify the Riemann hypothesis in any finite portionof the critical strip, so long as the zeroes in that strip are all simple. Indeed, if there was a zero
off of the critical line
, then an application of the argument principle (and Rouche’s theorem) in some small contour around
but avoiding the critical line would be capable of numerically determining that there was a zero off of the line. Similarly, for each simple zero
on the critical line, applying the argument principle for some small contour around that zero and symmetric around the critical line would numerically verify that there was exactly one zero within that contour, which by the functional equation would then have to lie exactly on that line. (In practice, more efficient methods are used to numerically verify the Riemann hypothesis over large finite portions of the strip, but we will not detail them here.)
— 2. The explicit formula —
We now prove Riemann-von Mangoldt explicit formula. Since is a non-trivial entire function of order
, with zeroes at the non-trivial zeroes of
(the trivial zeroes having been cancelled out by the Gamma function), we see from the Hadamard factorisation theorem (in the form of Exercise 35 from Notes 1) that
One can compute the values of explicitly:
Exercise 19 By inspecting both sides of (16) as, show that
, and hence
.
Jensen’s formula tells us that the number of non-trivial zeroes of in a disk
is at most
for any
and
. One can obtain a local version:
Exercise 20 (Local bound on zeroes)
- (i) Establish the upper bound
whenever
and
with
. (Hint: use (10). More precise bounds are available with more effort, but will not be needed here.)
- (ii) Establish the bounds
uniformly in
. (Hint: use the Euler product.)
- (iii) Show that for any
, the number of non-trivial zeroes with imaginary part in
is
. (Hint: use Jensen’s formula and the functional equation.)
- (iv) For
,
, and
, with
not a zero of
, show that
(Hint: use Exercise 9 of Notes 1.)
Meanwhile, from Perron’s formula (Exercise 12 of Notes 2) and (8) we see that for any non-integer , we have
Exercise 21 (Riemann-von Mangoldt explicit formula) Letand
. Establish the following bounds:
(Hint: for (i)-(iii), shift the contour
- (i)
.
- (ii)
.
- (iii) For any positive integer
, we have
- (iv) For any non-trivial zero
, we have
- (v) We have
.
- (vi) We have
.
to
for an
that gets sent to infinity, and using the residue theorem. The same argument works for (iv) except when
is really close to
, in which case a detour to the contour may be called for. For (vi), use Exercise 20 and partition the zeroes depending on what unit interval
falls into.)
- (viii) Using the above estimates, conclude Theorem 8.
The explicit formula in Theorem 8 is completely exact, but turns out to be a little bit inconvenient for applications because it involves all the zeroes , and the series involving them converges very slowly (indeed the convergence is not even absolute). In practice it is preferable to work with a smoothed version of the formula. Here is one such smoothing:
Exercise 22 (Smoothed explicit formula)
- (i) Let
be a smooth function compactly supported on
. Show that
is entire and obeys the bound
(say) for some
, all
, and all
.
- (ii) With
as in (i), establish the identity
with the summations being absolutely convergent by applying the Fourier inversion formula to
, shifting the contour to frequencies
for some
, applying (8), and then shifting the contour again (using Exercise 20 and (i) to justify the contour shifting).
- (iii) Show that
whenever
is a smooth function, compactly supported in
, with the summation being absolutely convergent.
- (iv) Explain why (iii) is formally consistent with Theorem 8 when applied to the non-smooth function
.
— 3. Extending the zero free region, and the prime number theorem —
We now show how Theorem 9 implies Theorem 6. Let be parameters to be chosen later. We will apply Exercise 22 to a function
which equals one on
, is supported on
, and obeys the derivative estimates
Exercise 23 Assuming the Riemann hypothesis, show thatfor any
and
, and that
for any
and
. Conversely, show that either of these two estimates are equivalent to the Riemann hypothesis. (Hint: find a holomorphic continuation of
to the region
in a manner similar to how
was first holomorphically continued to the region
).
It remains to prove Theorem 9. The claim is clear for thanks to the simple pole of
at
, so we may assume
. Suppose for contradiction that there was a zero of
at
, thus
Exercise 24 Establish the inequalityfor any
and
.
Remark 25 There are a number of ways to improve Theorem 9 that move a little closer in the direction of the Riemann hypothesis. Firstly, there are a number of zero-free regions for the Riemann zeta function known that give lower bounds for(and in particular preclude the existence of zeros) a small amount inside the critical strip, and can be used to improve the error term in the prime number theorem; for instance, the classical zero-free region shows that there are no zeroes in the region
for some sufficiently small absolute constant
, and lets one improve the
error term in Theorem 6 to
(with a corresponding improvement in Theorem 1, provided that one replaces
with the logarithmic integral
). A further improvement in the zero free region and in the prime number theorem error term was subsequently given by Vinogradov. We also mention a number of important zero density estimates which provide non-trivial upper bounds for the number of zeroes in other, somewhat larger regions of the critical strip; the bounds are not strong enough to completely exclude zeroes as is the case with zero-free regions, but can at least limit the collective influence of such zeroes. For more discussion of these topics, see the various lecture notes to this previous course.