247B, Notes 1: Restriction theory
What's new 2020-03-30
This set of notes focuses on the restriction problem in Fourier analysis. Introduced by Elias Stein in the 1970s, the restriction problem is a key model problem for understanding more general oscillatory integral operators, and which has turned out to be connected to many questions in geometric measure theory, harmonic analysis, combinatorics, number theory, and PDE. Only partial results on the problem are known, but these partial results have already proven to be very useful or influential in many applications.
We work in a Euclidean space . Recall that
is the space of
-power integrable functions
, quotiented out by almost everywhere equivalence, with the usual modifications when
. If
then the Fourier transform
will be defined in this course by the formula
From the dominated convergence theorem we see that is a continuous function; from the Riemann-Lebesgue lemma we see that it goes to zero at infinity. Thus
lies in the space
of continuous functions that go to zero at infinity, which is a subspace of
. Indeed, from the triangle inequality it is obvious that
If , then Plancherel’s theorem tells us that we have the identity
Because of this, there is a unique way to extend the Fourier transform from
to
, in such a way that it becomes a unitary map from
to itself. By abuse of notation we continue to denote this extension of the Fourier transform by
. Strictly speaking, this extension is no longer defined in a pointwise sense by the formula (1) (indeed, the integral on the RHS ceases to be absolutely integrable once
leaves
; we will return to the (surprisingly difficult) question of whether pointwise convergence continues to hold (at least in an almost everywhere sense) later in this course, when we discuss Carleson’s theorem. On the other hand, the formula (1) remains valid in the sense of distributions, and in practice most of the identities and inequalities one can show about the Fourier transform of “nice” functions (e.g., functions in
, or in the Schwartz class
, or test function class
) can be extended to functions in “rough” function spaces such as
by standard limiting arguments.
By (20), (3), and the Riesz-Thorin interpolation theorem, we also obtain the Hausdorff-Young inequality
for all and
, where
is the dual exponent to
, defined by the usual formula
. (One can improve this inequality by a constant factor, with the optimal constant worked out by Beckner, but the focus in these notes will not be on optimal constants.) As a consequence, the Fourier transform can also be uniquely extended as a continuous linear map from
. (The situation with
is much worse; see below the fold.)
The restriction problem asks, for a given exponent and a subset
of
, whether it is possible to meaningfully restrict the Fourier transform
of a function
to the set
. If the set
has positive Lebesgue measure, then the answer is yes, since
lies in
and therefore has a meaningful restriction to
even though functions in
are only defined up to sets of measure zero. But what if
has measure zero? If
, then
is continuous and therefore can be meaningfully restricted to any set
. At the other extreme, if
and
is an arbitrary function in
, then by Plancherel’s theorem,
is also an arbitrary function in
, and thus has no well-defined restriction to any set
of measure zero.
It was observed by Stein (as reported in the Ph.D. thesis of Charlie Fefferman) that for certain measure zero subsets of
, such as the sphere
, one can obtain meaningful restrictions of the Fourier transforms of functions
for certain
between
and
, thus demonstrating that the Fourier transform of such functions retains more structure than a typical element of
:
Theorem 1 (Preliminary
restriction theorem) If
and
, then one has the estimate
for all Schwartz functions
, where
denotes surface measure on the sphere
. In particular, the restriction
can be meaningfully defined by continuous linear extension to an element of
.
Proof: Fix . We expand out
From (1) and Fubini’s theorem, the right-hand side may be expanded as
where the inverse Fourier transform of the measure
is defined by the formula
In other words, we have the identity
Since the sphere have bounded measure, we have from the triangle inequality that
Also, from the method of stationary phase (as covered in the previous class 247A), or Bessel function asymptotics, we have the decay
for any (note that the bound already follows from (6) unless
). We remark that the exponent
here can be seen geometrically from the following considerations. For
, the phase
on the sphere is stationary at the two antipodal points
of the sphere, and constant on the tangent hyperplanes to the sphere at these points. The wavelength of this phase is proportional to
, so the phase would be approximately stationary on a cap formed by intersecting the sphere with a
neighbourhood of the tangent hyperplane to one of the stationary points. As the sphere is tangent to first order at these points, this cap will have diameter
in the directions of the
-dimensional tangent space, so the cap will have surface measure
, which leads to the prediction (7). We combine (6), (7) into the unified estimate
where the “Japanese bracket” is defined as
. Since
lies in
precisely when
, we conclude that
Applying Young’s convolution inequality, we conclude (after some arithmetic) that
whenever , and the claim now follows from (5) and Hölder’s inequality.
Remark 2 By using the Hardy-Littlewood-Sobolev inequality in place of Young’s convolution inequality, one can also establish this result for
.
Motivated by this result, given any Radon measure on
and any exponents
, we use
to denote the claim that the restriction estimate
for all Schwartz functions ; if
is a
-dimensional submanifold of
(possibly with boundary), we write
for
where
is the
-dimensional surface measure on
. Thus, for instance, we trivially always have
, while Theorem 1 asserts that
holds whenever
. We will not give a comprehensive survey of restriction theory in these notes, but instead focus on some model results that showcase some of the basic techniques in the field. (I have a more detailed survey on this topic from 2003, but it is somewhat out of date.)
— 1. Necessary conditions —
It is relatively easy to find necessary conditions for a restriction estimate to hold, as one simply needs to test the estimate (9) against a suitable family of examples. We begin with the simplest case
. The Hausdorff-Young inequality (4) tells us that we have the restriction estimate
whenever
. These are the only restriction estimates available:
Proposition 3 (Restriction to
) Suppose that
are such that
holds. Then
and
.
We first establish the necessity of the duality condition . This is easily shown, but we will demonstrate it in three slightly different ways in order to illustrate different perspectives. The first perspective is from scale invariance. Suppose that the estimate
holds, thus one has
for all Schwartz functions . For any scaling factor
, we define the scaled version
of
by the formula
Applying (10) with replaced by
, we then have
From change of variables, we have
and from the definition of Fourier transform and further change of variables we have
so that
combining all these estimates and rearranging, we conclude that
If is non-zero, then by sending
either to zero or infinity we conclude that
for all
, which is absurd. Thus we must have the necessary condition
, or equivalently that
.
We now establish the same necessary condition from the perspective of dimensional analysis, which one can view as an abstraction of scale invariance arguments. We give the spatial variable a unit of length. It is not so important what units we assign to the range of the function
(it will cancel out of both sides), but let us make it dimensionless for sake of discussion. Then the
norm
will have the units of , because integration against
-dimensional Lebesgue measure will have the units of
(note this conclusion can also be justified in the limiting case
). For similar reasons, the Fourier transform
will have the units of ; also, the frequency variable
must have the units of
in order to make the exponent
appearing in the exponential dimensionless. As such, the norm
has units . In order for the estimate (10) to be dimensionally consistent, we must therefore have
, or equivalently that
.
Finally, we establish the necessary condition once again using the example of a rescaled bump function, which is basically the same as the first approach but with
replaced by a bump function. We will argue at a slightly heuristic level, but it is not difficult to make the arguments below rigorous and we leave this as an exercise to the reader. Given a length scale
, let
be a bump function adapted to the ball
of radius
around the origin, thus
where
is some fixed test function
supported on
. As long as
is non-zero, the norm
is comparable to
(up to constant factors that can depend on
but are independent of
). The uncertainty principle then predicts that the Fourier transform
will be concentrated in the dual ball
, and within this ball (or perhaps a slightly smaller version of this ball)
would be expected to be of size comparable to
(the phase
does not vary enough to cause significant cancellation). From this we expect
to be comparable in size to
. If (10) held, we would then have
for all , which is only possible if
, or equivalently
.
Now we turn to the other necessary condition . Here one does not use scaling considerations; instead, it is more convenient to work with randomised examples. A useful tool in this regard is Khintchine’s inequality, which encodes the square root cancellation heuristic that a sum
of numbers or functions
with randomised signs (or phases) should have magnitude roughly comparable to the square function
.
Lemma 4 (Khintchine’s inequality) Let
, and let
be independent random variables that each take the values
with an equal probability of
.
- (i) For any complex numbers
, one has
- (ii) For any functions
on a measure space
, one has
Proof: We begin with (i). By taking real and imaginary parts we may assume without loss of generality that the are all real, then by normalisation it suffices to show the upper bound
for all , whenever
are real numbers with
.
When the upper and lower bounds follow by direct calculation (in fact we have equality
in this case). By Hölder’s inequality, this yields the upper bound for
and the lower bound for
. To handle the remaining cases of (11) it is convenient to use the exponential moment method. Let
be an arbitrary threshold, and consider the upper tail probability
For any , we see from Markov’s inequality that this quantity is less than or equal to
The expectation here can be computed to equal
By comparing power series we see that for any real
, hence by the normalisation
we see that
If we set we conclude that
since the random variable is symmetric around the origin, we conclude that
From the Fubini-Tonelli theorem we have
and this then gives the upper bound (11) for any . The claim (12) for
then follows from this, Hölder’s inequality (applied in reverse), and the fact that (12) was already established for
.
To prove (ii), observe from (i) that for every one has
integrating in and applying the Fubini-Tonelli theorem, we obtain the claim.
Exercise 5
- (i) How does the implied constant in (11) depend on
in the limit
if one analyses the above argument more carefully?
- (ii) Establish (11) for the case of even integers
by direct expansion of the left-hand side and some combinatorial calculation. How does the dependence of the implied constant in (11) on
compare with (i) if one does this?
- (iii) Establish a matching lower bound (up to absolute constants) for the implied constant in (11).
Now we show that the estimate (10) fails in the large regime
, even when
. Here, the idea is to have
“spread out” in physical space (in order to keep the
norm low), and also having
somewhat spread out in frequency space (in order to prevent the
norm from dropping too much). We use the probabilistic method (constructing a random counterexample rather than a deterministic one) in order to exploit Khintchine’s inequality. Let
be a non-zero bump function supported on (say) the unit ball
, and consider a (random) function of the form
where are the random signs from Lemma 4, and
are sufficiently separated points in
(all we need for this construction is that
for all
). Then the summands here have disjoint supports and
(note that the signs have no effect on the magnitude of
). If (10) were true, this would give the (deterministic) bound
On the other hand, the Fourier transform of is
so by Khintchine’s inequality
The phases can be deleted, and
is not identically zero, so one arrives at
Comparing this with (13) and sending , we obtain a contradiction if
. This completes the proof of Proposition 3.
Exercise 6 Find a deterministic construction that explains why the estimate (10) fails when
and
.
Exercise 7 (Marcinkiewicz-Zygmund theorem) Let
be measure spaces, let
, and suppose
is a bounded linear operator with operator norm
. Show that
for any at most countable index set
and any functions
. Informally, this result asserts that if a linear operator
is bounded from scalar-valued
functions to scalar-valued
functions, then it is automatically bounded from vector-valued
funct
Exercise 8 Let
be a bounded open subset of
, and let
. Show that
holds if and only if
and
. (Note: in order to use either the scale invariance argument or the dimensional analysis argument to get the condition
, one should replace
with something like a ball
of some radius
, and allow the estimates to depend on
.)
Now we study the restriction problem for two model hypersurfaces:
- (i) The paraboloid
equipped with the measure
induced from Lebesgue measure
in the horizontal variables
, thus
(note this is not the same as surface measure on
, although it is mutually absolutely continuous with this measure).
- (ii) The sphere
.
These two hypersurfaces differ from each other in one important respect: the paraboloid is non-compact, while the sphere is compact. Aside from that, though, they behave very similarly; they are both conic hypersurfaces with everywhere positive curvature. Furthermore, they are also very highly symmetric surfaces. The sphere of course enjoys the rotation symmetry under the orthogonal group . At first glance the paraboloid
only enjoys symmetry under the smaller orthogonal group
that rotates the
variable (leaving the final coordinate
unchanged), but it also has a family of Galilean symmetries
for any , which preserves
(and also can be seen to preserve the measure
, since the horizontal variable
is simply translated by
). Furthermore, the paraboloid also enjoys a parabolic scaling symmetry
for any , for which the sphere does not have an exact analogue (though morally Taylor expansion suggests that the sphere “behaves like” the paraboloid at small scales. The following exercise exploits these symmetries:
- (i) Let
be a bounded non-empty open subset of
, and let
. Show that
holds if and only if
holds.
- (ii) Let
be bounded non-empty open subsets of
(endowed with the restriction of
to
), and let
. Show that
holds if and only if
holds.
- (iii) Suppose that
are such that
holds. Show that
. (Hint: Any of the three methods of scale invariance, dimensional analysis, or rescaled bump functions will work here.)
- (iv) Suppose that
are such that
holds. Show that
. (Hint: The same three methods still work, but some will be easier to pull off than others.)
- (v) Suppose that
are such that
holds for some bounded non-empty open subset
of
, and that
. Conclude that
holds.
- (vi) Suppose that
are such that
holds, and that
. Conclude that
holds.
Exercise 10 (No non-trivial restriction estimates for flat hypersurfaces) Let
be an open non-empty subset of a hyperplane in
, and let
. Show that
can only hold when
.
To obtain a further necessary condition on the restriction estimates or
holding, it is convenient to dualise the restriction estimate to an extension estimate.
Exercise 11 (Duality) Let
be a Radon measure on
, let
, and let
. Show that the following claims are equivalent:
This gives a further necessary condition as follows. Suppose for instance that holds; then by the above exercise, one has
for all . In particular,
. However, we have the following stationary phase computation:
for all
and some non-zero constants
depending only on
. Conclude that the estimate
can only hold if
.
Exercise 13 Show that the estimate
can only hold if
. (Hint: one can explicitly test (15) when
is a gaussian; the fact that gaussians are not, strictly speaking, compactly supported can be dealt with by a limiting argument.)
It is conjectured that the necessary conditions claimed above are sufficient. Namely, we have
Conjecture 14 (Restriction conjecture for the sphere) Let
. Then we have
whenever
and
.
Conjecture 15 (Restriction conjecture for the paraboloid) Let
. Then we have
whenever
and
.
It is also conjectured that Conjecture 14 holds if one replaces the sphere by any bounded open non-empty subset of the paraboloid
.
The current status of these conjectures is that they are fully solved in the two-dimensional case (as we will see later in these notes) and partially resolved in higher dimensions. For instance, in
one of the strongest results currently is due to Hong Wang, who established
for
a bounded open non-empty subset of
when
(conjecturally this should hold for all
); for higher dimensions see this paper of Hickman and Rogers for the most recent results.
We close this section with an important connection between the restriction conjecture and another conjecture known as the Kakeya maximal function conjecture. To describe this connection, we first give an alternate derivation of the necessary condition in Conjecture 14, using a basic example known as the Knapp example (as described for instance in this article of Strichartz).
Let be a spherical cap in
of some small radius
, thus
for some
. Let
be a bump function adapted to this cap, say
where
is a fixed non-zero bump function supported on
. We refer to
as a Knapp example at frequency
(and spatially centred at the origin). The cap
(or any slightly smaller version of
) has surface measure
, thus
for any . We then apply the extension operator to
:
The integrand is only non-vanishing if ; since also from the cosine rule we have
we also have . Thus, if
lies in the tube
for a sufficiently small absolute constant , then the phase
has real part
. If we set
to be non-negative and not identically zero, and note that , we conclude that
for . Since the tube
has dimensions
, its volume is
and thus
for any . By Exercise 11, we thus see that if the estimate
holds, then
for all small ; sending
to zero, we conclude that
or equivalently that , recovering the second necessary condition in Conjecture 14.
- (i) By considering a random superposition of Knapp examples located at different frequencies
, and using Khintchine’s inequality, recover the first necessary condition
of Conjecture 14.
- (ii) Suppose that
holds for some
. Establish the estimate
whenever
is a collection of
tubes – that is to say, sets of the form
whose directions
are
-separated (thus
for any two distinct
), and the
are arbitrary real numbers.
- (iii) Establish claims (i) and (ii) with the sphere
replaced by a bounded non-empty open subset of the paraboloid
.
Using this exercise, we can show that restriction estimates imply assertions about the dimension of Kakeya sets (also known as Besicovitch sets.
Exercise 17 (Restriction implies Kakeya) Assume that either Conjecture 14 or Conjecture 15 holds. Define a Kakeya set to be a compact subset
of
that contains a unit line segment in every direction (thus for every
, there exists a line segment
for some
that is contained in
. Show that for any
, the
-neighbourhood of
has Lebesgue measure
for any
. (This is equivalent to the assertion that
has Minkowski dimension
.) It is also possible to show that the restriction conjecture implies that all Kakeya sets have Hausdorff dimension
, but this is trickier; see this paper of Bourgain. (This can be viewed as a challenge problem for those students who are familiar with the concept of Hausdorff dimension.)
The Kakeya conjecture asserts that all Kakeya sets in have Minkowski and Hausdorff dimension equal to
. As with the restriction conjecture, this is known to be true in two dimensions (as was first proven by Davies), but only partial results are known in higher dimensions. For instance, in three dimensions, Kakeya sets are known to have (upper) Minkowski dimension at least
for some absolute constant
(a result of Katz, Laba, and myself), and also more recently for Hausdorff dimension (a result of Katz and Zahl). For the latest results in higher dimensions, see these papers of Hickman-Rogers-Zhang and Zahl.
Much of the modern progress on the restriction conjecture has come from trying to reverse the implication in Exercise 17, and use known partial results towards the Kakeya conjecture (or its relatives) to obtain restriction estimates. We will not give the latest arguments in this direction here, but give an illustrative example (in the multilinear setting) at the end of this set of notes.
— 2. theory —
One of the best understood cases of the restriction conjecture is the case. Note that Conjecture 14 asserts that
holds whenever
, and Conjecture 15 asserts that
holds when
. Theorem 1 already gave a partial result in this direction. Now we establish the full range of the
restriction conjecture, due to Tomas and Stein:
Theorem 18 (Tomas-Stein restriction theorem) Let
. Then
holds for all
, and
holds for
.
The exponent is sometimes referred to in the literature as the Tomas-Stein exponent; though the dual exponent
is also referred to by this name.
We first establish the restriction estimate in the non-endpoint case
by an interpolation method. Fix
. By the identity (5) and Hölder’s inequality, it suffices to establish the inequality
We use the standard technique of dyadic decomposition. Let be a bump function supported on
that equals
on
. Then one has the telescoping series
where is a bump function supported on the annulus
. We can then decompose the convolution kernel
as
so by the triangle inequality it will suffice to establish the bounds
for all and some constant
depending only on
.
The function is smooth and compactly supported, so (18) is immediate from Young’s inequality (note that
when
). So it remains to prove (19). Firstly, we recall from (7) (or (8)) that the kernel
is of magnitude
. Thus by Young’s inequality we have
We now complement this with an estimate. The Fourier transform of
can be computed as
for any , and hence by the triangle inequality and the rapid decay of the Schwartz function
we have
By dyadic decomposition we then have
From elementary geometry we have
(basically because the sphere is -dimensional), and then on summing the geometric series we conclude that
From Plancherel’s theorem we conclude that
Applying either the Marcinkiewicz interpolation theorem (or the Riesz-Thorin interpolation theorem) to (20) and (21), we conclude (after some arithmetic) the required estimate (18) with
which is indeed positive when .
At the endpoint the above argument does not quite work; we obtain a decent bound for each dyadic component
of
, but then we have trouble getting a good bound for the sum. The original argument of Stein got around this problem by using complex interpolation instead of dyadic decomposition, embedding
in an analytic family of functions. We present here another approach, which is now popular in PDE applications; the basic inputs (namely, an
to
estimate similar to (20), an
estimate similar to (21), and an interpolation) are the same, but we employ the additional tool of Hardy-Littlewood-Sobolev fractional integration to recover the endpoint.
We turn to the details. Set . We write
as
and parameterise the frequency variable
by
with
, thus for instance
. (One can think of
as a “time” variable that we will give a privileged role in the physical domain
.) We split the spatial variable
similarly. Let
be a non-negative bump function localised to a small neighbourhood of the north pole
of
. By Exercise 9 it will suffice to show that
for . Squaring as before, it suffices to show that
For each , let
denote the function
, and let
denote the function
Then we have
where on the right-hand side the convolution is now over rather than
. By the Fubini-Tonelli theorem and Minkowski’s inequality, we thus have
From Exercise 12 we have the bounds
leading to the dispersive estimate
for any (the claim is vacuous when
vanishes). On the other hand, the
-dimensional Fourier transform of
can be computed as
which is bounded by , hence by Plancherel we have the energy estimate
Interpolating, we conclude after some arithmetic that
Applying the one-dimensional Hardy-Littlewood-Sobolev inequality we conclude (after some more arithmetic) that
and the claim follows.
This latter argument can be adapted for the paraboloid, which in turn leads to some very useful estimates for the Schrödinger equation:
Exercise 19 (Strichartz estimates for the Schrödinger equation) Let
.
- (i) By modifying the above arguments, establish the restriction estimate
.
- (ii) Let
, and let
denote the function
(This is the solution to the Schrödinger equation
with initial data
.) Establish the Strichartz estimate
- (iii) More generally, with the hypotheses as in (ii), establish the bound
whenever
are exponents obeying the scaling condition
. (The endpoint case
of this estimate is also available when
, using a more sophisticated interpolation argument; see this paper of Keel and myself.)
The Strichartz estimates in the above exercise were for the linear Schrödinger equation, but Strichartz estimates can also be established by the same method (namely, interpolating between energy and dispersive estimates) for other linear dispersive equations, such as the linear wave equation . Such Strichartz estimates are a fundamental tool in the modern analysis of nonlinear dispersive equations, as they often allow one to view such nonlinear equations as perturbations of linear ones. The topic is too vast to survey in these notes, but see for instance my monograph on this topic.
— 3. Bilinear estimates —
A restriction estimate such as
or its equivalent dual form \begin{equation \| (g d\mu)^\vee \|_{L^{p’}(R^d)} \lesssim_{p,q,d,\mu} \| g\|_{L^{q’}(R^d,\mu)} are linear estimates, asserting the boundedness of either a restriction operator (where
denotes the support of
) or an extension operator
. In the last ten or twenty years, it has been realised that one should also consider bilinear or multilinear versions of the extension estimate, both as stepping stones towards making progress on the linear estimate, and also as being of independent interest and application.
In this section we will show how the consideration of bilinear extension estimates can be used to resolve the restriction conjecture for the circle (i.e., the case of Conjecture 14):
Theorem 20 (Restriction conjecture for
) One has
whenever
and
.
Note from Exercise 9(vi) that this theorem also implies the case of Conjecture 15. This case of the restriction conjecture was first established by Zygmund; Zygmund’s proof is shorter than the one given here (relying on the Hausdorff-Young inequality (4)), but the arguments here have broader applicability, in particular they are also useful in higher-dimensional settings.
To prove this conjecture, it suffices to verify it at the endpoint , since from Hölder’s inequality the norm
is essentially non-decreasing in
, where
is arclength measure on
. By Exercise 9, we may replace
here by (say) the first quadrant
of the circle, where
is the map
; we let
be the arc length measure on that quadrant. (This reduction is technically convenient to avoid having to deal with antipodal points with parallel tangents a little later in the argument.)
By (3) and relabeling, it suffices to show that
whenever ,
, and
(we drop the requirement that
is smooth, in order to apply rough cutoffs shortly), and arcs such as
are always understood to be endowed with arclength measure.
We now bilinearise this estimate. It is clear that the estimate (22) is equivalent to
for any , since (23) follows from (22) and Hölder’s inequality, and (22) follows from (23) by setting
.
Right now, the two functions and
are both allowed to occupy the entirety of the arc
. However, one can get better estimates if one separates the functions
to lie in transverse sub-arcs
of
(where by “transverse” we mean that there is some non-zero separation between the normal vectors of
and the normal vectors of
. The key estimate is
Proposition 21 (Bilinear
estimate) Let
be subintervals of
such that
. Then we have
for an
, where
denotes the arclength measure on
.
Proof: To avoid some very minor technicalities involving convolutions of measures, let us approximate the arclength measures . Observe that we have
in the sense of distributions, where is the annular region
Thus we have the pointwise bound
and
and similarly for . Hence by monotone convergence it suffices to show that
for sufficiently small . By Plancherel’s theorem, it thus suffices to show that
for , if
is sufficiently small. From Young’s inequality one has
so by interpolation it suffices to show that
But this follows from the pointwise bound
for sufficiently small , whose proof we leave as an exercise.
Exercise 22 Establish (24).
Remark 23 Higher-dimensional bilinear
estimates, involving more complicated manifolds than arcs, play an important role in the modern theory of nonlinear dispersive equations, especially when combined with the formalism of dispersive variants of Sobolev spaces known as
spaces, introduced (and independently \href). See for instance this book of mine for further discussion.
From the triangle inequality we have
so by complex interpolation (which works perfectly well for bilinear operators) we have
for any . The estimate (25) begins to look rather similar to (23), and we can deduce (23) from (25) as follows. Firstly, it is convenient to use Marcinkiewicz interpolation (using the fact that we have an open range of
) to reduce (22) to proving a restricted estimate
for any measurable subset of the circle, so to prove (23) it suffices to show that
We can view the expression as a two-dimensional integral
- It is a known theorem (first conjectured by Klainerman and Machedon) that one has the bilinear restriction theorem
whenever ,
, disjoint compact subsets
of
, and functions
, where
denotes the measure given by the integral
(The range is known to be sharp for (28) except possibly for the endpoint
, which remains open currently.) Assuming this result, show that Conjecture 15 holds for all
. (Hint: one repeats the above arguments, but at one point one will be faced with estimating a bilinear expression involving two “close” regions
, which could be very large or very small. The hypothesis (28) does not specify how the implied constants depend on the size or location of
, but one can obtain such a dependence by exploiting the translation and Galilean symmetries of the paraboloid.)
— 4. Multilinear estimates —
We now turn to multilinear (or more precisely, -linear) Kakeya and restriction estimates, where we happen to have nearly optimal estimates. For instance, we have the following estimate (cf. (17)), first established by Bennett, Carbery, and myself:
Theorem 24 (Multilinear Kakeya estimate) Let
, let
be sufficiently small, and let
. Suppose that
are collections of
tubes such that each tube
in
is oriented within
of the basis vector
. Then we have
Exercise 25 Assuming Theorem 24, obtain an estimate for
for any
in terms of
and
, and use examples to show that this estimate is optimal in the sense that the exponents for
and
can only be improved by epsilon factors at best.
In the two-dimensional case the estimate is easily established with no epsilon loss. Indeed, in this case we can expand the left-hand side of (29) as
But if is a
rectangle oriented near
, and
is a
-rectangle oriented near
, then
is comparable with
, and the claim follows.
The epsilon loss was removed in general dimension by Guth, using the polynomial method. We will not give that argument here, but instead give a simpler proof of Theorem 24, also due to Guth, and based primarily on the method of induction on scales. We first treat the case when , that is when all the tubes in each family
are completely parallel:
Exercise 26
Proof: For each , let
be a collection of
tubes oriented within
of
. Our objective is to show that
Let be a small constant depending only on
. We partition
into cubes
of sidelenth
, then the left-hand side of (32) can be decomposed as
Clearly we can restrict the inner sum to those tubes that actually intersect
. For
small enough, the intersection of
with
is contained in a
tube oriented within
of
; such a tube can be viewed as a rescaling by
of a
tube, also oriented within
of
. From (30) and rescaling we conclude that
Now let be the
tube with the same central axis and center of mass as
. For
small enough, if
then
equals
on all of
, and hence
Combining all these estimates, we can bound the left-hand side of (32) by
But by (30) and rescaling we have
and the claim follows.
Now let , and let
be sufficiently large depending on
. If
is sufficiently small depending on
, then from (31) we have the claim
whenever . On the other hand, from Proposition 26 we see (for
large enough) that if (33) holds in some range
with
then it also holds in the larger range
. By induction we then have (33) for all
. Combining this with (31), we have shown that
for all , whenever
is sufficiently small depending on
. This is almost what we need to prove Theorem 24, except that we are requiring
to be small depending on
as well as
, whereas Theorem 24 only requires
to be sufficiently small depending on
and not
. We can overcome this (at the cost of worsening the implied constants by an
-dependent factor) by the triangle inequality and exploiting affine invariance (somewhat in the spirit of Exercise 9). Namely, suppose that
and
is only assumed to be small depending on
but not on
. By what we have previously established, we have
whenever the tubes lie within
of
, where
is a quantity that is sufficiently small depending on
. Now we apply a linear transformation to both sides, and also modify
slightly, and conclude that for any
within
of
, we still have the bound (34) if the
are assumed to lie within
(say) of
instead of
. On the other hand, by compactness (or more precisely, total boundedness), we can find
directions
that lie within
of
, such that any other direction that lies within
of
lies within
of one of the
. Applying the (quasi-)triangle inequality for
, we conclude that
whenever the direction of are merely assumed to lie within
of
. This concludes the proof of Theorem 24.
Exercise 27 By optimising the parameters in the above argument, refine the estimate in Theorem 24 slightly to
for any
.
We can use the multilinear Kakeya estimate to prove a multilinear restriction (or more precisely, multilinear extension) estimate:
Theorem 28 (Multilinear restriction estimate) Let
, let
be sufficiently small, and let
. Suppose that
are open subsets of
that lie within
of the basis vector
. Then we have
Exercise 29 By modifying the arguments used to prove Exercise 16(ii), show that Theorem 28 implies Theorem 24.
Exercise 30 Assuming Theorem 28, obtain for each
and
an estimate of the form
whenever
,
, and
and some exponent
, and use examples to show that the exponent
you obtain is best possible.
Remark 31 In the
case, this result with no epsilon loss follows from Proposition 21. It is an open question whether the epsilon can be removed in higher dimensions; see this recent paper of mine for some progress in this direction.
To prove Theorem 28, we again turn to induction on scales; the argument here is a corrected version of one from this paper of Bennett, Carbery and myself, which first appeared in this paper of Bennett. Fix , and let
be sufficiently small. For technical reasons it is convenient to replace the subsets
of the sphere by annuli. More precisely, for each
, let
denote the best constant in the inequality
whenever , where
is the annular cap
Because we have restricted both the Fourier and spatial domains to be compactly supported it is clear that is finite for each
, thus
Exercise 32 Show that (39) implies Theorem 28. (Hint: starting with
, multiply
by a suitable weight function that is large on
and has Fourier transform supported on
, and write this as
for a suitable
. Then obtain
estimates on
.)
To establish (39), the key estimate is
Proposition 33 (Induction on scales) For any
, one has
where the multilinear Kakeya constant
was defined in (30).
Suppose we can establish this claim. Applying Theorem 24, we conclude that if for a sufficiently large
depending on
, one has
From (38) one has
for all and some
, and then an easy induction then shows that (40) holds for all
, giving the claim.
It remains to prove Proposition 33. We will rely on the wave packet decomposition. Informally, this decomposes into a sum of “wave packets” that is approximately of the form
where ranges over
-tubes in
oriented in various directions
oriented near
, and the coefficients
obey an
type estimate
(This decomposition is inaccurate in a number of technical ways, for instance the sharp cutoff should be replaced by something smoother, but we ignore these issues for the sake of this informal discussion.) Heuristically speaking, (41) is asserting that
behaves like a superposition of various (translated) Knapp examples (16) with
.
Let us informally indicate why we would expect the wave packet decomposition to hold, and then why it should imply something like Proposition 33. Geometrically, the annular cap behaves like the union of essentially disjoint
-disks
, each centred at some point
on the unit sphere that is close to
, and oriented normal to the direction
. Thus
should behave like the sum of the components
. By the uncertainty principle, each such component
should behave like a constant multiple of the plane wave
on each translate of the dual region
to
, which is a
tube oriented in the direction
. By Plancherel’s theorem, the total
norm of
should equal
. Thus we expect to have a decomposition roughly of the form
where is a collection of parallel and boundedly overlapping
tubes
oriented in the direction
, and the
are coefficients with
Summing over and collecting powers of
, we (heuristically) obtain the wave packet decomposition (41) with bound (42).
Now we informally explain why the decomposition (41) (and attendant bound (42)) should yield Proposition 33. Our task is to show that
for . We may as well normalise
. Applying the wave packet decomposition, one expects to have an approximation of the form
and the are essentially distinct
tubes oriented within
of
. We cover
by balls
of radius
. On each such ball, the cutoffs
are morally constant, and so
From the uncertainty principle, the trigonometric polynomial behaves on
like the inverse Fourier transform
of a function
supported on
with
and hence by (37) we expect the expression (45) to be bounded by
which is also morally
Averaging in , we thus expect the left-hand side of (43) to be
Applying a rescaled (and weighted) version of Theorem 24, this is bounded by
and the claim now follows from (44).
Now we begin the rigorous argument. We need to prove (43), and we normalise . By Fubini’s theorem we have
Let be a fixed Schwartz function that is bounded away from zero on
and has Fourier transform supported on
, thus the function
is bounded away from zero on
and has Fourier transform supported on
. In particular we have
We can write
and observe that has Fourier transform supported in
. Thus
Thus it remains to establish the bound
We cover by a collection of
disks
, each one centered at an element
that lies within
of
, and is oriented with normal
, with the
separated from each other by
. A partition of unity then lets us write
where each
with
The functions then have bounded overlapping supports in the sense that every
is contained in at most
of these supports. Hence
By Plancherel’s theorem the right-hand side is at most
This is morally bounded by
so one has morally bounded the left-hand side of (46) by
In practice, due to the rapid decay of , one has to add some additional terms involving some translates of the balls
, but these can be handled by the same method as the one given below and we omit this technicality for brevity. We can write
, where
is a Schwartz function adapted to a slight dilate of
whose inverse Fourier transform
is a bump function adapted to a
tube
oriented along
through the origin, and
with
This gives a reproducing-type formula
which by Cauchy-Schwarz (or Jensen’s inequality) gives the pointwise bound
By enlarging slightly, we then have
for all , hence
We have thus bounded the left-hand side of (46) by
which we can rearrange as
Using a rescaled version of (30) (and viewing the convolution here as a limit of Riemann sums) we can bound this by
which by (48), (47) is bounded by
giving (46) as desired.