The counter-intuitive behaviour of high-dimensional spaces
Wildon's Weblog 2020-04-19
This post is an extended version of a talk I last gave a few years ago on an extended Summer visit to Bristol; three weeks into lockdown seems like a good time to write it up for a wider audience. The overall thesis is that it’s hard to have a good intuition for really high-dimensional spaces, and that the reason is that, asked to picture such a space, most of it come far closer to something like then the more accurate
. Some details are left to the (highly optional) exercise sheet (with solutions) that accompanies this post. Sources are acknowledged at the end of the post.
For the moment, I’ve only typed up the section on Euclidean spaces.
As a warm up for my talk, I invited to audience to order the following cardinals:
Of course they are already correctly ordered by size. From the perspective of ‘effective computation’, I claim that and
are indistinguishable and ‘tiny’,
and
are indistinguishable and ‘huge’ and
sits somewhere in the middle. To give some hint of the most surprising part of this, interpret
as the Cantor set
, so
is the set of subsets of
. Computationally definable subsets of
are then computable predicates on
(i.e. functions
), and since
is compact, it is computable whether two such predicates are equal. In contrast, there is no algorithm that, given two predicates on
, will run for a finite length of time and then return the correct answer.
Euclidean space and unit balls.
Let be the solid
-dimensional unit ball and let
be the
-dimensional surface of
.
House prices in Sphereland
Imagine a hypothetical ‘Sphereland’ whose inhabitants live uniformly distributed on the surface . (For instance,
is the one-point compactification of Flatland.) With a god’s eye view you are at the origin, and survey the world. At what values of the
th coordinate are most of the inhabitants to be found?
For example, the diagram below shows the case with two regions of equal height shaded.
It is a remarkable fact, going all the way back to Archimedes, that the surface areas of the red and blue regions are equal: the reduction in cross-section as we go upwards is exactly compensated by the shallower slope. For a calculus proof, observe that in the usual Cartesian coordinate system, we can parametrize the part of the surface with and
by
. Choose
. Then since
is orthogonal to the gradient of the sphere at
, to first order, if we increase the height
from
to
then we must decrease the second coordinate
from $y$ to $y-z/y k$. This is shown in the diagram below.
Hence the norm squared of the marked line segment tangent to the surface is
Hence, as Archimedes knew, the area of the region between the lines of latitude of height between
and
is (to first order in
) the product of
and the circumference of the line of latitude at height
. It is therefore
which is independent of . As a small check, integrating over $altex z$ from
to
we get
for the surface area of the sphere; as expected this is also the surface area of the enclosing cylinder of radius
and height
.
This cancellation in the displayed formula above is special to the case . For instance, when
, by the argument just given, the arclength at height
is proportional to
. Therefore in the one-point compactification of Lineland (a place one might feel is already insular enough), from the god’s perspective (considering slices with varying
), most of the inhabitants are near the north and south poles (
), and almost no-one lives on the equation (
).
More generally, the surface area of is given by integrating the product of the arclength
and the surface area of the ‘latitude crosssection’ at height
. The latter is the
-dimensional sphere of radius
. By dimensional analysis, its surface area is
for some constant
. Hence the density of Sphereland inhabitants at height
is proportional to
. (A complete proof of this using only elementary calculus is given in Exercise 1 of the problem sheet.) In particular, we see that
is large, almost all the density is at the equator
. From the god’s eye perspective, in high-dimensional Sphereland, everyone lives on the equator, or a tiny distance from it. To emphasise this point, here are the probability density functions for small values of
.
This seems deeply unintuitive to me. There is, after all, nothing intrinsic about the -coordinate. Indeed, the god can pick any hyperplane in
through the origin, and get a similar conclusion. In particular, if a random inhabitant of Spaceland is at
then it is almost certain that most of the
are very small.
Concentration of measure
One could make this precise by continuing with differential geometry, but Henry Cohn’s answer to this Mathoverflow question on concentration of measure gives an elegant alternative approach. Let be independent identically distributed normal random variables each with mean
and variance
. Then
, and so, by the Weak Law of Large Numbers, the probability that
tends to
as
. Therefore we can, with small error, regard
as a uniformly chosen point on
. By Markov’s inequality, the probability that
(or equivalently,
) is at most
, and the probability that
(or equivalently,
) is at most
. Since
as , we see that with high probability, a random Sphereland inhabitant has all its coordinates in
. I think this makes the counter-intuitive conclusion of the previous subsection even starker.
Volume of the unit ball
The ‘length of tangent line times area of cross-section’ argument says that if is the surface area of
then
A quick consultation of Mathematica to evaluate the integral shows that
It follows easily by induction that . Since
and
, the volume
of
satisfies
In particular, from for $t \in \mathbb{N}$ we get
. Hence
(is there a quick way to see this?), and the proportion of the cube
occupied by the sphere
is
Clearly this proportion tends to , very quickly. I find this somewhat surprising, since my mental picture of a sphere is as a bulging convex object that should somehow `fill out’ an enclosing cube. This is clearly completely misconceived.
Sources
The results on volumes of spheres and balls are cobbled together from several MathOverflow questions and answers, in particular Joseph O’Rourke’s question asking for an intuitive explanation of the concentration of measure phenomenon, and S. Carnahan’s answer to a question on the volume of the -dimensional unit ball. The coding theory results go back to Shannon, and can be found in many textbooks, for example Van Lint’s Introduction to coding theory. The diagrams below were drawn using TikZ with the help of these macros due to Tomasz M. Trzeciak. The material about computability is either very basic or comes from a blog post by Martín Escardó (on Andrej Bauer’s blog) giving a wonderfully accessible account of Infinite sets that admit fast exhaustive search, published in the proceedings of Logic in Computer Science 2007.