Equidistribution of Syracuse random variables and density of Collatz preimages
What's new 2020-01-26
Define the Collatz map on the natural numbers
by setting
to equal
when
is odd and
when
is even, and let
denote the forward Collatz orbit of
. The notorious Collatz conjecture asserts that
for all
. Equivalently, if we define the backwards Collatz orbit
to be all the natural numbers
that encounter
in their forward Collatz orbit, then the Collatz conjecture asserts that
. As a partial result towards this latter statement, Krasikov and Lagarias in 2003 established the bound
for all and
. (This improved upon previous values of
obtained by Applegate and Lagarias in 1995,
by Applegate and Lagarias in 1995 by a different method,
by Wirsching in 1993,
by Krasikov in 1989, and some
by Crandall in 1978.) This is still the largest value of
for which (1) has been established. Of course, the Collatz conjecture would imply that we can take
equal to
, which is the assertion that a positive density set of natural numbers obeys the Collatz conjecture. This is not yet established, although the results in my previous paper do at least imply that a positive density set of natural numbers iterates to an (explicitly computable) bounded set, so in principle the
case of (1) could now be verified by an (enormous) finite computation in which one verifies that every number in this explicit bounded set iterates to
. In this post I would like to record a possible alternate route to this problem that depends on the distribution of a certain family of random variables that appeared in my previous paper, that I called Syracuse random variables.
Definition 1 (Syracuse random variables) For any natural number
, a Syracuse random variable
on the cyclic group
is defined as a random variable of the form
where
are independent copies of a geometric random variable
on the natural numbers with mean
, thus
} for
. In (2) the arithmetic is performed in the ring
.
Thus for instance
and so forth. One could also view as the mod
reduction of a
-adic random variable
The probability density function of the Syracuse random variable can be explicitly computed by a recursive formula (see Lemma 1.12 of my previous paper). For instance, when
,
is equal to
for
respectively, while when
,
is equal to
when respectively.
The relationship of these random variables to the Collatz problem can be explained as follows. Let denote the odd natural numbers, and define the Syracuse map
by
where the –valuation
is the number of times
divides
. We can define the forward orbit
and backward orbit
of the Syracuse map as before. It is not difficult to then see that the Collatz conjecture is equivalent to the assertion
, and that the assertion (1) for a given
is equivalent to the assertion
for all , where
is now understood to range over odd natural numbers. A brief calculation then shows that for any odd natural number
and natural number
, one has
where the natural numbers are defined by the formula
so in particular
Heuristically, one expects the -valuation
of a typical odd number
to be approximately distributed according to the geometric distribution
, so one therefore expects the residue class
to be distributed approximately according to the random variable
.
The Syracuse random variables will always avoid multiples of three (this reflects the fact that
is never a multiple of three), but attains any non-multiple of three in
with positive probability. For any natural number
, set
Equivalently, is the greatest quantity for which we have the inequality
for all integers not divisible by three, where
is the set of all tuples
for which
Thus for instance ,
, and
. On the other hand, since all the probabilities
sum to
as
ranges over the non-multiples of
, we have the trivial lower bound
There is also an easy submultiplicativity result:
Lemma 2 For any natural numbers
, we have
Proof: Let be an integer not divisible by
, then by (4) we have
If we let denote the set of tuples
that can be formed from the tuples in
by deleting the final component
from each tuple, then we have
with an integer not divisible by three. By definition of
and a relabeling, we then have
for all . For such tuples we then have
so that . Since
for each , the claim follows.
From this lemma we see that for some absolute constant
. Heuristically, we expect the Syracuse random variables to be somewhat approximately equidistributed amongst the multiples of
(in Proposition 1.4 of my previous paper I prove a fine scale mixing result that supports this heuristic). As a consequence it is natural to conjecture that
. I cannot prove this, but I can show that this conjecture would imply that we can take the exponent
in (1), (3) arbitrarily close to one:
Proposition 3 Suppose that
(that is to say,
as
). Then
as
, or equivalently
I prove this proposition below the fold. A variant of the argument shows that for any value of , (1), (3) holds whenever
, where
is an explicitly computable function with
as
. In principle, one could then improve the Krasikov-Lagarias result
by getting a sufficiently good lower bound on
, which is in principle achievable numerically (note for instance that Lemma 2 implies the bound
for any
, since
for any
).
— 1. Proof of proposition —
Assume . Let
be sufficiently small, and let
be sufficiently large depending on
. We first establish the following proposition, that shows that elements in a certain residue class have a lot of Syracuse preimages:
Proposition 4 There exists a residue class of
with the property that for all integers
in this class, and all non-negative integers
, there exist natural numbers
with
and
and at least
tuples
obeying the additional properties
Proof: We begin with the base case . By (4) and the hypothesis
, we see that
for all integers not divisible by
. Let
denote the tuples
in
that obey the additional regularity hypotheses
for all ,note that this implies in particular the
case of (7). From the Chernoff inequality (noting that the geometric random variable
has mean
) and the union bound we have
for an absolute constant (where we use the periodicity of
in
to define
for
by abuse of notation). Hence by the pigeonhole principle we can find a residue class
not divisible by
such that
and hence by the triangle inequality we have
for all in this residue class.
Henceforth is assumed to be an element of this residue class. For
, we see from (8)
hence by the pigeonhole principle there exists (so in particular
) such that
so the number of summands here is at least . This establishes the base case
.
Now suppose inductively that , and that the claim has already been proven for
. By induction hypothesis, there exists natural numbers
with
(which in particular imply that ) and at least
tuples
obeying the additional properties
and (7) for all .
For each tuple (10), we may write (as in the proof of Lemma 2)
for some integers . We claim that these integers lie in distinct residue classes modulo
where
Indeed, suppose that for two tuples
,
of the above form. Then
(where we now invert in the ring
), or equivalently
By (11), (7), all the summands on the left-hand side are natural numbers of size , hence the sum also has this size; similarly for the right-hand side. From the estimates of
, we thus see that both sides are natural numbers between
and
, by hypothesis on
. Thus we may remove the modular constraint and conclude that
and then a routine induction (see Lemma 6.2 of my paper) shows that . This establishes the claim.
As a corollary, we see that every residue class modulo contains
of the at most. Since there were at least
tuples
to begin with, we may therefore forbid up to
residue classes modulo
, and still have
surviving tuples
with the property that
avoids all the forbidden classes.
Let be one of the tuples (10). By the hypothesis
, we have
Let denote the set of tuples
with the additional property
for all , then by the Chernoff bound we have
for some absolute constant . Thus, by the Markov inequality, by forbidding up to
classes, we may ensure that
and hence
We thus have
where run over all tuples with
being one of the previously surviving tuples, and
. By (11) we may rearrange this a little as
By construction, we have
for any tuple in the above sum, hence by the pigeonhole principle we may find an integer
In particular the number of summands is at least . Also observe from (13), (12) that
so in particular
It is a routine matter to verify that all tuples in this sum lie in and obeys the requirements (6), (7), closing the induction hypothesis.
Corollary 5 For all
in the residue class from the previous proposition, and all
, we have
In particular, we have
as
.
Proof: For every tuple in the previous proposition, we have
for some integer . As before, all these integers
are distinct, and have magnitude
From construction we also have , so that
. The number of tuples is at least
which can be computed from the properties of to be of size at least
. This gives the first claim, and the second claim follows by taking
to be the first integer for which
.
To conclude the proof of Proposition 3, it thus suffices to show that
Lemma 6 Every residue class
has a non-trivial intersection with
.
Indeed, if we let be the residue class from the preceding propositions, and use this lemma to produce an element
of
that lies in this class, then from the inclusion
we obtain (3) with
, and then on sending
to zero we obtain the claim.
Proof: An easy induction (based on first establishing that for all natural numbers
) shows that the powers of two modulo
occupy every residue class not divisible by
. From this we can locate an integer
in
of the form
. Since
, the claim follows.
We remark that the same argument in fact shows (assuming of course) that
for any natural number not divisible by three.