Equidistribution of Syracuse random variables and density of Collatz preimages
What's new 2020-01-26
Define the Collatz map on the natural numbers by setting to equal when is odd and when is even, and let denote the forward Collatz orbit of . The notorious Collatz conjecture asserts that for all . Equivalently, if we define the backwards Collatz orbit to be all the natural numbers that encounter in their forward Collatz orbit, then the Collatz conjecture asserts that . As a partial result towards this latter statement, Krasikov and Lagarias in 2003 established the bound
for all and . (This improved upon previous values of obtained by Applegate and Lagarias in 1995, by Applegate and Lagarias in 1995 by a different method, by Wirsching in 1993, by Krasikov in 1989, and some by Crandall in 1978.) This is still the largest value of for which (1) has been established. Of course, the Collatz conjecture would imply that we can take equal to , which is the assertion that a positive density set of natural numbers obeys the Collatz conjecture. This is not yet established, although the results in my previous paper do at least imply that a positive density set of natural numbers iterates to an (explicitly computable) bounded set, so in principle the case of (1) could now be verified by an (enormous) finite computation in which one verifies that every number in this explicit bounded set iterates to . In this post I would like to record a possible alternate route to this problem that depends on the distribution of a certain family of random variables that appeared in my previous paper, that I called Syracuse random variables.
Definition 1 (Syracuse random variables) For any natural number , a Syracuse random variable on the cyclic group is defined as a random variable of the form
where are independent copies of a geometric random variable on the natural numbers with mean , thus
} for . In (2) the arithmetic is performed in the ring .
Thus for instance
and so forth. One could also view as the mod reduction of a -adic random variable
The probability density function of the Syracuse random variable can be explicitly computed by a recursive formula (see Lemma 1.12 of my previous paper). For instance, when , is equal to for respectively, while when , is equal to
when respectively.
The relationship of these random variables to the Collatz problem can be explained as follows. Let denote the odd natural numbers, and define the Syracuse map by
where the –valuation is the number of times divides . We can define the forward orbit and backward orbit of the Syracuse map as before. It is not difficult to then see that the Collatz conjecture is equivalent to the assertion , and that the assertion (1) for a given is equivalent to the assertion
for all , where is now understood to range over odd natural numbers. A brief calculation then shows that for any odd natural number and natural number , one has
where the natural numbers are defined by the formula
so in particular
Heuristically, one expects the -valuation of a typical odd number to be approximately distributed according to the geometric distribution , so one therefore expects the residue class to be distributed approximately according to the random variable .
The Syracuse random variables will always avoid multiples of three (this reflects the fact that is never a multiple of three), but attains any non-multiple of three in with positive probability. For any natural number , set
Equivalently, is the greatest quantity for which we have the inequality
for all integers not divisible by three, where is the set of all tuples for which
Thus for instance , , and . On the other hand, since all the probabilities sum to as ranges over the non-multiples of , we have the trivial lower bound
There is also an easy submultiplicativity result:
Proof: Let be an integer not divisible by , then by (4) we have
If we let denote the set of tuples that can be formed from the tuples in by deleting the final component from each tuple, then we have
with an integer not divisible by three. By definition of and a relabeling, we then have
for all . For such tuples we then have
so that . Since
for each , the claim follows.
From this lemma we see that for some absolute constant . Heuristically, we expect the Syracuse random variables to be somewhat approximately equidistributed amongst the multiples of (in Proposition 1.4 of my previous paper I prove a fine scale mixing result that supports this heuristic). As a consequence it is natural to conjecture that . I cannot prove this, but I can show that this conjecture would imply that we can take the exponent in (1), (3) arbitrarily close to one:
Proposition 3 Suppose that (that is to say, as ). Then
as , or equivalently
I prove this proposition below the fold. A variant of the argument shows that for any value of , (1), (3) holds whenever , where is an explicitly computable function with as . In principle, one could then improve the Krasikov-Lagarias result by getting a sufficiently good lower bound on , which is in principle achievable numerically (note for instance that Lemma 2 implies the bound for any , since for any ).
— 1. Proof of proposition —
Assume . Let be sufficiently small, and let be sufficiently large depending on . We first establish the following proposition, that shows that elements in a certain residue class have a lot of Syracuse preimages:
Proposition 4 There exists a residue class of with the property that for all integers in this class, and all non-negative integers , there exist natural numbers with
and
and at least tuples
obeying the additional properties
Proof: We begin with the base case . By (4) and the hypothesis , we see that
for all integers not divisible by . Let denote the tuples in that obey the additional regularity hypotheses
for all ,note that this implies in particular the case of (7). From the Chernoff inequality (noting that the geometric random variable has mean ) and the union bound we have
for an absolute constant (where we use the periodicity of in to define for by abuse of notation). Hence by the pigeonhole principle we can find a residue class not divisible by such that
and hence by the triangle inequality we have
for all in this residue class.
Henceforth is assumed to be an element of this residue class. For , we see from (8)
hence by the pigeonhole principle there exists (so in particular ) such that
so the number of summands here is at least . This establishes the base case .
Now suppose inductively that , and that the claim has already been proven for . By induction hypothesis, there exists natural numbers with
(which in particular imply that ) and at least tuples
obeying the additional properties
and (7) for all .
For each tuple (10), we may write (as in the proof of Lemma 2)
for some integers . We claim that these integers lie in distinct residue classes modulo where
Indeed, suppose that for two tuples , of the above form. Then
(where we now invert in the ring ), or equivalently
By (11), (7), all the summands on the left-hand side are natural numbers of size , hence the sum also has this size; similarly for the right-hand side. From the estimates of , we thus see that both sides are natural numbers between and , by hypothesis on . Thus we may remove the modular constraint and conclude that
and then a routine induction (see Lemma 6.2 of my paper) shows that . This establishes the claim.
As a corollary, we see that every residue class modulo contains
of the at most. Since there were at least tuples to begin with, we may therefore forbid up to residue classes modulo , and still have surviving tuples with the property that avoids all the forbidden classes.
Let be one of the tuples (10). By the hypothesis , we have
Let denote the set of tuples with the additional property
for all , then by the Chernoff bound we have
for some absolute constant . Thus, by the Markov inequality, by forbidding up to classes, we may ensure that
and hence
We thus have
where run over all tuples with being one of the previously surviving tuples, and . By (11) we may rearrange this a little as
By construction, we have
for any tuple in the above sum, hence by the pigeonhole principle we may find an integer
In particular the number of summands is at least . Also observe from (13), (12) that
so in particular
It is a routine matter to verify that all tuples in this sum lie in and obeys the requirements (6), (7), closing the induction hypothesis.
Corollary 5 For all in the residue class from the previous proposition, and all , we have
In particular, we have
as .
Proof: For every tuple in the previous proposition, we have
for some integer . As before, all these integers are distinct, and have magnitude
From construction we also have , so that . The number of tuples is at least
which can be computed from the properties of to be of size at least . This gives the first claim, and the second claim follows by taking to be the first integer for which .
To conclude the proof of Proposition 3, it thus suffices to show that
Lemma 6 Every residue class has a non-trivial intersection with .
Indeed, if we let be the residue class from the preceding propositions, and use this lemma to produce an element of that lies in this class, then from the inclusion we obtain (3) with , and then on sending to zero we obtain the claim.
Proof: An easy induction (based on first establishing that for all natural numbers ) shows that the powers of two modulo occupy every residue class not divisible by . From this we can locate an integer in of the form . Since , the claim follows.
We remark that the same argument in fact shows (assuming of course) that
for any natural number not divisible by three.