A result of Bui–Pratt–Zaharescu, and Erdös problem #437
What's new 2024-08-09
The following problem was posed by Erdös and Graham (and is listed as problem #437 on the Erdös problems website):
Problem 1 Let be integers. How many of the partial products , , , can be squares? Is it true that, for any , there can be more than squares?
If one lets denote the maximal number of squares amongst such partial products, it was observed in the paper of Erdös and Graham that the bound is “trivial” (no proof was provided, but one can for instance argue using the fact that the number of integer solutions to hyperelliptic equations of the form for fixed is quite sparse, and in fact finite for thanks to Siegel’s theorem), and the problem then asks if .
It turns out that this problem was essentially solved (though not explicitly) by a recently published paper of Bui, Pratt, and Zaharescu, who studied a closely related quantity introduced by Erdös, Graham, and Selfridge (see also Problem B30 of Guy’s book), defined for any natural number as the least natural number such that some subset of , when multiplied together with , produced a square. Among the several results proven about in that paper was the following:
Theorem 2 (Bui–Pratt–Zaharescu, Theorem 1.2) For sufficiently large, there exist integers such that .
The arguments were in fact quite elementary, with the main tool being the theory of smooth numbers (the theory of hyperelliptic equations is used elsewhere in the paper, but not for this particular result).
If one uses this result as a “black box”, then an easy greedy algorithm argument gives the lower bound
but with a small amount of additional work, one can modify the proof of the theorem to give a slightly better bound:Theorem 3 (Bounds for ) As , we have the lower bound and the upper bound In particular, for any , one has for sufficiently large .
The purpose of this blog post is to record this modification of the argument, which is short enough to present immediately. For a large , let denote the quantity
We call a natural number -smooth if all of its prime factors are at most . From a result of Hildebrand (or the older results of de Bruijn), we know that the number of -smooth numbers less than or equal to is Let be the number of primes up to . From the prime number theorem we haveTo prove the lower bound on , which is a variant of Theorem 2. The key observation is that given any -smooth numbers , some non-trivial subcollection of them will multiply to a square. This is essentially Lemma 4.2 of Bui–Pratt–Zaharescu, but for the convenience of the reader we give a full proof here. Consider the multiplicative homomorphism defined by
where is the prime and is the number of times divides . The vectors lie in a -dimensional vector space over , and thus are linearly dependent. Thus there exists a non-trivial collection of these vectors that sums to zero, which implies that the corresponding elements of the sequence multiply to a square.From (1), (2) we can find sequences of -smooth numbers in , with each sequence being to the right of the previous sequence. By the above observation, each sequence contains some non-trivial subcollection that multiplies to a square. Concatenating all these subsequences together, we obtain a single sequence with at least partial products multiplying to a square, giving the desired lower bound on .
Next, we prove the upper bound on . Suppose that a sequence has partial products that are squares for some . Then we have a square for all (with the convention ). The key observation (essentially Lemma 3.4 of Bui–Pratt–Zaharescu) is that, for each , one of the following must hold:
- (i) At least one of the is -smooth.
- (ii) At least one of the is divisible by for some prime .
- (iii) .
From (1) we see that the number of for which (i) occurs is at most . From the union bound we see that the number of for which (ii) occurs is at most
Finally, from the pigeonhole principle we see that the number of for which (iii) occurs is also at most Thus one has , as desired. This completes the proof.The upper bound arguments seem more crude to the author than the lower bound arguments, so I conjecture that the lower bound is in fact the truth: .