A Quest For Simple Hard Statements
Gödel’s Lost Letter and P=NP 2018-08-02
Made-to-order statements that are not so simple
Harvey Friedman is a long-standing friend who is a world expert on proofs and the power of various logics. This part of mathematics is tightly connected to complexity theory. This first mention of his work on this blog was a problem about the rationals that says nothing of “logic” or “proofs.”
Today Ken and I would like to introduce some of Harvey’s work in progress.
Harvey’s work is a kind of quest. Recall a quest is a a long or arduous search for something. Ever since his 1967 PhD thesis, Harvey has been trying to find simple statements that are immensely hard to prove. This might seem to be a strange quest: Why look for hard statements; why not look for easy statements? Indeed. Since Kurt Gödel’s seminal work, logicians have been interested in finding natural statements that hide great complexity underneath. The measure of complexity is that the statements are true but cannot be proved in certain prominent logical theories.
Wikipedia has this to say about the famous theorem of Jeff Paris and Leo Harrington (with input from Laurence Kirby) that Peano Arithmetic (PA) cannot prove a certain strengthening of the finite Ramsey theorem:
This was the first “natural” example of a true statement about the integers that could be stated in the language of arithmetic, but not proved in Peano arithmetic; it was already known that such statements existed by Gödel’s first incompleteness theorem.
See also this for history and related examples including earlier work by Reuben Goodstein that was developed further by Kirby and Paris.
What if we can find statements that (i) use arguably simpler notions than arithmetic, yet (ii) are unprovable in systems such as ZFC set theory that are vastly stronger than PA? They might require systems beyond ZFC, such as adding plausible “large cardinal” axioms, to prove. The foundational goal of Harvey’s quest is to demonstrate that realms of higher set theory long regarded as wildly infinitistic and intangible connect algorithmically to concrete and intimately familiar combinatorial ideas. He does also mention motives in complexity, such as in this recent popular talk, and we will develop this aspect here.
Unprovable Yet Simple
Introductory theory courses show how to define sets that are computably enumerable but not decidable. The complement is then definable by a formula whose only unbounded quantifiers over numbers are universal. Such formulas and the resulting sentences for any fixed value are said to be . For instance, the set of numbers coding Turing machines that accept no inputs is defined by saying that for all and , is not a valid accepting computation of the machine on input . For every effective theory , be it PA or ZFC or stronger, the set of cases that can prove is computably enumerable. Thus cannot equal . Presuming does not prove any false cases, this makes a proper subset of . Every (and there are infinitely many) yields a sentence that is true but not provable in .
Indeed, one can construct particular values via Gödel’s diagonalization mechanism. The resulting sentence is simple in terms of quantifiers, but the “” is definitely not simple. It bakes all the complexity of diagonalizing over proofs into the statement. Gödel further showed one can use a sentence expressing the consistency of , but this too references and proofs explicitly.
The Paris-Harrington statements are natural, but they are not —that is, they have the form “(for all )(there exists )” where is first-order with bounded quantifiers only. They embody the idea that functions giving for these quantifiers must grow infinitely often faster than any function PA can prove to be computable. The statement is likewise . Some time back we wrote a post on independence and complexity that also covered work by Harvey with Florian Pelupessy related to Paris-Harrington. In 1991, Shai Ben-David and Shai Halevi proved a two-way connection to circuit complexity that applies when an effective first-order theory is augmented to a theory by adding all true sentences over its language as axioms. They cited as folklore a theorem that and generally have the same sets of provably computable functions and said in consequence:
[I]f is a natural mathematical statement for which “ is independent of PA” is provable in any of the known methods for establishing such results, then is independent of PA as well. [Likewise for ZFC and any similar and its corresponding .]
Well this is not in Harvey’s quest either. Here are several objections: Statements of Paris-Harrington type cannot be brought much below . The theory is not effective. Adding true numerical sentences may be benign for provable growth rates but not for and other statements we want to prove. Although it is often kosher to assume particular statements like (forms of) the Riemann Hypothesis, allowing arbitrary ones begs the idea of proof. The situation is that either is stronger than PA and the functions grow so insanely as to strain naturalness, or is weaker but then the independence is weak too.
What we all want is to show independence from strong theories for simple and natural statements that while not literally in form demonstrably have no more power. Then we may hope to apply his results to understand the hardness of statements like at the ground level. According to Ben-David and Halevi, this will require ideas and constructions beyond “the known methods.” Harvey’s ideas begin by blending questions about the integers into questions about the rationals that leverage the latter’s dense order properties. Let’s turn to look at them now.
Theories on the Rationals
Consider the rationals . How hard are statements about the rationals? It depends.
If we allow first order sentences that involve addition and multiplication over the rationals, then Julia Robinson proved, long ago, that the integers can be defined in this first order theory. This immediately implies the general unprovability results discussed above. We have covered this topic before here.
What is surprising, perhaps, is that there are complex statements that use only the order properties of the rationals. However, if we restrict ourselves to the first order theory of the rationals under the usual ordering , then it is long known that:
Theorem 1 The first order theory is decidable.
Harvey’s ideas use mildly second-order statements about the rationals—with little or no arithmetic besides the relation—to get his hard statements. This may be surprising or may not—it depends on your viewpoint.
Michael Rabin further showed that the second-order monadic theory of is decidable (see also this). So we need more than quantifiers over sets of rationals. If one allows an existential quantifier over triples, then it does not take much else to write down the rules for relations and to behave like addition and multiplication and use the quantifier to assert their existence, whereupon undecidability follows by Robinson’s result. It seems to us that this paper gives a ready way to show the undecidability also for pairs.
Harvey uses predicates defining sets of tuples of rationals and a single quantifier over them. He eliminates that quantifier by expressing the formula body using satisfiability in first-order predicate calculus with equality, obtaining the schematic form
where uses only first-order quantifiers to talk about finite sets of -tuples and is computable from . The satisfaction relation is equivalent to the negation of a derivable contradiction. Hence is semantically equivalent to a sentence . The point is that whereas would encode proofs or computations, is simple and free of navel-gazing.
Some Key Concepts
Harvey’s work is based on the standard notion of order equivalence:
Definition 2 Say is order equivalent to if
for all indices and . We use to denote this.
Theorem 3 Order equivalence on -tuples is an equivalence relation and there are only a polynomial in equivalence classes.
Of course, if the tuples are restricted to distinct rationals then there are exactly equivalence classes. The exact numbers of classes for each when members can be equal are addressed in this paper.
Now we can define the key binary relation on sets of -tuples of rationals. It uses order equivalence on -tuples obtained by flattening elements of and :
Definition 4 Let be fixed. Then provided
Harvey calls this emulation and the equivalence he calls duplication. The notions naturally extend to and for fixed , when they are called –emulation and –duplication. In applications, will be finite while can be huge, but in writing we still regard as being constrained by .
The third key concept is a kind of invariance that is customizable. Its calibrations may determine the level of hardness of the statements obtained. They require identifying some elements of a -tuple as being integers. Here is an example:
For all sequences of nonnegative rationals, each less than the integer where ,
The case says that for all ,
.
A stronger constraint is
whenever . Here stand for integers that are not distinct and like are not required to be in nondescending order. Harvey calls these various notions of being stable.
The final core ingredient is the usual notion of being maximal with respect to some property: has the property but no proper superset of does. It will suffice to consider sets obtained by adding one element to . Now we can exemplify his statements in simple terms:
For every finite set of -tuples of rationals there is a set that is maximal with regard to and is stable.
There are similar statements for duplication and other variants of emulation. Their definitive treatment will come in papers after #106 and #107 on Harvey’s manuscripts page.
The Goal
The above concepts are all basic. They figure into many areas of combinatorial mathematics, for instance regarding preferences and rankings. None talks about Gödel numbers or Turing machines or proofs. The goal is to establish and classify instances of this phenomenon:
There are simple true statements involving little more than and that require immensely strong set theories to prove.
Here “true” needs some explanation. If ordinary set theory cannot prove a statement , with what confidence can we say is true? The answer is that the aforementioned large cardinal axioms which Harvey uses to prove are widely accepted. We note this 2014 post by Bill Gasarch on how Fermat’s Last Theorem was originally proved using theory derived from the axiom asserting the existence of a strongly inaccessible cardinal. This is equivalent to having a Grothendieck universe, by which Alexander Grothendieck proved statements used by Andrew Wiles in his proof. Colin McLarty later showed how to remove this dependency and prove Fermat in a fragment of Ernst Zermelo’s set theory—but the relevance of to “real mathematics” remains.
So the upshot will be that ZFC plus some will prove but ZFC alone cannot (unless ZFC is inconsistent). It is possible for ZFC to be consistent whereas ZFC is inconsistent. All the known large cardinal axioms have these relationships to ZFC:
- ZFC can prove that implies the consistency of ZFC.
- ZFC can prove that the consistency of ZFC implies the consistency of ZFC .
The final—and long—step needed in Harvey’s proofs is to get point 1 with in place of . This exemplifies a reversal step in his program of Reverse Mathematics. From this, the above desired conclusion about ZFC not proving follows. By point 2, ZFC cannot prove either. Nevertheless, we can accept as true. This is intended to rest not only on heuristic arguments in favor of the particular employed, as exemplified for Fermat, but on evidence that can be programmed.
Harvey’s ultimate goal will be realized when the effect of is shown to be algorithmic. Here is a facile analogy. Suppose we build an infinite set by a simple greedy algorithm. We get a “simple path” of finite sets that grow successively larger: each has cardinality . We can view the process as structured by the ordinal being the limit of as grows. The idea is that more-ramified search structures can be “governed” by higher ordinals and patterns of descents from them. The end product will nevertheless always be a countable set of tuples and the evidence of progress will always be a longer finite path. Greedy may give only maximality, whereas the large cardinal axiom implies the existence of an infinite with all stipulated properties. Though it may provide no guidance on building longer and longer finite paths, it implies their existence so that computers can always find a longer path by a finite search. The concrete import of this and further motivations are described in this Nautilus article from last year.
Open Problems
Open problems are most engaging when easy to state: the simple attracts and the complex repels. Simplicity allows them to be stated also in the direction that most of us believe. Yet we may think they are true but don’t know how to prove them. The whole point of Harvey’s quest is to create more milestones that are simple but hard to prove statements. How much will it help to have such statements?