Sunday, May 24, 2026

Two Erdős Problems on Points in the Plane and AI

In a 1946 paper in the American Mathematical Monthly, Paul Erdős posed the Erdős Distinct Distance Problem and the Erdős Unit Distance Problem.

--------------------------------------------------------------------

THE ERDŐS DISTINCT DISTANCE PROBLEM

A nice high school math competition problem, if it were not fairly well known, is:

Show that for all sets of \(n\) points in the plane, there are at least \(0.5\sqrt{n}\) distinct distances.


A bit harder is to show that there exist \(n\) points in the plane where the number of distinct distances is \(O(\frac{n}{(\log n)^{1/2}})\). The points are the grid points of a \(\sqrt{n}\times\sqrt{n}\) integer grid. 

ADDED LATER:  The set of points is \(\{ (x,y) \in Z\times Z   \colon    1\le x,y\le \sqrt{n}   \} \)

Let \(g(n)\) be the minimum number such that, for every set of \(n\) points in the plane,
there are \(g(n)\) distinct distances. The above results (of Erdős) show that

\( \Omega(n^{1/2}) \le g(n) \le O(\frac{n}{(\log n)^{1/2}}) \)

Erdős made no conjecture about \(g(n)\) in that 1946 paper. However, later papers attribute to him one of two conjectures:

1) \( (\forall \epsilon)[g(n)\ge n^{1-\epsilon}]\)

or

2) \(g(n) = \Omega(\frac{n}{(\log n)^{1/2}})\)

Those papers incorrectly point to the 1946 paper for where he made those conjectures.  I suspect he made those conjectures later in talks or compilations of open problems.

This was a great problem in that there were many papers making progress on it, each one having new ideas. The current best result is that \(g(n) \ge \Omega(\frac{n}{\log n})\).  This result was obtained by Larry Guth and Netz Katz in 2011 and is, by far, the largest leap forward in progress on the problem.  Brass-Moser-Pach (2005) and later Pach-Sharir (2009) observed that existing techniques could never prove \(g(n) \ge \Omega(n^{8/9+\epsilon})\).  I've also heard the observation credited to Rusza. Guth and Katz invented (discovered?) new techniques to obtain their breakthrough.

See my webpage of all papers on the problem here or the Wikipedia page here

Takeaway: The Erdős Distinct Distance Problem saw decades of steady human progress and was eventually almost completely resolved without computer assistance.  I would not have emphasized without computer assistance one year ago.  Maybe not even one week ago.

----------------------------------------------

THE ERDŐS UNIT DISTANCE PROBLEM

Let S be a set of \(n\) points in the plane. Some of the distances between points of S may be 1. How many?  What is the maximum number of unit distances that can occur? Denote this by \(f(n)\). Erdős showed

\( n^{1+\Omega(1/\log\log n)} \le f(n) \le O(n^{3/2}) \).

The lower bound is obtained by using the grid points of a scaled version of the \(\sqrt{n}\times\sqrt{n}\) integer grid. See a writeup by ChatGPT,   here.  We note for later that this can be phrased as using the points in the Gaussian integers, \(Z[i]\).  

In the 1946 paper Erdős conjectured that, for all \(\epsilon\),  \(f(n) = O(n^{1+\epsilon})\).

Joel Spencer, Endre Szemerédi, and William Trotter in 1984 showed \(f(n) = O(n^{4/3})\).

Aside from the Spencer-Szemerédi-Trotter bound, there has been relatively little progress on either the upper or lower bound.

----------------------------------

CONTRASTING THE TWO PROBLEMS

Contrast: The Erdős Distinct Distance problem has seen extensive progress and a large literature. By contrast, although many papers were written on the Erdős unit distance problem, there was little improvement. That changed on May 20, 2026.

------------------------------------------

OPENAI PRODUCES A COUNTEREXAMPLE TO THE ERDŐS CONJECTURE

OpenAI recently produced a counterexample to Erdős's conjecture.  More precisely, OpenAI proved the following:

There exists \(\epsilon>0 \) and a sequence of point  sets \(P_i \subseteq R^2\) such that, letting \(|P_i|=n_i\):

a) \(n_i \rightarrow\infty\), and

b) for all \(i\), the number of unit distances in \(P_i\) is at least \(n_i^{1+\epsilon}\).


The  OpenAI announcement is here. The technical paper is here


Note that the author is OpenAI. There are no human co-authors.  However,(1)  Lije Chen used an internal OpenAI model and (2) Mark Selke and Metaab Sawhney verified correctness. (For a comment on the lack of human authors, see Sebastien Bubeck's comment below.) The \(\epsilon\) is not given though one could go through the paper and find it. It would be very small. 

After the initial AI-generated argument (or AI-assisted?) human mathematicians streamlined and clarified the proof. The result is a paper by Noga Alon, Thomas Bloom, W.T. Gowers, Daniel Litt, Will Sawin, Arul Shankar, Jacob Tsimermann, Victor Wang, and Melanie Matchett Wood  that explains the history, the proof, and their reactions to it. That paper is here. They obtain \(\epsilon \sim 6\times 10^{-38}\). 

Recall that Erdős's lower bound was obtained by using points in \(Z[i]\). The new result used more complicated number fields. Indeed, the degree of the number fields used goes to infinity.  For more details see the Alon et al. proof.  

Will Sawin wrote a paper where he obtained \(\epsilon=0.014114\). Quoting his paper:

To do this, we make explicit and sharpen every step of the argument. Interestingly, this rarely makes the argument much more complex. The total length of this paper is essentially the same as the original OpenAI writeup, though longer than the simplified version prepared by human authors [Alon et a.].

His paper is here

-------------------------------------------

 MY POINTS

1) The Erdős's Unit Distance problem is a well known and interesting conjecture, so this is not picking out some obscure problem.

2) AI-generated or AI-assisted? The announcement by OpenAI, and the Alon et al. paper, indicate that the proof is AI-generated.

Fellow blogger Gary Marcus disagrees and argues that it is AI-assisted.  See his post here

ADDED LATER: An interview with Gary Marcus is here.


I think this is a hard question, and perhaps there is more of a continuum.  However, I trust Alon et al., so I will go with AI-generated.

3) Humans were still needed to verify and  clean up the proof. In the future, we will all be grad students, verifying and cleaning up what AI outputs.

4) The final proof was readable. One concern about AI proofs is that they would be unreadable and hard to verify. That was not the case here.

5) The ideas needed for the solution already existed; however:

a) The right combination was hard to find.

b) The relevant techniques used, algebraic number theory, are not standard tools in combinatorial geometry.

c) It was widely believed that Erdős's conjecture was true.

d) Producing a counterexample seems less impressive than proving the theorem.  It would be of interest to see if OpenAI could prove the Guth-Katz result; however, that would not work now since Guth-Katz is already out there for OpenAI to see. Perhaps OpenAI should try to improve Guth-Katz. 


6) In the short term, this result and what it portends, is good: math problems we care about will be resolved with the help of AI, perhaps solely by AI. But in the long run we may lose the ability---or at least the patience---to do the proofs ourselves. Note that I am assuming that AI will have future successes---see item 10-ONE for a counterthought.

7) AI seems to be good at combining known concepts. Is it good at coming up with new ones? Are humans? The distinction between combining known ideas and coming up with new ones is very thin.

8) Two contrary lessons:

a) You should know many fields of mathematics so that you can use ideas from one field in another, like the AI did.

b) You should know some field of math really well so you may do something new in it that current AI could not have done.

9) If an AI produces a new treatment for cancer that is cheaper and better than what is known I do not think we will care that an AI did it (though we will have humans check it). Is mathematics similar? Do we care that an AI did it?  Medicine primarily values outcomes; Mathematics also values understanding. Does reading a proof that AI did give the same understanding as coming up with it yourself?  Can AI help with that as well?


10) I suggest two futures:

ONE: While this AI-generated (or AI-assisted) result is impressive, it will be a rare occurrence. This result was actually a counterexample. The needed math was known. The result was interesting. This is a perfect storm that we might not see again for a while.

TWO: Even before the AI revolution, when I came up with a math problem I wanted solved, I would seek help, perhaps too early. My curiosity far exceeds my ego.  Since AI makes it easy to get help, my fear is that eventually we will all be Bill Gasarch---scary.


19 comments:

  1. Replies
    1. This is the real Bill Gasarch. That last comment was not by me, but I agree: Scary!

      Delete
  2. 1. Who is so audacious and misrepresented to call it Author: OpenAI?
    I am referring to the one they uploaded on Arxiv. Note that they had not specified anyone human names regarding provers/readers.
    2. what is a \sqrt{n} by \sqrt{n} integer grid? can you motivate this please with a simple example? reason for using it and also how to use it

    ReplyDelete
    Replies
    1. 1) OpenAI are the ones who posted a paper with Author: OpenAI which I pointed to. Perhaps Chen, Selke, and Sawhney should have been authors.
      2) I have added to the two times I used the integer grid what I meant by it. Thanks for forcing me to make this a more informative post.

      Delete
    2. Gasarch's point 1 is wrong. This is truly an openai-wide achievement. There is a lot that goes into creating a model that can produce such a result. Singling out humans at the end of the funnel is simply inappropriate.

      Delete
    3. Since Sebastian Bubeck WORKS at OpenAI on AI, I defer to his wisdom.

      Delete
    4. @sebastian, howdy, would you be so kind to reach out to Theresa (Openai) and kindly connect with her on some of my inquiries? Thanks. Also, it would seem far more transparent to list individuals involved in this endeavor ... rather than just state "experts" were consulted. I thought that there's a degree of transparency involved here after all that OpenAi strives to maintain.

      Delete
  3. Computers have been helpful in doing math since at least the proof of the four color theorem.

    ReplyDelete
  4. so i would be interested to get more clarity in everyone who consulted for openai and helped on that preprint. no names were mentioned at all.

    ReplyDelete
  5. to be able to evaluate that kind of works well, you need to be able to reproduce it, and details of their setup.

    typically they spend massive amount of top engineering and researcher time to set up these things, they intervene in the process at least initially, they spend massive amount of compute and time, ...

    in other words we are not yet close to generic scalable systems.

    but yes, the progress is impressive.

    ReplyDelete
  6. so you are saying it wasn't just the "prompt" they pretended to have written to get the answer? or let me say, it was that prompt but finalize that prompt in exactly that form, took a lot of time since any other form would have yielded incorrect output?

    ReplyDelete
    Replies
    1. yes.

      these systems they do a lot of manual trial and error when setting them up, they keep changing them and asking new things to it that they think might be useful, etc. etc.

      it is not a system that a human gives it a problem and it returns a solution without human intervention. and often the system is configured for that particular program. and the problem selection itself is a human process as well, they pick problems manually that they think they might be able to solve.

      was the system told to prove the conjecture or was it told to find a counterexample is also interesting to know.

      so at this point, these systems are definitely useful, and they are becoming more useful, but the claims of AI doing things, always be skeptical about such claims, particularly when it is coming from a company that likes to overhype and tries to raise massive amount of investment and has many high profile former executives and board members say under oath at a court that it has dishonest and manipulative leadership and is trying to do a trillion dollar IPO later this year.

      so putting AI as the name of the author for this result is a great deceptive marketing.

      Delete
    2. The exact (AI-generated) prompt is given in the paper. It asks to resolve the conjecture, either in the affirmative or negative. The paper claims that "the problem was solved in a completely automated fashion." This suggests that they didn't repeatedly rewrite the prompt. (But I'm sure they tried to solve other open math problems and this is the most notable success.)

      Delete
    3. That is deceptive. It is certain they have given the system more than one prompt multiple times and that they have "tuned" the system based on failed ones until they got the successful one.

      This is how these experiments are actually done. These take a lot of time and resources and money to run. They are not going to let these systems run without monitoring what they do and intervening and modifying the system when they think it is not on the right track.

      They try to make it look like the standard system and model, the same one that we have access to, solved it. That is definitely not the case.

      Delete
  7. Regarding formulation. Are we talking about n randomly selected points on the 2-D Euclidean plane? how are they sampled and what distribution are we assuming? As far as the current formulation of the high school competition problem, would you consider it complete or incomplete to the point that it cannot be processed without further clarification?

    ReplyDelete
    Replies
    1. Erdos Distinct Distance Problem: (a) show that FOR ALL setsof n points in the plane there exists \ge BLAH distinct distances, (b) show that THERE EXISTS n points in the plane with \le BLAH distinct distances.

      Erdos Unit Distance Problem: (a) show that FOR ALL sets of n points in the plane there are \le BLAH unit distances, (b) show that THERE EXISTS n points in the plane with \ge BLAH unit distances (this is the one that we are discussing).

      The Erdos Distinct Distance Problem could have a random version: take n points in the unit square at random, what is the expected number of distinct distances. I suspect this is known.

      The Erdos Unit Distance Problem you can't use the unit square, or perhaps you could but pick a different distance. Or make the distance a parameter: Let 0\le d\le 1. If you have n random points in the unit sequare, what is the expected number of pairs with distance d. I suspect this is also known. If not then just ask ChatGPT or Claude.

      Delete
    2. @Gasarch, thanks! I didn't look into the Erdoes problem , but surprised that he didn't look at it this way. Back in the day, we'd say "google it" now we say "Chatgpt it" ... or "claude it", vastly superior 'media' in terms of falsely persuading someone that something is known
      whether it holds or not. In any case, i didn't see you mention "Gemini it".
      I can sense some eerie background of where things are headed ... for us ...

      Delete
  8. I'm confused a bit. Does recent ai result actually say anything about Distinct-distances problem? Or did you add it as a contrast that ai solved stuff humans had no progress on?

    You introduced it, and then never used it when talking about ai. Or was that the plan? Use it only as contrast against the unit-distance problem, not the ai usage? It just feels like it was a setup, and there's payoff about ai, but setup isn't connected to payoff.

    Or maybe I'm biased in that everyone talks about ai part of the news and title has "2 problems and ai", but this blog is mainly about problems...

    ReplyDelete
  9. YES it was there just for contrast. I did have a MILD reference to it later when I wonder how AI would have done on it before the Guth-Katz result but YES you are right that it is not an AI thing.

    ReplyDelete