What source did you provide? All I see is that this was a general purpose model trained for this task. Many people believe that represents the best use of this technology: bespoke models specifically and exhaustively trained on curated data to perform well-considered and defined tasks.You joking? They are the ones that did it!
The guy I quoted confidently wrote that 'they used a specialised model' - he provided no source and yet you swallowed that whole.
I provide the only source you can trust and you question it.
If you don't believe OpenAI, why even believe that a model did this? Maybe a mathematician did it and they just took credit for it - in that case, this whole article by Ars is based on a press release.
Conspiracy theorists are truly nut cases...
I strongly suspect there is a reason why it sounds like spam marketing fluff.On the one hand, this sounds a lot like the old spam post you used to see on poorly-moderated fora along the lines of "my friend quit her job to work from home, and now the makes over $5,000 a month using this one simple trick!"
I was finding interesting the dude named after Lenin's confidence that he was among the enlightened and therefore suitable to cast judgment on the perspectives of others on how society's resources are best employed.Dude named after Lenin posts about his desire to "get rid of people". If you're not trolling, then I'd strongly suggest some introspection.
Let's see if you have the ability to read your own words and apply introspection.It's a spectrum of abilities and it's very jagged - a genius physicist or surgeon might struggle to understand basic things only slightly adjacent to their field.
I'll subscribe to this newsletter, with a note to the editor that it is less that I object to using these tools to do those things, it is misrepresenting what using those tools means AND that our choice to do so without consideration affects who we are.... I'm not anti-AI. I'm anti-abuse of AI.
This seems to me like a perfect use case for AI. It extends human capabilities and human knowledge. No one was able to solve this problem until they used AI as a tool to help do it.
What I object to is using AI to do things that humans not only can do but which are expressions of what it means to be human.
Hey, it's Zen. Enlightenment will come to us when we embraceYou're taking as a given that these LLMs think like us, that they are indistinguishable from us, and that they have our capabilities and better.
And you cannot consider why people are dismissing your ideas? You've created a godhead and you can't even recognize it.
Now THAT's poking the bear.The real question is if AI recommends pineapple pizza because pineapple pizza is the best pizza
Acknowledged and glossed over even in detailed discussions with the major players, the plan has always been:Economics calls this the law of diminishing marginal return.
...
Extracting further improvements from the existing paradigm is going to get exponentially more expensive if they can't figure out a new avenue. They've already reached diminishing returns on training and recall phases of the process. I'm not sure where they're going to find more juice to squeeze.
Think about that. "After it was announced" other models could get there if pointed in the right direction. Future models will have that knowledge in their training data. Incremental improvement is to be expected if not required just to keep pace--which is why training of models shouldn't be considered one-time start-up outlay. It is an ongoing (and opaque) component of operating cost, measured over lifecycles that are pretty darn short.Check the last paragraph:
"At the same time, we haven’t fully explored what current models can achieve in math. Soon after OpenAI’s announcement, University of Michigan postdoc Xiao Ma found that GPT-5.5 was also able to prove Erdős wrong if given a small hint. If a generally available model could disprove this famous conjecture and no one noticed, what other discoveries could happen today that no one has thought to try?"
It's not that much of a leap to think that the next gen general model can do that without a hint.
Is this before or after we spend eight pages of comments regurgitating debates in other threads about what definition of "understanding" we start with before people who really want us to accept that "AI" is Just Like Real Folks move the goalposts on us?Do I need to adjust my understanding of decidability, or are the LLM salesmen just obfuscating things with handwaving and "do it in an AI model"?
I asked what source YOU provided. Are you referring to saying "RTFA"? The article AND the links in it all say it's an unreleased general purpose model, but say nothing else, and I suggest you are likely over-stating how generic this model and its training may have been. Very few rational people are going to accept that this one time OpenAI is giving us all the information.You didn't look up the thread?
Two people put this without any sources:
Disproof and proof are different things. The article and the ones you link to explain this.OpenAI said it.
Proof: Planar Point Sets with Many Unit Distances
Models chain of though (abridged): Rewritten Chain of Thought for the Solution to the Unit Distance Problem
Remarks by mathematicians: Remarks on the Disproof of the Unit Distance Conjecture
Dunno about you, but I had a lot of math and many of the questions involved filling in a blank of some sort.First it was autocomplete.
Pretty amazing that autocomplete could solve a mathematical problem which has stumped mathematicians for 80 years.
Somewhere in there is a reference to a table provided by OpenAI indicating it was a fifty/fifty process, which I read to mean on multiple trials of an unspecified n about half the time it found this answer, and half the time it didn't. Also unspecified is who did the initial validation before declaring Eureka! and sending this out to the mathematicians whose evaluations were subsequently part of their PR campaign. In the comments of those mathematicians were references to buzzes in that community over the model's apparent success before it went wide.Agreed. The reaction to, in essence, a million monkeys smashing the keys on typewriters 24/7 putting out a paragraph from Romeo and Juliet word for word is a bit over the top imo, especially considering what it is costing us to have those monkeys typing away 24/7.
It's weird that they don't mention anywhere how many tries this took (I'm willing to bet it wasn't a one-shot).
I am going to stun people who want to put me in a box of their design and say something positive about these tools: They are excellent at identifying possible connections that represent a true synthesis, in the spirit of what Johansson called the Medici Effect.Great write-up.
Not to lessen the accomplishments of the engineers (and presumably mathematicians) who created this internal OpenAI model, but there is a key phrase in your article that sticks out: "But it didn’t pioneer any genuinely new techniques. "
You are right, of course. And this to me indicates a distinct possibility as it relates to these LLMs that many people are so enamored with. It's possible that it actually cannot create anything genuinely new, not now, and maybe not ever - just mashups of what it has been trained on. I
You won't even look at what YOU wrote. You said you provided a source. Which source? I found none offered by you in this thread other than the "RTFA" comment. The "FA" does not go as far as to say what you assert, either.Are you missing the poster I was responding to made a confident statement about the type of llm without providing any source?
Boy you guys are really a cult now aren't you? You won't even try to look at anyone in-group...
A "cope." Noted for future reference.This reads as cope. Assuming it was 50/50, it still accomplished the end goal and served as an incredibly useful tool for the humans working on the problem. These models have not plateaued and clever people will continue to use them to advance science and math. Just as humans have always used tools.
This is why I equate these comments to anti-vaxxers. The mRNA vaccines have side effects: "Not perfect, ban it!!" "Scary new tech made by evil pharma lizard people!". Meanwhile an mRNA vaccine just brought us to striking distance of curing pancreatic cancer. We are just scratching the surface with generalized LLMs. Maybe it leads nowhere, but there is enough evidence now that makes it worth exploring.
That's not at all what that article was saying. It was saying when provided with assumptions identified as false, the models were prone to using the false assumptions in forming their output without regard for their lack of validity.Wasn't there just an article about how AI didn't change it's stance when presented with new evidence? And commenters like you were using it as evidence of your view? And now the same people are in denial of this well written article, even going so far as to pull an unrelated analogy from 18 years ago? So funny.
https://meincmagazine.com/ai/2026/05/...en-after-explicit-warnings-that-theyre-false/
There are two kinds of people in this world: Those that insist on putting everybody into boxes.It's happening slowly, and bifurcating into those interested in learning and those that autocomplete the comment box with nonsense they hear in their 'information' bubble.
Agreed!I applaud Ars for doing a deep dive on one of the many fascinating stories showing the usefulness of these tools in augmenting human progress.
According to the article, "maximum":I wonder what the token budget actually was?
... an OpenAI chart revealed that even with the maximum token budget, the internal model solves the problem only half of the time.