Has Gemini surpassed ChatGPT? We put the AI models to the test.

AdamM · Jan 21, 2026

coopster said:
"I have to say I was surprised to see ChatGPT say that I joined Ars Technica in 2007. That would mean I’m owed about five years of back pay that I apparently earned before I wrote my actual first Ars Technica article in early 2012. ChatGPT also hallucinated a new subtitle for my book"

It's just beyond absurd how much people trust these things when you just have to take a slightly critical look at the far too wrong output.

I don't love these prompts, but at the same time I hate all the "LLM tests" that they fully cheat and train on so I don't have a better solution. All the stupid bullet points....

These are better used as an assistant rather than a brain replacer. If one were to trust it unquestioningly in every aspect, one would have a bad time.

If I had a reason to write a short biography on someone, having the structure laid out and quickly proofreading and fact checking would still be a bit quicker than writing the whole thing from the ground up. I would hopefully do enough cursory research to be able to quickly see things that warrant further investigation.

Is it good enough to take someone's job unsupervised? No. Can it speed up some tedious tasks? Sure.

AdamM · Jan 21, 2026

dropadrop said:
I’m curious to understand the beginning of the article where it mentions the justification for using free versions. Do we actually know what models Siri would be using? I would expect thats purely up to negotiations and if Apple would be willing to pay enough they could probably even get something custom?

Also, how much do we know about what Apple will do with it? I’ve always assumed they would not be aiming at creating a clone of the existing chap apps but rather turning Siri into something usefull?

Purely conjecture here, but the likelihood of Google offering something that cannibalizes its paid offerings seems unlikely, especially if Apple is only paying 1 billion/yr. So one could theorize that it will be equivalent to Google's free tier.

I imagine they'll also roll out an option to let Siri send queries to Google if users want to use their Gemini accounts, similar to the current ChatGPT arrangement.

Overall, I don't see this being much different from the arrangements DuckDuckGo has to run models privately.

Search

Search

Has Gemini surpassed ChatGPT? We put the AI models to the test.

AdamM

Ars Praefectus

More options

AdamM

Ars Praefectus

More options