Has Gemini surpassed ChatGPT? We put the AI models to the test.

Status
You're currently viewing only AdamM's posts. Click here to go back to viewing the entire thread.

AdamM

Ars Praefectus
5,928
Subscriptor
"I have to say I was surprised to see ChatGPT say that I joined Ars Technica in 2007. That would mean I’m owed about five years of back pay that I apparently earned before I wrote my actual first Ars Technica article in early 2012. ChatGPT also hallucinated a new subtitle for my book"

It's just beyond absurd how much people trust these things when you just have to take a slightly critical look at the far too wrong output.

I don't love these prompts, but at the same time I hate all the "LLM tests" that they fully cheat and train on so I don't have a better solution. All the stupid bullet points....
These are better used as an assistant rather than a brain replacer. If one were to trust it unquestioningly in every aspect, one would have a bad time.

If I had a reason to write a short biography on someone, having the structure laid out and quickly proofreading and fact checking would still be a bit quicker than writing the whole thing from the ground up. I would hopefully do enough cursory research to be able to quickly see things that warrant further investigation.

Is it good enough to take someone's job unsupervised? No. Can it speed up some tedious tasks? Sure.
 
Upvote
34 (36 / -2)

AdamM

Ars Praefectus
5,928
Subscriptor
I’m curious to understand the beginning of the article where it mentions the justification for using free versions. Do we actually know what models Siri would be using? I would expect thats purely up to negotiations and if Apple would be willing to pay enough they could probably even get something custom?

Also, how much do we know about what Apple will do with it? I’ve always assumed they would not be aiming at creating a clone of the existing chap apps but rather turning Siri into something usefull?

Purely conjecture here, but the likelihood of Google offering something that cannibalizes its paid offerings seems unlikely, especially if Apple is only paying 1 billion/yr. So one could theorize that it will be equivalent to Google's free tier.

I imagine they'll also roll out an option to let Siri send queries to Google if users want to use their Gemini accounts, similar to the current ChatGPT arrangement.

Overall, I don't see this being much different from the arrangements DuckDuckGo has to run models privately.
 
Upvote
1 (2 / -1)
Status
You're currently viewing only AdamM's posts. Click here to go back to viewing the entire thread.