LLMs believe false statements even after explicit warnings that they’re false

arsisloam · Thursday at 8:22 PM

poltroon said:
For something to be marked as false in a way that the LLM can ingest, it would have to be labeled as false in metadata, right? Because otherwise, when you're building the tokens and the statistical relationships, it's very easy for the negated words to fall out depending upon how you've set up the context. It's still true that the statements are statistically similar.

Setting up your context so it always knows where a fact comes from and gives a true citation is I think the only safe way forward for anything of importance.

Statistically similar is just not the same as true or accurate, and that it is able to plausibly pass for that 90% of the time only makes it more dangerous.

Except for all the times we've tried that, and it doesn't work? LLMs may refuse to follow their directives. it gets worse the longer the model runs. I would love to see someone actually fix that, but it seems inexorably tied to temperature.

arsisloam · Friday at 4:01 AM

SraCet said:
If it was the year 1995 and a coworker told you that AltaVista was useful, would you contradict him? Because internet search engines don't always return relevant results, or sometimes they point you to results that aren't factually accurate?

Would you make that person write a 3 page essay about how internet search engines work?

LLMs are undeniably useful. Or, rather, you can deny their usefulness, but you'll look like an idiot.

Things can be flawed but still useful.

You are not an LLM. You are flawed, and useful. I believe in you.

arsisloam · Saturday at 12:33 AM

nimelennar said:
How about a simple test of understanding what a letter is and counting how many of them are in a word?

Here's a simple question I asked Gemini a few minutes ago.

How low do you think someone's IQ would have to be before a majority of them couldn't answer this question correctly (i.e. read a word and count to two and not past it)?

Once again proving how embarrassingly dumb Gemini is.

All LLMs are on this spectrum. When you use them a lot to do technical things, the logic loops, downward spirals of "what if's" and "the actual problem's", and just point blank idiocy will shine through in the most frustrating of ways.

They can still do useful work. They're useful machines. But they are not remotely aware or intelligent. I can just reinitialize qwen, wipe it clean and start over, and it keeps humming along processing prompts as if nothing happened, because there's no there, there. It's crystalline clockwork.

arsisloam · Sunday at 1:46 PM

SraCet said:
You're accusing me of not reading or understanding your posts, and then you go and attribute this strawman nonsense to me?

I never "stated" or even implied any such thing.

My whole thing is that saying LLMs are "statistics" is reductive to the point of uselessness. Actually, even talking about "statistics" in the context of LLMs is worse than useless, because it gives stupid people the idea that all LLMs are doing is Bayesian inference or n-gram completion or similar.

I don't agree with the way people are dismissing "statistics" as if that's a valid refutation to "LLMs are thinking*."

But, a weights file is essentially a complex lookup table. LLMs fundamentally are solving systems of polynomial equations (with up to a trillion parameters now, apparently).

*They are not aware. They are text prediction machines. That does not reduce them, unless you're deeply invested in believing they are alive. Which, no. Maybe one day. Not yet.

arsisloam · 2026-06-01T11:38:08-0400

wildsman said:
Let me know when you have an objective, falsifiable alternative measure of 'intelligence'/'understanding' and then we can talk.

In the meantime, people who are using operational definitions to continue their work are going to do just that.

Counting letters in words, displaying empathy and compassion when a person is in crisis, and not hallucinating sources.

arsisloam · 2026-06-01T14:05:04-0400

SraCet said:
Throw that in the bucket of nonsense that people say you're arguing but you aren't.

"Conscious," "self-aware," "alive," and now positive "net utility."

It's just strawmen all the way down.

This is nobody's first time around the shed with wildsman. I used to think they were reasonable too, and their core point that LLMs are a kind of simulation of thinking has merit. But wildsman is way out there. They think current LLMs, as they are now, are living, thinking entities with agency.

Search

Search

LLMs believe false statements even after explicit warnings that they’re false

arsisloam

Ars Scholae Palatinae

More options

arsisloam

Ars Scholae Palatinae

More options

arsisloam

Ars Scholae Palatinae

More options

arsisloam

Ars Scholae Palatinae

More options

arsisloam

Ars Scholae Palatinae

More options

arsisloam

Ars Scholae Palatinae

More options