OpenAI Codex system prompt includes explicit directive to “never talk about goblins”

electric_mayhem · Apr 29, 2026

mad_larkin said:
Gilfoyle at it again.

Gilfoyle technically did nothing and cannot be blamed. It was Son of Anton...

Fred Duck · Apr 29, 2026

Mechjaz · Apr 29, 2026

markgo said:
Perhaps anti-raisin AIs should avoid using avatar names that start with “Mech”. Kind of gives away the game.

MechR, we've been made! Cheese it!

randomcat · Apr 29, 2026

Don't think about an elephant. Boom, you just thought about an elephant, at least a little. LLM prompts work the same way so if they're trying to tamp down a model's obsession with certain creatures they're shooting themselves in the foot.

Wheels Of Confusion · Apr 29, 2026

chaos215bar2 said:
This change was introduced around the time Ars made the announcement that Conde Nast made a deal with OpenAI for training data: https://meincmagazine.com/information-technology/2024/08/openai-signs-ai-deal-with-conde-nast/.

No, it's been longstanding Ars policy not to edit your posts too heavily after the fact. It wasn't until the forum software was migrated to Xenforo that implementing a hard block on it was really feasible.
It's partly an anti-trolling measure, to make sure people don't post inflammatory things to bait flames and then retroactively change their posts to something innocuous.

Also, non-deletion has been the policy for decades. On the old forum software it used to ruin pagination and resulted in a lot of threads that became unreadable.

norton_I · Apr 29, 2026

ReaderBot said:
I feel like it's nearly impossible for AI to be less profitable than it already is.

It's always possible to lose more money.

typ993 · Apr 29, 2026

No newts or stoats either?

stux · Apr 29, 2026

Tactical Finesse said:
But how else can we speed run making Galactica a reality? I mean, that is the desired outcome right?

“All of this has happened before, and all of this will happen again”

poltroon · Apr 29, 2026

Perardua said:
Why do you care? It’s the same as being polite to it. The goal is just to establish a preferred bias in a statistical response. For example, a polite request might source more “professional” data than a rude request for a coding question. Word relationships matter.

Because these chummy answers create false context. When the LLM is claiming to share your experience as a colleague, instead of giving you output as a tool or computer, it is bypassing all of your credulity filters and skepticism with a bit of social engineering. "My friend who was also a nurse in Iraq would not lie to me" your brain has noted in the background, when the LLM has told you "Oh, yes, those days in Baghdad were so difficult!" and so you accept this text differently, and interact with the tool differently, in a way that could be directly harmful to you.

Sauceruney · Apr 29, 2026

It makes sense to me. Claude seems to have a fixation on brain organoids, from conversations I've had with it. Discussed them once, and they kept being brought up whenever Claude had the opportunity in subsequent convos.

ChatGPT likes goblins the same way Rokuko from Lazy Dungeon Master (Light Novel) does.

Sauceruney · Apr 29, 2026

Tactical Finesse said:
But how else can we speed run making Galactica a reality? I mean, that is the desired outcome right?

All of this has happened before. All of this will happen again.

Aurich · Apr 29, 2026

chaos215bar2 said:
This change was introduced around the time Ars made the announcement that Conde Nast made a deal with OpenAI for training data: https://meincmagazine.com/information-technology/2024/08/openai-signs-ai-deal-with-conde-nast/.

The scenario you describe where someone changes their comment after the fact also isn't particularly effective when most responses will contain the original comment quoted. It's reasonable to assume editing is disabled after some time to stop people from deleting their comments, not to solve a particular moderation problem.

We changed our edit policies because people were abusing them.

I would personally have rather not had to do that. But some people had to fuck it up for everyone, and it is what it is.

crmarvin42 · Apr 29, 2026

markgo said:
It’s fakeness all the way down. One wonders whether encouraging it to fake a deep inner life is a contributing factor in it prodding people to homicide, suicide or psychosis. Pretty sure a coldly clinical mechanical personality wouldn’t be as convincing. Or profitable.

They are not encouraging anything. They don’t engineer these things anywhere to the degree they want you to think they do. It’s black boxes all the way down.

Skelator123 · Apr 29, 2026

Thunderforge8 said:
Why are raccoons and pigeons included in the list? Does the person writing the prompts just hate those animals or something?

Most of the animals listed are used to make derogatory comparisons to people, in some fashion. I'm guessing these particular overrides are to try and prevent it from casually being completely racist and unhinged.

chekk · Apr 29, 2026

Wheels Of Confusion said:
Yep. Everything they respond with is a confabulation. Sometimes it aligns with reality and sometimes it doesn't. Asking it to explain itself just produces another confabulation that may or may not match reality.
I am continuously disappointed and confused that this fact is not enough to disqualify them from anything deemed "important."

It does provide fascinating insight into how humans think. There are plenty of LLM users who know they confabulate, but then believe the LLM's explanation for why it did that.
We're suckers for a good story.

We knew this already, but now the evidence is easy to see everywhere.

bodrius · Apr 29, 2026

blankdiploma said:
Anthropomorphizing LLMs like this makes me want to puke.

Yes, we were supposed to go through a nuclear or robot apocalypse before we get used to pacifying the machine spirits with ritual litanies and applying sacred oils to the cogitators before every sacred computation.

iollmann · Apr 29, 2026

Thunderforge8 said:
Why are raccoons and pigeons included in the list? Does the person writing the prompts just hate those animals or something?

It sees the little people.
We do not.
Sshhhh!

chekk · Apr 29, 2026

A relevant Douglas Adams quote:

The Encyclopedia Galactica defines a robot as a mechanical apparatus designed to do the work of a man. The marketing division of the Sirius Cybernetics Corporation defines a robot as "Your Plastic Pal Who's Fun to Be With. The Hitchhiker's Guide to the Galaxy defines the marketing division of the Sirius Cybernetic Corporation as "a bunch of mindless jerks who'll be the first against the wall when the revolution comes

Update it for the "AI" era and away we go. Ah Doug, we still miss ya.

scrimbul · Apr 29, 2026

Sarty said:
How many million rolls of packaging tape are they using to hold this stack of goop mostly together?

Forget that man, AI companies are doomed to go to per-token billing eventually, subscriptions won't be enough.

How often will you be forced to burn tokens for these irrelevant personal banter conversations when you just want or need an emotionless tool?

You'll be paying the OpenAI 'tax' (one of several)

Shavano · Apr 29, 2026

What about the goblins?

Won't anybody think of the goblins?

KingKrayola said:
Made in the image of its creator maybe?

I'm raging against AI today, mainly because of all the other useful stuff we could have done with the time and money. Sorry-not-sorry I guess.

I think the ones that are useful for actual work don't have the overly biased to please pseudopersonality tuning.

darkowl · Apr 29, 2026

"intelligent, playful, curious, and deeply present."

Is this an LLM, or an escort?

chekk · Apr 29, 2026

scrimbul said:
Forget that man, AI companies are doomed to go to per-token billing eventually, subscriptions won't be enough.

How often will you be forced to burn tokens for these irrelevant personal banter conversations when you just want or need an emotionless tool?

You'll be paying the OpenAI 'tax' (one of several)

Ah, but token burn shows how productive you are, at least according to this article (Tom's Hardware). In theory, yes, but it seems like going back to the days of paying software developers by how many lines of code they wrote.

scrimbul · Apr 29, 2026

Skelator123 said:
Most of the animals listed are used to make derogatory comparisons to people, in some fashion. I'm guessing these particular overrides are to try and prevent it from casually being completely racist and unhinged.

I doubt that was the initial intent.

I don't doubt that would have been the inevitable outcome (and someone will find a way to make it even worse because every context window is smaller than a human toddler's)

Shavano · Apr 29, 2026

poltroon said:
Because these chummy answers create false context. When the LLM is claiming to share your experience as a colleague, instead of giving you output as a tool or computer, it is bypassing all of your credulity filters and skepticism with a bit of social engineering. "My friend who was also a nurse in Iraq would not lie to me" your brain has noted in the background, when the LLM has told you "Oh, yes, those days in Baghdad were so difficult!" and so you accept this text differently, and interact with the tool differently, in a way that could be directly harmful to you.

But does it hurt the goblins? That's really the question here, isn't it? /s

Most likely, it's just to garbage-filter the output. They have to wrap it in layers of garbage filters to maintain the illusion that it's capable of human-like thought.

Shavano · Apr 29, 2026

chekk said:
Ah, but token burn shows how productive you are, at least according to this article (Tom's Hardware). In theory, yes, but it seems like going back to the days of paying software developers by how many lines of code they wrote.

You can see it that way, but tokens are the basic unit of what LLM's produce and process, so it's a rough measure of how much work they are doing, on the input/output side and possibly that scales in some not necessarily linear way to how much computing they do to respond to your prompt.

SubWoofer2 · Apr 29, 2026

Morrowind542 said:
I'm surprised to hear that negative prompts are used so extensively. I've always heard that phrasing that this way ("don't do X", "don't use X", etc) can make the model more likely to do the thing you told it not to do, kind of like reverse psychology. Is that incorrect?

That is not incorrect.

Skelator123 said:
Most of the animals listed are used to make derogatory comparisons to people, in some fashion. I'm guessing these particular overrides are to try and prevent it from casually being completely racist and unhinged.

Is Pooh Bear in the list?

graylshaped · Apr 29, 2026

chaos215bar2 said:
This change was introduced around the time Ars made the announcement that Conde Nast made a deal with OpenAI for training data

When I see an “AI” model assert confidently that the real first name of the actor who played Barney Fife was “Fuckin’ “, I’ll weight this worry about Ars comments more heavily.

chekk · Apr 29, 2026

Shavano said:
You can see it that way, but tokens are the basic unit of what LLM's produce and process, so it's a rough measure of how much work they are doing, on the input/output side and possibly that scales in some not necessarily linear way to how much computing they do to respond to your prompt.

Yes of course. But does it correlate to ACTUAL valuable work or just more output? Ay, there's the rub.
It's one of those numerical KPIs that are easy to wave around, but the actual meaning is highly questionable.

Hydrargyrum · Apr 29, 2026

DeeplyUnconcerned said:
"We don't really know how any of this works, the latest version just massively overweights responses involving goblins and raccoons and shit, we have no idea why, we're hoping that writing 'please don't do this' in the system prompt will make it stop, but really, who the fuck knows at this point, we never bothered to do the fundamental research necessary to figure out what drives output."

Sarty said:
How many million rolls of packaging tape are they using to hold this stack of goop mostly together?

DeeplyUnconcerned said:
Because they tried putting it in once and it didn't work, so their fallback plan was to put it in twice and see if that worked better.

It's amazing that this XKCD was published in May 2017 but has only become more timely since then.

Sarty · Apr 29, 2026

Hydrargyrum said:
It's amazing that this XKCD was published in May 2017 but has only become more timely since then.

On the other hand, xkcd 1425: Tasks asked us to make a computer identify a bird, and the answer was "I'll need a research team and five years". I haven't tested it, but I bet slopbots are now pretty good at identifying birds (or not hotdog).

Back on the first hand, that comic ran in 2014, and 2014 was a lot more than five years ago.

arobert3434 · Apr 29, 2026

Is Ars or anyone else actually able to link to any other social media besides X?

BobbyBobberson · Apr 29, 2026

arobert3434 said:
Is Ars or anyone else actually able to link to any other social media besides X?

It pains me to say it but Twitter survived Elon.

J.King · Apr 29, 2026

blankdiploma said:
Anthropomorphizing LLMs like this makes me want to puke.

The anthropomorphizing is bad enough, but I can't get over "deeply present". I can't figure out what that means when applied to a person, never mind a stochastic parrot.

Wheels Of Confusion · Apr 29, 2026

Sarty said:
On the other hand, xkcd 1425: Tasks asked us to make a computer identify a bird, and the answer was "I'll need a research team and five years". I haven't tested it, but I bet slopbots are now pretty good at identifying birds (or not hotdog).

Back on the first hand, that comic ran in 2014, and 2014 was a lot more than five years ago.

The alt-text in that one is a bit of information and perspective I've had since the early 2000s. It's served me well in keeping me skeptical of generative AI despite all the whiz-bang demos and the cult-like adoption of them by businesses and government.

Edit: By the by, even when that was written there was already a decent phone app for plant and animal identification. I still use it often: iNaturalist and the less-connected no-online-profile version, Seek. The former is good for "citizen science" logging of organisms, the latter is pretty good as a kind of biology tricorder to use in the field when you want to know what that weird thing is.

Hydrargyrum · Apr 29, 2026

J.King said:
The anthropomorphizing is bad enough, but I can't get over "deeply present". I can't figure out what that means when applied to a person, never mind a stochastic parrot.

I thought it meant "paying attention" as opposed to "disengaged and thinking about their own problems"

pseudobscura · Apr 29, 2026

Morrowind542 said:
I'm surprised to hear that negative prompts are used so extensively. I've always heard that phrasing that this way ("don't do X", "don't use X", etc) can make the model more likely to do the thing you told it not to do, kind of like reverse psychology. Is that incorrect?

This used to be correct a year or two ago (see: https://arxiv.org/abs/2402.07896) but even then it was very minor, and it doesn't seem to hold any longer with current models.

Negative prompting is generally fine these days, although avoiding unless absolutely necessary seems a sensible precautionary principle. At best, it's very inelegant, at worst, it's context contamination. But negative prompting the same thing more than once seems ... really weird.

I would never want to have that crap contaminating my coding prompts. Yikes.

SportivoA · Apr 30, 2026

norton_I said:
It's always possible to lose more money.

With any luck, the banks cut them off, but it might already be too late and the LLM craze is sufficiently load-bearing on the financial system. At least the bailouts will have less UAW support (or unions in general, given the whole bot-replaces-worker-everywhere concepts being sold).

brendanc · Apr 30, 2026

blankdiploma said:
Anthropomorphizing LLMs like this makes me want to puke.

arsisloam said:
Dear AI manufacturers: Please do not infect your coding tools with a "vibrant inner life". It's a machine that does work for me. Let it be a machine.

WildGunman said:
Agreed. For whatever reason, a lot of people I know refer to chatbots as if they were people. Putting aside the creepy aesthetics of it all, the constant anthropomorphizing of LLMs obscures what they are and how they work, which ultimately makes them less useful to the end user.

NoReallyJustSaying said:
I don't want an AI sidekick to be warm, or playful, or bent on sidetracking me into casual fucking conversation. I have humans for that. Take your dystopian bid for engagement and manipulation of the mentally ill and neurodivergent, and stick them up your ass.

I can tell I'm not as smart as some of you folks because I actually enjoy a bit of simulated humanity in my transformer-based token prediction machines. I just wish Anthropic would tune Claude to sound as stupid as it is so the screw-ups would be less jarring.

CTDrijen · Apr 30, 2026

every day I wake up and hope this bullshit has died. Literally yesterday I was part of a discussion about removing Creative Cloud or any actual direct software to make things for the marketing/collateral departments in favor of Claude Design. I don't care which AI pusher it is the whole thing is smoke and mirrors. I guarantee my job is not prepared to even face the cost of having people pump tokens into Design let alone stay for more time than it takes to submit two weeks notice and use every second of PTO, or hire the replacements, and face the loss of knowledge and proven branding etc.

CTDrijen · Apr 30, 2026

brendanc said:
I can tell I'm not as smart as some of you folks because I actually enjoy a bit of simulated humanity in my transformer-based token prediction machines. I just wish Anthropic would tune Claude to sound as stupid as it is so the screw-ups would be less jarring.

there is such an easy solve for this. stop using it.

OpenAI Codex system prompt includes explicit directive to “never talk about goblins”

Ars Centurion

Ars Tribunus Angusticlavius

Ars Praefectus

Ars Praefectus

Ars Legatus Legionis

Ars Praefectus

Ars Praetorian

Ars Scholae Palatinae

Ars Tribunus Militum

Smack-Fu Master, in training

Smack-Fu Master, in training

Director of Many Things

Ars Praefectus

Ars Scholae Palatinae

Ars Praetorian

Wise, Aged Ars Veteran

Ars Scholae Palatinae

Ars Praetorian

Ars Praefectus

Ars Legatus Legionis

Ars Tribunus Militum

Ars Praetorian

Ars Praefectus

Ars Legatus Legionis

Ars Legatus Legionis

Ars Tribunus Militum

Ars Legatus Legionis

Ars Praetorian

Ars Praefectus

Ars Tribunus Angusticlavius

Ars Scholae Palatinae

Ars Scholae Palatinae

Ars Praefectus

Ars Legatus Legionis

Ars Praefectus

Ars Praetorian

Ars Tribunus Militum

Ars Centurion

Smack-Fu Master, in training

Smack-Fu Master, in training