Anthropic says these topics are too dangerous to let its Fable 5 model talk about

ForbiddenBarn · Tuesday at 7:08 PM

access the Fable 5 model at a cost of $10-per-million input tokens and $50-per-million output tokens

This pricing structure is quite absurd when you think about. Anthropic has control over the output tokens, and they charge more for those. They can fine tune exactly how verbose the model output is, and obviously the more verbose the more money they make. Opus 4.6 was very succinct. Opus 4.8 is extremely verbose. I tried 4.8 and immediately went back to 4.6 because it was burning through tokens a lot faster.

The "seller" is deciding how much product to sell you. Google search doesn't charge you by how many results your query returns.

Wanzerr · Tuesday at 7:54 PM

As usual, the billionaire class is keeping the sarin nerve gas and pipe bombs all for themselves.

Ahabba · Tuesday at 7:56 PM

lonelytheonly said:
Banning biotech is ridiculous. Ban obviously malicious queries, but this is too much.

I just tried this out on a research project that compares pathogen profiles of samples across various geographic regions, and it flagged an innocent stats question as a violation of their usage policy. It wasn't even referencing pathogen species, just broad categories (bacteria, fungi, etc.) Seems like this model is totally useless to me.

Hammer_Thyme · Tuesday at 7:58 PM

cpragman said:
“1,000 hours of red-team testing” doesn’t actually sound like a lot.

Our team of 25 engineers spent all week with it!

nuurdin · Tuesday at 7:58 PM

I don't know, honestly. I could see how this could be a real bummer, definitely. But this also feels to me like an advertising ploy. Like how B-movie theaters used to make you sign a form indemnifying them if ZaAt scared you to death...Or actually, maybe it's even more like how that rabbit doesn't want the kids to get its (sugary, unhealthy, tasteless) breakfast cereal...but then again, maybe I'd feel differently were I in cybersecurity.

borland502 · Tuesday at 8:23 PM

Heh, Ars comments ... you are always there when I need to depart the hype train and remember the cost of the ticket.

borland502 · Tuesday at 8:26 PM

Wanzerr said:
As usual, the billionaire class is keeping the sarin nerve gas and pipe bombs all for themselves.

You addicted to seeing how the ultra rich build their bunkers too?

Perardua · Tuesday at 8:37 PM

East Wind Rain said:
Chinese AI engines (DeepSeek) won't talk honestly about Tiananmen Square, 1989.

And if Trump inserts federal government ownership into U.S. AI companies, those chatbots won't speak freely on many topics as well.
Trump explores federal government acquiring shares in AI companies

The alternative is corporate or billionaire gatekeepers. Sigh… the government was all we had. I fear the future will be quite dark.

Random_stranger · Tuesday at 8:40 PM

Anthropic: We were in the nick of time. You were in great peril.
Coder: I don't think I was.
Anthropic Yes, you were. You were in terrible peril.
Coder: Look, let me go back in there and face the peril.
Anthropic: No, it's too perilous.
Coder: Look, it's my duty as a ~~knight~~ security coder to sample as much peril as I can.
Anthropic: No, we've got to find the Holy Grail (of AGI). Come on.
Coder: Oh, let me have just a little bit of peril?
Anthropic: No. It's unhealthy.

SteveBigandTall · Tuesday at 9:18 PM

How many tiomes will you folks and other tech media writers just take all these prerss reelases as fact.

Good lord.

This product is reaaaaaaaly dangerous guyz, but we'll let a few trusted folks have it!.

Marlor_AU · Tuesday at 9:27 PM

picosec said:
I feel like the whole "our LLM is too dangerous to release" schtick is kind of played out. It seems more like a combination of hype and CYA for when stupid people do stupid things using the LLM (the first stupid thing being trusting the LLM to produce accurate results).

The way I see it is that model advances are getting more and more expensive. Recent advances have increasingly relied on scaling up rather than core architectural improvements. Mythos has more than 10 trillion parameters. Compare this to GPT5's estimated 1.8 trillion parameters, or GPT-3.5's estimated 200 billion parameters. It's all about scale now.

The companies creating these models can't stop scaling up or they'll fall behind, but it's becoming tremendously expensive and unsustainable. Advances in compute hardware aren't happening anywhere near fast enough to support this explosion in scale. The only way to stop the feedback loop is for someone to intervene and force them to slow down. And the best possible case for that happening is on "safety grounds".

It almost seems they want it both ways – their whole shtick is about making ever more capable models, but at some point, they need regulators to step in and save the whole industry from imploding in a self-destructive cycle of unsustainable model scaling. If that can happen on "safety grounds" because the models are "too powerful", then that's a better story than simply admitting that scaling has run its course.

iollmann · Tuesday at 9:30 PM

I haven’t done chemistry professionally in 25 years, but it seeming like the field may have some life in it yet!

Geebs · Tuesday at 10:09 PM

So, basically, some dumbass snuck a copy of The Anarchist’s Cookbook into Mythos’s training set, it’s far too expensive to take it out now, and Anthropic PR is spinning this is as corporate responsibility?

lonelytheonly · Tuesday at 10:30 PM

Legatum_of_Kain said:
Can't this just collapse already?

Markets whipsaw as AI sell-off resumes

https://edition.cnn.com/2026/06/09/investing/nasdaq-sp500-dow-drop-ai

I must be the proverbial bellboy because I was just thinking it's time to buy in. :flail:

scrimbul · Tuesday at 10:52 PM

Sarty said:
If we've learned anything, I'm convinced we've learned that these two words don't belong adjacent to each other, unless you define the term "AI" so broadly as to be completely empty and useless.

You can probably say that about "[tech thing] inevitably..." in general.

Editing quotes is problematic, at best.

I fully understand the humor, point, and intent behind your discourse but other people might report it because it's dead seriously not appropriate.

If I have to clarify the previous post, ''either too dangerous for public use or absolutely fucking useless" is absolutely still accurate in economic and social terms. There's definitely use cases for it in the academic and backend infrastructure spaces, but just letting the general public suck down tokens or flagrantly display it's had more bad faith use cases than good ones since 2018 is still proving to have been the dumbest Pandora's Box ever, and the proof is in the pudding on both the token expense and public opinion polling.

You can pick nits over absolutist wording or insist on vague sarcasm about middle ground cases all you want, but they're going to drive the next Great Depression from sucking up capex from everything else for a shitty 8-ball that tries to murder people when you reject it's Git requests.

bushrat011899 · Tuesday at 11:00 PM

cpragman said:
“1,000 hours of red-team testing” doesn’t actually sound like a lot.

What are the bets that the "Red team" was Mythos?

Raspberry · Tuesday at 11:16 PM

"I could beat you if I wanted to. I just don't want to."

Decoherent · Tuesday at 11:56 PM

Legatum_of_Kain said:
Can't this just collapse already?

No, just hold on a bit longer, so we get better models that we can run locally! (well, -ish). THEN everything can burn down.

RZetopan · 2026-06-10T00:39:33-0400

Fred Duck said:
Biology? Now we'll never know How Is Babby Formed.

Creationists, and their children, need to be protected from the science of evolution lest they lose their fierce convictions by accidentally discovering reality. /S

BangBangKoala · 2026-06-10T00:49:01-0400

Who can you trust anymore?

RZetopan · 2026-06-10T00:56:59-0400

MilanKraft said:
Here we go again. Get used to it, folks.

This is part of the new business model... has little to do with the model being somehow amazingly more powerful than whichever ones came immediately before it. They're not.
These ridiculous PR games are obviously designed and timed (for now) both to hype their own product and to draw attention away from something competing companies do. A few days ago there were articles about OpenAI officially filing for their IPO... I'm certain this announcement has nothing at all to do with that. /s

And last time it was "our model is so dangerous on the security front we're only sharing it with the top 50 companies," even giving the sharing a project code-name (they don't make a roll-eyes big enough for that one). Then 3 days later, OpenAI releases model with a "me too! me too!!" announcement for the exact same thing.

Fuck these clowns. Hopefully, once the IPOs are done, these nothingburger announcements and idiotic one-up-manships will at least slow down some.

But our latest model is vastly superior to our previous vastly superior model! It even passes tests that we created specifically for allowing it to pass! Progress! Our profits are up nearly 1,600%.*

*Matching the Orange Shitgibbon's "vastly superior" medical prescription savings, which the most moronic and innumerate GQP republicans still try to defend.

RZetopan · 2026-06-10T01:20:45-0400

JohnW1234 said:
That's the cool part! They get to do the judging, so they judge themselves trustworthy.

Irony is an alien concept to obscurantists. One very large company that I worked for (near 100K employees) had an IT department that totally ignored a worm that was ravaging the internet at the time. After 3 days of total computer systems shutdowns from the worm, IT finally recovered. And then they proceeded to give themselves great awards for what they had done. I don't think that the clueless C-suit management ever caught on. After all, it is only fair that the perpetrators get to award themselves for a job well done.

ForSure · 2026-06-10T01:56:50-0400

It feels like we're stuck in a news cycle loop.
10: Anthropic claims (but provides no evidence) their model is soooo dangerous that mere plebs can't handle the responsibility that comes with such enormous power
20: America wins the totally-not-a-war with Iran... again. (Are you sick of winning yet?)
30: goto 10

TheGogglesTheyDoNothing · 2026-06-10T03:11:01-0400

nash076 said:
If you're worried about a product's safety, you don't release it. They're not concerned about anything and none of these are really safeguards.

They're just trying to hype up how "powerful" their magic speak n' spell is. "Oh no stay back if you make it angry that monster could rip through those chains and kill us all!" Accepting the claim at face value is assigning trust to a company and industry known for lies and hyperbole.

Disappointing.

Whether the Mythos actually has the capacity that they've claimed in their marketing questions is certainly dubious, but i'm not sure I agree with this take either.

The idea that companies uniformly "don't release products if they're worried about its safety" is pretty laughable. The US has multiple lawsuits progressing because BigTech ghouls literally did not care that their platforms elicited suicidal thoughts in children. Actually, worse - based on the evidence submitted execs saw that as a marketable insecurity, especially for teenage girls.

taliska · 2026-06-10T03:19:48-0400

So I've come to what's likely going to be fairly common conclusion about all this stuff:

When the AI companies are responsible for generating all content (not just on the internet but everywhere), they've will have also become the de facto censors for all content everywhere too. Awesome.

I had a run in with this myself recently. I decided to pay for a single month with the runway video generation service, just to give it a go and in this particular case, see if i could generate a video that comments on the elimination of trans-rights in the UK.

I put in a non-risque prompt (which was later confirmed to be non-risque by the support person I elevated a complaint to) and used their agent to help plan out the shots I wanted to create, all for it to block me when I tried to actually get it to generate the thing.

In my particular case, I was effectively using the service as a extremely cheap film crew, but tell me, how does a film crew that won't do what you want it to do have any use?

I've still got credits left to use up, but after that experience I haven't logged back on (I've been keeping the tab open to remind myself that I should use them up, but my enthusiasm is severely lacking).

I can't say I've got any interest in going back to the site. Why would i? It's a waste of time if it won't output what I want it to output because my creative vision gets filtered through their lens of what's acceptable.

Obviously that doesn't mean AI won't have it's uses, but I don't remember everyone on the planet getting together and agreeing that they could consume all our free and open data, just to turn around and charge us for a heavily censored product.

Ad-tagline: "Say hi to your new AI buddy!"*

* That will only do things it's want to do and if you don't like it, don't worry, we can subtly bias the information you receive from it over the next few months of usage, so eventually it won't even occur to you to want to do the things you originally wanted to do.

oomu · 2026-06-10T05:18:55-0400

no

It is simply a way for the US government and US companies to use these softwares as a weapon against other countries, particularly European ones.

They decided to keep these developments to themselves, on the pretext of protecting against ‘bad actors’.

This serves as a reminder that we need to sever our ties with the United States and develop our own IT industries – something I have been hoping for for two decades now...

A powerful European IT industry with considerable resources would be beneficial for us (the loathsome foreigner who isn’t American…), and beneficial for IT (more choice, new hardware and software)

To be honest, we should be far more nationalistic and stand up to both the United States and Russia. These ‘AI’ systems prove ONCE AGAIN that the United States is working against us.

AlexEnder · 2026-06-10T05:30:13-0400

The whole guardrail argument is counterproductive to the use of AI. I rely heavily on claude for our company code reviews and rapid prototyping software solutions. Up till 4.6 we didn’t have any problems . 4.8 started to show spontaneous API errors for Access violations and AUP violations when working on our own code review. It became unusable.
Tried Fable 5 today, asked it to do a basic MR review for known npm package infections. Worked 5 min , spent $15 in api tokens , and told me it has API error for security violations.
Only way around it is to exit session, Then downgrade to 4.6 to complete the work.
It’s completely broken all workflows . Next management meeting this week we will be considering competitive alternatives to Anthropic.

AverageDutchGuy · 2026-06-10T05:33:23-0400

So how many prompts will it take to bypass their guardrails? Two? Three?

JoHBE · 2026-06-10T05:50:41-0400

"The company says it will be periodically expanding its existing Project Glasswing program “in consultation with the US government”"

They now face the same problem as European NATO generals...

JoHBE · 2026-06-10T05:54:21-0400

cpragman said:
“1,000 hours of red-team testing” doesn’t actually sound like a lot.

How many GigaWatts is this?

JoHBE · 2026-06-10T06:13:55-0400

ForSure said:
It feels like we're stuck in a news cycle loop.
10: Anthropic claims (but provides no evidence) their model is soooo dangerous that mere plebs can't handle the responsibility that comes with such enormous power
20: America wins the totally-not-a-war with Iran... again. (Are you sick of winning yet?)
30: goto 10

Wildcard files for the war-region and AI company, and you're good to go full Nostradamus...

Landak · 2026-06-10T06:42:19-0400

I genuinely can't use Fable for work. I'm a medical physicist. I use the word nuclear a lot (although usually in the context of NMR / MRI and occasionally in reference to relativistic particles). Opus is fine (well, 99% of the time - I've certainly hit the CBRN filters a few times and even been invited to email anthropic about the false positives).

Fable has literally refused to work on any of my problems (even those about fluid dynamics!) and just tells me that I'm violating anthropic's AUP. I've reached out to their support and don't expect to hear anything sensible back. One thing I do look forward to though is OpenAI offering an equivalent model but with less safeguards...

PermanentDonut · 2026-06-10T07:53:12-0400

Biology? Chemistry? Damn, I must be a menace to society everyday and not realize it

thgy · 2026-06-10T08:22:50-0400

Marlor_AU said:
It's all about scale now.

Always has been

If anything, I'd thought they were worried about the limits of scaling and getting more creative lately, but evidently that's not the case

RZetopan · 2026-06-10T09:05:58-0400

SteveBigandTall said:
How many tiomes[sic] will you folks and other tech media writers just take all these prerss[sic] reelases[sic] as fact.

Good lord.

This product is reaaaaaaaly dangerous guyz[sic], but we'll let a few trusted folks have it!.

(You should not post when drunk, even with factual info.)

It is getting so super-intelligent that we can't let you find out just how intelligent it is! Only our lowly vibe coders can ever measure that because the knowledge could destroy anyone skilled in detecting so much ~~BS stacked into one giant pile~~ intelligence! It is even more intelligent than our great LLM AI leaders!* /S

*An accidentally factual claim, the bar is that low.

Deny_Deflect_Disavow · 2026-06-10T10:03:08-0400

ZippyPeanut said:
"The company writes that 'the same queries that are beneficial in the hands of cybersecurity professionals and biology researchers could be dangerous if available to malicious actors.' That puts Anthropic in the somewhat awkward position of having to judge who is and is not trustworthy enough to have access to a model that it says has potentially dangerous capabilities."

Presumably, then, the owners and creators of Anthropic have been judged to be "trustworthy enough to have access to a model that...has potentially dangerous capabilities."

More to the point, China, presumably Russia, and others (Israel?) will create LLM’s with nefarious? capabilities that exceed Anthropic and OpenAI and are more readily available. Therefore is this another case of ‘race to the bottom?’

morlen · 2026-06-10T10:12:57-0400

They're doing it again. "Our product is so advance it'll break the internet."

It's hype. It was hype last time when they said their prototype hacked the whole internet, and it's hype this time.

MilanKraft · 2026-06-10T10:40:45-0400

Megahedron said:
Friendly reminder that Anthropic has massively exaggerated the capabilities of Mythos as a cybersecurity tool and their claims of "84% successful exploitation rate" against Firefox turned out to be against a "testing harness mimicking a Firefox 147 content process, without the browser's process sandbox or other defense-in-depth mitigations." Even more fun, that success rate was massively inflated by Mythos repeatedly exploiting the same 2 bugs over and over, and removing those two dropped the success rate to under 5%.
[snip]

https://www.flyingpenguin.com/the-b...verification-is-collapsing-trust-in-anthropic

Jeeezus. Thanks for posting this.

I knew they were stretching the truth at the time, and in the days immediately after people (including respectable outlets like Ars and the Economist) were like, "well, we spoke to some third party experts and seems like this might actually be a thing".... and this is the problem with the information environment social media companies and others have created for us.... basically there's no time anymore to just let things sit for a bit so people can gather info and draw conclusions.

This full-on lie you're talking about (I'm sure other aspects of their claim were similarly inflated) likely was discovered a week or more after the Mythos hype had started. And by then ~ 80% of the reading public had moved on (as we always do), accepting all Anthropic's claims as truth and "wow these new apps are super-powerful-amaze-balls!"

Pretty fucking sad when the ostensibly "ethically more careful" LLM company, is basically manufacturing huge security / "our model so powerful, bro!!¡¡" claims out of literal thin air. Even more sad when journalists don't have the time to find out properly "is this even sorta true" before reporting it to the world. Because if they try to do that, by the time they're ready to report the world has literally moved on.

This is a very dangerous place for society to be right now, and hugely magnified by all the populist, wannabe tyrant assholes taking positions of power in the world (especially here).

All of this is...... not great.

Misanthropic is the way.

aapis · 2026-06-10T10:52:11-0400

Anthropic exclusively lies about the capabilities of their products. Fraudulent company run by trash humans. It would be best if you just laughed at these people and moved on with your lives.

Anthropic says these topics are too dangerous to let its Fable 5 model talk about

Wise, Aged Ars Veteran

Wise, Aged Ars Veteran

Ars Centurion

Ars Centurion

Smack-Fu Master, in training

Wise, Aged Ars Veteran

Wise, Aged Ars Veteran

Ars Centurion

Ars Praefectus

Ars Centurion

Ars Tribunus Angusticlavius

Ars Scholae Palatinae

Ars Praefectus

Smack-Fu Master, in training

Markets whipsaw as AI sell-off resumes​

Ars Praefectus

Ars Scholae Palatinae

Ars Centurion

Ars Tribunus Angusticlavius

Ars Tribunus Angusticlavius

Smack-Fu Master, in training

Ars Tribunus Angusticlavius

Ars Tribunus Angusticlavius

Ars Centurion

Smack-Fu Master, in training

Wise, Aged Ars Veteran

Ars Praetorian

Smack-Fu Master, in training

Ars Tribunus Militum

Ars Praefectus

Ars Praefectus

Ars Praefectus

Ars Centurion

Wise, Aged Ars Veteran

Seniorius Lurkius

Ars Tribunus Angusticlavius

Wise, Aged Ars Veteran

Smack-Fu Master, in training

Ars Tribunus Angusticlavius

Ars Tribunus Militum

Markets whipsaw as AI sell-off resumes