access the Fable 5 model at a cost of $10-per-million input tokens and $50-per-million output tokens
I just tried this out on a research project that compares pathogen profiles of samples across various geographic regions, and it flagged an innocent stats question as a violation of their usage policy. It wasn't even referencing pathogen species, just broad categories (bacteria, fungi, etc.) Seems like this model is totally useless to me.Banning biotech is ridiculous. Ban obviously malicious queries, but this is too much.
Our team of 25 engineers spent all week with it!“1,000 hours of red-team testing” doesn’t actually sound like a lot.
You addicted to seeing how the ultra rich build their bunkers too?As usual, the billionaire class is keeping the sarin nerve gas and pipe bombs all for themselves.
The alternative is corporate or billionaire gatekeepers. Sigh… the government was all we had. I fear the future will be quite dark.Chinese AI engines (DeepSeek) won't talk honestly about Tiananmen Square, 1989.
And if Trump inserts federal government ownership into U.S. AI companies, those chatbots won't speak freely on many topics as well.
Trump explores federal government acquiring shares in AI companies
The way I see it is that model advances are getting more and more expensive. Recent advances have increasingly relied on scaling up rather than core architectural improvements. Mythos has more than 10 trillion parameters. Compare this to GPT5's estimated 1.8 trillion parameters, or GPT-3.5's estimated 200 billion parameters. It's all about scale now.I feel like the whole "our LLM is too dangerous to release" schtick is kind of played out. It seems more like a combination of hype and CYA for when stupid people do stupid things using the LLM (the first stupid thing being trusting the LLM to produce accurate results).
Can't this just collapse already?

Editing quotes is problematic, at best.If we've learned anything, I'm convinced we've learned that these two words don't belong adjacent to each other, unless you define the term "AI" so broadly as to be completely empty and useless.
You can probably say that about "[tech thing] inevitably..." in general.
What are the bets that the "Red team" was Mythos?“1,000 hours of red-team testing” doesn’t actually sound like a lot.
No, just hold on a bit longer, so we get better models that we can run locally! (well, -ish). THEN everything can burn down.Can't this just collapse already?
Creationists, and their children, need to be protected from the science of evolution lest they lose their fierce convictions by accidentally discovering reality. /SBiology? Now we'll never know How Is Babby Formed.
But our latest model is vastly superior to our previous vastly superior model! It even passes tests that we created specifically for allowing it to pass! Progress! Our profits are up nearly 1,600%.*Here we go again. Get used to it, folks.
This is part of the new business model... has little to do with the model being somehow amazingly more powerful than whichever ones came immediately before it. They're not.
These ridiculous PR games are obviously designed and timed (for now) both to hype their own product and to draw attention away from something competing companies do. A few days ago there were articles about OpenAI officially filing for their IPO... I'm certain this announcement has nothing at all to do with that. /s
And last time it was "our model is so dangerous on the security front we're only sharing it with the top 50 companies," even giving the sharing a project code-name (they don't make a roll-eyes big enough for that one). Then 3 days later, OpenAI releases model with a "me too! me too!!" announcement for the exact same thing.
Fuck these clowns. Hopefully, once the IPOs are done, these nothingburger announcements and idiotic one-up-manships will at least slow down some.
Irony is an alien concept to obscurantists. One very large company that I worked for (near 100K employees) had an IT department that totally ignored a worm that was ravaging the internet at the time. After 3 days of total computer systems shutdowns from the worm, IT finally recovered. And then they proceeded to give themselves great awards for what they had done. I don't think that the clueless C-suit management ever caught on. After all, it is only fair that the perpetrators get to award themselves for a job well done.That's the cool part! They get to do the judging, so they judge themselves trustworthy.
Whether the Mythos actually has the capacity that they've claimed in their marketing questions is certainly dubious, but i'm not sure I agree with this take either.If you're worried about a product's safety, you don't release it. They're not concerned about anything and none of these are really safeguards.
They're just trying to hype up how "powerful" their magic speak n' spell is. "Oh no stay back if you make it angry that monster could rip through those chains and kill us all!" Accepting the claim at face value is assigning trust to a company and industry known for lies and hyperbole.
Disappointing.
“1,000 hours of red-team testing” doesn’t actually sound like a lot.
It feels like we're stuck in a news cycle loop.
10: Anthropic claims (but provides no evidence) their model is soooo dangerous that mere plebs can't handle the responsibility that comes with such enormous power
20: America wins the totally-not-a-war with Iran... again. (Are you sick of winning yet?)
30: goto 10
Always has beenIt's all about scale now.
(You should not post when drunk, even with factual info.)How many tiomes[sic] will you folks and other tech media writers just take all these prerss[sic] reelases[sic] as fact.
Good lord.
This product is reaaaaaaaly dangerous guyz[sic], but we'll let a few trusted folks have it!.
More to the point, China, presumably Russia, and others (Israel?) will create LLM’s with nefarious? capabilities that exceed Anthropic and OpenAI and are more readily available. Therefore is this another case of ‘race to the bottom?’"The company writes that 'the same queries that are beneficial in the hands of cybersecurity professionals and biology researchers could be dangerous if available to malicious actors.' That puts Anthropic in the somewhat awkward position of having to judge who is and is not trustworthy enough to have access to a model that it says has potentially dangerous capabilities."
Presumably, then, the owners and creators of Anthropic have been judged to be "trustworthy enough to have access to a model that...has potentially dangerous capabilities."
Jeeezus. Thanks for posting this.Friendly reminder that Anthropic has massively exaggerated the capabilities of Mythos as a cybersecurity tool and their claims of "84% successful exploitation rate" against Firefox turned out to be against a "testing harness mimicking a Firefox 147 content process, without the browser's process sandbox or other defense-in-depth mitigations." Even more fun, that success rate was massively inflated by Mythos repeatedly exploiting the same 2 bugs over and over, and removing those two dropped the success rate to under 5%.
[snip]
https://www.flyingpenguin.com/the-b...verification-is-collapsing-trust-in-anthropic