Anthropic releases Claude Fable 5, its most powerful AI: model security fears and risks

Credit: Anthropic.

In the past few hours, Anthropic has made Claude Fable 5 accessible to the general public, the first public version of its flagship model from the Mythos family, until now reserved for a few selected partners due to concerns related to cybersecurity. The one taken by the Amodei brothers’ startup represents a significant step: Fable 5 is the most capable model that Anthropic has ever distributed on a large scale, with superior performance in almost all the main artificial intelligence benchmarks, from software engineering to scientific analysis, from computer vision to complex reasoning on documents.

But the launch comes under very specific conditions: in high-risk sectors such as cybersecurity, biology and chemistry, the model does not respond directly and automatically refers to Claude Opus 4.8, its predecessor. At the same time, Anthropic also distributed Claude Mythos 5, the version of the same model with some of these restrictions relaxed, to a limited group of approved organizations. Overall, this is a release that opens up new possibilities, but also raises important questions about how to handle such powerful technologies.

Fable 5 is available via the Anthropic API and Enterprise plans, with a free access window for the Pro, Max, Team and Enterprise plans until June 22, 2026. From the following day it will be necessary to purchase usage credits, at a cost of 10 dollars per million input tokens and a whopping 50 dollars per million output tokens, double that of Opus 4.8. The high price alone could limit massive use.

The capabilities of Fable 5: It’s an entire development team

In terms of capabilities, the results are notable. Stripe, the popular platform that allows individuals and businesses to send and receive payments online, previewed the model and reported that Fable 5 compressed months of engineering work into just a few days: on a code base of 50 million lines in the Ruby language, the model completed a migration in one day that would have taken an entire team over two months. Even in visual capabilities, where AI models must “see” and interpret images, the leap is evident: Fable 5 managed to complete the video game Pokémon FireRed using only screenshots of the game, without maps or additional information, a task that previous versions couldn’t accomplish even with support tools.

Looking at the cold benchmark numbers, all the potential of the model emerges. Taking the SWE-Bench Pro test as a reference, for example, Fable 5 reaches a score that is just over 80%, clearly distancing both Claude Opus 4.8 (Anthropic’s most advanced AI model so far, stuck at around 67%), and the equivalent models from OpenAI and Google. The advantage also tends to widen as the tasks become more complex and prolonged.

The Mythos 5 variant displays even more advanced scientific capabilities. In drug design, the model matched or surpassed experienced human researchers and identified nine protein targets, of which 14 produced viable candidates for new drug design (still under study). Another example that makes the power of the model clear: in just a week Mythos 5 conducted genomic research on millions of cells belonging to 138 animal species in almost total autonomy, producing results that surpassed a model published in the scientific journal Sciencedespite being a hundred times smaller.

Proteins designed by Mythos 5 to study immunity, cell growth, neurodegeneration and muscle diseases. Credit: Anthropic.

Will Anthropic’s guardrails be enough?

However, these exciting data are accompanied by doubts and fears regarding the safety of the model. Days ago Anthropic launched a public appeal for the main AI laboratories to coordinate a brake on the development of frontier models, i.e. the most advanced artificial intelligence systems available, warning that this technology could soon achieve the ability to self-improve without human intervention. The subsequent announcement relating to the availability of Fable 5 has for this reason surprised everyone a bit, users and professionals. Also because Anthropic, in announcing the news, did not hide the security fears linked to this technology, and in fact admitted:

Releasing such a capable model comes with risks. Without security measures, Fable 5 features in areas such as cybersecurity could be misused to cause serious harm.

However, Anthropic reassured that it had conducted over 1,000 hours of stress tests without finding “universal jailbreaks”, i.e. methods to systematically bypass security filters. However, recognizing that new attacks may still emerge, Anthropic has introduced a mandatory 30-day data retention policy for all traffic generated by the model, even for customers who previously had “zero retention” agreements. The data, Anthropic specifies, will not be used to train other models, but only to «defend against complex and new attacks, including new jailbreaks» and for «identify and reduce false positives».