Artificial Intelligence

Anthropic Launches Claude Fable 5: Mythos-Class AI With Cybersecurity Guardrails

The AI giant also announced that Project Glasswing partners are being given access to the upgraded Mythos 5.

Eduard Kovacs

Published

4 days ago

Claude security

Anthropic on Tuesday announced the general availability of Claude Fable 5, a powerful Mythos-class AI model engineered with new safeguards that specifically restrict its use in high-risk domains, including cybersecurity.

The AI giant says this marks the first time a model of this capability class has been deemed safe enough for widespread public and developer access.

While Fable 5 demonstrates good performance — surpassing prior models in software engineering, knowledge work, vision, and long-running tasks — the company prioritized safety by implementing targeted blocks.

In sensitive areas such as cybersecurity and biology, the model automatically falls back to the less capable Claude Opus 4.8 to prevent potential misuse. Early usage data indicates that at least 95% of sessions run entirely on Fable 5’s capabilities without triggering any fallback.

“The uplift from Mythos-level capabilities is valuable to many adversaries — for instance, those who could financially gain from cyberattacks — and we therefore expect them to be motivated to try to circumvent our safety measures,” Anthropic noted.

The company emphasized the rigor of its safety measures. It conducted extensive internal red-teaming of its classifiers, followed by an external bug bounty program spanning over 1,000 hours that yielded no universal jailbreaks.

Advertisement. Scroll to continue reading.

Independent external red-teaming also failed to uncover critical bypasses, underscoring the robustness of the safeguards against adversarial attempts to achieve restricted outputs.

Project Glasswing partners gain upgraded Mythos 5

Anthropic also announced on Tuesday that trusted users, including its cybersecurity partners in Project Glasswing, are being upgraded from Claude Mythos Preview to Claude Mythos 5.

The company plans to gradually expand this high-privilege access through a structured trusted-access program.

Anthropic announced recently that it’s expanding Project Glasswing to add roughly 150 new organizations.

The AI giant has not listed the new additions, but several cybersecurity and tech companies have since announced their participation in the project, including Dragos, Tenable, TrendAI (Trend Micro), Netskope, BeyondTrust, Rubrik, BT, Intercontinental Exchange, and Hitachi.

Both Fable 5 and Mythos 5 are priced at $10 per million input tokens and $50 per million output tokens, with the former available immediately via the Claude API for developers.

In this article:AI, Anthropic, Claude, Claude Fable, Featured, Mythos

Artificial Intelligence

Anthropic Says It Has Taken Its Latest AI Models Offline to Comply With New Export Controls

Anthropic takes Fable 5 and Mythos 5 offline to comply with a directive from the Trump administration to prevent use by foreign nationals.

Associated Press16 hours ago

Artificial Intelligence

Industry Reactions to Claude Fable 5: Feedback Friday

Industry professionals comment on various aspects of Fable 5, including dual-use capabilities, safeguards, and tiered access.

Eduard Kovacs1 day ago

Artificial Intelligence

Anthropic Disputes Fable 5 AI Jailbreak

An AI hacker claims to have achieved a prompt-based jailbreak shortly after Fable 5’s launch, but Anthropic says it’s not a real jailbreak.

Eduard Kovacs2 days ago

Incident Response

Alert Fatigue Is Becoming a Security Threat of Its Own

As alert volumes outpace human capacity, organizations are turning to AI, automation, and deeper context to separate real threats from the noise.

Kevin Townsend2 days ago

Application Security

After AI Reaches Production: 12 Ways Security Teams Can Take Control

Security teams need more than visibility into AI applications, they need a repeatable framework for monitoring, investigating, and defending them in production.

Joshua Goldfarb3 days ago

Vulnerabilities

OpenSSL Patches High-Severity Vulnerability Found With AI

A total of 18 vulnerabilities have been patched in the latest OpenSSL releases, including many that were potentially discovered by AI.

Eduard Kovacs4 days ago

Artificial Intelligence

Claude Mythos Turns N-Days Into N-Hours With Rapid Exploit Creation

Public LLM models with safeguards turned off can also build working exploits, increasing patch gap risks.

Ionut Arghire4 days ago

Application Security

New Platform Uses Cryptographic Invisibility to Protect AI-Built Applications

Atsign’s AI Architect applies cryptographic protections to agentic software development, aiming to prevent attackers from exploiting vulnerabilities by making application identities effectively invisible.

Kevin Townsend4 days ago

SecurityWeek

Artificial Intelligence

Anthropic Launches Claude Fable 5: Mythos-Class AI With Cybersecurity Guardrails

Project Glasswing partners gain upgraded Mythos 5

Related Content

Artificial Intelligence

Anthropic Says It Has Taken Its Latest AI Models Offline to Comply With New Export Controls

Artificial Intelligence

Industry Reactions to Claude Fable 5: Feedback Friday

Artificial Intelligence

Anthropic Disputes Fable 5 AI Jailbreak

Incident Response

Alert Fatigue Is Becoming a Security Threat of Its Own

Application Security

After AI Reaches Production: 12 Ways Security Teams Can Take Control

Vulnerabilities

OpenSSL Patches High-Severity Vulnerability Found With AI

Artificial Intelligence

Claude Mythos Turns N-Days Into N-Hours With Rapid Exploit Creation

Application Security

New Platform Uses Cryptographic Invisibility to Protect AI-Built Applications