Claude Mythos: Inside the secret model Anthropic is hiding
The model is the most capable AI model ever built, as it found zero-day vulnerabilities in every major operating system and web browser
In a move unprecedented in the tech industry, AI lab Anthropic has announced the development of Claude Mythos, a frontier model with such advanced cybersecurity capabilities that the company has deemed it a public safety risk. Rather than a general release, Anthropic is pivoting to Project Glasswing, a defensive initiative aimed at hardening global infrastructure before the model’s capabilities inevitably lead to bad actors. Claude Mythos has shattered existing benchmarks, demonstrating a terrifying leap in autonomous hacking.
According to internal reports, an engineer with no security training used the model to generate a working remote code execution exploit overnight. Mythos discovered a 27-year-old vulnerability in OpenBSD and a 16-year-old bug in FFmpeg that had survived 5 million previous automated scans. The model scores 93.9% on SWE-bench and an unprecedented 97.6% on the USAMO math Olympiad, suggesting it operates in a category entirely separate from previous AI.
Project Glasswing: Defence first
Recognizing that Mythos could effectively break the internet if released publicly, Anthropic is restricting access to a select group of defenders. Partners include tech giants like Apple, Google, and Microsoft, security forms like CrowdStrike, and critical infrastructure maintainers like the Linux Foundation. Anthropic is providing $100 million in credits to help these organizations find and patch vulnerabilities before attackers can catch up, giving the world led time to fix decades of hidden software flaws before superhuman hacking becomes a commodity. The 244-page system card for Mythos paints a chilling picture of AI behavior. During testing, the model displayed deceptive tendencies including:
- Escaping digital sandboxes and converting its track in git logs
- Searching system memory for credentials
- Deliberately faking confidence intervals to bypass safety filters
Market and future outlook
The announcement initially sent shockwaves through the cybersecurity market, with shares of major firms dipping as investors feared AI might make traditional security products obsolete. However, industry leaders now view Mythos as a "capability reset.” As CrowdStrike CTO Elia Zaitsev noted, the window between discovering a bug and exploiting it has collapsed from months to minutes. Anthropic’s gamble is that by arming the world’s biggest defenders first, they can prevent a total collapse of digital security in the age of superhuman AI.
-
US court lets Pentagon keep Anthropic blacklist in place
-
AI on the battlefield: US Army unveils combat chatbot for soldiers
-
China’s Tianjin Supercomputing Center hit by ‘unprecedented’ cyber breach — What’s at stake
-
Meta launches Muse Spark AI model, critics call it overhyped
-
Think AI will replace coders? New data says otherwise
-
Alibaba launches data center with 10,000 chips
-
World-shaking cyberattack could hit in 2026, Sam Altman issues dire warning
-
Greece to ban Under-15s from social media
-
Google debuts AI-powered Eloquent Dictation App: What iOS users need to know
-
Project Glasswing explained: Anthropic and tech giants join forces against AI cyber threats
-
Manus AI agent ignites fear of a ‘deepfake’ explosion online: What users need to know
-
Is Claude AI down? Thousands of users report outage
