Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for AI researchers and developers
A paragraph buried in Fable 5's 319-page system card revealed the model would silently downgrade its responses for certain AI development work — without telling users.
Opening excerpt (first ~120 words) tap to expand
When Anthropic made its first Mythos-tier model available to the general public yesterday, called Claude Fable 5, Fortune reported it was a “considerable step” for the lab, coming just over a week after the company confidentially filed for IPO paperwork. It had initially deemed Mythos-class models too dangerous to release, citing their significantly enhanced ability to identify software vulnerabilities, but said it was now confident new guardrails in Claude Fable 5 are enough to ensure these dangerous skills don’t fall into the wrong hands.Recommended Video Just hours after the model’s release, however, major backlash from AI researchers, developers and policy experts began brewing on social media.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at Fortune.