Our evaluation of OpenAI's GPT-5.5 cyber capabilities

Simon Willison· Apr 30, 2026 · 11:03 PM UTC ·1 min read · 0 reactions · 0 comments · 8 views

via

Simon Willison's Weblog

Our evaluation of OpenAI's GPT-5.5 cyber capabilities The UK's AI Security Institute previously evaluated Claude Mythos : now they've evaluated GPT-5.5 for finding security vulnerability and found it to be comparable to Mythos, but unlike Mythos it's generally available right now. Tags: ai , openai , generative-ai , llms , anthropic , claude , ai-security-research , gpt

Original article

Simon Willison's Weblog · Simon Willison

Read full at Simon Willison's Weblog →

Opening excerpt (first ~120 words) tap to expand

Our evaluation of OpenAI's GPT-5.5 cyber capabilities. The UK's AI Security Institute previously evaluated Claude Mythos: now they've evaluated GPT-5.5 for finding security vulnerability and found it to be comparable to Mythos, but unlike Mythos it's generally available right now.

Excerpt limited to ~120 words for fair-use compliance. The full article is at Simon Willison's Weblog.

Anonymous · no account needed

Discussion

0 comments

Our evaluation of OpenAI's GPT-5.5 cyber capabilities

Discussion

More from Simon Willison's Weblog