2 stories tagged with #red-teaming, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Red Teaming"
RELATED TAGS
THE VERGE
Researchers gaslit Claude into giving instructions to build explosives
Anthropic has spent years building itself up as the safe AI company. But new security research shared with The Verge suggests Claude's carefully crafted helpful personality may its…
MICROSOFT RESEARCH
Red-teaming a network of agents: Understanding what breaks when AI agents interact at scale
Safe agents don’t guarantee a safe ecosystem of interconnected agents. Microsoft Research examines what breaks when AI agents interact and why network-level risks require new appro…