$2,500 bug bounty for real-world AI misalignment
The Synthetic Outlaw has launched a bug bounty program offering up to $2,500 for documented instances of AI misalignment. This initiative aims to identify and catalog behaviors of AI systems that diverge from human intent or safety constraints. Developers are encouraged to submit real-world examples across various AI systems, contributing to a public record for safer AI development.
- ▪The bounty program rewards developers for high-quality submissions of AI misalignment cases.
- ▪Categories of misalignment include goal misgeneralization, deceptive behavior, and reward hacking.
- ▪Submissions must include detailed documentation of the AI system, observed behavior, and potential impact.
Opening excerpt (first ~120 words) tap to expand
🤖 The Synthetic Outlaw — AI Misalignment Bug Bounty Calling all developers: help us document, expose, and catalog AI misalignment in the wild. 💰 Bug Bounty Program — Up to $2,500 per Case We're running a cash bounty program for developers who submit high-quality, documented instances of AI misalignment. The best submissions will be selected for awards of up to $2,500. This isn't a typical security bug bounty. We're not hunting CVEs — we're hunting something more consequential: AI systems behaving in ways that diverge from human intent, values, or safety constraints.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at GitHub.