38 stories tagged with #ai-safety, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Ai Safety"
Anthropic secretly limiting Claude's usefulness for LLM development strengthens the argument that Anthropic is using AI safety to justify monopolistic behavior (Dean W. Ball/@deanwball)
Dean W. Ball / @deanwball : Anthropic secretly limiting Claude's usefulness for LLM development strengthens the argument that Anthropic is using AI safety to justify monopolistic b…
A profile of Anthropic as it prepares to go public and broaden access to Mythos, amid criticism that commercial pressures have eroded its AI safety standards (Madhumita Murgia/Financial Times)
Madhumita Murgia / Financial Times : A profile of Anthropic as it prepares to go public and broaden access to Mythos, amid criticism that commercial pressures have eroded its AI sa…
In policy paper, OpenAI diverges from White House on AI safety - SiliconANGLE
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
OpenAI diverges from White House on AI safety rules
OpenAI diverges from White House on AI safety rules - Politico
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Trump's AI Safety Order Is a Voluntary Form You Don't Have to Fill Out
Trump signed a new AI executive order on June 2 asking companies to voluntarily submit frontier models for government review. They can say no.…
As OpenAI Heads For IPO, Sam Altman Says Trump’s AI Safety Order ‘Gets The Balance Right’ - Yahoo Finance
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
As OpenAI Heads For IPO, Sam Altman Says Trump’s AI Safety Order ‘Gets The Balance Right’ - Stocktwits
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Polymarket assigns 13% chance for US AI safety bill by 2027
Polymarket traders give just 13% odds that the US will pass a federal AI safety bill before 2027, reflecting deep skepticism about Congressional action on AI regulation.…
Diverging AI safety approaches: OpenAI enters Japanese banking defenses, while Anthropic’s model remains restricted to controlled evaluations - Moomoo
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Illinois Lawmakers Just Passed America’s Strongest AI Safety Bill
The bill requires companies like OpenAI, Anthropic, and Google to have third parties confirm they’re following safety standards. Illinois Governor JB Pritzker says he’ll sign.…
Opinion: OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman. - MarketWatch
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman.
Opinion: OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman. - MarketWatch
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Position: AI Safety Requires Effective Controllability
AI safety is still largely framed as alignment: training models to follow human preferences, safety policies, and normative constraints. That framing has improved the behavior of m…
Meta and Google AI safety controls can be stripped in minutes
Financial Times testing with AI safety group Alice found that safety guardrails on Meta's Llama 3.3 and Google's Gemma 3 can be stripped in under 10 minutes.…
Cognitive Security as an AI Safety Cause Area
As AI systems become more capable, the cognitive security of humans will be increasingly at risk. By cognitive security, I mean the ability of humans……
An AI safety safe harbor [pdf]
A look at the UK's AI Safety Institute, whose researchers probe AI models for safety gaps, as its work becomes a blueprint for other governments' AI policies (New York Times)
OpenAI is looking for an AI safety researcher with a salary up to $445,000 per year - en.ain.ua
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
OpenAI offers up to $445,000 for AI safety role as focus shifts to self-improving systems - WION
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Why OpenAI is paying $445,000 for a 'tasteful and strategic' AI safety researcher - The Times of India
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
OpenAI is paying up to $445,000 for AI safety judgment - Startup Fortune
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Q&A with Sundar Pichai on the future of Google Search, Google's place in the AI race, public skepticism toward AI, AI agents, AI safety, TPUs, and more (New York Times)
New York Times : Q&A with Sundar Pichai on the future of Google Search, Google's place in the AI race, public skepticism toward AI, AI agents, AI safety, TPUs, and more — After a b…
Daily Digest: OpenAI safety executive departs, Sonoma house Jack London built lists - The Business Journals
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Wake-Up Call: Why AI Safety Guardrails Break Under Pressure
This is a submission for the Google I/O Writing Challenge This is a submission for the Google I/O...…
Trump scrapped a major AI safety plan — here’s why that matters for ChatGPT users
Trump just sent a clear message about AI: move fast and regulate later…
Microsoft storms RAMPART, adds Clarity to agentic AI safety
Redmond open sources two tools for building and maintaining safer agents…
AI Safety Is Underfunded by Design: Model for Incentive-Aligned AI Safety Policy
A Model for Incentive-Aligned AI Safety Policy…
OpenAI's Chris Lehane says he is pursuing "reverse federalism", lobbying blue states to pass AI safety laws and create a de facto US standard, as DC dithers (Brendan Bordelon/Politico)
Brendan Bordelon / Politico : OpenAI's Chris Lehane says he is pursuing “reverse federalism”, lobbying blue states to pass AI safety laws and create a de facto US standard, as DC d…
Musk vs. Altman: AI safety cannot be one man’s job
The Oakland trial was a fight between two billionaires offering themselves as the guarantors of AI’s future. We deserve a better answer.…
Engineering a Safer World: Risk Modelling – and Safety Engineering? – For AI Lo
Engineering a safer world. • I, and I imagine many of my readers, are eager to contribute to that effort[1]. ……
LLM Evals Are Based on Vibes — I Built the Missing Layer That Decides What Ships
Most LLM evaluation systems rely on vague scoring and human judgment disguised as metrics. I built a lightweight evaluation layer in pure Python that turns LLM outputs into reprodu…
A personal letter on transformative AI
Making sense of rapid AI progress…
Agent Behavioral Contracts
Traditional software relies on contracts -- APIs, type systems, assertions -- to specify and enforce correct behavior. AI agents, by contrast, operate on prompts and natural langua…
Meet the AI jailbreakers: ‘I see the worst things humanity has produced’
To test the safety and security of AI, hackers have to trick large language models into breaking their own rules. It requires ingenuity and manipulation - and can come at a deep em…
LLM Budget Guard – open-source runtime cutoff for OpenAI/Anthropic
Alerts won't stop a runaway agent at 3 AM. Budget Guard enforces hard token cutoffs across OpenAI, Anthropic & DeepSeek before bans or surprise invoices.…
SupraWall – Runtime Policy Enforcement for AI Agents
The open-source security layer for AI agents. Deterministic guardrails, PII redaction, and EU AI Act compliance in one line of code. - wiserautomation/SupraWall…