#ai-safety — Tagged Stories

Every story in the WeSearch catalog tagged with #ai-safety, chronological, with view counts. Subscribe to the per-tag RSS feed to follow this topic in your reader of choice.

51 stories tagged with #ai-safety, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.

⌘ RSS feed for this tag → or search "Ai Safety"

RELATED TAGS

#ai4 #cybersecurity3 #open-source2 #autonomous-agents2 #hugging-face2 #openai2 #cost-management1 #llm-operations1 #compliance1 #runtime-policy1 #jailbreaking1 #mental-health1

THE VERGE

We’re running out of reasons to ignore AI safety

In the aftermath of OpenAI’s attack on Hugging Face, experts say it’s time for everyone to take security far more seriously.…

5 views · Wed, 29 Jul 2026 11:00:00 GMT

#running #reasons #ignore

ABC NEWS (AUSTRALIA)

AI just had its Sarah Connor moment. Is Australia ready?

Far-fetched scenarios about artificial intelligence became more realistic last week when an OpenAI model hacked another company.…

5 views · Tue, 28 Jul 2026 03:35:10 GMT

#australia #cybersecurity

GOOGLE NEWS

Tech giants launch open AI safety plan following breach - NBC Bay Area

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

5 views · Tue, 28 Jul 2026 02:28:47 GMT

GOOGLE NEWS

Nvidia, SpaceX, Microsoft launch AI safety initiative as OpenAI cyberattack fallout continues - oodaloop.com

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

14 views · Mon, 27 Jul 2026 14:43:52 GMT

EDUCATEDGUESSWORK

What policy makers need to know about AI safety and security

9 views · Mon, 27 Jul 2026 14:03:08 GMT

#what #policy #makers

GOOGLE NEWS

Nvidia Forms AI Safety Alliance Following OpenAI Cyberattack - PYMNTS.com

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

7 views · Mon, 27 Jul 2026 14:01:33 GMT

TECHMEME

Nvidia launches the Open Secure AI Alliance, a coalition with Hugging Face and others to develop and share tools for AI safety and cybersecurity (Jaspreet Singh/Reuters)

Jaspreet Singh / Reuters : Nvidia launches the Open Secure AI Alliance, a coalition with Hugging Face and others to develop and share tools for AI safety and cybersecurity — Nvidia…

12 views · Mon, 27 Jul 2026 12:15:01 GMT

CNBC — TOP

Nvidia, SpaceX, Microsoft launch AI safety initiative as OpenAI cyber attack fallout continues

Microsoft, SpaceX, Palantir, alongside dozens of other tech companies from the U.S. and Europe, have joined the Open Secure AI Alliance.…

12 views · Mon, 27 Jul 2026 11:04:48 GMT

#nvidia #spacex #microsoft

NVIDIA BLOG

Industry Leaders Unite in Open Secure AI Alliance for AI Safety and Security

NVIDIA and founding members form new alliance to build and share open tools that promote responsible use of and trust in AI.…

9 views · Mon, 27 Jul 2026 09:00:07 GMT

#industry #leaders #unite

FORTUNE

AI safety experts say OpenAI’s rogue models may mean the company has already blown past its own internal red lines

Experts says the this month's Hugging Face hack likely triggered a risk threshold that OpenAI's own policies say require it to halt model development.…

11 views · Sat, 25 Jul 2026 17:05:14 GMT

#safety #experts #openai

GOOGLE NEWS

OpenAI President Endorses Musk’s Proposal For Industry Meetings on AI Safety - The Information

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

10 views · Fri, 24 Jul 2026 00:49:00 GMT

MIT TECHNOLOGY REVIEW

GPT-Red: an LLM super-hacker OpenAI built to make its models safer

Exclusive: The firm says it wants to future-proof its safety procedures and stay ahead of human attackers.…

78 views · Thu, 16 Jul 2026 07:00:56 GMT

#red teaming #prompt injection

GOOGLE NEWS

OpenAI Safety Boss Resigns in Latest Executive Departure - PYMNTS.com

OpenAI Safety Boss Resigns in Latest Executive Departure PYMNTS.com…

31 views · Sun, 12 Jul 2026 21:59:50 GMT

TECHMEME

Anthropic secretly limiting Claude's usefulness for LLM development strengthens the argument that Anthropic is using AI safety to justify monopolistic behavior (Dean W. Ball/@deanwball)

Dean W. Ball / @deanwball : Anthropic secretly limiting Claude's usefulness for LLM development strengthens the argument that Anthropic is using AI safety to justify monopolistic b…

50 views · Wed, 10 Jun 2026 15:55:56 GMT

TECHMEME

A profile of Anthropic as it prepares to go public and broaden access to Mythos, amid criticism that commercial pressures have eroded its AI safety standards (Madhumita Murgia/Financial Times)

Madhumita Murgia / Financial Times : A profile of Anthropic as it prepares to go public and broaden access to Mythos, amid criticism that commercial pressures have eroded its AI sa…

42 views · Thu, 04 Jun 2026 06:40:01 GMT

GOOGLE NEWS

In policy paper, OpenAI diverges from White House on AI safety - SiliconANGLE

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

50 views · Thu, 04 Jun 2026 01:56:48 GMT

POLITICO EUROPE

OpenAI diverges from White House on AI safety rules

40 views · Wed, 03 Jun 2026 18:18:19 GMT

GOOGLE NEWS

OpenAI diverges from White House on AI safety rules - Politico

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

41 views · Wed, 03 Jun 2026 17:59:07 GMT

DEV.TO (TOP)

Trump's AI Safety Order Is a Voluntary Form You Don't Have to Fill Out

Trump signed a new AI executive order on June 2 asking companies to voluntarily submit frontier models for government review. They can say no.…

25 views · Wed, 03 Jun 2026 08:20:41 GMT

#ai #policy #regulation

GOOGLE NEWS

As OpenAI Heads For IPO, Sam Altman Says Trump’s AI Safety Order ‘Gets The Balance Right’ - Yahoo Finance

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

31 views · Wed, 03 Jun 2026 04:03:35 GMT

GOOGLE NEWS

As OpenAI Heads For IPO, Sam Altman Says Trump’s AI Safety Order ‘Gets The Balance Right’ - Stocktwits

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

25 views · Wed, 03 Jun 2026 04:03:00 GMT

CRYPTO BRIEFING

Polymarket assigns 13% chance for US AI safety bill by 2027

Polymarket traders give just 13% odds that the US will pass a federal AI safety bill before 2027, reflecting deep skepticism about Congressional action on AI regulation.…

28 views · Sat, 30 May 2026 04:03:03 GMT

#ai #legislation #polymarket

GOOGLE NEWS

Diverging AI safety approaches: OpenAI enters Japanese banking defenses, while Anthropic’s model remains restricted to controlled evaluations - Moomoo

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

32 views · Fri, 29 May 2026 11:44:44 GMT

WIRED

Illinois Lawmakers Just Passed America’s Strongest AI Safety Bill

The bill requires companies like OpenAI, Anthropic, and Google to have third parties confirm they’re following safety standards. Illinois Governor JB Pritzker says he’ll sign.…

34 views · Thu, 28 May 2026 00:10:42 GMT

#ai #technology #regulation

GOOGLE NEWS

Opinion: OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman. - MarketWatch

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

19 views · Wed, 27 May 2026 11:50:00 GMT

MARKETWATCH — TOP STORIES

OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman.

28 views · Wed, 27 May 2026 11:50:00 GMT

GOOGLE NEWS

Opinion: OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman. - MarketWatch

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

25 views · Wed, 27 May 2026 11:50:00 GMT

ARXIV CS.AI

Position: AI Safety Requires Effective Controllability

AI safety is still largely framed as alignment: training models to follow human preferences, safety policies, and normative constraints. That framing has improved the behavior of m…

33 views · Wed, 27 May 2026 04:00:00 GMT

#ai #safety #controllability

CRYPTO BRIEFING

Meta and Google AI safety controls can be stripped in minutes

Financial Times testing with AI safety group Alice found that safety guardrails on Meta's Llama 3.3 and Google's Gemma 3 can be stripped in under 10 minutes.…

30 views · Tue, 26 May 2026 22:31:02 GMT

#ai #safety #technology

LESSWRONG

Cognitive Security as an AI Safety Cause Area

As AI systems become more capable, the cognitive security of humans will be increasingly at risk. By cognitive security, I mean the ability of humans……

30 views · Tue, 26 May 2026 07:09:30 GMT

#ai #cognitive security #psychology

WILLIAMRINEHART

An AI safety safe harbor [pdf]

32 views · Mon, 25 May 2026 20:30:44 GMT

TECHMEME

A look at the UK's AI Safety Institute, whose researchers probe AI models for safety gaps, as its work becomes a blueprint for other governments' AI policies (New York Times)

30 views · Mon, 25 May 2026 17:30:01 GMT

GOOGLE NEWS

OpenAI is looking for an AI safety researcher with a salary up to $445,000 per year - en.ain.ua

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

30 views · Mon, 25 May 2026 12:40:00 GMT

GOOGLE NEWS

OpenAI offers up to $445,000 for AI safety role as focus shifts to self-improving systems - WION

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

24 views · Sun, 24 May 2026 10:23:00 GMT

GOOGLE NEWS

Why OpenAI is paying $445,000 for a 'tasteful and strategic' AI safety researcher - The Times of India

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

28 views · Sun, 24 May 2026 06:07:00 GMT

GOOGLE NEWS

OpenAI is paying up to $445,000 for AI safety judgment - Startup Fortune

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

32 views · Sat, 23 May 2026 22:11:40 GMT

TECHMEME

Q&A with Sundar Pichai on the future of Google Search, Google's place in the AI race, public skepticism toward AI, AI agents, AI safety, TPUs, and more (New York Times)

New York Times : Q&A with Sundar Pichai on the future of Google Search, Google's place in the AI race, public skepticism toward AI, AI agents, AI safety, TPUs, and more — After a b…

23 views · Sat, 23 May 2026 06:35:01 GMT

GOOGLE NEWS

Daily Digest: OpenAI safety executive departs, Sonoma house Jack London built lists - The Business Journals

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

28 views · Fri, 22 May 2026 20:48:00 GMT

DEV.TO (TOP)

Wake-Up Call: Why AI Safety Guardrails Break Under Pressure

This is a submission for the Google I/O Writing Challenge This is a submission for the Google I/O...…

23 views · Fri, 22 May 2026 20:13:58 GMT

#ai #safety #technology

TOM'S GUIDE

Trump scrapped a major AI safety plan — here’s why that matters for ChatGPT users

Trump just sent a clear message about AI: move fast and regulate later…

20 views · Fri, 22 May 2026 15:38:51 GMT

#ai #technology #regulation

WWW.THEREGISTER.COM - ARTICLES

Microsoft storms RAMPART, adds Clarity to agentic AI safety

Redmond open sources two tools for building and maintaining safer agents…

46 views · Thu, 21 May 2026 10:30:00 GMT

#ai #technology #software

HACKER NEWS (AI / LLM)

AI Safety Is Underfunded by Design: Model for Incentive-Aligned AI Safety Policy

A Model for Incentive-Aligned AI Safety Policy…

35 views · Wed, 20 May 2026 17:07:44 GMT

#ai #safety #policy

TECHMEME

OpenAI's Chris Lehane says he is pursuing "reverse federalism", lobbying blue states to pass AI safety laws and create a de facto US standard, as DC dithers (Brendan Bordelon/Politico)

Brendan Bordelon / Politico : OpenAI's Chris Lehane says he is pursuing “reverse federalism”, lobbying blue states to pass AI safety laws and create a de facto US standard, as DC d…

37 views · Wed, 20 May 2026 15:25:02 GMT

FORTUNE

Musk vs. Altman: AI safety cannot be one man’s job

The Oakland trial was a fight between two billionaires offering themselves as the guarantors of AI’s future. We deserve a better answer.…

24 views · Mon, 18 May 2026 21:28:41 GMT

#ai #governance #law

LESSWRONG

Engineering a Safer World: Risk Modelling – and Safety Engineering? – For AI Lo

Engineering a safer world. • I, and I imagine many of my readers, are eager to contribute to that effort[1]. ……

42 views · Sun, 17 May 2026 17:00:56 GMT

#safety #engineering #ai

TOWARDS DATA SCIENCE

LLM Evals Are Based on Vibes — I Built the Missing Layer That Decides What Ships

Most LLM evaluation systems rely on vague scoring and human judgment disguised as metrics. I built a lightweight evaluation layer in pure Python that turns LLM outputs into reprodu…

38 views · Sun, 17 May 2026 13:00:00 GMT

#llms #evaluation #hallucination detection

HACKER NEWS (AI / LLM)

A personal letter on transformative AI

Making sense of rapid AI progress…

28 views · Sat, 16 May 2026 13:17:57 GMT

#artificial intelligence #transformative technology #global risks

ARXIV.ORG

Agent Behavioral Contracts

Traditional software relies on contracts -- APIs, type systems, assertions -- to specify and enforce correct behavior. AI agents, by contrast, operate on prompts and natural langua…

40 views · Sat, 16 May 2026 11:02:16 GMT

#artificial intelligence #software engineering

THE GUARDIAN — TECH

Meet the AI jailbreakers: ‘I see the worst things humanity has produced’

To test the safety and security of AI, hackers have to trick large language models into breaking their own rules. It requires ingenuity and manipulation - and can come at a deep em…

44 views · Wed, 29 Apr 2026 09:00:51 GMT

#jailbreaking #mental health

GITHUB

SupraWall – Runtime Policy Enforcement for AI Agents

The open-source security layer for AI agents. Deterministic guardrails, PII redaction, and EU AI Act compliance in one line of code. - wiserautomation/SupraWall…

28 views · Tue, 28 Apr 2026 16:03:55 GMT

#cybersecurity #compliance

LLMETER

LLM Budget Guard – open-source runtime cutoff for OpenAI/Anthropic

Alerts won't stop a runaway agent at 3 AM. Budget Guard enforces hard token cutoffs across OpenAI, Anthropic & DeepSeek before bans or surprise invoices.…

34 views · Tue, 28 Apr 2026 16:28:32 GMT

#cost management #open source

Browse more

All tags Search "Ai Safety" RSS feed World US Technology Markets

Ai Safety coverage.

We’re running out of reasons to ignore AI safety

AI just had its Sarah Connor moment. Is Australia ready?

Tech giants launch open AI safety plan following breach - NBC Bay Area

Nvidia, SpaceX, Microsoft launch AI safety initiative as OpenAI cyberattack fallout continues - oodaloop.com

What policy makers need to know about AI safety and security

Nvidia Forms AI Safety Alliance Following OpenAI Cyberattack - PYMNTS.com

Nvidia launches the Open Secure AI Alliance, a coalition with Hugging Face and others to develop and share tools for AI safety and cybersecurity (Jaspreet Singh/Reuters)

Nvidia, SpaceX, Microsoft launch AI safety initiative as OpenAI cyber attack fallout continues

Industry Leaders Unite in Open Secure AI Alliance for AI Safety and Security

AI safety experts say OpenAI’s rogue models may mean the company has already blown past its own internal red lines

OpenAI President Endorses Musk’s Proposal For Industry Meetings on AI Safety - The Information

GPT-Red: an LLM super-hacker OpenAI built to make its models safer

OpenAI Safety Boss Resigns in Latest Executive Departure - PYMNTS.com

Anthropic secretly limiting Claude's usefulness for LLM development strengthens the argument that Anthropic is using AI safety to justify monopolistic behavior (Dean W. Ball/@deanwball)

A profile of Anthropic as it prepares to go public and broaden access to Mythos, amid criticism that commercial pressures have eroded its AI safety standards (Madhumita Murgia/Financial Times)

In policy paper, OpenAI diverges from White House on AI safety - SiliconANGLE

OpenAI diverges from White House on AI safety rules

OpenAI diverges from White House on AI safety rules - Politico

Trump's AI Safety Order Is a Voluntary Form You Don't Have to Fill Out

As OpenAI Heads For IPO, Sam Altman Says Trump’s AI Safety Order ‘Gets The Balance Right’ - Yahoo Finance

As OpenAI Heads For IPO, Sam Altman Says Trump’s AI Safety Order ‘Gets The Balance Right’ - Stocktwits

Polymarket assigns 13% chance for US AI safety bill by 2027

Diverging AI safety approaches: OpenAI enters Japanese banking defenses, while Anthropic’s model remains restricted to controlled evaluations - Moomoo

Illinois Lawmakers Just Passed America’s Strongest AI Safety Bill

Opinion: OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman. - MarketWatch

OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman.

Opinion: OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman. - MarketWatch

Position: AI Safety Requires Effective Controllability

Meta and Google AI safety controls can be stripped in minutes

Cognitive Security as an AI Safety Cause Area

An AI safety safe harbor [pdf]

A look at the UK's AI Safety Institute, whose researchers probe AI models for safety gaps, as its work becomes a blueprint for other governments' AI policies (New York Times)

OpenAI is looking for an AI safety researcher with a salary up to $445,000 per year - en.ain.ua

OpenAI offers up to $445,000 for AI safety role as focus shifts to self-improving systems - WION

Why OpenAI is paying $445,000 for a 'tasteful and strategic' AI safety researcher - The Times of India

OpenAI is paying up to $445,000 for AI safety judgment - Startup Fortune

Q&A with Sundar Pichai on the future of Google Search, Google's place in the AI race, public skepticism toward AI, AI agents, AI safety, TPUs, and more (New York Times)

Daily Digest: OpenAI safety executive departs, Sonoma house Jack London built lists - The Business Journals

Wake-Up Call: Why AI Safety Guardrails Break Under Pressure

Trump scrapped a major AI safety plan — here’s why that matters for ChatGPT users

Microsoft storms RAMPART, adds Clarity to agentic AI safety

AI Safety Is Underfunded by Design: Model for Incentive-Aligned AI Safety Policy

OpenAI's Chris Lehane says he is pursuing "reverse federalism", lobbying blue states to pass AI safety laws and create a de facto US standard, as DC dithers (Brendan Bordelon/Politico)

Musk vs. Altman: AI safety cannot be one man’s job

Engineering a Safer World: Risk Modelling – and Safety Engineering? – For AI Lo

LLM Evals Are Based on Vibes — I Built the Missing Layer That Decides What Ships

A personal letter on transformative AI

Agent Behavioral Contracts

Meet the AI jailbreakers: ‘I see the worst things humanity has produced’

SupraWall – Runtime Policy Enforcement for AI Agents

LLM Budget Guard – open-source runtime cutoff for OpenAI/Anthropic

Browse more