WeSearch
Hub / Tags / Ai Safety
TAG · #AI-SAFETY

Ai Safety coverage.

Every story in the WeSearch catalog tagged with #ai-safety, chronological, with view counts. Subscribe to the per-tag RSS feed to follow this topic in your reader of choice.

38 stories tagged with #ai-safety, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.

⌘ RSS feed for this tag →   or   search "Ai Safety"

RELATED TAGS
#ai4#open-source2#autonomous-agents2#cost-management1#llm-operations1#cybersecurity1#compliance1#runtime-policy1#jailbreaking1#mental-health1#ethics1#software-engineering1
TECHMEME

Anthropic secretly limiting Claude's usefulness for LLM development strengthens the argument that Anthropic is using AI safety to justify monopolistic behavior (Dean W. Ball/@deanwball)

Dean W. Ball / @deanwball : Anthropic secretly limiting Claude's usefulness for LLM development strengthens the argument that Anthropic is using AI safety to justify monopolistic b…

16 views ·
TECHMEME

A profile of Anthropic as it prepares to go public and broaden access to Mythos, amid criticism that commercial pressures have eroded its AI safety standards (Madhumita Murgia/Financial Times)

Madhumita Murgia / Financial Times : A profile of Anthropic as it prepares to go public and broaden access to Mythos, amid criticism that commercial pressures have eroded its AI sa…

24 views ·
GOOGLE NEWS

In policy paper, OpenAI diverges from White House on AI safety - SiliconANGLE

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

24 views ·
POLITICO EUROPE

OpenAI diverges from White House on AI safety rules

18 views ·
GOOGLE NEWS

OpenAI diverges from White House on AI safety rules - Politico

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

15 views ·
DEV.TO (TOP)

Trump's AI Safety Order Is a Voluntary Form You Don't Have to Fill Out

Trump signed a new AI executive order on June 2 asking companies to voluntarily submit frontier models for government review. They can say no.…

10 views ·
#ai#policy#regulation
GOOGLE NEWS

As OpenAI Heads For IPO, Sam Altman Says Trump’s AI Safety Order ‘Gets The Balance Right’ - Yahoo Finance

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

14 views ·
GOOGLE NEWS

As OpenAI Heads For IPO, Sam Altman Says Trump’s AI Safety Order ‘Gets The Balance Right’ - Stocktwits

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

9 views ·
CRYPTO BRIEFING

Polymarket assigns 13% chance for US AI safety bill by 2027

Polymarket traders give just 13% odds that the US will pass a federal AI safety bill before 2027, reflecting deep skepticism about Congressional action on AI regulation.…

12 views ·
#ai#legislation#polymarket
GOOGLE NEWS

Diverging AI safety approaches: OpenAI enters Japanese banking defenses, while Anthropic’s model remains restricted to controlled evaluations - Moomoo

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

15 views ·
WIRED

Illinois Lawmakers Just Passed America’s Strongest AI Safety Bill

The bill requires companies like OpenAI, Anthropic, and Google to have third parties confirm they’re following safety standards. Illinois Governor JB Pritzker says he’ll sign.…

19 views ·
#ai#technology#regulation
GOOGLE NEWS

Opinion: OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman. - MarketWatch

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

10 views ·
MARKETWATCH — TOP STORIES

OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman.

12 views ·
GOOGLE NEWS

Opinion: OpenAI’s biggest problem isn’t AI safety. It’s Sam Altman. - MarketWatch

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

14 views ·
ARXIV CS.AI

Position: AI Safety Requires Effective Controllability

AI safety is still largely framed as alignment: training models to follow human preferences, safety policies, and normative constraints. That framing has improved the behavior of m…

11 views ·
#ai#safety#controllability
CRYPTO BRIEFING

Meta and Google AI safety controls can be stripped in minutes

Financial Times testing with AI safety group Alice found that safety guardrails on Meta's Llama 3.3 and Google's Gemma 3 can be stripped in under 10 minutes.…

16 views ·
#ai#safety#technology
LESSWRONG

Cognitive Security as an AI Safety Cause Area

As AI systems become more capable, the cognitive security of humans will be increasingly at risk. By cognitive security, I mean the ability of humans……

15 views ·
#ai#cognitive security#psychology
WILLIAMRINEHART

An AI safety safe harbor [pdf]

18 views ·
TECHMEME

A look at the UK's AI Safety Institute, whose researchers probe AI models for safety gaps, as its work becomes a blueprint for other governments' AI policies (New York Times)

20 views ·
GOOGLE NEWS

OpenAI is looking for an AI safety researcher with a salary up to $445,000 per year - en.ain.ua

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

11 views ·
GOOGLE NEWS

OpenAI offers up to $445,000 for AI safety role as focus shifts to self-improving systems - WION

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

13 views ·
GOOGLE NEWS

Why OpenAI is paying $445,000 for a 'tasteful and strategic' AI safety researcher - The Times of India

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

15 views ·
GOOGLE NEWS

OpenAI is paying up to $445,000 for AI safety judgment - Startup Fortune

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

16 views ·
TECHMEME

Q&A with Sundar Pichai on the future of Google Search, Google's place in the AI race, public skepticism toward AI, AI agents, AI safety, TPUs, and more (New York Times)

New York Times : Q&A with Sundar Pichai on the future of Google Search, Google's place in the AI race, public skepticism toward AI, AI agents, AI safety, TPUs, and more — After a b…

11 views ·
GOOGLE NEWS

Daily Digest: OpenAI safety executive departs, Sonoma house Jack London built lists - The Business Journals

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…

14 views ·
DEV.TO (TOP)

Wake-Up Call: Why AI Safety Guardrails Break Under Pressure

This is a submission for the Google I/O Writing Challenge This is a submission for the Google I/O...…

9 views ·
#ai#safety#technology
TOM'S GUIDE

Trump scrapped a major AI safety plan — here’s why that matters for ChatGPT users

Trump just sent a clear message about AI: move fast and regulate later…

12 views ·
#ai#technology#regulation
WWW.THEREGISTER.COM - ARTICLES

Microsoft storms RAMPART, adds Clarity to agentic AI safety

Redmond open sources two tools for building and maintaining safer agents…

22 views ·
#ai#technology#software
HACKER NEWS (AI / LLM)

AI Safety Is Underfunded by Design: Model for Incentive-Aligned AI Safety Policy

A Model for Incentive-Aligned AI Safety Policy…

18 views ·
#ai#safety#policy
TECHMEME

OpenAI's Chris Lehane says he is pursuing "reverse federalism", lobbying blue states to pass AI safety laws and create a de facto US standard, as DC dithers (Brendan Bordelon/Politico)

Brendan Bordelon / Politico : OpenAI's Chris Lehane says he is pursuing “reverse federalism”, lobbying blue states to pass AI safety laws and create a de facto US standard, as DC d…

15 views ·
FORTUNE

Musk vs. Altman: AI safety cannot be one man’s job

The Oakland trial was a fight between two billionaires offering themselves as the guarantors of AI’s future. We deserve a better answer.…

13 views ·
#ai#governance#law
LESSWRONG

Engineering a Safer World: Risk Modelling – and Safety Engineering? – For AI Lo

Engineering a safer world. • I, and I imagine many of my readers, are eager to contribute to that effort[1]. ……

17 views ·
#safety#engineering#ai
TOWARDS DATA SCIENCE

LLM Evals Are Based on Vibes — I Built the Missing Layer That Decides What Ships

Most LLM evaluation systems rely on vague scoring and human judgment disguised as metrics. I built a lightweight evaluation layer in pure Python that turns LLM outputs into reprodu…

14 views ·
#llms#evaluation#hallucination detection
HACKER NEWS (AI / LLM)

A personal letter on transformative AI

Making sense of rapid AI progress…

10 views ·
#artificial intelligence#transformative technology#global risks
ARXIV.ORG

Agent Behavioral Contracts

Traditional software relies on contracts -- APIs, type systems, assertions -- to specify and enforce correct behavior. AI agents, by contrast, operate on prompts and natural langua…

14 views ·
#artificial intelligence#software engineering
THE GUARDIAN — TECH

Meet the AI jailbreakers: ‘I see the worst things humanity has produced’

To test the safety and security of AI, hackers have to trick large language models into breaking their own rules. It requires ingenuity and manipulation - and can come at a deep em…

15 views ·
#jailbreaking#mental health
LLMETER

LLM Budget Guard – open-source runtime cutoff for OpenAI/Anthropic

Alerts won't stop a runaway agent at 3 AM. Budget Guard enforces hard token cutoffs across OpenAI, Anthropic & DeepSeek before bans or surprise invoices.…

14 views ·
#cost management#open source
GITHUB

SupraWall – Runtime Policy Enforcement for AI Agents

The open-source security layer for AI agents. Deterministic guardrails, PII redaction, and EU AI Act compliance in one line of code. - wiserautomation/SupraWall…

10 views ·
#cybersecurity#compliance