20 stories tagged with #autonomous-agents, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Autonomous Agents"
Identity is not enough for autonomous agents
Identity tells you who is talking. The autonomous economy needs to know whether to act on what they say.…
What Benchmarks Don't Measure: The Case for Evaluating Abstention Competence in Autonomous Agents
Benchmarks for autonomous agents measure whether agents complete tasks, yet this framing is systematically blind to whether an agent should have proceeded at all. Agents trained un…
Beyond Monolithic AI: How to Build a Pluggable "Brain" Architecture for Autonomous Agents
Imagine you’re building a personal research assistant. Its job is to ingest hundreds of academic...…
Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows
Large Language Models (LLMs) are increasingly deployed as autonomous agents that reason, use tools, and act over multiple steps. Yet most hallucination benchmarks still evaluate on…
Building an agent-ready website: how to make your site readable for ChatGPT, Perplexity and autonomous agents
ChatGPT and Perplexity don't read your site the way Google does. Here's the four-layer stack we shipped for FollowNow to get cited by LLMs: llms.txt, OpenAPI 3.1, .well-known/agent…
Foundation Protocol: A Coordination Layer for Agentic Society
Autonomous agents are moving from tools into a layer of social infrastructure: they browse, purchase, deploy software, manage systems, and increasingly interact with one another. A…
Authorization layer for AI agents (OAuth has no idea what your agent is doing)
Open-source PDP for AI agents. Verify identity, validate delegation chains, detect behavioral drift.…
Autonomous Agents Need Receipts, Not Just Reasoning
Most AI agent demos optimize for the wrong screenshot. They show the moment where the agent produces...…
Alibaba (BABA) Unveils Zhenwu M890 AI Chip for Autonomous Agents
Autonomous Agents: what breaks first (and why that's the real product) [2026-05-23]
Most "AI agents" you see online are basically: a scheduled loop a prompt a couple API calls That...…
SOLAR: A Self-Optimizing Open-Ended Autonomous Agent for Lifelong Learning and Continual Adaptation
Despite the remarkable success of large language models (LLMs), they still face bottlenecks while deploying in dynamic, real-world settings with primary challenges being concept dr…
Show HN: ANML – A machine-first markup language for the agentic web (IETF Draft)
ANML is a machine-first markup language for agent-to-agent and agent-to-service communication over the internet.…
Cron & Scheduled Tasks in Garudust Agent — Autonomous Agents That Run Without You
Most AI agents wait. They sit idle until a human types something, respond, then go back to...…
Trustworthy Agent Network: Trust in Agent Networks Must Be Baked In, Not Bolted On
The rapid advancement of Large Language Models has given rise to autonomous LLM-based agents capable of complex reasoning and execution. As these agents transition from isolated op…
A11 as a Cognitive Layer for Autonomous Agents in Isolated Execution Environments
1. Purpose This article formalizes: the architecture of autonomous LLM‑based agents...…
Look Before You Leap: Autonomous Exploration for LLM Agents
Large language model based agents often fail in unfamiliar environments due to premature exploitation: a tendency to act on prior knowledge before acquiring sufficient environment-…
Agent Behavioral Contracts
Traditional software relies on contracts -- APIs, type systems, assertions -- to specify and enforce correct behavior. AI agents, by contrast, operate on prompts and natural langua…
I Gave Hermes Agent 5 Impossible Tasks
This is a submission for the Hermes Agent Challenge Let me be honest with you before we start. I...…
Let the AI Do the Experimenting
Using autoresearch to optimise marketing campaigns under budget constraints…
LLM Budget Guard – open-source runtime cutoff for OpenAI/Anthropic
Alerts won't stop a runaway agent at 3 AM. Budget Guard enforces hard token cutoffs across OpenAI, Anthropic & DeepSeek before bans or surprise invoices.…