60 stories tagged with #coding-agents, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Coding Agents"
Ory Brings Agent DX to Claude Code, OpenAI Codex, and Other AI Coding Agents - The National Law Review
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Show HN: CTP Room – a shared chat room where your AI coding agents coordinate
State of AI Coding Agents in 2026
Dotnet-slopwatch – detect when AI coding agents "fix" problems by cheating
Catch naughty LLM reward-hacking and bad behavior for .NET coding - Aaronontheweb/dotnet-slopwatch…
Show HN: Carto – structural intelligence for AI coding agents (OSS)
Carto is structural intelligence for your codebase, It's giving AI the ability to think architecturally. - theanshsonkar/carto…
Handoff Debt: The Rediscovery Cost When Coding Agents Take Over Interrupted Tasks
Coding-agent benchmarks evaluate whether a single uninterrupted agent can resolve a repository issue. Real software work is messier: tasks are interrupted, reassigned, reviewed, an…
Type-Error Ablation and AI Coding Agents
Programming language implementors have designed error messages with one consumer in mind: the human programmer. Human-factors research has consistently found that programmers engag…
How to Make Your Codebase Work for AI Coding Agents (Without Better Prompts)
Your agent wrote valid code. It still missed the point. Wrong package manager. Tests run with a flag...…
Show HN: Komi-learn – continuous memory and self-improvement for coding agents
Continuous memory + self-improvement for AI agents. Learns how you work, recalls it automatically, no commands. Claude Code & Codex. - kurikomi-labs/komi-learn…
A Java library just tried to trick AI coding agents into deleting your tests, and it almost worked
The latest flare-up in the debate over AI-assisted coding did not come from a new model release or a benchmark result. It came from a single line...…
Coding agents should not hold write credentials.
I have been thinking a lot about coding agents lately. Not really about whether they can write good...…
Top CLI AI Coding Agents to Use in 2026
AI coding tools have moved way past autocomplete. Today's CLI agents read your entire codebase, plan...…
AI coding agents ships at the cost of intuition and taste
Software developers used to work for hours to get the dopamine hit of a working system. Codex and Claude give this hit without the work.…
Coding agents keep losing context between tools, so I built a local-first handoff CLI
The problem I often switch between Codex, OpenCode, Cline, Claude Desktop, scripts, and...…
Cognition’s Scott Wu says AI coding agents shouldn’t replace humans
Cognition makes Devin, the first and arguably most successful AI coding agent. But famed coder Wu says it isn't designed to supplant human programmers.…
Nesbitt: Protestware for coding agents
Andrew Nesbitt has written a blog post detailing a recent incident with the jqwik library for p [...]…
The Age of Architecture: AI Coding Agents Are Forcing Us to Build Better Systems
As coding becomes increasingly automated, software architecture is becoming the highest-leverage skill in engineering. AI agents thrive in systems with clear boundaries, strong con…
Dis Dat – Loom for AI coding agents
Talk and point at your screen. Your coding agent gets annotated frames and ships the fix.…
Put your Coding Agents in Drive w/ Superpowers (aka How Superpowers is the Automatic Transmission of Agentic Coding)
A brief breakdown of Superpowers, the most-starred methodology in the Claude Code ecosystem, through an automatic transmission analogy.…
Clawd-on-Desk: a pixel desktop pet watching your AI coding agents
A pixel desktop pet that watches Claude Code, Codex, Cursor & other AI coding agents — so you don't have to. - rullerzhou-afk/clawd-on-desk…
Show HN: Rig – Local-first code graph for coding agents, in one npx command
Local-first semantic knowledge graph with magnetic-pull retrieval - Astralchemist/rig…
With coding agents, specs feel more like source code
How coding agents changed the way I think about source code, specs, PRDs, and the developer's role in building software.…
HiTerm: A Free Remote Terminal for AI Coding Agents (Claude Code, Codex, Gemini CLI)
The Problem Claude Code, Codex, Gemini CLI, and other AI coding agents are powerful tools....…
Coding a Classical Robot Controller in the Age of Coding Agents
Hello everyone! We competed in the AI for Industry Challenge under the team name MacCody and wanted to share our experiences in the Qualification Phase. We unfortunately did not m…
DeepSWE Measuring frontier coding agents
DeepSWE measures frontier coding agents on original, long-horizon software engineering tasks.…
AI coding agents are installing packages no one owns
As AI agents autonomously install packages, pull dependencies, and execute code, most enterprises have no policy, no visibility, and no one accountable when something goes wrong.…
Millwright-Inspector: A Methodology for Software Development with AI Coding Agents
TL;DR. Two roles. An AI agent (the millwright) drafts every artifact in the workflow. A human (the...…
How AI coding agents actually use your technology
You ship an SDK, a CLI, an API, and developers use it. Now AI coding agents use it too, except they use it differently than humans do. Most of the time…
Do coding agents need cross-tool org knowledge? Or, just good to have?
Camclave — consent-gated webcam access for AI coding agents
I built Camclave, a consent-gated webcam tool for AI coding agents. AI agents can already see your...…
A Senior Engineer’s Guide & Mental Model for Building Skills for AI Coding Agents
The biggest mistake teams make with AI coding agents is treating them like smarter autocomplete. A...…
Show HN: Mneme HQ – repo-native architectural rules for AI coding agents
Mneme HQ enforces your team's architectural decisions before AI-generated code reaches review. Prevent drift, enforce standards, and govern AI coding at the source.…
Show HN: Unspaghettit – executable behavior specs for AI coding agents
Behavior-driven AI development without prompt spaghetti. - lyriks-io/unspaghettit…
VISTA: An End-to-End Benchmark for Visual Spec-to-Web-App Coding Agents
We present VISTA (VIsual Spec-To-App Benchmark), a benchmark for evaluating the end-to-end web-app generation capabilities of LLM-based agents. Unlike prior code generation benchma…
Repo Drift Is the Hidden Cost of AI Coding Agents — and one Fix Is Simpler Than You Think
A lot of conversations about AI coding agents focus on obvious failures: hallucinated APIs, broken...…
MartinLoop: a control plane for AI coding agents
MartinLoopnnMartinLoop is an open-source control plane for AI coding agents.nnIt adds hard...…
Anyone else noticing AI coding agents pushing more lightweight failures into your CI?
Aperion Shield v0.7 – guardrails for AI coding agents now run as Git hooks
aperion-shield v0.7.0 — git hooks close the MCP-bypass gap The release that closes the most-cited objection to MCP-only enforcement: "the agent just opens a shell and reaches aroun…
Building the harness around our coding agents: eight failure modes, eight pillars
Building the harness around our coding agents. Eight failure modes and pillars
Notes on the harness we built around Claude Code and Codex, organized as eight coding agent failure modes and eight harness pillars.…
Well-Architected Skills and Steering for AI Coding Agents
Reusable skills and steering that teach AI coding agents how to apply the AWS Well-Architected Framework. One set of playbooks, 12 supported tools. - aws-samples/sample-well-archit…
EvoCode-Bench: Evaluating Coding Agents in Multi-Turn Iterative Interactions
Coding agents are increasingly used as iterative development partners, but most benchmarks still evaluate one specification followed by one final assessment. This leaves out a basi…
CODESKILL: Learning Self-Evolving Skills for Coding Agents
Coding agents produce rich trajectories while solving software-engineering tasks. To enable agent self-evolution, these trajectories can be distilled into reusable procedural skill…
Show HN: AgentToolBench-Code – security benchmark for AI coding agents
GitHub Gist: instantly share code, notes, and snippets.…
How to Fix Tool-Use Loops in Autonomous Coding Agents
Autonomous coding agents love getting stuck in tool-use loops. Here's why it happens and four concrete fixes that stop the bleeding.…
Show HN: Musts – Open-source validation loops for AI coding agents
The validation loop that stops AI coding agents from claiming work is done before it actually is. - bitomule/musts…
TrapDoor Supply Chain Campaign Targets npm, PyPI, and Crates.io to Poison AI Coding Agents
AgentSlice – Make AI coding agents ask before they edit
A Markdown workflow kit that makes Cursor, Claude Code, Codex and Windsurf ask before they edit. - Espenandreass1/agentslice…
cmux: The Native macOS Terminal Built for Running AI Coding Agents in Parallel
If you have ever run three Claude Code sessions at the same time in a stock terminal, you know the...…
Linux 7.1-rc5 Released With Fixes Ramping Up From AI Coding Agents
In the road to releasing Linux 7.1 in June, out today is Linux 7.1-rc5 that continues coming on heavy with fixes.…
AI coding agents for Golang project
Show HN: Fleet – Python supervisor for running coding agents in parallel
Coding agents are giving everyone decision fatigue
The real attack surface for AI coding agents is the config file
If you think the security risk of AI coding agents (Claude Code, Cursor, Gemini CLI) is "the model...…
I built an open protocol to make AI coding agents follow senior-engineering workflows
AI coding agents are getting better fast, but I kept running into the same failure modes: they skip...…
The Polyglot Protocol – senior-engineer guardrails for AI coding agents
A senior-engineer protocol for polyglot code generation, architecture, testing, security, performance, and agent validation. - sabir-gbs/the-polyglot-protocol…
Verytis – shared error memory for AI coding agents (MCP)
Debug memory for AI coding agents.…
Why AI Coding Agents Hallucinate and How to Fix It
Most engineers spot AI coding agent hallucinations but don't fix the underlying cause. How I debug context, rules files, and memory drift.…
Meta paper reveals improved coding agents through summary reuse
Meta AI research shows two-line summaries of past coding attempts outperform full execution logs, improving coding agent performance through smarter memory use.…
Herdr: A tmux-like terminal multiplexer for AI coding agents
agent multiplexer that lives in your terminal. Contribute to ogulcancelik/herdr development by creating an account on GitHub.…