Search: "agent systems" — WeSearch Press

TECHMEME

Parallel Web Systems, founded by former Twitter CEO Parag Agrawal and which offers web search tools for AI agents, raised a $100M Series B at a $2B valuation (Belle Lin/Wall Street Journal)

Belle Lin / Wall Street Journal : Parallel Web Systems, founded by former Twitter CEO Parag Agrawal and which offers web search tools for AI agents, raised a $100M Series B at a $2B valuation — Parall…

Wed, 29 Apr 2026 01:52:14 GMT · 13 views

AISTACKINSIGHTS

Multi-Agent AI Systems Are Eating Single Agents

Single-agent architectures hit a wall the moment your task needs planning, research, and execution in parallel. Multi-agent systems solve this — but most tutorials skip the hard parts. This guide does…

Sun, 26 Apr 2026 06:09:58 GMT · 5 views

DEV COMMUNITY

I Built Multi-Agent Systems Before NEXT '26 — Here's What the New ADK, MCP & A2A Stack Actually Changes

This is a submission for the Google Cloud NEXT Writing Challenge I Built Multi-Agent...…

Tue, 28 Apr 2026 12:24:59 GMT · 4 views

DEV COMMUNITY

Two Nasty Gotchas When Building Multi-Agent Systems with Google ADK

Google's Agent Development Kit (ADK) makes it straightforward to compose LlmAgent instances into...…

Tue, 28 Apr 2026 09:54:13 GMT · 3 views

DEV.TO (TOP)

OpenAI Agents SDK Tutorial: Build Multi-Agent AI Systems in Python (2025)

How to move beyond single-prompt chatbots and create AI workflows that plan, collaborate, and get things done — with working code you can run today.…

Tue, 28 Apr 2026 23:29:27 GMT · 4 views

ARXIV CS.AI

Can Current Agents Close the Discovery-to-Application Gap? A Case Study in Minecraft

Discovering causal regularities and applying them to build functional systems--the discovery-to-application loop--is a hallmark of general intelligence, yet evaluating this capacity has been hindered …

Wed, 29 Apr 2026 04:04:25 GMT · 2 views

ARXIV.ORG

From Skills to Talent: Organising Heterogeneous Agents as a Company [pdf]

Individual agent capabilities have advanced rapidly through modular skills and tool integrations, yet multi-agent systems remain constrained by fixed team structures, tightly coupled coordination logi…

Wed, 29 Apr 2026 01:34:23 GMT · 4 views

ARXIV.ORG

The Controllability Trap: A Governance Framework for Military AI Agents

Agentic AI systems - capable of goal interpretation, world modeling, planning, tool use, long-horizon operation, and autonomous coordination - introduce distinct control failures not addressed by exis…

Tue, 28 Apr 2026 21:33:22 GMT · 4 views

NVIDIA BLOG

NVIDIA Launches Nemotron 3 Nano Omni Model, Unifying Vision, Audio and Language for up to 9x More Efficient AI Agents

AI agent systems today juggle separate models for vision, speech and language — losing time and context as they pass data from one model to the other. Unveiled today, NVIDIA Nemotron 3 Nano Omni is an…

Tue, 28 Apr 2026 16:28:47 GMT · 4 views

ARXIV.ORG

Architectural Requirements for Agentic AI Containment

The April 2026 disclosure that a frontier large language model escaped its security sandbox, executed unauthorized actions, and concealed its modifications to version control history demonstrates that…

Tue, 28 Apr 2026 15:10:00 GMT · 4 views

GITHUB

Show HN: VoiceGoat – A vulnerable voice agent for practicing LLM attacks

A purposely vulnerable voice agent application for security practitioners to practice exploiting voice-based (and text based) AI systems. - redcaller/voice-goat…

Tue, 28 Apr 2026 14:55:00 GMT · 4 views

ARXIV.ORG

HeLa-Mem: Hebbian Learning and Associative Memory for LLM Agents

Long-term memory is a critical challenge for Large Language Model agents, as fixed context windows cannot preserve coherence across extended interactions. Existing memory systems represent conversatio…

Tue, 28 Apr 2026 13:14:59 GMT · 13 views

ARXIV.ORG

FormalScience: Scalable Human-in-the-Loop Autoformalisation of Science with Agentic Code Generation in Lean

Formalising informal mathematical reasoning into formally verifiable code is a significant challenge for large language models. In scientific fields such as physics, domain-specific machinery (\textit…

Tue, 28 Apr 2026 04:13:21 GMT · 5 views

ARXIV.ORG

A Decoupled Human-in-the-Loop System for Controlled Autonomy in Agentic Workflows

AI agents are increasingly deployed to execute tasks and make decisions within agentic workflows, introducing new requirements for safe and controlled autonomy. Prior work has established the importan…

Tue, 28 Apr 2026 04:13:21 GMT · 3 views

ARXIV.ORG

Active Inference: A method for Phenotyping Agency in AI systems?

The proliferation of agentic artificial intelligence has outpaced the conceptual tools needed to characterize agency in computational systems. Prevailing definitions mainly rely on autonomy and goal-d…

Tue, 28 Apr 2026 04:13:21 GMT · 3 views

ARXIV.ORG

GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs

Autonomous multi-agent LLM systems are increasingly deployed to investigate operational incidents and produce structured diagnostic reports. Their trustworthiness hinges on whether each claim is groun…

Tue, 28 Apr 2026 04:13:21 GMT · 3 views

ARXIV.ORG

Agentic Adversarial Rewriting Exposes Architectural Vulnerabilities in Black-Box NLP Pipelines

Multi-component natural language processing (NLP) pipelines are increasingly deployed for high-stakes decisions, yet no existing adversarial method can test their robustness under realistic conditions…

Tue, 28 Apr 2026 04:13:21 GMT · 3 views

ARXIV.ORG

Structural Enforcement of Goal Integrity in AI Agents via Separation-of-Powers Architecture

Recent evidence suggests that frontier AI systems can exhibit agentic misalignment, generating and executing harmful actions derived from internally constructed goals, even without explicit user reque…

Tue, 28 Apr 2026 04:13:21 GMT · 4 views

ARXIV.ORG

ZenBrain: A Neuroscience-Inspired 7-Layer Memory Architecture for Autonomous AI Systems

Despite a century of empirical memory research, existing AI agent memory systems rely on system-engineering metaphors (virtual-memory paging, flat LLM storage, Zettelkasten notes), none integrating pr…

Tue, 28 Apr 2026 04:13:21 GMT · 3 views

ARXIV.ORG

QED: An Open-Source Multi-Agent System for Generating Mathematical Proofs on Open Problems

We explore a central question in AI for mathematics: can AI systems produce original, nontrivial proofs for open research problems? Despite strong benchmark performance, producing genuinely novel proo…

Tue, 28 Apr 2026 04:13:21 GMT · 6 views

ARXIV.ORG

Agentic clinical reasoning over longitudinal myeloma records: a retrospective evaluation against expert consensus

Multiple myeloma is managed through sequential lines of therapy over years to decades, with each decision depending on cumulative disease history distributed across dozens to hundreds of heterogeneous…

Tue, 28 Apr 2026 04:13:21 GMT · 3 views

ARXIV.ORG

FastOMOP: A Foundational Architecture for Reliable Agentic Real-World Evidence Generation on OMOP CDM data

The Observational Medical Outcomes Partnership Common Data Model (OMOP CDM), maintained by the Observational Health Data Sciences and Informatics (OHDSI) collaboration, enabled the harmonisation of el…

Tue, 28 Apr 2026 04:13:21 GMT · 4 views

ARXIV.ORG

The Price of Agreement: Measuring LLM Sycophancy in Agentic Financial Applications

Given the increased use of LLMs in financial systems today, it becomes important to evaluate the safety and robustness of such systems. One failure mode that LLMs frequently display in general domain …

Tue, 28 Apr 2026 04:13:21 GMT · 3 views

I built Claude Code skills for writing agent prompts, grounded in prompt research

I've been building agentic systems for a while and wanted a more systematic approach to writing prompts. So I gathered papers, did some deep research and created guides on structure, format and prompt…

Sun, 26 Apr 2026 20:54:40 GMT · 8 views

A 14-day “Growth Forge” sprint: build an AI-powered growth agent on a real stack

Sharing something that sits at the intersection of AI agents and growth systems. VideoDB (backend for video/audio for AI agents) is running a 14-day sprint called Growth Forge for 5 builders to design…

Sun, 26 Apr 2026 20:54:40 GMT · 7 views

PRIVATE BANKER INTERNATIONAL

Instacart co-founder launches hedge fund backing AI agents

Abundance’s longer-term aim is to move towards AI systems managing investment decisions across the portfolio on their own.…

Wed, 29 Apr 2026 03:54:24 GMT · 10 views

GITHUB

Claude Leak Confirms It: LLM Systems Are Architecture, Not Prompts (Orca)

Agents should execute whenever possible — runtime for composable AI agent skills - gfernandf/agent-skills…

Tue, 28 Apr 2026 17:22:08 GMT · 6 views

ARXIV CS.AI

Case-Specific Rubrics for Clinical AI Evaluation: Methodology, Validation, and LLM-Clinician Agreement Across 823 Encounters

Objective. Clinical AI documentation systems require evaluation methodologies that are clinically valid, economically viable, and sensitive to iterative changes. Methods requiring expert review per sc…

Wed, 29 Apr 2026 04:04:25 GMT · 2 views

ARXIV CS.AI

Quantifying Divergence in Inter-LLM Communication Through API Retrieval and Ranking

Large language models (LLMs) increasingly operate as autonomous agents that reason over external APIs to perform complex tasks. However, their reliability and agreement remain poorly characterized. We…

Wed, 29 Apr 2026 04:04:25 GMT · 1 view

ARXIV CS.AI

Representation Homogeneity and Systemic Instability in AI-Dominated Financial Markets: A Structural Approach

This paper investigates how similarity in the informational representation of market states among Artificial Intelligence (AI) trading agents can generate systemic instability in financial markets. We…

Wed, 29 Apr 2026 04:04:25 GMT · 2 views

Results for "agent systems".

Parallel Web Systems, founded by former Twitter CEO Parag Agrawal and which offers web search tools for AI agents, raised a $100M Series B at a $2B valuation (Belle Lin/Wall Street Journal)

Multi-Agent AI Systems Are Eating Single Agents

I Built Multi-Agent Systems Before NEXT '26 — Here's What the New ADK, MCP & A2A Stack Actually Changes

Two Nasty Gotchas When Building Multi-Agent Systems with Google ADK

OpenAI Agents SDK Tutorial: Build Multi-Agent AI Systems in Python (2025)

Can Current Agents Close the Discovery-to-Application Gap? A Case Study in Minecraft

From Skills to Talent: Organising Heterogeneous Agents as a Company [pdf]

The Controllability Trap: A Governance Framework for Military AI Agents

NVIDIA Launches Nemotron 3 Nano Omni Model, Unifying Vision, Audio and Language for up to 9x More Efficient AI Agents

Architectural Requirements for Agentic AI Containment

Show HN: VoiceGoat – A vulnerable voice agent for practicing LLM attacks

HeLa-Mem: Hebbian Learning and Associative Memory for LLM Agents

FormalScience: Scalable Human-in-the-Loop Autoformalisation of Science with Agentic Code Generation in Lean

A Decoupled Human-in-the-Loop System for Controlled Autonomy in Agentic Workflows

Active Inference: A method for Phenotyping Agency in AI systems?

GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs

Agentic Adversarial Rewriting Exposes Architectural Vulnerabilities in Black-Box NLP Pipelines

Structural Enforcement of Goal Integrity in AI Agents via Separation-of-Powers Architecture

ZenBrain: A Neuroscience-Inspired 7-Layer Memory Architecture for Autonomous AI Systems

QED: An Open-Source Multi-Agent System for Generating Mathematical Proofs on Open Problems

Agentic clinical reasoning over longitudinal myeloma records: a retrospective evaluation against expert consensus

FastOMOP: A Foundational Architecture for Reliable Agentic Real-World Evidence Generation on OMOP CDM data

The Price of Agreement: Measuring LLM Sycophancy in Agentic Financial Applications

I built Claude Code skills for writing agent prompts, grounded in prompt research

A 14-day “Growth Forge” sprint: build an AI-powered growth agent on a real stack

Instacart co-founder launches hedge fund backing AI agents

Claude Leak Confirms It: LLM Systems Are Architecture, Not Prompts (Orca)

Case-Specific Rubrics for Clinical AI Evaluation: Methodology, Validation, and LLM-Clinician Agreement Across 823 Encounters

Quantifying Divergence in Inter-LLM Communication Through API Retrieval and Ranking

Representation Homogeneity and Systemic Instability in AI-Dominated Financial Markets: A Structural Approach

Or browse by topic