WeSearch
Hub / Ai Research
ai-research · WeSearch

Ai Research news.

Page 2 of Ai Research headlines on WeSearch — deduped and updated continuously from 10+ editorial sources.

ARXIV CS.AI

SAGE: A Quantitative Evaluation of Socialized Evolution in Agent Ecosystems

6/3/2026 · 11 views
ARXIV CS.AI

From Prompt to Service: An SLM-Based Agent Orchestration Gateway for AI-Driven Virtual Worlds

6/3/2026 · 13 views
ARXIV CS.AI

Cross-Lingual Token Arbitrage: Optimizing Code Agent Context Windows via Local LLM Preprocessing

6/3/2026 · 17 views
ARXIV CS.AI

Bridging Auxiliary Constraints to Resolve Instruction Following in Large Reasoning Models

6/3/2026 · 14 views
ARXIV CS.AI

TSQAgent: Rating Time Series Data Quality via Dedicated Agentic Reasoning

6/3/2026 · 13 views
ARXIV CS.AI

Gender-Dependent Diagnostic Substitution in LLM Medical Triage: Same Symptoms, Unequal Urgency

6/3/2026 · 18 views
ARXIV CS.AI

Towards Non-Monotonic Entailment in Propositional Defeasible Standpoint Logic

6/3/2026 · 12 views
ARXIV CS.AI

Diagnosing Knowledge Gaps in LLM Tool Use: An Agentic Benchmark for Novel API Acquisition

6/3/2026 · 13 views
ARXIV CS.AI

From Answers to States: Verifiable Process-Level Evaluation of Chemical Reasoning in Large Language Models

6/3/2026 · 16 views
ARXIV CS.AI

EvoDrive: Pareto Evolution for Safety-Critical Autonomous Driving via Self-Improving LLM Agents

6/3/2026 · 14 views
ARXIV CS.AI

The DeepSpeak-Agentic Dataset

6/3/2026 · 13 views
ARXIV CS.AI

SkillPyramid: A Hierarchical Skill Consolidation Framework for Self-Evolving Agents

6/3/2026 · 27 views
ARXIV CS.AI

Dynamic Objective Selection with Safeguards and LLM Oversight for Financial Decision-Making

6/3/2026 · 20 views
ARXIV CS.AI

Code-on-Graph: Iterative Programmatic Reasoning via Large Language Models on Knowledge Graphs

6/3/2026 · 21 views
ARXIV CS.AI

Unveiling the Structure of Do-Calculus Reasoning via Derivation Graphs

6/3/2026 · 23 views
ARXIV CS.AI

When to Re-Plan: Subgoal Persistence in Hierarchical Latent Reasoning

6/3/2026 · 20 views
ARXIV CS.AI

Proof-Refactor: Refactoring Generated Formal Proofs into Modular Artifacts

6/3/2026 · 20 views
ARXIV CS.AI

LAP: An Agent-to-Instrument Protocol for Autonomous Science

6/3/2026 · 18 views
GOOGLE NEWS

How AI is Transforming Scientific Discovery While Keeping Humans at the Center - Stanford HAI

5/27/2026 · 16 views
ARXIV CS.AI

BrickAnything: Geometry-Conditioned Buildable Brick Generation with Structure-Aware Tokenization

5/27/2026 · 23 views
ARXIV CS.AI

Can LLMs Introspect? A Reality Check

5/27/2026 · 20 views
ARXIV CS.AI

Is Agent Memory a Database? Rethinking Data Foundations for Long-Term AI Agent Memory

5/27/2026 · 30 views
ARXIV CS.AI

Personalizing Embodied Multimodal Large Language Model Agents over Long-term User Interactions

5/27/2026 · 23 views
ARXIV CS.AI

Constraint acquisition needs better benchmarks

5/27/2026 · 17 views
ARXIV CS.AI

Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems

5/27/2026 · 21 views
ARXIV CS.AI

Experiments in Agentic AI for Science

5/27/2026 · 18 views
ARXIV CS.AI

Anchor: Mitigating Artifact Drift in Agent Benchmark Generation

5/27/2026 · 22 views
ARXIV CS.AI

OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling

5/27/2026 · 20 views
ARXIV CS.AI

JobBench: Aligning Agent Work With Human Will

5/27/2026 · 26 views
ARXIV CS.AI

Managing Uncertainty in LLM-Generated Procedural Knowledge for Virtual Laboratory Planning

5/27/2026 · 20 views
ARXIV CS.AI

ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence

5/27/2026 · 20 views
ARXIV CS.AI

Automatic Layer Selection for Hallucination Detection

5/27/2026 · 19 views
ARXIV CS.AI

Exploiting Local Dynamics Regularity for Reusable Skills in Offline Hierarchical RL

5/27/2026 · 22 views
ARXIV CS.AI

Advancing Creative Physical Intelligence in Large Multimodal Models

5/27/2026 · 25 views
ARXIV CS.AI

From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator

5/27/2026 · 20 views
ARXIV CS.AI

Reasoning, Code, or Both? How Large Language Models Handle Variations in Math Questions

5/27/2026 · 21 views
ARXIV CS.AI

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

5/27/2026 · 21 views
ARXIV CS.AI

Which Changes Matter? Towards Trustworthy Legal AI via Relevance-Sensitive Evaluation and Solver-Grounded Reasoning

5/27/2026 · 22 views
ARXIV CS.AI

PolyFusionAgent: A Multimodal Foundation Model and Autonomous AI Assistant for Polymer Property Prediction and Inverse Design

5/27/2026 · 17 views
ARXIV CS.AI

MobileExplorer: Accelerating On-Device Inference for Mobile GUI Agents via Online Exploration

5/27/2026 · 21 views
ARXIV CS.AI

MedGuideX: Internalizing Decision Logic from Executable Guidelines into Large Language Models for Clinical Reasoning

5/27/2026 · 21 views
ARXIV CS.AI

AGORA: Adapter-Grounded Observation-Action Retention for Inference-Free Prompt Compression in LLM Agents

5/27/2026 · 18 views
ARXIV CS.AI

FAST-GOAL: Fast and Efficient Global-local Object Alignment Learning

5/27/2026 · 19 views
ARXIV CS.AI

Tail-Aware HiFloat4: W4A4 Post-Training Quantization for Wan2.2

5/27/2026 · 17 views
ARXIV CS.AI

UnityMAS-O: A General RL Optimization Framework for LLM-Based Multi-Agent Systems

5/27/2026 · 21 views
ARXIV CS.AI

Completion vs Optimality: Policy Gradient in Long-Horizon Cumulative-Damage Problems

5/27/2026 · 20 views
ARXIV CS.AI

MemFail: Stress-Testing Failure Modes of LLM Memory Systems

5/27/2026 · 21 views
ARXIV CS.AI

Mind the Tool Failures: Achieving Synergistic Tool Gains for Medical Agents

5/27/2026 · 20 views
ARXIV CS.AI

Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation

5/27/2026 · 24 views
ARXIV CS.AI

It's Not the Capability: Harness Sensitivity Is Non-Monotone Across LLM Agent Tiers

5/27/2026 · 18 views

Sources in Ai Research

Other categories