Hub / Ai Research

ai-research · WeSearch

Ai Research news.

Page 13 of Ai Research headlines on WeSearch — deduped and updated continuously from 10+ editorial sources.

How Far Are We From True Auto-Research?

5/20/2026 · 16 views

Discoverable Agent Knowledge -- A Formal Framework for Agentic KG Affordances (Extended Version)

5/20/2026 · 19 views

Hallucination as Exploit: Evidence-Carrying Multimodal Agents

5/20/2026 · 21 views

Not all uncertainty is alike: volatility, stochasticity, and exploration

5/20/2026 · 19 views

SimGym: A Framework for A/B Test Simulation in E-Commerce with Traffic-Grounded VLM Agents

5/20/2026 · 16 views

Can Large Language Models Revolutionize Survey Research? Experiments with Disaster Preparedness Responses

5/20/2026 · 19 views

Causal Evidence for Attention Head Imbalance in Modality Conflict Hallucination

5/20/2026 · 20 views

AQuaUI: Visual Token Reduction for GUI Agents with Adaptive Quadtrees

5/20/2026 · 21 views

Swimming with Whales: Analysis of Power Imbalances in Stake-Weighted Governance

5/20/2026 · 16 views

MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization

5/20/2026 · 14 views

Agentic Trading: When LLM Agents Meet Financial Markets

5/20/2026 · 15 views

Generative Recursive Reasoning

5/20/2026 · 15 views

PRISM: A Benchmark for Programmatic Spatial-Temporal Reasoning

5/20/2026 · 14 views

Conflict-Resilient Multi-Agent Reasoning via Signed Graph Modeling

5/20/2026 · 13 views

What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents

5/20/2026 · 16 views

Generative Auto-Bidding with Unified Modeling and Exploration

5/20/2026 · 15 views

Beyond Mode Collapse: Distribution Matching for Diverse Reasoning

5/20/2026 · 12 views

Attention-Guided Reward for Reinforcement Learning-based Jailbreak against Large Reasoning Models

5/20/2026 · 14 views

Position: The Turing-Completeness of Real-World Autoregressive Transformers Relies Heavily on Context Management

5/20/2026 · 13 views

BLINKG: A Benchmark for LLM-Integrated Knowledge Graph Generation

5/20/2026 · 13 views

Efficient Elicitation of Collective Disagreements

5/20/2026 · 10 views

Generative-Evaluative Agreement: A Necessary Validity Criterion for LLM-Enabled Adaptive Assessment

5/20/2026 · 12 views

Library Drift: Diagnosing and Fixing a Silent Failure Mode in Self-Evolving LLM Skill Libraries

5/20/2026 · 13 views

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

5/20/2026 · 16 views

Towards Multi-Model LLM Schedulers: Empirical Insights into Offloading and Preemption

5/20/2026 · 9 views

Formal Skill: Programmable Runtime Skills for Efficient and Accurate LLM Agents

5/20/2026 · 12 views

EMO-BOOST: Emotion-Augmented Audio-Visual Features for Improved Generalization in Deepfake Detection

5/20/2026 · 13 views

When Tabular Foundation Models Meet Strategic Tabular Data: A Prior Alignment Approach

5/20/2026 · 17 views

Pseudocode-Guided Structured Reasoning for Automating Reliable Inference in Vision-Language Models

5/20/2026 · 9 views

Transforming Constraint Programs to Input for Local Search

5/20/2026 · 15 views

Beyond Rational Illusion: Behaviorally Realistic Strategic Classification

5/20/2026 · 14 views

Projecting Latent RL Actions: Towards Generalizable and Scalable Graph Combinatorial Optimization

5/20/2026 · 11 views

EngiAI: A Multi-Agent Framework and Benchmark Suite for LLM-Driven Engineering Design

5/20/2026 · 12 views

Memory-Augmented Reinforcement Learning Agent for CAD Generation

5/20/2026 · 14 views

CogScale: Scalable Benchmark for Sequence Processing

5/20/2026 · 15 views

What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code

5/20/2026 · 9 views

GroupAffect-4: A Multimodal Dataset of Four-Person Collaborative Interaction

5/20/2026 · 13 views

Minimax Optimal Variance-Aware Regret Bounds for Multinomial Logistic MDPs

5/20/2026 · 15 views

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

5/20/2026 · 9 views

Distribution-Free Uncertainty Quantification for Continuous AI Agent Evaluation

5/20/2026 · 11 views

From SGD to Muon: Adaptive Optimization via Schatten-p Norms

5/20/2026 · 14 views

Prior Knowledge or Search? A Study of LLM Agents in Hardware-Aware Code Optimization

5/20/2026 · 9 views

From Prompts to Pavement Through Time: Temporal Grounding in Agentic Scene-to-Plan Reasoning

5/20/2026 · 15 views

Explainable Wastewater Digital Twins: Adaptive Context-Conditioned Structured Simulators with Self-Falsifying Decision Support

5/20/2026 · 15 views

Streamlined Constraint Reasoning via CNN Pattern Recognition on Enumerated Solutions

5/20/2026 · 14 views

PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents

5/20/2026 · 11 views

Robotics-Inspired Guardrails for Foundation Models in Socially Sensitive Domains

5/20/2026 · 11 views

Probabilistic Tiny Recursive Model

5/20/2026 · 16 views

GeoX: Mastering Geospatial Reasoning Through Self-Play and Verifiable Rewards

5/20/2026 · 16 views

When Skills Don't Help: A Negative Result on Procedural Knowledge for Tool-Grounded Agents in Offensive Cybersecurity

5/20/2026 · 8 views

Sources in Ai Research

Lilian Weng — Lil'Log Andrej Karpathy Sebastian Ruder Distill.pub Papers with Code Trending arXiv cs.AI arXiv cs.LG (Machine Learning)arXiv cs.CL (Computation/Language)arXiv cs.CV (Computer Vision)Stanford HAI