Hub / Ai Research
ai-research · WeSearch
Ai Research news.
Page 13 of Ai Research headlines on WeSearch — deduped and updated continuously from 10+ editorial sources.
ARXIV CS.AI
How Far Are We From True Auto-Research?
ARXIV CS.AI
Discoverable Agent Knowledge -- A Formal Framework for Agentic KG Affordances (Extended Version)
ARXIV CS.AI
Hallucination as Exploit: Evidence-Carrying Multimodal Agents
ARXIV CS.AI
Not all uncertainty is alike: volatility, stochasticity, and exploration
ARXIV CS.AI
SimGym: A Framework for A/B Test Simulation in E-Commerce with Traffic-Grounded VLM Agents
ARXIV CS.AI
Can Large Language Models Revolutionize Survey Research? Experiments with Disaster Preparedness Responses
ARXIV CS.AI
Causal Evidence for Attention Head Imbalance in Modality Conflict Hallucination
ARXIV CS.AI
AQuaUI: Visual Token Reduction for GUI Agents with Adaptive Quadtrees
ARXIV CS.AI
Swimming with Whales: Analysis of Power Imbalances in Stake-Weighted Governance
ARXIV CS.AI
MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization
ARXIV CS.AI
Agentic Trading: When LLM Agents Meet Financial Markets
ARXIV CS.AI
Generative Recursive Reasoning
ARXIV CS.AI
PRISM: A Benchmark for Programmatic Spatial-Temporal Reasoning
ARXIV CS.AI
Conflict-Resilient Multi-Agent Reasoning via Signed Graph Modeling
ARXIV CS.AI
What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents
ARXIV CS.AI
Generative Auto-Bidding with Unified Modeling and Exploration
ARXIV CS.AI
Beyond Mode Collapse: Distribution Matching for Diverse Reasoning
ARXIV CS.AI
Attention-Guided Reward for Reinforcement Learning-based Jailbreak against Large Reasoning Models
ARXIV CS.AI
Position: The Turing-Completeness of Real-World Autoregressive Transformers Relies Heavily on Context Management
ARXIV CS.AI
BLINKG: A Benchmark for LLM-Integrated Knowledge Graph Generation
ARXIV CS.AI
Efficient Elicitation of Collective Disagreements
ARXIV CS.AI
Generative-Evaluative Agreement: A Necessary Validity Criterion for LLM-Enabled Adaptive Assessment
ARXIV CS.AI
Library Drift: Diagnosing and Fixing a Silent Failure Mode in Self-Evolving LLM Skill Libraries
ARXIV CS.AI
SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects
ARXIV CS.AI
Towards Multi-Model LLM Schedulers: Empirical Insights into Offloading and Preemption
ARXIV CS.AI
Formal Skill: Programmable Runtime Skills for Efficient and Accurate LLM Agents
ARXIV CS.AI
EMO-BOOST: Emotion-Augmented Audio-Visual Features for Improved Generalization in Deepfake Detection
ARXIV CS.AI
When Tabular Foundation Models Meet Strategic Tabular Data: A Prior Alignment Approach
ARXIV CS.AI
Pseudocode-Guided Structured Reasoning for Automating Reliable Inference in Vision-Language Models
ARXIV CS.AI
Transforming Constraint Programs to Input for Local Search
ARXIV CS.AI
Beyond Rational Illusion: Behaviorally Realistic Strategic Classification
ARXIV CS.AI
Projecting Latent RL Actions: Towards Generalizable and Scalable Graph Combinatorial Optimization
ARXIV CS.AI
EngiAI: A Multi-Agent Framework and Benchmark Suite for LLM-Driven Engineering Design
ARXIV CS.AI
Memory-Augmented Reinforcement Learning Agent for CAD Generation
ARXIV CS.AI
CogScale: Scalable Benchmark for Sequence Processing
ARXIV CS.AI
What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code
ARXIV CS.AI
GroupAffect-4: A Multimodal Dataset of Four-Person Collaborative Interaction
ARXIV CS.AI
Minimax Optimal Variance-Aware Regret Bounds for Multinomial Logistic MDPs
ARXIV CS.AI
OpenComputer: Verifiable Software Worlds for Computer-Use Agents
ARXIV CS.AI
Distribution-Free Uncertainty Quantification for Continuous AI Agent Evaluation
ARXIV CS.AI
From SGD to Muon: Adaptive Optimization via Schatten-p Norms
ARXIV CS.AI
Prior Knowledge or Search? A Study of LLM Agents in Hardware-Aware Code Optimization
ARXIV CS.AI
From Prompts to Pavement Through Time: Temporal Grounding in Agentic Scene-to-Plan Reasoning
ARXIV CS.AI
Explainable Wastewater Digital Twins: Adaptive Context-Conditioned Structured Simulators with Self-Falsifying Decision Support
ARXIV CS.AI
Streamlined Constraint Reasoning via CNN Pattern Recognition on Enumerated Solutions
ARXIV CS.AI
PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents
ARXIV CS.AI
Robotics-Inspired Guardrails for Foundation Models in Socially Sensitive Domains
ARXIV CS.AI
Probabilistic Tiny Recursive Model
ARXIV CS.AI
GeoX: Mastering Geospatial Reasoning Through Self-Play and Verifiable Rewards
ARXIV CS.AI