Hub / Ai Research
ai-research · WeSearch
Ai Research news.
Page 5 of Ai Research headlines on WeSearch — deduped and updated continuously from 10+ editorial sources.
ARXIV CS.AI
HyperGuide: Hyperbolic Guidance for Efficient Multi-Step Reasoning in Large Language Models
ARXIV CS.AI
Neuro-Inspired Inverse Learning for Planning and Control
ARXIV CS.AI
Palette: A Modular, Controllable, and Efficient Framework for On-demand Authorized Safety Alignment Relaxation in LLMs
ARXIV CS.AI
Inference Time Context Sparsity: Illusion or Opportunity?
ARXIV CS.AI
EPPC-OASIS: Ontology-Aware Adaptation and Structured Inference Refinement for Electronic Patient-Provider Communication Mining in Secure Messages
ARXIV CS.AI
A Sober Look at Agentic Misalignment in Automated Workflows
ARXIV CS.AI
When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs
ARXIV CS.AI
Identifying and Mitigating Systemic Measurement Bias in Production LLM Inference Benchmarks
ARXIV CS.AI
Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows
ARXIV CS.AI
How Well Do Models Follow Their Constitutions?
ARXIV CS.AI
Toward Enactive Artificial Intelligence
ARXIV CS.AI
Safety-Oriented Routing Analysis of Mixtral MoE Under Benign and Harmful Prompts
ARXIV CS.AI
When Does Synthetic Patent Data Help? Volume-Fidelity Trade-offs in Low-Resource Multi-Label Classification
ARXIV CS.AI
Adaptive Human-AI Coordination via Hierarchical Action Disentanglement
ARXIV CS.AI
Partner-Aware Hierarchical Skill Discovery for Robust Human-AI Collaboration
ARXIV CS.AI
Distilling Game Code World Model Generation into Lightweight Large Language Models
ARXIV CS.AI
A governance horizon for ethical-use constraints in open-weight AI models
ARXIV CS.AI
Understanding and Mitigating Premature Confidence for Better LLM Reasoning
ARXIV CS.AI
ConceptM$^3$oE: Concept-Guided Multimodal Mixture of Experts for Interpretable Computational Pathology
ARXIV CS.AI
Advancing Graph Few-Shot Learning via In-Context Learning
ARXIV CS.AI
The Model Is Not the Product: A Dual-Pillar Architecture for Local-First Psychological Coaching
ARXIV CS.AI
JT-SAFE-V2: Safety-by-Design Foundation Model with World-Context Data
ARXIV CS.AI
Benchmarking the Limits of In-Context Reinforcement Learning for Ad-Hoc Teamwork
ARXIV CS.AI
SAM: State-Adaptive Memory for Long-Horizon Reasoning Agent
ARXIV CS.AI
SPACE: Unifying Symmetric and Asymmetric Routing Problems for Generalist Neural Solver
ARXIV CS.AI
AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning
ARXIV CS.AI
TIGER: Text-Informed Generalized Enzyme-Reaction Retrieval
ARXIV CS.AI
Market Regime Council for Dynamic Credit Assignment in Multi-Agent LLM Decision Systems
ARXIV CS.AI
Reasoning as an Attack Surface: Adaptive Evolutionary CoT Jailbreaks for LLMs
ARXIV CS.AI
Hypothesis Generation and Inductive Inference in Children and Language Models
ARXIV CS.AI
DemoEvolve: Overcoming Sparse Feedback in Agentic Harness Evolution with Demonstrations
ARXIV CS.AI
Emission-Aware Reinforcement Learning for Sustainable Electric Vehicle Charging and Carbon Dioxide Reduction Under Varying Renewable Penetration
ARXIV CS.AI
Beyond Control-Flow: Integrating the Resource Perspective into Multi-Collaborative Process Modeling from Text
ARXIV CS.AI
PALoRA: Projection-Adaptive LoRA for Preserving Reasoning in Large Language Models
ARXIV CS.AI
Jailbreak to Protect: Buffering and Reinforcing via Temporary Jailbreaking for Safe Fine-Tuning in Large Language Models
ARXIV CS.AI
Summoning the Oracle to Slay It: Mitigating Look-Ahead Bias in Financial Backtesting with Large Language Models
ARXIV CS.AI
Associations between echocardiographic traits and AI-ECG predictions of heart failure
ARXIV CS.AI
HeartBeatAI: An Interpretable and Robust Deep Learning Framework for Multi-Label ECG Arrhythmia Detection
ARXIV CS.AI
Learning to Reason Efficiently with A* Post-Training
ARXIV CS.AI
Hera: Learning Long-Horizon Coordination for Device-Cloud Collaborative LLM Agents
ARXIV CS.AI
Agent-as-Peer-Debriefer: A Multi-Agent Framework with Perspective-Based Refinement for Qualitative Analysis
ARXIV CS.AI
Lattice theory and algebraic models for deep convolutional learning based on mathematical morphology
ARXIV CS.AI
GlobalDentBench: A Multinational Benchmark for Evaluating LLM Clinical Reasoning in Dentistry with Expert Calibration
ARXIV CS.AI
AVBench: Human-Aligned and Automated Evaluation Benchmark for Audio-Video Generative Models
ARXIV CS.AI
Beyond Inference-Only Deployment: Comparing Weight-Based Consolidation Against Cascading Compaction
ARXIV CS.AI
Measuring Reasoning Quality in LLMs: A Multi-Dimensional Behavioral Framework
ARXIV CS.AI
When Mean CE Fails: Median CE Can Better Track Language Model Quality
ARXIV CS.AI
Exploration of Perceptual Speech Features for Clinical Decision-Support in Mental Health Care
ARXIV CS.AI
Emotional intelligence in large language models is fragmented across perception, cognition, and interaction
ARXIV CS.AI