Hub / Ai Research
ai-research · WeSearch
Ai Research news.
Page 3 of Ai Research headlines on WeSearch — deduped and updated continuously from 10+ editorial sources.
ARXIV CS.AI
A Dataset of Robot-Patient and Doctor-Patient Medical Dialogues for Spoken Language Processing Tasks
ARXIV CS.AI
Beyond a Single Direction: Chain-of-Thought Disrupts Simple Steering of Refusal
ARXIV CS.AI
The Attribution Blind Spot: Detecting When Language Models Rely on Memory Rather Than Retrieved Context
ARXIV CS.AI
LiveK12Bench: Have Large Multimodal Models Truly Conquered High School-level Examinations?
ARXIV CS.AI
Composition Collapse: Stable Factual Knowledge Does Not Imply Compositional Reasoning
ARXIV CS.AI
What Makes Chain-of-Thought Work at Probe Time? Local Co-occurrence Rather Than Global Derivation
ARXIV CS.AI
Helicase: Uncertainty-Guided Supply Chain Knowledge Graph Construction with Autonomous Multi-Agent LLMs
ARXIV CS.AI
Multi-Stakeholder LLM Alignment: Decomposing Estimation from Aggregation
ARXIV CS.AI
On the Detection of Commutative Factors in Factor Graphs: Necessary and Sufficient Conditions
ARXIV CS.AI
TADDLE: A Tool-Augmented Agent for Detecting Deficient LLM-Generated Peer Reviews
ARXIV CS.AI
From Norms to Indicators (N2I-RAG): An Agentic Retrieval-Augmented Generation Framework for Legal Indicator Computation
ARXIV CS.AI
Developing a Totally Unimodular Linear Program for Optimal Conformance Checking: When and Why It Complements A*
ARXIV CS.AI
Neuro-Symbolic Verification of LLM Outputs for Data-Sensitive Domains (extended preprint)
ARXIV CS.AI
LELA: An End-to-end LLM-based Entity Linking Framework with Zero-shot Domain Adaptation
ARXIV CS.AI
Generating Robust Portfolios of Optimization Models using Large Language Models
ARXIV CS.AI
ORCA: An End-to-End Interactive Copilot for Optimized Root Cause Analysis
ARXIV CS.AI
Boosting Knowledge Graph Foundation Models via Enhanced Negative Sampling
ARXIV CS.AI
BatteryMFormer: Multi-level Learning for Battery Degradation Trajectory Forecasting
ARXIV CS.AI
Traceable Knowledge Graph Reasoning Enables LLM-Assisted Decision Support for Industrial VOCs in the Steel Industry
ARXIV CS.AI
Can Broad Biomedical Knowledge be Contextualized into Scenario-Grounded Propositions?
ARXIV CS.AI
Counteraction-Aware Multi-Teacher On-Policy Distillation for General Capability Recovery with Domain Preservation
ARXIV CS.AI
Position: AI Safety Requires Effective Controllability
ARXIV CS.AI
Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation
ARXIV CS.AI
ICCU: In-Context Continual Unlearning via Pattern-Induced Refusal Rules
ARXIV CS.AI
StepOPSD: Step-Aware Online Preference Distillation for Agent Reinforcement Learning
ARXIV CS.AI
VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions
ARXIV CS.AI
Detecting Is Not Resolving: The Monitoring Control Gap in Retrieval Augmented LLMs
ARXIV CS.AI
Query Symbolically or Retrieve Semantically? A Dataset and Method for Semi-Structured Question Answering
ARXIV CS.AI
The Compressive Knowledge Graph Hypothesis: Which Graph Facts Matter for Scientific Hypothesis Generation?
ARXIV CS.AI
Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments
ARXIV CS.AI
Gumbel Machine: Counterfactual Student Writing Generation via Gumbel Noise Steering
ARXIV CS.AI
SIA: Self Improving AI with Harness & Weight Updates
ARXIV CS.AI
Modeling Agentic Technical Debt and Stochastic Tax: A Standalone Framework for Measurement, Simulation, and Dashboarding
ARXIV CS.AI
Maat: The Agentic Legal Research Assistant for Competition Protection
ARXIV CS.AI
2-ASP(Q) programs with weak constraints: Complexity and efficient implementation
ARXIV CS.AI
Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases
ARXIV CS.AI
Natural Language Query to Configuration for Retrieval Agents
ARXIV CS.AI
MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation
ARXIV CS.AI
Xe-Forge: Multi-Stage LLM-Powered Kernel Optimization for Intel GPU
ARXIV CS.AI
Edge AI Deployment Beyond Models: A BSP-Aware Systems Framework for Industrial Embedded Platforms
ARXIV CS.AI
GEM: Geometric Entropy Mixing for Optimal LLM Data Curation
ARXIV CS.AI
Pretraining Data Exposure in Large Language Models: A Survey of Membership Inference, Data Contamination, and Security Implications
ARXIV CS.AI
Eroding Trust in Real Speech: A Large-Scale Study of Human Audio Deepfake Perception
ARXIV CS.AI
AssetGen: Deployable 3D Asset Generation at Interactive Speed
ARXIV CS.AI
VISTA: An End-to-End Benchmark for Visual Spec-to-Web-App Coding Agents
ARXIV CS.AI
Augment Engineering: A Methodology for Multi-Tool AI Orchestration Across Professional Domains
ARXIV CS.AI
MemMorph: Tool Hijacking in LLM Agents via Memory Poisoning
ARXIV CS.AI
When Does Adaptive Guidance Help? Belief-Aware Privileged Distillation for Autonomous Driving Under Partial Observability
ARXIV CS.AI
Turning Bias into Bugs: Bandit-Guided Style Manipulation Attacks on LLM Judges
ARXIV CS.AI