Hub / Ai Research
ai-research · WeSearch
Ai Research news.
Page 4 of Ai Research headlines on WeSearch — deduped and updated continuously from 10+ editorial sources.
ARXIV CS.AI
TSFMAudit: Data Contamination Auditing in Forecasting Time Series Foundation Models
ARXIV CS.AI
On the Push-Based Asynchronous Federated Learning: A Bias-Correction Aggregation Approach
ARXIV CS.AI
Tool-Schema Compression Enables Agentic RAG Under Constrained Context Budgets
ARXIV CS.AI
Enhancing Autonomous Online Intrusion Detection for IoT with Balanced Learning, Reliable Pseudo-Labels, and Lightweight Architectures
ARXIV CS.AI
Planning Neural Dynamics with Lie Group Embedding through Supervised Projective Manifold Learning
ARXIV CS.AI
A Universal Cliff and a Design Fingerprint: Cross-Section Defect Detection Under LLM Orchestration
ARXIV CS.AI
InfoQuant: Shaping Activation Distributions for Low-Bit LLM Quantization
ARXIV CS.AI
PitchBench: Measuring Pitch Hearing in Audio-Language Models
ARXIV CS.AI
RepoMirage: Probing Repository Context Reasoning in Code Agents with Perturbations
ARXIV CS.AI
AutoDFT: A Closed-Loop Multi-Agent Framework for Autonomous DFT Calculations
ARXIV CS.AI
GAC: Noise-Aware Adaptive Mixing for Hybrid SFT-RL Post-Training
ARXIV CS.AI
SetupX: Can LLM Agents Learn from Past Failures in Functionality-Correct Code Repository Setup?
ARXIV CS.AI
In Search of the Ingredients of Open-Endedness: Replicating Picbreeder with Large Vision-Language Models
ARXIV CS.AI
Confidence Calibration in Large Language Models
ARXIV CS.AI
How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning
ARXIV CS.AI
Context: Proactive Goal-Directed Intelligence via Composable Sandboxed Programs, Declarative Wiring, and Structured Interaction
ARXIV CS.AI
Toward Reliable Design of LLM-Enabled Agentic Workflows: Optimizing Latency-Reliability-Cost Tradeoffs
ARXIV CS.AI
Quantum Frog: Emergent Cooperation and Difficulty Scaling in a Quantized-Time Cooperative Game
ARXIV CS.AI
BODHI: Precise OS Kernel Specification Inference
ARXIV CS.AI
When Correct Beliefs Collapse: Epistemic Resilience of LLMs under Clinical Pressure
ARXIV CS.AI
Practical Quantum CIM Empowerment via All-Domestic-Core Agentic Large Model
ARXIV CS.AI
Operationalizing Reconstructive Authority: Runtime Construction, Dependency Resolution, and Execution Gating in Autonomous Agent Systems
ARXIV CS.AI
Fuzzy, Neutrosophic, and Uncertain Graph Theory: Properties and Applications
ARXIV CS.AI
BoxLitE: A Faithful Knowledge Base Embedding Based on Convex Optimization
ARXIV CS.AI
Authority Inversion in LLM-Mediated Ubiquitous Systems: When Models Trust Users Over Sensors
ARXIV CS.AI
DRIVE: Modeling Skills at the Reasoning and Interaction Levels for Web Agents under Continual Learning
ARXIV CS.AI
Residual Drift Dominates Contradiction in Multi-Turn Constraint Reasoning
ARXIV CS.AI
MEMOR-E: In-Context and Fine-Tuned LLM Personalization for Alzheimer's Assistive Robotics
ARXIV CS.AI
A Dynamical Framework for Cognitive Processes Based on Transformations and Semantic Equivalence
ARXIV CS.AI
Spacetime Formation under Requirements: Contextual Realization and Form-Dependent Probability
ARXIV CS.AI
Right-Sizing Communication and Recommendation Set Size in AI-Assisted Search
ARXIV CS.AI
Accelerating Long-Tail Generation in Synchronous RLHF Training via Adaptive Tensor Parallelism
ARXIV CS.AI
Stop Comparing LLM Agents Without Disclosing the Harness
ARXIV CS.AI
Methods for Formal Verification of Agent Skills: Three Layers Toward a Mechanically Checkable Capability-Containment Proof
ARXIV CS.AI
Machine Psychometrics: A Mathematical Psychology of Artificial Intelligence
ARXIV CS.AI
From Accuracy to Auditability: A Survey of Determinism in Financial AI Systems
ARXIV CS.AI
QUIVER: A Formal Framework for Quantifying Perturbation Propagation and Bifurcation in Compound AI Systems
ARXIV CS.AI
Low-Cost Labels, Reliable Choices: Rollout-Calibrated Hyper-Heuristics for Job Shop Scheduling
ARXIV CS.AI
LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs
ARXIV CS.AI
Why We Need World Models for AGI: Where LLMs Fail and How World Models May Outperform
ARXIV CS.AI
Saturating Scaling Laws for Equational Discovery: A Phenomenology of Growth Dynamics in Three Toy Substrates with Two Real-World Replications
ARXIV CS.AI
Beyond Predefined Learning Objects: A Thinking-Learning Interaction Model for Up-to-Date Autonomous Robot Learning
ARXIV CS.AI
Towards trustworthy agentic AI: a comprehensive survey of safety, robustness, privacy, and system security
ARXIV CS.AI
Reason--Imagine--Act: Closed-Loop LLM Decision Making with World Models for Autonomous Driving
ARXIV CS.AI
LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition
ARXIV CS.AI
EvoSci: A Bio-Inspired Multi-Agent Framework for the Evolution of Scientific Discovery
ARXIV CS.AI
Breaking the Chains of Probability: Neutrosophic Logic as a New Framework for Epistemic Uncertainty in Large Language Models
ARXIV CS.AI
EvoCode-Bench: Evaluating Coding Agents in Multi-Turn Iterative Interactions
ARXIV CS.AI
SkillEvolBench: Benchmarking the Evolution from Episodic Experience to Procedural Skills
ARXIV CS.AI