25 results for "stem research"
From pet to pest: Research warns invasive goldfish are reshaping freshwater ecosystems
SASTRA signs MoU with Tata Advanced Systems Ltd.
SASTRA and Tata Advanced Systems sign MoU to enhance collaboration in research, training, and capacity building.…
Agentic CEO – An AI research organism that hunts, critiques, and evolves itself
Autonomous multi-agent research system. 3,700+ knowledge entries, 173 hunts, 68 domains, 35 days of autonomous operation, ~$25 total. - brcrusoe72/agentic-ceo…
Judging the Judges: A Systematic Evaluation of Bias Mitigation Strategies in LLM-as-a-Judge Pipelines
LLM-as-a-Judge has become the dominant paradigm for evaluating language model outputs, yet LLM judges exhibit systematic biases that compromise evaluation reliability. We present a comprehensive empir…
AI Identity: Standards, Gaps, and Research Directions for AI Agents
AI agents are now running real transactions, workflows, and sub-agent chains across organizational boundaries without continuous human supervision. This creates a problem no current infrastructure is …
SoccerRef-Agents: Multi-Agent System for Automated Soccer Refereeing
Refereeing is vital in sports, where fair, accurate, and explainable decisions are fundamental. While intelligent assistant technologies are being widely adopted in soccer refereeing, current AI-assis…
ZenBrain: A Neuroscience-Inspired 7-Layer Memory Architecture for Autonomous AI Systems
Despite a century of empirical memory research, existing AI agent memory systems rely on system-engineering metaphors (virtual-memory paging, flat LLM storage, Zettelkasten notes), none integrating pr…
QED: An Open-Source Multi-Agent System for Generating Mathematical Proofs on Open Problems
We explore a central question in AI for mathematics: can AI systems produce original, nontrivial proofs for open research problems? Despite strong benchmark performance, producing genuinely novel proo…
I built Claude Code skills for writing agent prompts, grounded in prompt research
I've been building agentic systems for a while and wanted a more systematic approach to writing prompts. So I gathered papers, did some deep research and created guides on structure, format and prompt…
Multi-Agent AI Systems Are Eating Single Agents
Single-agent architectures hit a wall the moment your task needs planning, research, and execution in parallel. Multi-agent systems solve this — but most tutorials skip the hard parts. This guide does…
Can Current Agents Close the Discovery-to-Application Gap? A Case Study in Minecraft
Discovering causal regularities and applying them to build functional systems--the discovery-to-application loop--is a hallmark of general intelligence, yet evaluating this capacity has been hindered …
How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks
The wide adoption of AI agents in complex human workflows is driving rapid growth in LLM token consumption. When agents are deployed on tasks that require a significant amount of tokens, three questio…
PivotMerge: Bridging Heterogeneous Multimodal Pre-training via Post-Alignment Model Merging
Multimodal Large Language Models (MLLMs) rely on multimodal pre-training over diverse data sources, where different datasets often induce complementary cross-modal alignment capabilities. Model mergin…
IntrAgent: An LLM Agent for Content-Grounded Information Retrieval through Literature Review
Scientific research relies on accurate information retrieval from literature to support analytical decisions. In this work, we introduce a new task, INformation reTRieval through literAture reVIEW (In…
Claude for Creative Work
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.…
Study finds infrasound the likely horror in hauntings
Noise below the range of human hearing from old pipes, machinery and ventilation systems can induce stressful sensations, according to a study published by Canadian researchers in Frontiers in Behavio…
HeLa-Mem: Hebbian Learning and Associative Memory for LLM Agents
Long-term memory is a critical challenge for Large Language Model agents, as fixed context windows cannot preserve coherence across extended interactions. Existing memory systems represent conversatio…
AI prefers resumes written by itself: Self-preferencing in Algorithmic Hiring
As artificial intelligence (AI) tools become widely adopted, large language models (LLMs) are increasingly involved on both sides of decision-making processes, ranging from hiring to content moderatio…
Should I persue EdgeAI? [D]
Should I persue EdgeAI For context I'll be joining my engineering college year, I wanna study computer engineering and am really interested into embedded systems, I researched and found out about Edge…
Scammers use Gmail dot alias trick to spoof Robinhood in phishing scam
Robinhood users have been reporting a new phishing email sent directly from Robinhood's email server, with security researchers pointing to Gmail’s dot alias feature and flaws in Robinhood’s account c…
Analytica: Soft Propositional Reasoning for Robust and Scalable LLM-Driven Analysis
Large language model (LLM) agents are increasingly tasked with complex real-world analysis (e.g., in financial forecasting, scientific discovery), yet their reasoning suffers from stochastic instabili…
FinGround: Detecting and Grounding Financial Hallucinations via Atomic Claim Verification
Financial AI systems must produce answers grounded in specific regulatory filings, yet current LLMs fabricate metrics, invent citations, and miscalculate derived quantities. These errors carry direct …
Watching TV with the Second-Party
Smart TVs implement a unique tracking approach called Automatic Content Recognition (ACR) to profile viewing activity of their users. ACR is a Shazam-like technology that works by periodically capturi…
A 14-day “Growth Forge” sprint: build an AI-powered growth agent on a real stack
Sharing something that sits at the intersection of AI agents and growth systems. VideoDB (backend for video/audio for AI agents) is running a 14-day sprint called Growth Forge for 5 builders to design…
ChatGPT-psychosis: How it can occur and how to avoid it.
Hey everyone, If there are AI developers, prompt engineers, or system architects here, this is especially for you. You should really take this into account. We have all seen the reports about the nega…