60 stories tagged with #llm, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Llm"
LLM Budget Guard – open-source runtime cutoff for OpenAI/Anthropic
Hillman Solutions Corp. (HLMN) Q1 2026 Earnings Call Transcript
PAVO-Bench – 50K voice turns and an 85K-param router for ASR→LLM→TTS
A 50K-turn voice pipeline benchmark and an 85K-param meta-controller that cuts P95 latency 10.3% and energy 71% vs fixed cloud. TMLR 2026. - vnmoorthy/pavo-bench…
Show HN: VoiceGoat – A vulnerable voice agent for practicing LLM attacks
A purposely vulnerable voice agent application for security practitioners to practice exploiting voice-based (and text based) AI systems. - redcaller/voice-goat…
The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]
Stride earnings up next: Can enrollment rebound from tech crisis?
Built a Character Portrait Generator that reads books, identifies characters, and generates consistent portraits using ComfyUI (full RAG pipeline, local LLM, open-source)
Hillman Solutions Q1 Earnings Call Highlights
Home Assistant's local LLM support outperforms Gemini for Home, and Google knows it
LLM Prompt Engineering in Practice: CoT, Few-Shot, and System Prompt Design
LLM Prompt Engineering in Practice: CoT, Few-Shot, and System Prompt Design Stop getting...…
Conestoga Capital Advisors Sold Hillman Solutions Corp. (HLMN) As Secular Challenges Impacted Its Growth.
The private NotebookLM alternative I'm moving all my notes to
AnythingLLM gave me what NotebookLM couldn't on my Android phone…
The Bot Left a Fingerprint: Detecting and Attributing LLM-Generated Passwords
HeLa-Mem: Hebbian Learning and Associative Memory for LLM Agents
Long-term memory is a critical challenge for Large Language Model agents, as fixed context windows cannot preserve coherence across extended interactions. Existing memory systems r…
Hillman Solutions Corp. 2026 Q1 - Results - Earnings Call Presentation
2026-04-28. The following slide deck was published by Hillman Solutions Corp.…
LLM from pre-1930 derives quantum mechanics and relativity
LLMs Corrupt Your Documents When You Delegate
Large Language Models (LLMs) are poised to disrupt knowledge work, with the emergence of delegated work as a new interaction paradigm (e.g., vibe coding). Delegation requires trust…
Sage-Wiki: An LLM-compiled personal knowledge base
An LLM-compiled personal knowledge base. Drop in your papers, articles, and notes. sage-wiki compiles them into a structured, interlinked wiki — with concepts extracted, cross-ref…
Yann LeCun: LLMs Are Nearing the End, but Better AI Is Coming (2025)
Yann LeCun, Chief AI Scientist at Meta, believes LLMs are doomed due to their inability to represent the high-dimensional spaces that characterize our world…
Show HN: Knowerage – code coverage for LLM analysis
Local MCP server that tracks AI analysis coverage against your codebase - MTimma/knowerage…
A Primer on LLM Post-Training
We Fixed Karpathy’s LLM Wiki - PENgram Is the Typed Knowledge Graph Pipeline Everyone Asked For
We recently published an article about the gaps in Karpathy's LLM Wiki pattern. The thesis was...…
NARE: An LLM agent that amortizes reasoning into memory and executable rules
Contribute to starface77/Neuro-Adaptive-Reasoning-Engine development by creating an account on GitHub.…
Centene’s Obamacare Enrollment Drops By 2 Million After Congress Strips Subsidies
John Wiley & Sons: Strong Fundamentals But Colleges Are In A Bad Way
John Wiley & Sons is downgraded to Hold amid falling college enrollment and demographic shifts. Click here to read my latest analysis of WLY stock.…
News Finance AI – Cutting through the noise with LLM sentiment analysis
Why the same LLM gives different answers in different environments
What I found diagnosing a failure mode in my own system, and the moment retrieval turned out to be already shaped before it started…
🦊GoClaw Deep Dive 🤖 — A Builder's Guide to a Multi-Tenant AI Agent Platform 📘
Source: https://github.com/nextlevelbuilder/goclaw — a Go-based, multi-tenant AI agent gateway with...…
How to build advanced features for AI chatbots on SSE
Agents used to be a thing you talked to synchronously. Now they’re a thing that runs in the background while you work. When you make that change, the ……
Porting a Scratch-Built 500M LLM Training Pipeline to ROCm on Strix Halo
A lightweight transformer language model built from scratch in PyTorch, trained on a single consumer GPU with a full pipeline for data processing, pretraining, and instruction tuni…
OpenAI and the New Cognitive Architecture of Software Repositories
TL;DR OpenAI's latest harness engineering report suggests something deeper than "agents...…
Show HN: Waiting for LLMs Suck – Give your user a game
Give your user a game while they wait for the LLM to return a result.…
Don't Make the LLM Read the Graph: Make the Graph Think
We investigate whether explicit belief graphs improve LLM performance in cooperative multi-agent reasoning. Through 3,000+ controlled trials across four LLM families in the coopera…
Analytica: Soft Propositional Reasoning for Robust and Scalable LLM-Driven Analysis
Large language model (LLM) agents are increasingly tasked with complex real-world analysis (e.g., in financial forecasting, scientific discovery), yet their reasoning suffers from …
Towards Automated Ontology Generation from Unstructured Text: A Multi-Agent LLM Approach
Automatically generating formal ontologies from unstructured natural language remains a central challenge in knowledge engineering. While large language models (LLMs) show promise,…
PhySE: A Psychological Framework for Real-Time AR-LLM Social Engineering Attacks
The emerging threat of AR-LLM-based Social Engineering (AR-LLM-SE) attacks (e.g. SEAR) poses a significant risk to real-world social interactions. In such an attack, a malicious ac…
Judging the Judges: A Systematic Evaluation of Bias Mitigation Strategies in LLM-as-a-Judge Pipelines
LLM-as-a-Judge has become the dominant paradigm for evaluating language model outputs, yet LLM judges exhibit systematic biases that compromise evaluation reliability. We present a…
From Coarse to Fine: Self-Adaptive Hierarchical Planning for LLM Agents
Large language model-based agents have recently emerged as powerful approaches for solving dynamic and multi-step tasks. Most existing agents employ planning mechanisms to guide lo…
CAP-CoT: Cycle Adversarial Prompt for Improving Chain of Thoughts in LLM Reasoning
Chain-of-Thought (CoT) prompting has emerged as a simple and effective way to elicit step-by-step solutions from large language models (LLMs). However, CoT reasoning can be unstabl…
LEGO: An LLM Skill-Based Front-End Design Generation Platform
Existing LLM-based EDA agents are often isolated task-specific systems. This leads to repeated engineering effort and limited reuse of successful design and debugging strategies. W…
GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs
Autonomous multi-agent LLM systems are increasingly deployed to investigate operational incidents and produce structured diagnostic reports. Their trustworthiness hinges on whether…
When Corrective Hints Hurt: Prompt Design in Reasoner-Guided Repair of LLM Overcaution on Entailed Negations under OWL~2~DL
We report a reproducible error pattern in GPT-5.4 on OWL~2~DL compliance queries: the model frequently answers ``unknown'' when the reasoner-entailed answer is ``no'' under \emph{F…
Expert Evaluation of LLM's Open-Ended Legal Reasoning on the Japanese Bar Exam Writing Task
Large language models (LLMs) have shown strong performance on legal benchmarks, including multiple-choice components of bar exams. However, their capacity for generating open-ended…
ClawTrace: Cost-Aware Tracing for LLM Agent Skill Distillation
Skill-distillation pipelines learn reusable rules from LLM agent trajectories, but they lack a key signal: how much each step costs. Without per-step cost, a pipeline cannot distin…
LLM-Augmented Traffic Signal Control with LSTM-Based Traffic State Prediction and Safety-Constrained Decision Support
Traffic signal control is a critical task in intelligent transportation systems, yet conventional fixed-time and rule-based methods often struggle to adapt to dynamic traffic deman…
Context-Aware Hospitalization Forecasting Evaluations for Decision Support using LLMs
Medical and public health experts must make real-time resource decisions, such as expanding hospital bed capacity, based on projected hospitalization trends during large-scale heal…
LLM-Guided Agentic Floor Plan Parsing for Accessible Indoor Navigation of Blind and Low-Vision People
Indoor navigation remains a critical accessibility challenge for the blind and low-vision (BLV) individuals, as existing solutions rely on costly per-building infrastructure. We pr…
Multi-Dimensional Evaluation of Sustainable City Trips with LLM-as-a-Judge and Human-in-the-Loop
Evaluating nuanced conversational travel recommendations is challenging when human annotations are costly and standard metrics ignore stakeholder-centric goals. We study LLMs-as-Ju…
STELLAR-E: a Synthetic, Tailored, End-to-end LLM Application Rigorous Evaluator
The increasing reliance on Large Language Models (LLMs) across diverse sectors highlights the need for robust domain-specific and language-specific evaluation datasets; however, th…
The Price of Agreement: Measuring LLM Sycophancy in Agentic Financial Applications
Given the increased use of LLMs in financial systems today, it becomes important to evaluate the safety and robustness of such systems. One failure mode that LLMs frequently displa…
Self-hosting an LLM agent for incident response — does anyone here actually do this? What's working / not working?
I built an open-source tool to distill books into knowledge graphs
I have a bad habit: I buy books faster than I read them. Not because I'm lazy — I start most of...…
PrePrompt – MCP server that rewrites vague prompts before they reach the LLM
MCP server that intercepts and optimizes prompts in Claude Code and Cursor before they reach the LLM. Zero noise, sub-ms latency, runs locally.…
Lightport – AI gateway that makes LLM providers OpenAI-compatible
How do you handle RP-style prompts (actions + dialogue) in LLM systems?
TradingAgents v0.2.4: A Multi-Agent LLM Framework That Simulates an Entire Trading Firm
TL;DR UCLA Tauric Research released TradingAgents v0.2.4 (2026-04-25) — a LangGraph-based...…
the karpathy "llm wiki" idea, but as shared context for your team
Google DeepMind Paper Argues LLMs Will Never Be Conscious | Philosophers said the paper’s argument is sound, but that “all these arguments have been presented years and years ago.”
Skymizer Taiwan Inc. Unveils Breakthrough Architecture Enabling Ultra-Large LLM Inference on a Single Card
Source Article excerpt: With a single PCIe card — powered by six HTX301 chips and 384 GB of memory — enterprises can now run 700B-parameter model inference locally at just ~240W pe…
Running Local LLMs Offline on a Ten-Hour Flight
I flew from London to Google Cloud Next 2026 in Las Vegas. Ten hours with no in-flight wifi. I used the time to test how far a modern MacBook can carry engineering work on local LL…