60 stories tagged with #local-llm, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Local Llm"
TensorSharp: Open-Source Local LLM Inference Engine
A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama…
Microsoft forms partnership with Unsloth AI about local LLM execution
Local models are coming to your laptop soon! 🚀 We're excited to partner with @Microsoft to enable millions of developers run local models on Windows!…
Cross-Lingual Token Arbitrage: Optimizing Code Agent Context Windows via Local LLM Preprocessing
AI-assisted coding agents are bottlenecked by input-token cost. Two pathologies of raw human input drive much of this overhead: tokenization inefficiency for non-English text and s…
Cost Analysis of my $6.4k Local LLM Server
We built an app that runs AI completely offline on your phone (Local LLMs). Perfect for flights, camping, or dead zones.
We built an app that runs AI completely offline on your phone (Local LLMs). Perfect for flights, camping, or dead zones.
BMO( Adventure Time) local LLM on smart phone?
Which Coding Agent Features Are Useful For Local LLMs
Run your own local LLM with rate limits via API-keys
Small Ruby prototype for an OpenAI-compatible LLM proxy with a refillable token bucket - skorotkiewicz/llm-rt…
I replaced NotebookLM with this free tool that uses my local LLMs
It lets me use my own local LLMs instead of being locked into Google's models…
Is a 128 GB MacBook Pro M5 Max actually too slow for large-context local LLM coding workflows?
I replaced ChatGPT, Claude, and Gemini on my phone with a local LLM, and it's a mobile upgrade I didn't expect
Local AI is on my phone now…
Local LLM Hosting in Switzerland: Real Costs, Latency & Compliance
When a Lausanne fintech burned through CHF 4,200 in a single day after a GDPR‑triggered API outage,...…
What GUI are you using for local LLMs on Mac?
I finally stopped forcing local LLMs and switched back to cloud AI
Cloud AI isn't perfect, but it actually works.…
Running Local LLM - 0$ Personal Agentic AI Assistant - Part 3
Introduction Part 3 of the Zero Dollar personal AI Assistant series, running Local LLMs on...…
Linux + local LLMs = pure crack
Insults & Cutlasses, Local LLM Sword Fighting on Melee Island
This is a submission for the Gemma 4 Challenge: Build with Gemma 4 What I Built Insults...…
RTX 6000 Ada vs RTX PRO Blackwell for local LLM inference?
I built GemmaPod - A truly composable and portable AI agent solution powered by your local LLM
What I Built GemmaPod is a composable, portable AI agent platform that packages local...…
Build for Local LLM (Revised)
Is NVIDIA still the default best choice for local LLMs in 2026?
Show HN: A picker that maps local LLMs to hardware, hardware to LLMs
I ditched LM Studio for llama.cpp, and my local LLM doesn't feel like a downgrade anymore
My new main runner…
Local LLMs perform so much better when you teach them to ask before they answer
One small change leads to more helpful answers.…
Getting Started: Run Your First Local LLM in 5 Minutes
No experience needed. Install Ollama, pull your first model, and start chatting — all on your own computer with zero API costs. A step-by-step guide for absolute beginners.…
Hardware Guide: What Do You Actually Need to Run Local LLMs?
No matter what computer you have, there's a model that will run on it. GPU comparison table, budget builds from $0-$2500, CPU-only guide, Mac/AMD/Intel support, and RAM/VRAM calcul…
I added these MCP servers to my local LLM stack, and one of them replaces a $249 paid tool
These MCP servers make my local LLM even better.…
LLMKube – A Kubernetes operator for local LLMs across Nvidia and Mac fleets
Run production LLMs for pennies. Self-hosted inference on consumer GPUs with Kubernetes-native orchestration. 20x cheaper than cloud.…
I use Claude and local LLMs together now, and it costs half as much while being twice as fast
What's the average cost between $20 and $0?…
I tested 3 tiny local LLMs for everyday work, and only one of them impressed me
Small but not useless…
[Showoff Saturday] I built a Local LLM VRAM Calculator to instantly check if your GPU can run Llama 4, Qwen3, and DeepSeek-V4 locally
Show HN: BonzAI – self-sovereign, local LLM inference in the browser
Generate unlimited AI content offline. Train custom models and earn crypto by serving them on our decentralized P2P network powered by Chainlink.…
I tried a new 8B local LLM, and its design might be the biggest shift since DeepSeek R1
Zaya1-8B is a huge shift in LLMs, and the results are impressive.…
[Fully Buffered] I Ran a Full Local LLM on a Pentium 4 (NetBurstGPT)
GraphRAG on Consumer Hardware: Benchmarking Local LLMs for Healthcare EHR Schema Retrieval
Graph-based Retrieval Augmented Generation (GraphRAG) extends retrieval-augmented generation to support structured reasoning over complex corpora, but its reliability under resourc…
I Built a Private AI Assistant That Queries My Git History and Project Management Data — Using Only Local LLMs
No API keys. No cloud. All data stays on my machine. The Problem As a web developer, I...…
Local LLM Code Completion Showdown: Zed AI vs Continue vs Cursor (Honest 2026 Review)
Local LLM Code Completion in 2026: Zed AI vs Continue vs Cursor If you have been...…
How to Build a Local LLM Agent to Automate Work List Generation from Monthly Reports (With Jira Integration)
Our management team spent hours manually extracting work items (“bug fix”, “released version 1”,...…
Built a small desktop app for running local LLMs with Ollama.
I made an App that uses local LLMs to monitor your screen
Self-Hosted Pomodoro Timer for Local LLM Reliability
Self-Hosted Pomodoro Timer: Mastering Focus with Local AI Tools We don’t do cloud...…
Show HN: Local LLM code-generation with Gemma 4 e2B via JSON AST to Clojure
JSON AST > Clojure. Contribute to quadracollision/llmisp development by creating an account on GitHub.…
Show HN: Llama-dash – local LLM operators dashboard and proxy
The operations layer for your local LLM stack. Contribute to ndom91/llama-dash development by creating an account on GitHub.…
Local LLM for building AI Security platform
The Infrastructure Behind Making Local LLM Agents Useful
Lessons from building a fast, reliable single-cell analysis agent on open-weight models…
Capturing ideas with voice, local LLMs, and obsidian
a more coherent problem solving exercise…
What happens to local LLM if/when LLMs are no longer released for free?
I built my own Googlebook with a Raspberry Pi, local LLMs, and old hardware
I turned out much better than I expected…
My local LLM can call Claude when it's stuck, and it changed everything about my local-first setup
Local LLMs aren't very good on their own…
Is the Beelink SER9 Pro HX370 worth buying at $950 in 2026 for local LLMs and AI agents?
I use this local AI tool to turn boring documents into cool narrations
Since it runs locally, I don't have to spend a dime on expensive cloud platforms…
How I Built a Completely Free Local AI Stack — Inspired by a 60-Second YouTube Short
How I Built a Completely Free Local AI Stack — Inspired by a 60-Second YouTube Short By...…
OpenClaw: 13 Errors, $1.50/Month, and an AI Team That Doesn’t Need the Cloud
I run a team of AI agents on a Mac I bought in 2022. They handle my Slack, run research, draft...…
ClickBook – Offline Android eReader with local LLM inference via llama.rn
Tap any word to instantly understand it. Offline AI ereader for EPUB and PDFs.…
Langfuse v4 + Ollama: Tracing Local LLMs Without Mocks or Monkey-Patches
Disclosure: I learn topics like this through LLM dialogue. The prompts are mine, the depth comes from...…
Show HN: AI/ML benchmark for local LLM inference and XGBoost training on GPU/CPU
A suite to benchmark CPU/GPU Python performance in training ML models and running local LLMs - albedan/ai-ml-gpu-bench…
Building an Ambient Developer Daemon with Nous Hermes
A hands-on experiment in what changes when your dev assistant lives on your machine, runs...…
You don't need an expensive GPU to run a local LLM that actually works
Sometimes smaller is better.…
How to Replace Google Gemini with a local LLM as Your Phone's Default Assistant
Your Android phone ships with Google Gemini as the default assistant. Long-press the power button,...…