35 stories tagged with #ollama, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Ollama"
TensorSharp: Open-Source Local LLM Inference Engine
A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama…
Local-first: a Model on Your Own Machine, Zero Cloud
This is the concrete, runnable walkthrough for Post 1 of the Portway series. The goal: stand up a...…
I Told a Robot to "Clean the Kitchen", and It Actually Did
I typed "Clean the kitchen" into a chat box. A robot turned toward the right room, drove over, swept...…
AI Workstation Build Check: £1100 Budget Tesla V100 32GB + Xeon 8268 + 64GB RAM in a Dell Precision T7820 (Ollama)
LLM-Manager: Orchestrating Ollama and Llama.cpp with Pure Bash
LLM-Manager is a lightweight, modular Bash suite with a dual JSON/Interactive interface designed to...…
Built a local AI voice control for my smart home — Ollama + faster-whisper + n8n, 2.4× faster than cloud
Tlamatini – Local-first AI dev assistant with 68 agents and hybrid RAG
Agentic Development AI Tempered. Contribute to XAIHT/Tlamatini development by creating an account on GitHub.…
Show HN: Local AI server with persistent memory, RAG and plugins
Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services. - jgoy-lab…
I quit ChatGPT for a free, private, and local AI called Ollama - here's why
Save your money, your privacy, and the planet. This installable AI offers several benefits you won't find with more traditional models like ChatGPT.…
Prompter – Compare and benchmark Ollama models side-by-side in your terminal
Terminal-based multi-model comparison, benchmarking, and evaluation tool for Ollama. Zero dependencies, one file. - whonixnetworks/prompter…
Ollama v0.30.0-rc23: "directly support llama.cpp" & "compatibility with GGUF"
This version of Ollama will change the architecture to directly support llama.cpp instead of building on top of GGML, and allows for compatibility with GGUF file format. MLX is use…
Building a Local-Only RAG System with Ollama and TypeScript
Building a Local-Only RAG System with Ollama and TypeScript Most RAG tutorials send your...…
Running Gemma 4 on a Modest Machine: Unsloth vs LM Studio vs llama.cpp vs Ollama
This is a submission for the Gemma 4 Challenge: Write About Gemma 4 When local AI conversations...…
I built a local GUI for the TradingAgents framework — works with Ollama
Easy Agentic Tool Calling with Gemma 4
In this tutorial, we will give Gemma 4 two new tools and watch the model decide, on its own, when to look around and when to compute.…
Anyone built a local email summariser with Ollama? Trying to avoid sending work emails to OpenAI
I built a local Claude Code alternative with Ollama — here's how the agentic loop works
I Built a Local Autonomous Coding Agent with Ollama — Soul, Autonomy, and a 40-Round...…
Eve Agent V2 Unleashed – open-source local coding agent, powered by Ollama, FREE
Eve Agent V2 Unleashed — local-first autonomous AI coding agent powered by Ollama - JeffGreen311/eve-agent-v2-unleashed…
Gemma 4 wrote three summaries in one response. The middle one was a self-disclaimer.
At num_ctx=2048, Gemma 4 E2B writes a hallucinated meeting summary, notes that it's not actually in the transcript, then writes a more careful one. Every single time. Here's the 15…
Built a small desktop app for running local LLMs with Ollama.
Ollama vs llama.cpp vs vLLM: Which Should You Use in 2026?
Ollama vs llama.cpp vs vLLM compared — ease of use, speed, GPU needs. Which inference engine is right for your workflow?…
CrustAI – Self-Hosted AI for Telegram/WhatsApp/Discord via Ollama, Zero Cloud
What I shipped during I/O 2026 week: Gemma 4 on Ollama with a five-piece safety stack
Drafted in anticipation of the Google I/O 2026 Writing Challenge. Will add the devchallenge and...…
We built a tool that installs frameworks like ComfyUI, Ollama, OpenWebUI etc on any cloud GPU in one command and saves your whole setup between sessions [R]
Streaming Ollama Responses in Next.js: The SSE Pattern That Actually Works
Streaming Ollama Responses in Next.js: The SSE Pattern That Actually Works Most Next.js +...…
5 empty responses from gemma4:e4b. 4 hypotheses. 0 root cause.
dev.to — Gemma 4 Challenge submission (Write track) Drafted: 2026-05-18 Track: Write...…
I built GHOST — an AI agent that actually fixes your slow laptop using Gemma 4 locally
This is a submission for the Gemma 4 Challenge: Build with Gemma 4 What I Built GHOST...…
Expanded to 4 nodes this week, running a local AI agent cluster with Ollama, CrewAI, and a custom command center UI
Swarm-Consensus Defense Achieves 98.2% Against Cloud-LLM Adversarial Attacks
5-defender consensus swarm + autohealer hit 100% defense rate by round 400 after only 6 breaches in...…
Looking to migrate off of Ollama and LMStudio
How I Built a Completely Free Local AI Stack — Inspired by a 60-Second YouTube Short
How I Built a Completely Free Local AI Stack — Inspired by a 60-Second YouTube Short By...…
Running Local GGUF Models with Ollama (GPU Enabled)
1. Install & Start Ollama curl -fsSL https://ollama.com/install.sh | sh systemctl...…
Langfuse v4 + Ollama: Tracing Local LLMs Without Mocks or Monkey-Patches
Disclosure: I learn topics like this through LLM dialogue. The prompts are mine, the depth comes from...…
You don't need an expensive GPU to run a local LLM that actually works
Sometimes smaller is better.…
Show HN: PeopleMesh, Semantic Search for People
PeopleMesh is the AI-powered matching layer for modern organizations. It helps people discover the right colleagues, internal opportunities, communities, and projects through seman…