WeSearch
Hub / social / r/LocalLLaMA
social · source

r/LocalLLaMA on WeSearch

Recent social headlines from r/LocalLLaMA.

R/LOCALLLAMA

I’ve done it!!! FINALLY I have become a (quasi-local) summoner!!! AMA [imtiredboss.jpg]

5/22/2026 · 5 views
R/LOCALLLAMA

Low-level coding dataset

5/22/2026 · 10 views
R/LOCALLLAMA

Anyone evaluated the difference between Qwen Code for the local qwen models vs another harness? CC, OC, LC, Aider etc..

5/22/2026 · 19 views
R/LOCALLLAMA

What model weights (quantized included) under 150GB have the best general knowledge depth?

5/22/2026 · 9 views
R/LOCALLLAMA

When your LLM treats data center GPUs like an optional DLC

5/22/2026 · 14 views
R/LOCALLLAMA

Latest b9274 Addresses MTP VRAM leak

5/21/2026 · 16 views
R/LOCALLLAMA

Waiting for Qwen 3.7 open weight... The new King has arrived...

5/21/2026 · 13 views
R/LOCALLLAMA

Gorgon Halo is 6.7% faster than predecessor Strix Halo

5/21/2026 · 16 views
R/LOCALLLAMA

Strix Halo 128GB vs M5 pro 64GB

5/21/2026 · 17 views
R/LOCALLLAMA

Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.

5/21/2026 · 12 views
R/LOCALLLAMA

110 tok/s with 12GB VRAM on Qwen3.6 35B A3B and ik_llama.cpp

5/21/2026 · 12 views
R/LOCALLLAMA

Open-source LLMs are still weak against long reasoning jailbreaks, even with lightweight defenses

5/21/2026 · 17 views
R/LOCALLLAMA

Model Golf for some Runpod Credits!

5/21/2026 · 14 views
R/LOCALLLAMA

Back again, many changes have taken place.

5/21/2026 · 9 views
R/LOCALLLAMA

How can you stop your model from looping

5/21/2026 · 14 views
R/LOCALLLAMA

"AWS secures rare Mac Studios while ordinary Apple customers remain completely locked out"

5/20/2026 · 11 views
R/LOCALLLAMA

Guide to building smoltorrent | A Distributed ML Checkpoint Storage System

5/20/2026 · 10 views
R/LOCALLLAMA

What small speech to text (STT) model is best at recognizing whispered speech?

5/20/2026 · 11 views
R/LOCALLLAMA

Gemma 4 MTP with LlamaCPP

5/20/2026 · 13 views
R/LOCALLLAMA

Impulse Purchase.

5/20/2026 · 9 views
R/LOCALLLAMA

Qwen3.7 Max scored by Artificial Analysis, 27B/35B waiting room

5/20/2026 · 8 views
R/LOCALLLAMA

Guardrails take an 8B model from 53% to 99% on agentic tasks [ACM CAIS '26 preprint]

5/20/2026 · 9 views
R/LOCALLLAMA

Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks

5/20/2026 · 8 views
R/LOCALLLAMA

LM Studio finally added support for MTP Speculative Decoding

5/20/2026 · 12 views
R/LOCALLLAMA

Claude Code plugins a risk to local ecosystem?

5/19/2026 · 19 views
R/LOCALLLAMA

anyone else spending more time managing ai markdown files than actually coding?

5/19/2026 · 16 views
R/LOCALLLAMA

Carbon: Decoding the Language of Life

5/19/2026 · 9 views
R/LOCALLLAMA

Llama-server and MTP

5/19/2026 · 11 views
R/LOCALLLAMA

Qwen is cooking hard

5/19/2026 · 12 views
R/LOCALLLAMA

We have sub-agents at home

5/19/2026 · 16 views
R/LOCALLLAMA

Why might MTP be net negative for tool heavy agentic flows?

5/19/2026 · 15 views
R/LOCALLLAMA

Is there any <3B model with usable 200k+ context window?

5/19/2026 · 11 views
R/LOCALLLAMA

How many GPUs do you have on your local system/server/AI PC?

5/19/2026 · 10 views
R/LOCALLLAMA

favorite Agentic Coding Harness

5/18/2026 · 13 views
R/LOCALLLAMA

Still happy for yall

5/18/2026 · 6 views
R/LOCALLLAMA

Is the llama.cpp nixos flake just broken?

5/18/2026 · 15 views
R/LOCALLLAMA

MTP (Multi-Token Prediction): 2x Faster Token Generation on AMD Strix Halo & Radeon 9700 AI Pro

5/18/2026 · 14 views
R/LOCALLLAMA

Qwen cant wait to release 3.7 models

5/18/2026 · 11 views
R/LOCALLLAMA

Qwen 35b a3b surprises me

5/18/2026 · 10 views
R/LOCALLLAMA

Hopes and dreams for Google IO tomorrow? 👀

5/18/2026 · 16 views
R/LOCALLLAMA

What happens to local LLM if/when LLMs are no longer released for free?

5/18/2026 · 9 views
R/LOCALLLAMA

I tested 42 LLMs on their willingness to build the apocalypse. The "safest" closed-source models are lying to you.

5/18/2026 · 15 views
R/LOCALLLAMA

Quantizing MTP KV Cache = free lunch?

5/18/2026 · 15 views
R/LOCALLLAMA

GGUF with MTP vs MLX without. Is mlx still the way to go for mac users?

5/18/2026 · 13 views
R/LOCALLLAMA

New models when? Forecasting release date.

5/18/2026 · 14 views
R/LOCALLLAMA

The Lurk Report - The last 30 days of r/LocalLLaMA

5/18/2026 · 8 views
R/LOCALLLAMA

Is anyone prioritizing code quality checks via a small local model?

5/18/2026 · 17 views
R/LOCALLLAMA

Big new memory tool with local benchmarks

5/18/2026 · 19 views
R/LOCALLLAMA

I built a coding agent that gets 87% on benchmarks with a 4B parameter model, here's how

5/18/2026 · 20 views
R/LOCALLLAMA

May 2026 updated chart of strix halo mini pc size chart

5/18/2026 · 16 views

More social sources

Visit r/LocalLLaMA directly →