WeSearch
Hub / Tags / Speculative Decoding
TAG · #SPECULATIVE-DECODING

Speculative Decoding coverage.

Every story in the WeSearch catalog tagged with #speculative-decoding, chronological, with view counts. Subscribe to the per-tag RSS feed to follow this topic in your reader of choice.

5 stories tagged with #speculative-decoding, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.

⌘ RSS feed for this tag →   or   search "Speculative Decoding"

RELATED TAGS
#deepseek-v41#sglang1#miles1#hybrid-attention1
LOCALLLAMA

Simple to use vLLM Docker Container for Qwen3.6 27b with Lorbus AutoRound INT4 quant and MTP speculative decoding - 118 tokens/second on 2x 3090s

2 views ·
REDDIT

Speculative Decoding Implementations: EAGLE-3, Medusa-1, PARD, Draft Models, N-gram and Suffix Decoding from scratch

I’ve been working on an educational implementation repo for speculative decoding: The goal is not to wrap existing libraries, but to implement several speculative decoding methods …

5 views ·
REDDIT

Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks

So for my project I was using up until now either Gemini 3 / 2.5 Flash or Flash-lite. All my use cases are not agentic, simply LLM workflows for atomic tasks like extracting refere…

8 views ·
REDDIT

Speculative Decoding Implementations: EAGLE-3, Medusa-1, PARD, Draft Models, N-gram and Suffix Decoding from scratch [P]

I’ve been working on an educational implementation repo for speculative decoding: The goal is not to wrap existing libraries, but to implement several speculative decoding methods …

6 views ·
LMSYS

DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles

We are thrilled to announce Day-0 support for DeepSeek-V4 across both inference and RL training. SGLang and Miles form the first open-source stack to serve and train DeepSeek-V4 on…

4 views ·
#deepseek-v4#sglang#miles