loading every MCP server on every prompt was quietly destroying my token budget

May 2, 2026 · 3:07 AM UTC · 0 reactions · 0 comments · 1 view

via

ClaudeAI

had like 5 or 6 MCP servers configured and did not realize all of them were loading every single time i sent a prompt. even for the dumbest simplest questions. found a routing layer that only loads the relevant ones per prompt and token usage dropped a lot. prompts feel faster too. honestly cannot believe i let it go on that long without checking

Original article

ClaudeAI

Read full at ClaudeAI →

Anonymous · no account needed

Discussion

0 comments

loading every MCP server on every prompt was quietly destroying my token budget

Discussion

More from ClaudeAI