WeSearch

loading every MCP server on every prompt was quietly destroying my token budget

· 0 reactions · 0 comments · 1 view

had like 5 or 6 MCP servers configured and did not realize all of them were loading every single time i sent a prompt. even for the dumbest simplest questions. found a routing layer that only loads the relevant ones per prompt and token usage dropped a lot. prompts feel faster too. honestly cannot believe i let it go on that long without checking

Original article
ClaudeAI
Read full at ClaudeAI →
Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from ClaudeAI