WeSearch

Cut AI token usage by 96%?

Frederic Lardinois· ·6 min read · 0 reactions · 0 comments · 5 views
Cut AI token usage by 96%?

AWS developer advocate Morgan Willis on Strands Agents, intent-based tools, MCP gateways, and how smarter tool design cut agent token usage from 52K to 2K.

Original article
The New Stack · Frederic Lardinois
Read full at The New Stack →
Opening excerpt (first ~120 words) tap to expand

AWS sponsored this post. For this episode of The New Stack Makers, I sat down with AWS developer advocate Morgan Willis to talk about Strands Agents, the company’s open source agentic framework, which has seen over 14 million downloads since it launched just under a year ago. Willis brought a hands-on demo built around a simple accounting API to show what building with Strands looks like in practice. The demo walks through three iterations of the same task: looking up the latest invoice for a customer. First, Willis mapped each API endpoint directly to an agent tool, the way most developers would by default. The agent needed five chained API calls and burned roughly 52,000 tokens. Then she swapped in intent-based tools that are built around an outcome rather than a data operation.

Excerpt limited to ~120 words for fair-use compliance. The full article is at The New Stack.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from The New Stack