The amount of wasted tokens from repeated tool calls, metadata lookups, and semantically-equivalent messages in AI agents today is staggering.
In 2025, I demonstrated how to build idempotent tools and how to design cacheable APIs for agent use. But that wasn't enough. 🧵