This is not stupid. This is expensive.
A very simple and non-controversial loop: “monitor all PRs for broken tests and fix them”
In a 200k codebase, this alone will blow through $1-2k of tokens a month.
You will have very high cache hit rate yes.
But you will also have the agent get an error, scan, fix, retry, get another error, and so on.
Each failure eventually costs $5/fix. This is great economics when a developer costs hundreds per hour.
But compound these loops and you begin to have issues with costs.
Here’s your monthly reminder that you shouldn’t be prompting coding agents anymore.
You should be designing loops that prompt your agents.