🗺️One Message. 60M Tokens. $30. Still
Running.
Theo ran one session on Copilot. 52 million input tokens. 838K output. $30 of inference and the session was still going. then he did the math: the current billing model gives you 1,500 messages per cycle, regardless of how expensive each one is. on a $40 plan, you could theoretically run $45,000 of compute.
GitHub confirmed they are moving off this billing model on June 1st. Theo ran 15 more sessions and hit $221 of tokens total, which was 1.6% of his $40 plan.
@cheatyyyy said what everyone running serious coding agents is quietly thinking: "i leave my agent on and come back to it asking a stupid question, it's been too long and i see it charge me a dollar in input costs on next message LMAO"
the per-message model was never going to survive real usage. the opportunity is in smarter routing: local model fallbacks, usage-aware orchestration, anything that stops sending every call to the frontier model.
Builder signal: cost-optimized agent orchestration,
local/hybrid model switching, token-aware routing
Part of today's InnoFlow Newsletter/Podcast → →
innoflows.net/
via
@theo @cheatyyyy
I sent a single message on Copilot and it did over 60m tokens. It's still going. $30 of inference so far.
In their current billing model, you get 1,500 messages, regardless of how expensive each is. I'm pretty sure I can do $45,000 of messaging on this plan