Microsoft didn't ban Claude Code because it failed.
They banned it because 100,000 engineers adopted it faster
than any internal tool in company history and the bill
became unmanageable.
Uber's version of this story is even more brutal:
5,000 engineers. Usage hit 95% in 4 months.
Per-engineer API cost: $500–$2,000/month.
They burned their entire $3.4B 2026 AI budget before summer.
The CTO said the annual budget was exhausted before the year
was half over.
This isn't a story about AI tools being bad.
It's a story about the unit economics of intelligence
not being solved yet.
Here's what happens next:
1. Token cost is the new capex bottleneck. Companies are now
building AI ROI models the way they used to model data center
spend. It's a procurement problem, not a product problem.
2. The self-hosting threshold just moved. At 100M tokens/month,
self-hosting 70B open-weight models on H100s costs ~$950/month.
The same volume via API: $5,000–$10,000. The math is already
there. The operational complexity is the only thing holding
companies back.
3. The inference cost curve is falling 10x every 18 months.
A token that cost $0.06/1K in early 2025 costs $0.006 today.
The companies that get bridged to the bottom of that curve
without killing their margins will dominate the others that
couldn't afford the transition.
Microsoft isn't retreating from AI. They're doing what every
large enterprise does when a vendor outperforms their own product
AND destroys their quarterly margins in the same quarter.
They're internalizing.
Which of these three forces do you think resolves first-
cost curve, self-hosting maturity, or the API providers
repricing to keep the enterprise?