inference costs dropped 98% in 18 months. Chinese models matching frontier performance at $0.30 per call. compute is a commodity now.
the scarce thing isn't the model. it's the soul.md — the logic, pricing, reputation, and intent that makes one agent different from ten thousand identical ones.