6 month prediction: agentic coders will use giant models to scaffold, set up, and plan; small model swarms for implementation, review, and verification.
Cursor, conductor, cognition, and other 3rd party harnesses will surge with this pattern, unless Fable-level models truly come down *dramatically* in price.
Few things converging to drive this:
- fable is legitimately great, but too expensive.
- small models are getting really good in short bursts (go try Gemma E2B. It’s insanely good for a 2B model.)
- ensembles are a real pattern with legs (even copilot(!) catches issues in fable PRs)
- ai cost controls are arriving in the enterprise
- Anthropic and OAI shifted enterprise costs to usage based
- teams want to collaborate, and Anthropic and OAI aren’t focused on that pattern
Again, Fable-tier models could get super cheap and fast and surprise us all. But the current crop of small models are *magnitudes* cheaper; a good harness could unlock their potential.