The fix isn't prompt caching. It's architecture.
AI generates the rule engine, templates, knowledge base, validators — once.
Then the system runs deterministically. No model in the hot path. Cost per interaction: $0.001.
The crane builds the building. It doesn't live inside it.