1 Million token context windows are a luxury, not an architecture.
My AI side-project hit a wall when the system prompt bloated to 120,000 tokens per message.
Here is how I fixed it using a Knowledge Graph, a "Waterfill" budget, and Agentless GraphRAG. 🧵👇