1/ I ran Claude Code for six months. Then I ran Codex. Then I ran both at the same time, hoping the combined cost would finally justify the output.
It did not.
The problem was not the models. Claude is excellent. The frontier models are the best available.
The problem was memory.
Every session starts from zero. The agent does not remember what you taught it last week, what worked, what failed, or why a decision was made.
I wanted an agent that gets smarter over time because it remembers what it learns.
đź§µ