The Transformer architecture is fundamentally a parallel processor of context, but reasoning is a sequential, iterative process.
To solve complex problems, a model needs a "scratchpad" not just in its output CoT, but in its internal state. A differentiable way to loop, branch, and backtrack until the model finds a solution that works.