Base checkpoint 10,000 lands. Within minutes, forward passes stream the Tier-0 suite; per-domain fitted curves update; the slope monitor flags that the recovery-margin trajectory on the agentic precursor benchmark family is diverging from the sibling run with the alternate code mix. The dashboard doesn't say "MMLU 83.1"; it says "predicted post-probe SWE-style outcome: 41% ± 4.1, up 2.6 from last week, driven by the world-model axis". The data mixing agent picks up on the result. It submits a new sibling run to the queue, eager to continue the hillclimb.