Phase 3: the system runs experiments - continuously - at scale
A Strategist agent proposes experiments, Worker agents implement and submit them to a GPU cluster via Slurm, and after each run the system analyzes what happened and why. Each experiment moves through a kanban board: queued -> implement -> execute -> analyze -> done (with a fix state for failures). It learns from every result - what works, what doesn't, what to try next - and the strategy evolves over the course of the campaign
In practice what we love about this is - it scales to whatever cluster you have and it just lives there. Whenever there's an idle GPU, AlphaLab can fill it with something interesting. If someone needs the GPU back, just kill the job, doesn't matter, it adapts and moves on. you can basically just leave it running in the background and it's always making progress.
We cap campaigns at 50 experiments in the paper for fair comparison, but in practice it just keeps going as long as you let it