The elegant part is that the same asymmetry that makes inverse design hard also suggests the fix. Scientists have already built forward simulators for many of these problems. Those simulators can be repurposed as RL environments: propose a design, evaluate it, learn from the reward.
In our case studies, training on simulator feedback lets a small 8B model outperform much larger frontier models on selected scientific design tasks.