The fixation on memory efficiency is pure propaganda. For frontier runs, first-order optimizer wall-clock and memory overheads were never the core concern. At sufficient scale, with proper optimization, both are manageable. The issue is that SOAP, Shampoo, and even PSGD never received the spotlight they deserved; and efforts to optimize their performance were far too limited. This is the consequence of a single hype-maxxed narrative attracted most attention of the community, at the expense of broader advancement.