Often found scribbling down math with intermittent bursts of bashing out code.

Joined June 2011
106 Photos and videos
Pinned Tweet
Do you want to do a Postdoc developing new methods/theory in Optimization for deep learning/ML? Do you enjoy bluesky open research and discussions on black boards? Then Apply to the Flatiron Fellowship in the Center of Computational Mathematics simonsfoundation.org/flatiro… 1/3
1
7
25
5,539
Robert M. Gower retweeted
"It's easier to tune the LR for method A than for B." We tried to formalize this for model-based stochastic optimization methods. We find a key quantity, called stability index, that describes how stable a (weakly) convex bound is as a function of LR. 📚arxiv.org/abs/2602.09842
3
9
67
7,245
And now we are very proud and humbled to have received the ICLR 2026 Honorable Mention award for this work blog.iclr.cc/2026/04/23/anno… Very fun to have found this useful math nugget that can actually speed-up LLM training.

Are you interested in the new Muon/Scion/Gluon method for training LLMs? To run Muon, you need to approximate the matrix sign (or polar factor) of the momentum matrix. We've developed an optimal method *The PolarExpress* just for this! If you're interested, climb aboard 1/x
9
58
4,042
Very happy that this has now been accepted to ICML2026! Great, systematic work done by @CrichaelMawshaw
We've just finished some work on improving the sensitivity of Muon to the learning rate, and exploring a lot of design choices. If you want to see how we did this, follow me ....1/x (Work lead by the amazing @CrichaelMawshaw)
4
46
3,599
And now we got the Honorable paper mention of ICLR 2026 for our work on Muon PolarExpress!
Are you interested in the new Muon/Scion/Gluon method for training LLMs? To run Muon, you need to approximate the matrix sign (or polar factor) of the momentum matrix. We've developed an optimal method *The PolarExpress* just for this! If you're interested, climb aboard 1/x
1
3
59
3,707
Robert M. Gower retweeted
4 Dec 2025
Check out my poster today (Thurs) at 11am--2pm session. Exhibit Hall C,D,E Poster Location: #602 "Fisher meets Feynman: score-based variational inference with a product of experts" (NeurIPS spotlight) with @gowerrobert David Blei and Lawrence Saul @FlatironInst #NeurIPS2025
2
10
63
5,369
Robert M. Gower retweeted
24 Nov 2025
We’re recruiting for both postdoc and open-rank positions. Learn more about ML@CCM 👉 users.flatironinstitute.org/… I’ll also be in San Diego for NeurIPS — feel free to DM if you’re interested in #AIforScience or #GenerativeAI

Want to do fundamental ML research in NYC? 🧠 The Center for Computational Mathematics @FlatironInst @SimonsFdn is hiring! – Flatiron Research Fellow (postdoc, by Dec 1): apply.interfolio.com/173401 – Open Rank (by Jan 15): apply.interfolio.com/173640
1
3
27
4,915
Diana was the driving force behind all our variational inference work, and any department would be lucky to have her!
7 Nov 2025
I'm on the academic job market! I design and analyze probabilistic machine-learning methods---motivated by real-world scientific constraints, and developed in collaboration with scientists in biology, chemistry, and physics. A few highlights of my research areas are:
1
1
14
3,018
We've just finished some work on improving the sensitivity of Muon to the learning rate, and exploring a lot of design choices. If you want to see how we did this, follow me ....1/x (Work lead by the amazing @CrichaelMawshaw)
6
23
189
29,732
Our paper covers a lot ground, including exploring different product norms, formalizing MuonAdam as steepest descent, introducing the combination of truncation Muon, and a lot of experiments! Here are the details -> arxiv.org/pdf/2510.09827 and ...

1
2
23
1,814
Robert M. Gower retweeted
27 Oct 2025
Fisher meets Feynman! 🤝 We use score matching and a trick from quantum field theory to make a product-of-experts family both expressive and efficient for variational inference. To appear as a spotlight @ NeurIPS 2025. #NeurIPS2025 (link below)
5
44
401
35,328
Robert M. Gower retweeted
Call for participation:  KAUST Workshop on Distributed Training in the Era of Large Models kaust.edu.sa/events/dtelm25/ location: KAUST, Saudi Arabia dates: Nov 24-26, 2025. There will be a chance for some participants to present a poster and/or give a lightning talk.
2
10
20
3,785