CS PhD @McGillU and @mila_quebec, working on 🍒 and 🤖 stuff / ex- @LetsUnifyAI, @NUSComputing, @EngineeringOSU

Joined February 2017
73 Photos and videos
Pinned Tweet
📢 New paper out! We introduce QWM: a single locomotion world model trained across 8 quadrupeds and deployed zero-shot on robots it had never seen by conditioning on their morphology specs: ANYmal-D and Unitree Go1 🦾 No fine-tuning, no warm-up, no retraining from scratch. The key insight: robot morphology isn't a latent variable to infer from motion history, it's a known engineering spec sitting in the USD (or URDF) file. So we just use it directly.
1
3
21
1,437
Mohamad H. Danesh retweeted
The absolute peak of doing science is witnessing, for the first time, something that the world hasn't seen before. Happened to me today. Can't wait to tell you more about it.
3
3
44
2,869
Our method adds as little as ~10 lines on top of TD3 BC: pull generated actions toward the data, push them apart from each other. Go and check it out!
The code is now available! 🚀 DriftQL learns a one-step Q-guided actor corrected by a learned drift field. We beat baselines with no denoising, solvers, auxiliary actors, or distillation. 💻 Code: github.com/anashoussaini/dri…
2
55
The code is now available! 🚀 DriftQL learns a one-step Q-guided actor corrected by a learned drift field. We beat baselines with no denoising, solvers, auxiliary actors, or distillation. 💻 Code: github.com/anashoussaini/dri…
Excited to share DriftQL☄️, a new paradigm for offline RL. Instead of fitting a behavior prior, DriftQL learns a one-step Q-guided actor whose samples are corrected by a drift field. Simple. SOTA on OGBench/D4RL. No denoising. No solvers. No auxiliary actor. No distillation. With my co-authors @mo_danesh, Amin Abyaneh, Scott Fujimoto, Hsiu-Chin Lin, David Meger 🌐 driftql.github.io 🧵
1
3
245
AutoEval appears to be paused and may potentially be discontinued. For my research, I've trained on the BridgeData V2 and need a remote setup for real-world evaluation. Are there any alternative remote evaluation platforms, shared testbeds, or labs that support Bridge-style setups and allow external researchers to deploy policies remotely?
1
33
Mohamad H. Danesh retweeted
Excited to share DriftQL☄️, a new paradigm for offline RL. Instead of fitting a behavior prior, DriftQL learns a one-step Q-guided actor whose samples are corrected by a drift field. Simple. SOTA on OGBench/D4RL. No denoising. No solvers. No auxiliary actor. No distillation. With my co-authors @mo_danesh, Amin Abyaneh, Scott Fujimoto, Hsiu-Chin Lin, David Meger 🌐 driftql.github.io 🧵
2
2
13
1,471
Excited to announce Michael Rabbat (Co-Founder & VP World Models - AMI labs @amilabs) as a new speaker joining our stellar lineup ✨ 🌐 worldmodels-rlc.github.io/
📢 Call for Papers! #RLC2026 🇨🇦 🌎 We are now inviting contributions to the Workshop on Model-based RL in the Era of Generative World Models at @RL_Conference in Montreal, Canada! 🇨🇦 🔗 Webpage worldmodels-rlc.github.io/ 📄 Submit paper now! openreview.net/group?id=rl-c… 🧵Format
1
136
Mohamad H. Danesh retweeted
World models are becoming a powerful approach for making the most of available data, but how do we create them to help build better agents? Come check out this workshop at @RL_Conference and submit related ideas!
📢 Call for Papers! #RLC2026 🇨🇦 🌎 We are now inviting contributions to the Workshop on Model-based RL in the Era of Generative World Models at @RL_Conference in Montreal, Canada! 🇨🇦 🔗 Webpage worldmodels-rlc.github.io/ 📄 Submit paper now! openreview.net/group?id=rl-c… 🧵Format
5
26
3,549
This is what made QWM possible 🏗️ Training 8 different quadrupeds simultaneously in one sim was a prerequisite for learning a policy (or a world model if you will) that generalizes across morphologies. Full blog post: modanesh.github.io/blog/hete…
I trained a single PPO policy across 8 quadrupeds simultaneously: Spot, ANYmal (B, C, D), Unitree (Go1, Go2, A1, B2). 🤖 Same weights. Same compute as training on 1 robot. No core Isaac Lab changes. Here's how we broke Isaac Lab's homogeneity assumption to make it work. 🧵👇 x.com/mo_danesh/status/20429…
1
90
Cleaning at Montreal airport is going autonomous. Robots taking over quietly
3
6
207
I trained a single PPO policy across 8 quadrupeds simultaneously: Spot, ANYmal (B, C, D), Unitree (Go1, Go2, A1, B2). 🤖 Same weights. Same compute as training on 1 robot. No core Isaac Lab changes. Here's how we broke Isaac Lab's homogeneity assumption to make it work. 🧵👇 x.com/mo_danesh/status/20429…
2
1
2
291
For some reason the video got deleted, so here I'm posting it again:
56
📢 New paper out! We introduce QWM: a single locomotion world model trained across 8 quadrupeds and deployed zero-shot on robots it had never seen by conditioning on their morphology specs: ANYmal-D and Unitree Go1 🦾 No fine-tuning, no warm-up, no retraining from scratch. The key insight: robot morphology isn't a latent variable to infer from motion history, it's a known engineering spec sitting in the USD (or URDF) file. So we just use it directly.
1
3
21
1,437
The trick: stop treating morphology as a mystery to infer, and start treating it as what it actually is a known engineering spec 📐 We read the robot's USD file, encode its kinematics, mass & actuation, and inject that into the world model's dynamics at every step. No adaptation lag. No warm-up. No dangerous trial-and-error on a real robot 🤖
1
1
120
Excited to co-organize the MBRL WM Workshop at @RL_Conference 2026 in Montreal! 🚀 Working on model-based RL, world models, or embodied AI? Submit your work and join the conversation. Looking forward to seeing what the community brings! 🤝
📢 Call for Papers! #RLC2026 🇨🇦 🌎 We are now inviting contributions to the Workshop on Model-based RL in the Era of Generative World Models at @RL_Conference in Montreal, Canada! 🇨🇦 🔗 Webpage worldmodels-rlc.github.io/ 📄 Submit paper now! openreview.net/group?id=rl-c… 🧵Format
3
95
Mohamad H. Danesh retweeted
📢 Call for Papers! #RLC2026 🇨🇦 🌎 We are now inviting contributions to the Workshop on Model-based RL in the Era of Generative World Models at @RL_Conference in Montreal, Canada! 🇨🇦 🔗 Webpage worldmodels-rlc.github.io/ 📄 Submit paper now! openreview.net/group?id=rl-c… 🧵Format
2
6
46
9,826
Mohamad H. Danesh retweeted
I trained a single PPO policy across 8 quadrupeds simultaneously: Spot, ANYmal (B, C, D), Unitree (Go1, Go2, A1, B2). 🤖 Same weights. Same compute as training on 1 robot. No core Isaac Lab changes. Here's how we broke Isaac Lab's homogeneity assumption to make it work. 🧵👇 x.com/mo_danesh/status/20429…
2
1
2
291
The result: one training run, 8 robots, morphology-agnostic locomotion. 🚀 The pattern is "composition over modification": no Isaac Lab core classes were forked. Adding a new robot = add a config block. No env logic changes needed. ♻️
1
39