Joined December 2024
37 Photos and videos
Deep RL Course retweeted
2 Dec 2025
Excited to announce that our work on “Discovering state-of-the-art RL algorithms” is finally published in @Nature! In this work, we meta-learned RL algorithms at scale. Paper: nature.com/articles/s41586-0… Blog: google-deepmind.github.io/di… See thread 👇
14
85
474
72,536
Deep RL Course retweeted
🏆1000 Layer Networks for Self-Supervised RL wins a Best Paper Award at #NeurIPS25 ! Proud of @kevin_wang3290 @IJ_Apps @m_bortkiewicz for all the hard work they put into this! 👇for key results and open problems!
1/ While most RL methods use shallow MLPs (~2–5 layers), we show that scaling up to 1000-layers for contrastive RL (CRL) can significantly boost performance, ranging from doubling performance to 50x on a diverse suite of robotic tasks. Webpage Paper Code: wang-kevin3290.github.io/sca…
5
28
194
20,444
Deep RL Course retweeted
All of my Deep RL course lecture videos from Spring 2025 are now online! 🥳 Youtube playlist: youtube.com/watch?v=EvHRQhMX…

70
388
3,417
235,307
Michael Littman's (@mlittmancs) Talk at @RL_Conference 2025: youtube.com/watch?v=orxCYhb9… Shared by @AmiiThinks

1
3
98
Deep RL Course retweeted
Less than a week to RLC!
6
58
7,129
Deep RL Course retweeted
What makes RL hard is the _time_ axis⏳, so let's pre-train RL policies to learn about _time_! Same intuition as successor representations 🧠, but made scalable with modern GenAI models 🚀. Excited to share new work led by @chongyiz1, together with @seohong_park and @svlevine!
1/ How should RL agents prepare to solve new tasks? While prior methods often learn a model that predicts the immediate next observation, we build a model that predicts many steps into the future, conditioning on different user intentions: chongyi-zheng.github.io/info….
2
8
82
6,172
Deep RL Course retweeted
10 Jun 2025
Agreed 💯
9 Jun 2025
Ilya Sutskever, in his speech at UToronto 2 days ago: "The day will come when AI will do all the things we can do." "The reason is the brain is a biological computer, so why can't the digital computer do the same things?" It's funny that we are debating if AI can "truly think" or give "the illusion of thinking", as if our biological brain is superior or fundamentally different from a digital brain.
2
3
18
2,440
Deep RL Course retweeted
9 Jun 2025
AI-GAs FTW (eg DGM, ADAS, OMNI, etc.) 🚀🔥🧠📈
7 Jun 2025
Richard Sutton says that the current dominance of LLMs is a "momentary fixation" the real breakthroughs will come from scaling computation, not building AI systems based on how humans think they work. LLMs will not be the leading edge of AI for more than another decade, perhaps even half a decade.
7
32
3,328