Scaling up RL at OpenAI 🍓 Optimization and long context research before. Math PhD, MIT.

Joined October 2016
54 Photos and videos
In AI everything behaves according to a log-log scaling law, including the compensation
6
1
65
6,662
yasmin has been goat maxxing since forever
May 27
Most VCs wouldn’t touch Anthropic in 2023. Yasmin Razavi did. The Spark Capital partner led a $450M round when Anthropic had no public product, no revenue and a massive capital need. Now the AI giant’s rise has landed her on the Forbes Midas List for the first time. forbes.com/sites/iainmartin/… (Photo: Guerin Blask For Forbes) #ForbesMidas
1
3
46
12,415
Mo Bavarian retweeted
New blackboard lecture w @reinerpope How do chips actually work – starting with basic logic gates, and working up to why GPUs, TPUs, FPGAs, and the human brain each look the way they do. 0:00:00 – Building a multiply-accumulate from logic gates 0:16:20 – Muxes and the cost of data movement 0:25:59 – How systolic arrays work 0:39:00 – Clock cycles and pipeline registers 0:51:40 – FPGAs vs ASICs 1:03:14 – Cache vs scratchpad 1:07:16 – Why CPU cores are much bigger than GPU cores 1:11:49 – Brains vs chips 1:15:22 – A GPU is just a bunch of tiny TPUs Look up Dwarkesh Podcast on YouTube/Spotify/etc to watch. Enjoy!
94
725
5,595
925,844
Mo Bavarian retweeted
AI has now solved a major open problem -- one of the best known Erdos problems called the unit distance problem, one of Erdos's favourite questions and one that many mathematicians had tried. openai.com/index/model-dispr…
75
614
3,561
1,489,740
Mo Bavarian retweeted
Banger take
Gavin Baker: "I've been optimistic that the fundamental shortage of wafers, which is really controlled by Taiwan Semi, will prevent a bubble." "If Taiwan Semi did what Jensen wanted, Nvidia could sell $2 trillion of GPUs in 2026 or 2027. But there is a limit where consumers would consume so much that you'd probably be in an overbuild. And you are starting to see companies go to Intel and Samsung. A lot of this may come down to the degree to which Taiwan Semi can maintain a lead over Intel and Samsung and the pace at which they expand capacity. If I were to watch one thing to understand whether there's a bubble, it's Taiwan Semi's capacity decisions. There's a Goldilocks zone where they expand enough to make it hard for Intel or Samsung to emerge as a second source, but they also keep the fundamental constraint on wafers that helps us avoid a bubble."
31
35
838
170,033
Stop living in a bubble. The key to happiness, the true meaning of life, is not impact or wealth or fame, but health, friendship, and care for others. Living authentically is not that expensive. In other words, money or success will not fill the hole in your heart.
3
4
103
34,592
Mo Bavarian retweeted
I remind myself of this quite often.
86
455
5,978
757,660
Stochastic gradient descent
9
33
421
19,409
One of the biggest benefits of peptides is that it has distracted a decent fraction of grifters (ever-present in SF tech scene) away from AI. God bless 🙏
3
1
82
6,438
It's goblins all the way down
artificial goblin intelligence achieved
10
1,785
San Francisco— the birthplace of AGI
4
1
39
2,489
It was fun doing this panel with my friends @dwarkesh_sp @_sholtodouglas and Melvin Johnson, and thanks @labelbox for hosting. It's the first time for me facing the ever-spicy Dwarkesh questions in front of an audience. It's v important for frontier lab researchers to stay connected. Less adversity, more comradeship & cooler heads is necessary as the stakes for this technology gets higher.
This week, we had the pleasure of hosting 50 researchers and builders from leading AI companies to meet, talk and socialize (MTS 😎) at Labelbox HQ. Huge thanks to @dwarkesh_sp, Sholto Douglas (Anthropic), Mo Bavarian (OpenAI), and Melvin Johnson (DeepMind) for leading our fireside chat on scaling RL and the pursuit of AGI.
1
3
54
15,088
Mo Bavarian retweeted
JRR Tolkien used genz brainrot slang over 70 years ago, that's how ahead he was
138
1,659
21,895
1,141,662
Good cooking by a cracked team 🧑‍🍳
Mar 10
Two massive updates for the ComfyUI ecosystem today: 1️⃣ App Mode: The power of the node graph, now behind an easy-to-use interface. Turn complex workflows into custom apps. 2️⃣ ComfyHub: A brand new home to discover, run, and share community workflows and apps instantly via URL. Try ComfyHub preview via links.comfy.org/4dke0ki Create in App Mode. Share on ComfyHub. Learn more here: links.comfy.org/4bAOjuz
4
1,582
Mo Bavarian retweeted
I do not share the cynicism of some with respect to OpenAI’s actions in the DoW/Ant dispute. It basically seems to me as though OpenAI was attempting to deescalate last week; whether they executed well is a separate question, but in their defense good execution in such chaos was nearly impossible. But from where I sit it seems OpenAI tried to reduce tensions and find a productive path forward, while allowing its employees considerable latitude to speak their minds. The easy thing would have been for management to stay quiet and let this happen; they did not do that, and they also stood firm in opposition to the supply-chain risk designation. In general, OpenAI is unjustly maligned. This is the thing that bothers me the most about Dario’s leaked memo; it spends so much time on OpenAI conspiracies and cynicism that I fear industry solidarity in the future will be harder than it needs to be. This is not the last time we will see state interference into frontier AI, and until we build formalized structures for such interference it will be important for the industry to hang tough together. I fear that will be less likely now.
39
40
517
42,516
Anthropic SCR designation is unfair, unwise, and an extreme overreaction. Anthropic is filled with brilliant hard-working well-intentioned people who truly care about Western civilization & democratic nations success in frontier AI. They are real patriots. Designating an organization which has contributed so much to pushing AI forward and with so much integrity does not serve the country or humanity well.
13
32
404
36,332
I don’t think there is an un-crossable gap between what Anthropic wants and DoW’s demands. With cooler heads it should be possible to cross the divide. Even if divide is un-crossable, off-boarding from Anthropic models seems like the right solution for USG. The solution is not designating a great American company by the SCR label, which is reserved for the enemies of the US and comes with crippling business implications.
2
2
57
5,046
As an American working in frontier for the last 5 years (at Anthropic’s biggest rival, OpenAI), it pains me to see the current unnecessary drama between Admin & Anthropic. I really hope the Admin realies its mistake and reverses course. USA needs Anthropic and vice versa! 🇺🇸
2
2
78
3,771
Mo Bavarian retweeted
I can’t help but think (and feel) that the world is generally very sad right now. Injured really. Yesterday I was in Utah with family. Three generations. We played sports, enjoyed good food, saw friends, and just messed around all day. One of the best days in recent memory for all of us. This is where I grew up. It took me back to my childhood. Allowing me to embody those psychological states and feel the comparative difference between then and now. The hollowing and sadness of the modern world seems to stem in part from our phones, social media, and the ferocious need to be seen and relevant in every moment. We have mistakenly idolized a specific kind of dysfunction: a manic, sleepless hyper-vigilance that needs to be omnipresent. Everyone I know who’s unplugged for a week, returns reporting life-changing levels of improved life satisfaction. I’ve never met anyone who didn’t return feeling spry and vibrant and clear-eyed about the corrosive nature of current social culture. The science supports them feeling that way. They were in a dopamine deficit from the hyper-stimulated state of the world so everything felt gray.  So why don’t we unplug more and more often? We’re all kind of trapped in a prisoner's dilemma. Most want to move to the mountains and be relieved of it all but are terrified that if they unplug, they’ll be invisible. Real life consequences of reduced power and status. So we stay plugged in and drink the poison. This hypervigilant state keeps us in chronic fight or flight (anxiety). Simultaneously, our addiction creates a dopamine deficit (the emptiness/grayness feeling) and a background hum of anxiety. Mammals are biologically hardwired to co-regulate: physical touch, eye contact, proximity and in-person vibes. Things which release oxytocin and activate the vagal nerve's parasympathetic system. Screens eliminate all of this goodness. There are small wins to be had here. More in-person time. A day off technology per week. A block of 4 hours. One hour before bedtime.  I hope that there’s a collective awakening that we’re all being mined for engagement. Then we get trapped. And then trap each other.
351
402
6,075
803,930