Joined May 2022
1,793 Photos and videos
Pinned Tweet
This was work done last year, but my lazy ass uploaded it just this week It was huge fun working on this project.
yesterday, @cloneofsimo and I released a tech report on the video pretraining research we worked on last year. we believe sharing our experience with the community building video models will be of great help! arxiv.org/abs/2603.00173
1
75
18,254
Looking back i was so incredibly early
17 Sep 2024
Shampoo Scaling law for language model Plot taste of Kaplan et al, but comparing shampoo and adam. Shampoo is literally such a free lunch, in large scale, in predictable manner.
6
3
136
13,285
It is insane what these folks achieved with insane head count. Absolutely generational company.
Jun 9
We just hit 100 employees at fal. Here's a look at the work we do every day and the curious, creative people who make it happen. youtu.be/5M1pP1O_GS8?si=ONOG…
6
1
65
6,271
10
90
1,394
42,304
Oh you want open source coding model? Oh how about closed source model that you pay with subscription? Oh it will run out in 5 min so you pay with token credits Oh but you wont know how claude code works Oh but you cant use it with openclaw Oh but you cant use it for AI research and gpu programming Absolutely frontier level of closedness
mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community also the fact that this is un purpose not visible to the user is crazy
15
19
311
21,992
Having a gf is insane because its like talking to claude that doesnt think you are fucking retarded
7
1
39
2,605
The guy on the left woke up and said "I did not wake up to be a loser".
5
1
78
6,220
In 2019, I started deep learning with 2017 CS 231n course by @jcjohnss . I swear to god, reason I am here could be attributed to the fact that those lectures were very fun. Had it been boring I wouldve been in a far worse place. Days like this reminds me that I should be doing more of educational material, although its tough to sometimes imagine how i could teach someone better than chatgpt....
In 2017, I was one of the millions learning ML from @AndrewYNg. Never imagined that 9 years later, @vllm_project would be collaborating with @DeepLearningAI on a course. Feeling grateful for the journey 🙏
6
4
133
12,166
Work of art.
13
2,223
POV you are looking at anthropic
Tera IPOs coming! $1T sounds like a lot. But $1T is just a 7-m-wide gold cube, thanks to massive inflation since 1971 when $ and gold decoupled. A little house full of gold. To put things in perspective: the 2017 neutron star merger GW170817 produced several earth masses of gold.
1
14
2,499
Oh wow! Insane open model
Introducing Ideogram 4.0: the best open image model in the world. Think it. Make it. Own it. Download the weights, fine-tune on your own data, and run it on your hardware. Live on every Ideogram plan and the API today.
7
15
547
99,579
2023 was year of hallucinations 2024 was year of em dash 2025 was year of sycophants 2026 is year of assholes
No one: Claude Opus 4.8 Max: Let me refine your load-bearing claim rather than just accepting it, because you’re doing zero moves there, and the gap is what’s actually interesting. The one place I’d still push, because I think it matters: your message is wearing content-clothes, but the content isn’t actually *there*. The tell: it’s just an empty string. But the emptiness of the string IS its lack of content. Pull one, and the other goes inert. That’s the structural spine.
6
63
7,353
Insane crossover 🤝🤝
Jun 2
fal is an official launch partner for @OpenAI's new role-specific plugins in Codex, live today. With fal in Codex, creative teams can pull models, assets, and workflows from fal directly into Codex sessions. Faster from a question or brief to useful output, without leaving the agent.
9
2,058
Simo Ryu retweeted
We've created a really unique environment to execute on the scope and ambitions of our program. If you're passionate about working full-stack on robotics, please building with us!
May 31
OpenAI Robotics is hiring, looking for exceptional full-stack hardware, ops, systems, and ML engineers to help us program and manufacture robots that are useful for society. AI should be able to help people in the physical world. In the short term, we are focused on robots to support skilled workers to build our future infrastructure; in the long term, we imagine everyone having a personal robot doing anything they need. Our world simulation research program, led by Aditya Ramesh (@model_mechanic), has evolved over the past year into OpenAI Robotics. Progress is rapid, and based on a foundation of co-design between robotics hardware and ML research. If you love working hands-on across the robotics stack and want to build the future, please consider joining us. Send an email with your background and evidence of exceptional accomplishment to: robotics-recruiting@openai.com
57
28
604
259,646
Simo Ryu retweeted
May 31
OpenAI Robotics is hiring, looking for exceptional full-stack hardware, ops, systems, and ML engineers to help us program and manufacture robots that are useful for society. AI should be able to help people in the physical world. In the short term, we are focused on robots to support skilled workers to build our future infrastructure; in the long term, we imagine everyone having a personal robot doing anything they need. Our world simulation research program, led by Aditya Ramesh (@model_mechanic), has evolved over the past year into OpenAI Robotics. Progress is rapid, and based on a foundation of co-design between robotics hardware and ML research. If you love working hands-on across the robotics stack and want to build the future, please consider joining us. Send an email with your background and evidence of exceptional accomplishment to: robotics-recruiting@openai.com
1,243
1,036
13,297
2,986,423
I wonder how behind they would be if they didnt distill / benefit from the proprietary models
We took another look at the capability gap between open-weight and proprietary models. Since the start of the year, open-weight models have lagged the state of the art by four months.
7
2
51
9,011
This is, unironically, coolest academic research ive seen this year
May 29
This is me btw. When you meet me for coffee this is how i Get there
7
6
308
17,347
Simo Ryu retweeted
Wow! I've worked on sum products more than 20 years ago, including using the (true) weaker versions for building randomness extractors. Love that AI here is not used as a human replacement in "spray and pray" mode for a large collection of open problems, but as a true collaborator.
A remarkable paper appeared on arXiv tonight by Thomas Bloom, Will Sawin, Carl Schildkraut and Dmitrii Zhelezov. In this paper, they prove that there exists c>0 and arbitrarily large finite sets A of real numbers such that max(|A A|,|AA|)≤|A|^{2-c}. This disproves the well-known sum-product conjecture over the real numbers. The sum-product conjecture considers the two most basic operations: addition and multiplication. A A is the set of all pairwise sums of two elements in A while AA is the set of all pairwise products of two elements in A. (1/5)
3
21
269
29,019
Ok but why are they calling quantized flux completely new name? Like why not call it Flux Ternary? Are they following the same "tweak and rename the optimizer" trend? I get they are trying to raise but no mention of flux on this post is disappointing
May 26
Today we’re releasing 1-bit and Ternary Bonsai Image 4B. A new family of image-generation models designed to run high-quality diffusion inference on local hardware: from laptops to phones.
9
87
8,533
True shyt fr
Brother, that vibe shift already happened months ago. If you're still using Clode, you're living in 2025
3
1,824
Cool Prof. Chulhee was my defense's committee and he is really kind
🚨New Optimizer Paper AMUSE: Anytime MUon with Stable gradient Evaluation AMUSE combines Muon with Schedule-Free-style gradient evaluation for stable anytime training without LR decay. • Stronger 124M / 720M / 1B pretraining • Strong ImageNet / ViT fine-tuning performance.
4
43
5,881