Founding Member @Recursive_SI. Prev: @GoogleDeepMind | @UBC_CS | @UniofOxford | @SakanaAILabs | @Waymo | @MSFTResearch. The AI Scientist | Genie 3 | SIMA

Joined October 2019
62 Photos and videos
Pinned Tweet
Jun 11
Recursive just came out of stealth, and the team has been cooking šŸ”„ Our first results: an automated AI research system that can improve AI across 3 very different settings across training and GPU kernel optimization. recursive.com/articles/first…
19
35
336
46,307
Cong Lu retweeted
Detailed explanation of our first results by our amazing teammate @cong_ml !
Jun 11
Recursive just came out of stealth, and the team has been cooking šŸ”„ Our first results: an automated AI research system that can improve AI across 3 very different settings across training and GPU kernel optimization. recursive.com/articles/first…
1
3
66
17,777
Cong Lu retweeted
Lots of people have been asking me what we're up to at @Recursive_SI. We still can't say much quite yet, but we thought we'd share a little early demo. It turns out our automated researchers are pretty effective at performance optimization!
6
14
93
10,586
Cong Lu retweeted
Look ma, no hands! tldr: Recursive's automation loop improves AIs by itself šŸš€šŸš€šŸš€ more in the šŸ‘‡
Jun 11
Recursive just came out of stealth, and the team has been cooking šŸ”„ Our first results: an automated AI research system that can improve AI across 3 very different settings across training and GPU kernel optimization. recursive.com/articles/first…
2
14
4,462
Cong Lu retweeted
AI is now doing our AI research. At Recursive we set out to build recursive self-improving superintelligence (RSI) to automate knowledge discovery. The best way to expand humanity’s knowledge is through the scientific method. RSI leads to better ideas, explanations and inventions which lead to better RSI. Automating the scientific method requires closing the loop between ideation, implementation and validation, and being able to run it over extended periods of time. Today, we are excited to share the first outputs of Recursive’s automated open-ended discovery system. To be clear, this system is merely a milestone towards RSI, a v0.1 of what I sometimes call the ā€œEureka Machineā€. It is one program that you can point at any hard problem and get useful inventions out. Though it’s still very early, we've run it on three AI tasks and achieved state-of-the-art results on all three. These results demonstrate that even this early version of the system can solve a variety of autoresearch problems in AI and improve over prior state of the art. Concretely, it did this on the community benchmarks NanoGPT speedrun, NanoChat, and NVIDIA's Sol-ExecBench. AI is code and AI can code. The code and ideas that lead to these results were not invented by our team but by the AI system itself. To do RSI safely, we need to investigate its inventions. That's best done transparently with the community. @Recursive_SI we are open-sourcing the system’s discoveries, demonstrating that it finds creative and benign solutions instead of focusing on obvious optimizations or dangerous ideas. Link below.
36
86
605
115,759
Cong Lu retweeted
Very excited to share some of our first results already!
Excited to show results of the first steps towards automated AI research at @Recursive_SI. The same general system achieved state of the art on @NVIDIAAI's SOL-ExecBench GPU Kernel Optimization, nanoGPT Speedrun, and @karpathy's NanoChat autoresearch benchmarks.
1
2
14
1,013
Cong Lu retweeted
Jun 11
Excited to share these preliminary results on our internal autoresearch system @Recursive_SI, where we achieve SOTA on nanochat / nanogpt speedrun / kernel benchmarks using the same underlying system without task-specific adaptations. blog: recursive.com/articles/first…
3
24
108
16,238
Cong Lu retweeted
First results from Recursive on AI that improves AI! šŸš€ ✨ šŸ“ˆ Our automated AI research system incorporates principles from open-ended and AI-generating algorithms. 🧠 šŸ’”šŸŒ±šŸ§¬ It conducts key parts of the science loop: proposing ideas, implementing them, testing them, and picking the next ideas based on the data. šŸ”¬ šŸ”­ šŸ§Ŗāš—ļø The same general system produces state-of-the-art results on three different problems (two on training language models, one on speeding up AI via kernel optimization). • NanoChat Autoresearch: 1.3x faster to reach the same loss than the best solution produced by an entire community of humans agents over months, and 1.8x faster than the initial hand-optimized solution • NanoGPT Speedrun: 3% speedup of a very efficient solution produced by entire community of humans agents over 2 years • GPU Kernel Optimization: 18% reduction in gap to theoretically optimal score on Nvidia’s SOL-ExecBench These are early tests of our system. We’re very excited for what the future holds! Post: recursive.com/articles/first… Great work by the amazing team at Recursive!
6
14
82
11,034
Cong Lu retweeted
Early results from Recursive šŸš€šŸš€ SotA results from our open-ended knowledge discovery system: 1ļøāƒ£NanoChat 5min pre-training (0.9372 bpb -> 0.9109 bpb, 2.8% lower Bits-Per-Byte than long-standing community SoTA) 2ļøāƒ£NanoGPT SpeedRun (79.7s -> 77.5s, 2.8% faster than long-standing community SoTA) 3ļøāƒ£GPU kernel optimization (Overall 7.8% better than SoTA performance in SOL- ExecBench, hosted by NVIDIA) To achieve that, our system automatically finds and combines innovations together to create better solutions than current ones carefully designed by expert humans in various domains. We have open-sourced resulting artifacts found by our system so you can check the output yourself. See a full breakdown and technical writeup: recursive.com/articles/first…
6
35
351
52,415
Cong Lu retweeted
very cool results! i really believe speedruns by @kellerjordan0 @karpathy and others are a great testbed for RSI like tasks, both for evaluating and improving how models do research there are still limitations in how current speedruns are designed, and work needed on the system side to make agents efficient. seeing what models can do in math/code, i believe one missing piece to achieve the same for ai research is giving them the right environment
5
15
265
25,882
Jun 11
Recursive just came out of stealth, and the team has been cooking šŸ”„ Our first results: an automated AI research system that can improve AI across 3 very different settings across training and GPU kernel optimization. recursive.com/articles/first…
19
35
336
46,307
Jun 11
The common thread is the automated research loop. Search over ideas, turn them into executable changes, test them, validate cross-check the result, and use empirical evidence to plan the next ideas. Extremely excited to keep pushing @Recursive_SI !!
1
12
840
Cong Lu retweeted
Excited to show results of the first steps towards automated AI research at @Recursive_SI. The same general system achieved state of the art on @NVIDIAAI's SOL-ExecBench GPU Kernel Optimization, nanoGPT Speedrun, and @karpathy's NanoChat autoresearch benchmarks.
6
12
134
9,039
Cong Lu retweeted
Jun 11
initial results on automated ai research from our team on small scale pre-training and kernel optimization. we also open-source the corresponding artifacts. it's been great seeing all the amazing progress here in such a short time!
2
5
54
7,359
Cong Lu retweeted
Recursive’s autonomous research system just set new SOTAs on @karpathy’s nanogpt nanochat and @nvidia’s kernel benchmarks!! No humans in the loop. The system discovered these gains on its own. Open-sourcing results today.
6
9
60
9,194
Cong Lu retweeted
Thrilled to finally share some of the work we at @Recursive_SI has been doing since we launched!šŸ”„ We've made a system that autonomously conducts AI research and tested it across three different settings: Small Language Model training, NanoGPT speedrunning, and kernel engineering! A bit like Karpathy's Autoresearch, but scaled up and designed to be open-ended, we can push our system to optimize models, training algorithms, and kernels in really cool ways. Blogpost: recursive.com/articles/first…
2
11
53
3,748