Joined June 2012
6 Photos and videos
This model and system prompt @tobi 🌟😆
I love this idea from Tobi Lutke: “I take the view that I'm a corporate raider. That Shopify went bankrupt and I bought it on a fire sale and I'm marching in on day one, and that previous management was crazy and we need to turn this place around.”
155
A new pathway to intelligent systems. This is the way. Congrats @ylecun, @sainingxie and team.
Advanced Machine Intelligence (AMI) is building a new breed of AI systems that understand the world, have persistent memory, can reason and plan, and are controllable and safe. We’ve raised a $1.03B (~€890M) round from global investors who believe in our vision of universally intelligent systems centered on world models. This round is co-led by Cathay Innovation, Greycroft, Hiro Capital, HV Capital, and Bezos Expeditions, along with other investors and angels across the world. We are a growing team of researchers and builders, operating in Paris, New York, Montreal and Singapore from day one. Read more: amilabs.xyz/ AMI - Real world. Real intelligence.
2
255
Inspiring team ✨🌲
New paper out! We present a training method for multimodal generative models, called Self-Flow, which combines classic flow matching and representation learning. Why? Unlike most representation alignment methods, our new approach does not require external, pretrained models and thus scales gracefully to joint multimodal training on images, videos and audio. How? It combines per-timestep flow matching with dual-timestep representation learning, improving the models' internal representations. This approach outperforms prior methods and shows promising scaling behavior in multimodal pretraining. It also enables downstream applications such as action prediction for embodied AI. webpage paper: bfl.ai/research/self-flow code: github.com/black-forest-labs… Credit to @hila_chefer, @pess_r, Dominik, @dustin_podell, Vikash, @Vinh_Suhi and Antonio. If you enjoy doing open research like this, come and join BFL! We are actively hiring🌲
1
3
299
Adam Perold retweeted
Introducing VL-JEPA: Vision-Language Joint Embedding Predictive Architecture for streaming, live action recognition, retrieval, VQA, and classification tasks with better performance and higher efficiency than large VLMs. • VL-JEPA is the first non-generative model that can perform general-domain vision-language tasks in real-time, built on a joint embedding predictive architecture. • We demonstrate in controlled experiments that VL-JEPA, trained with latent space embedding prediction, outperforms VLMs that rely on data space token prediction. • We show that VL-JEPA delivers significant efficiency gains over VLMs for online video streaming applications, thanks to its non-autoregressive design and native support for selective decoding. • We highlight that our VL-JEPA model, with an unified model architecture, can effectively handle a wide range of classification, retrieval, and VQA tasks at the same time. by @Delong0_0 @MustafaShukor1 @TheoMoutakanni @willyhcchung Jade Lei Yu Tejaswi Kasarla @AllenBolourchi @ylecun @pascalefung arxiv.org/abs/2512.10942
13
86
555
90,086
Adam Perold retweeted
Yann LeCun (Chief AI Scientist, Meta, @ylecun), @PimDeWitte (CEO, General Intuition), and Aude Durand (Kyutai, @aude_drn), talk about world models, embodied agents, Yann's new company, and the limitations of LLMs 0:00 - Introduction to World Models 5:00 - Why World Models, Intuition & Introducing Yann's new company 10:00 - Architectures Merging Language & Interaction Data towards General Agents 20:00 - Open Source, Sovereign AI & @kyutai_labs Partnership Keynote for #aiPULSE2025 at Station F in Paris 🇫🇷
29
124
642
110,776
Great decomposition @karpathy
Finally had a chance to listen through this pod with Sutton, which was interesting and amusing. As background, Sutton's "The Bitter Lesson" has become a bit of biblical text in frontier LLM circles. Researchers routinely talk about and ask whether this or that approach or idea is sufficiently "bitter lesson pilled" (meaning arranged so that it benefits from added computation for free) as a proxy for whether it's going to work or worth even pursuing. The underlying assumption being that LLMs are of course highly "bitter lesson pilled" indeed, just look at LLM scaling laws where if you put compute on the x-axis, number go up and to the right. So it's amusing to see that Sutton, the author of the post, is not so sure that LLMs are "bitter lesson pilled" at all. They are trained on giant datasets of fundamentally human data, which is both 1) human generated and 2) finite. What do you do when you run out? How do you prevent a human bias? So there you have it, bitter lesson pilled LLM researchers taken down by the author of the bitter lesson - rough! In some sense, Dwarkesh (who represents the LLM researchers viewpoint in the pod) and Sutton are slightly speaking past each other because Sutton has a very different architecture in mind and LLMs break a lot of its principles. He calls himself a "classicist" and evokes the original concept of Alan Turing of building a "child machine" - a system capable of learning through experience by dynamically interacting with the world. There's no giant pretraining stage of imitating internet webpages. There's also no supervised finetuning, which he points out is absent in the animal kingdom (it's a subtle point but Sutton is right in the strong sense: animals may of course observe demonstrations, but their actions are not directly forced/"teleoperated" by other animals). Another important note he makes is that even if you just treat pretraining as an initialization of a prior before you finetune with reinforcement learning, Sutton sees the approach as tainted with human bias and fundamentally off course, a bit like when AlphaZero (which has never seen human games of Go) beats AlphaGo (which initializes from them). In Sutton's world view, all there is is an interaction with a world via reinforcement learning, where the reward functions are partially environment specific, but also intrinsically motivated, e.g. "fun", "curiosity", and related to the quality of the prediction in your world model. And the agent is always learning at test time by default, it's not trained once and then deployed thereafter. Overall, Sutton is a lot more interested in what we have common with the animal kingdom instead of what differentiates us. "If we understood a squirrel, we'd be almost done". As for my take... First, I should say that I think Sutton was a great guest for the pod and I like that the AI field maintains entropy of thought and that not everyone is exploiting the next local iteration LLMs. AI has gone through too many discrete transitions of the dominant approach to lose that. And I also think that his criticism of LLMs as not bitter lesson pilled is not inadequate. Frontier LLMs are now highly complex artifacts with a lot of humanness involved at all the stages - the foundation (the pretraining data) is all human text, the finetuning data is human and curated, the reinforcement learning environment mixture is tuned by human engineers. We do not in fact have an actual, single, clean, actually bitter lesson pilled, "turn the crank" algorithm that you could unleash upon the world and see it learn automatically from experience alone. Does such an algorithm even exist? Finding it would of course be a huge AI breakthrough. Two "example proofs" are commonly offered to argue that such a thing is possible. The first example is the success of AlphaZero learning to play Go completely from scratch with no human supervision whatsoever. But the game of Go is clearly such a simple, closed, environment that it's difficult to see the analogous formulation in the messiness of reality. I love Go, but algorithmically and categorically, it is essentially a harder version of tic tac toe. The second example is that of animals, like squirrels. And here, personally, I am also quite hesitant whether it's appropriate because animals arise by a very different computational process and via different constraints than what we have practically available to us in the industry. Animal brains are nowhere near the blank slate they appear to be at birth. First, a lot of what is commonly attributed to "learning" is imo a lot more "maturation". And second, even that which clearly is "learning" and not maturation is a lot more "finetuning" on top of something clearly powerful and preexisting. Example. A baby zebra is born and within a few dozen minutes it can run around the savannah and follow its mother. This is a highly complex sensory-motor task and there is no way in my mind that this is achieved from scratch, tabula rasa. The brains of animals and the billions of parameters within have a powerful initialization encoded in the ATCGs of their DNA, trained via the "outer loop" optimization in the course of evolution. If the baby zebra spasmed its muscles around at random as a reinforcement learning policy would have you do at initialization, it wouldn't get very far at all. Similarly, our AIs now also have neural networks with billions of parameters. These parameters need their own rich, high information density supervision signal. We are not going to re-run evolution. But we do have mountains of internet documents. Yes it is basically supervised learning that is ~absent in the animal kingdom. But it is a way to practically gather enough soft constraints over billions of parameters, to try to get to a point where you're not starting from scratch. TLDR: Pretraining is our crappy evolution. It is one candidate solution to the cold start problem, to be followed later by finetuning on tasks that look more correct, e.g. within the reinforcement learning framework, as state of the art frontier LLM labs now do pervasively. I still think it is worth to be inspired by animals. I think there are multiple powerful ideas that LLM agents are algorithmically missing that can still be adapted from animal intelligence. And I still think the bitter lesson is correct, but I see it more as something platonic to pursue, not necessarily to reach, in our real world and practically speaking. And I say both of these with double digit percent uncertainty and cheer the work of those who disagree, especially those a lot more ambitious bitter lesson wise. So that brings us to where we are. Stated plainly, today's frontier LLM research is not about building animals. It is about summoning ghosts. You can think of ghosts as a fundamentally different kind of point in the space of possible intelligences. They are muddled by humanity. Thoroughly engineered by it. They are these imperfect replicas, a kind of statistical distillation of humanity's documents with some sprinkle on top. They are not platonically bitter lesson pilled, but they are perhaps "practically" bitter lesson pilled, at least compared to a lot of what came before. It seems possibly to me that over time, we can further finetune our ghosts more and more in the direction of animals; That it's not so much a fundamental incompatibility but a matter of initialization in the intelligence space. But it's also quite possible that they diverge even further and end up permanently different, un-animal-like, but still incredibly helpful and properly world-altering. It's possible that ghosts:animals :: planes:birds. Anyway, in summary, overall and actionably, I think this pod is solid "real talk" from Sutton to the frontier LLM researchers, who might be gear shifted a little too much in the exploit mode. Probably we are still not sufficiently bitter lesson pilled and there is a very good chance of more powerful ideas and paradigms, other than exhaustive benchbuilding and benchmaxxing. And animals might be a good source of inspiration. Intrinsic motivation, fun, curiosity, empowerment, multi-agent self-play, culture. Use your imagination.
419
27 Sep 2025
Congrats @robrombach @andi_blatt ! 🌲💥
FLUX is now also in Meta AI. What a day. 🥵
3
501
27 Jan 2025
Open source winning means consumers and enterprise win, distribution and application layers win, we all build on top of these innovations together, advancing society, and it's fantastic to see companies across the world taking this approach regardless of country, but especially heartening to see in this context specifically.
24 Jan 2025
Nice job! Open research / open source accelerates progress.
1
603
22 Jul 2024
Imagine @JeffBezos leading team USA. @lexfridman : please ask him if he will run. He'll win. It will be good.
14 Dec 2023
Here's my conversation with @jeffbezos, founder of Amazon and Blue Origin. This is his first time doing a long-form conversation of this kind, and it was an epic one. It's here on X in full & is up on YouTube, Spotify, and everywhere else. Timestamps: 0:00 - Introduction 0:24 - Texas ranch and childhood 4:02 - Space exploration and rocket engineering 16:36 - Physics 26:10 - New Glenn rocket 1:08:59 - Lunar program 1:18:55 - Amazon 1:36:16 - Principles 1:54:56 - Productivity 2:05:34 - Future of humanity
695
1. And I always try to "write" emails away from my keyboard. Then typing them is easy/fast :-)
Running and swimming are 100% where I get my best ideas, I just run a bit slower to make sure I keep some blood for the brain. You just need to find what’s best for you.
1,009
Part 14:15 ✅
"The best thing founders can do is subtraction. It's much much easier to add things than it is to remove." Tobi is acutely brilliant & focused; it's why working with him the last 12yrs has been the privilege of my life. Great episode @tobi & @shaneparrish bit.ly/3TyLXAk
698
11 Jan 2024
Well presented by brilliant entrepreneur running one of the companies that might ultimately measurably impact GDP @amasad @replit 💫✨
9 Jan 2024
My Ted talk is on YouTube. I did not watch it so I’m not sure if any good. But I tried to envision what the future of AI-powered software creation will look like in the next few years, and what it means for the world. Of course, the best way to predict the future is to build it.
3
787
17 Nov 2023
Wow, @sedielem ! I'm so excited for this! volunteering trusted tester please 🙋‍♂️🎶
5-6 years ago I was working on music generation at DeepMind, but let me tell you, this is... something else. Incredibly excited to be able to finally share what our team has been working on!
3
812
Adam Perold retweeted
1/ We’ve submitted a letter to President Biden regarding the AI Executive Order and its potential for restricting open source AI. We believe strongly that open source is the only way to keep software safe and free from monopoly. Please help amplify.
113
884
2,665
2,056,794
After a long week, LLM’s always love to hit the dance floor and Prisencolinensinainciusol! 🕺💃✨
Italian singer Adriano Celentano released a song in the 70s with nonsense lyrics meant to sound like American English, apparently to prove Italians would like any English song. It was a hit
2
554
21 Oct 2023
💛
21 Oct 2023
A reminder that people can disagree about important things but still be good friends.
1
358
21 Sep 2023
It will be enlightening in 5 years to list how many fundamental advancements in society were built on top of open source AI technologies, and how much credit is due to the efforts of the people working tirelessly for it today @ylecun @MetaAI @huggingface
21 Sep 2023
My opening statement at the Senate Intelligence Committee yesterday.
2
5
70
161,977
17 Sep 2023
🔥🔥
15 Sep 2023
I began working on Transformers in 2018 at @StanfordAILab -- training and inference in the early days was anything but fun. Today it takes 3 lines of code and zero setup time to use a multi-B LLM on @Replit. Releasing Replit ModelFarm is one of my career dreams come true. 1/ 🧵
1
558
27 Jul 2023
Exciting to cheer on one of the world's most-loved collaborative coding communities as they launch new innovations in the generative coding era! The heart and soul of StackOverflow can't be replicated. Congrats @pchandrasekar @StackOverflow !
Today we officially launch the next stage of community and AI here at @StackOverflow: OverflowAI! Just shared the exciting news on the @WeAreDevs keynote stage. If you missed it, watch highlights of our announcements and visit stackoverflow.co/labs/?utm_s….
4
429