retired!! former cofounder of @adeptailabs, vp engineering @openai and @amazon, and @google LLMs lead. all about type II fun.

Joined March 2009
24 Photos and videos
🫔 tons of respect for Teddy Collins et al!
We’re excited to introduce Inherent, a lab designed from scratch to build AI agents that discover new knowledge. The coming era of machine-driven scientific inquiry demands a new kind of research institution and a new kind of AI. To achieve our mission, we live within the experiment, recursively self-improving the entire research organisation. We investigate questions including: - What does ā€˜AI taste’ look like in the sciences, and how can we build an institution that embraces this new aesthetic of discovery? - What new kinds of human-machine teaming will make the most of AI that can truly innovate? - How can we build recursive self-improvement at the collective level that continually increases human agency over outcomes? We have just closed a $50m seed round led by @IndexVentures and @radicalvcfund, with participation from other outstanding investors including NVentures (@nvidia's venture capital arm), @buildexante, Metaplanet, Macroscopic, @MythosVentures, Charlie Songhurst, @chalfs, @jluan, @dwarkesh_sp, @Thom_Wolf, @j_foerst and @maxjaderberg. We are advised by @matthewclifford. Inherent is a Public Benefit Corporation headquartered in London.
2
14
3,309
Nathan's a real OG of AI investing, is friends with a bunch of legit researchers, and did a lot to help me during the Adept days. Respect!
News! @airstreet has raised $232,323,232 for Fund III to back AI-first companies from the earliest stages in the US and Europe. Now the largest solo GP venture firm in Europe. Our third epoch begins today. Join us!
3
15
4,612
apparently this is what the chinese AI ecosystem thinks our grand american AI master plan is! (this is not a joke! forwarded to me by an attendee at a real chinese ai conference!)
238
335
3,545
1,043,158
David Luan retweeted
Excited to release PostTrainBench v1.0! This benchmark evaluates the ability of frontier AI agents to post-train language models in a simplified setting. We believe this is a first step toward tracking progress in recursive self-improvement 🧵:
46
94
723
174,511
I’ll be leaving Amazon at the end of this week to cook up something new! Thanks to the Adept deal, I’ve spent the last ~2 years learning from @ajassy et al while leading Amazon’s agents R&D effort and our San Francisco AI lab. As a childhood EC2 Micro Instance fanboy, it was fun to speedrun launching our own tier-1 AWS service. We scaled up the Adept agent recipes, did new RL research, and shipped it to AWS customers like Hertz, 1Password, and Amazon.com itself. And it's cool to see Nova Act on top of realevals.xyz (at least for now). There’s incredible work to be done at Amazon and I'm grateful for the opportunities to take on more here. But with AGI so close, I want to spend 100% of my time on teaching AI systems brand new capabilities. At OpenAI, I was lucky to incubate the first GPTs; at Adept, we went all-in on agents before anyone else–our tech/people now drive computer-use efforts at every major lab. I have a bet for what's next. ;) This wasn't an easy decision, and I'm sad to leave this wonderful team. I’m grateful for the trust our execs placed in me during an important moment for Amazon and the field. I'm excited to swing at the next idea!
21
2
301
31,913
Reiner is perhaps the strongest chip LLM person I have ever gotten to work with, and he and @MikeGunter_ have built just as legit of a team. I'm stoked to have been an investor in an early round. Tapeout in under a year! šŸ˜…
We’re building an LLM chip that delivers much higher throughput than any other chip while also achieving the lowest latency. We call it the MatX One. The MatX One chip is based on a splittable systolic array, which has the energy and area efficiency that large systolic arrays are famous for, while also getting high utilization on smaller matrices with flexible shapes. The chip combines the low latency of SRAM-first designs with the long-context support of HBM. These elements, plus a fresh take on numerics, deliver higher throughput on LLMs than any announced system, while simultaneously matching the latency of SRAM-first designs. Higher throughput and lower latency give you smarter and faster models for your subscription dollar. We’ve raised a $500M Series B to wrap up development and quickly scale manufacturing, with tapeout in under a year. The round was led by Jane Street, one of the most tech-savvy Wall Street firms, and Situational Awareness LP, whose founder @leopoldasch wrote the definitive memo on AGI. Participants include @sparkcapital, @danielgross and @natfriedman’s fund, @patrickc and @collision, @TriatomicCap, @HarpoonVentures, @karpathy, @dwarkesh_sp, and others. We’re also welcoming investors across the supply chain, including Marvell and Alchip. @MikeGunter_ and I started MatX because we felt that the best chip for LLMs should be designed from first principles with a deep understanding of what LLMs need and how they will evolve. We are willing to give up on small-model performance, low-volume workloads, and even ease of programming to deliver on such a chip. We’re now a 100-person team with people who think about everything from learning rate schedules, to Swing Modulo Scheduling, to guard/round/sticky bits, to blind-mated connections—all in the same building. If you’d like to help us architect, design, and deploy many generations of chips in large volume, consider joining us.
1
4
65
13,038
29 Aug 2025
Tried to maximize hot takes per unit time -- including one that @HarryStebbings asked @Benioff about ;) Really fun talking with Alex!
21 Aug 2025
I enjoyed chatting with Amazon's @jluan about what he has been up to since kickstarting its AGI / agents research lab last year David has seen it all and is refreshingly candid theverge.com/decoder-podcast…
3
1
20
8,913
16 Jul 2025
Since launch, it's been really cool to see Nova Act handle real agentic workflows for real enterprises, such as scaling out public benefits and QA testing. Knowledge work is much bigger than just chatting and coding! You can now take these agents to production. Use cases here:
Nova Act is nowāš”ļø enterprise ready āš”ļø and we've added new capabilities to our preview to help you take your prototype to production—with 90% reliability across our early enterprise customer use cases!
2
1
30
8,438
16 Jul 2025
We're working on a really cool agent RL training recipe across a bajillion gym environments with core contributors from verl, sglang, Adept, and replay.io! Shoot me a message if you're interested :)
1
12
2,238
31 Mar 2025
Stoked about the first release from our new lab: our browser use agent lets you MapReduce over the web! This early preview moves us closer to reliable agents that learn from rewards across a wide range of digital and physical environments. Love our Adept Amazon team so much!
Meet Amazon Nova Act — an effortless way to build AI agents that can reliably use browsers šŸ§‘ā€šŸ’» With our new model, compose robust steps into complex workflows; handle everything from bookings to QA testing. Getting started takes just 3 lines of code. See what Nova Act can do šŸ§µšŸ‘‡
4
7
75
11,536
12 Mar 2025
Incredible what happens when you bring the existing world knowledge and intuition of VLMs to the physical world... only a few more missing pieces in the recipe for AGI... really feeling the agi with this one
Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. šŸ¤– Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵 goo.gle/gemini2-robotics
3
1
18
4,715
David Luan retweeted
I’m excited to announce Tolan, our first Embodied Companion. With no launch or press we’ve quietly hit 500,000 downloads, over $1m in ARR, and a #1 app store category ranking. Today I’m also announcing our $10m seed round (more on that below) and sharing some of what we’ve learned building an AI companion for consumers.
62
52
841
219,932
David Luan retweeted
28 Jan 2025
1. Breakdown of DeepSeek V3 efficiency vs Llama 3: - Better: 11x fewer FLOPs per token, thanks to MoE [37B vs 405B activated params] - Better: 2x faster numerics [fp8 vs bf16 training] - Worse: 0.5x flops utilization [16% vs 33% end-to-end MFU*] - Neutral: similar hardware platform [H800 and H100 both have 2Pflops/s dense fp8] - Neutral: same training data volume [14.8T vs 15T tokens] Llama 3’s design was obviously and intentionally conservative: dense model (not MoE), bf16 training (not fp8), GQA attention (not cheaper alternatives). DeepSeek benefited by being aggressive on all these fronts, at the cost of being later to market. 2. The core algorithmic improvements were already known; the closed source LLM labs were probably already doing similar things. DeepSeek’s improvements are real, but far more modest than the Llama comparison would suggest; my wild guess is closer to 1.5x improvement. MoE was published in 2017; in 2021 Switch Transformer reported 7x speedups vs dense models, similar to DeepSeek’s 11x. OpenAI is widely rumored to have been using MoE models for years. NVIDIA published their fp8 training paper in 2022. 3. NVIDIA’s stock price is down 15% after DeepSeek. Should it be? LLM compute is like a gas: it expands to fill the available budget. Over the last 3 years the labs have grown their budgets, despite algorithms and hardware improving. There’s no reason to expect this to change now: you win by making the best model, not by shrinking your budget. The more meaningful question: do algorithmic improvements like DeepSeek’s mean that margins will shift from hardware vendors to labs? Hard to see why. Algorithmic improvements are quickly copied from one lab to another, making it hard for them to maintain technological differentiation. Hardware improvements take much longer to copy.
7
40
383
68,335
David Luan retweeted
new sf ai lab at amazon led by @jluan (ex-adept) and @pabbeel (ex-covariant/berkeley)! exciting to see :)
9 Dec 2024
!! @pabbeel and I are building a new AI research lab in SF for Amazon! We’re focused on the remaining major problems to build generally intelligent agents and are looking for a few dozen intrinsically motivated people to join our team and work with the Adept folks here. DM me!
1
1
22
8,395
9 Dec 2024
!! @pabbeel and I are building a new AI research lab in SF for Amazon! We’re focused on the remaining major problems to build generally intelligent agents and are looking for a few dozen intrinsically motivated people to join our team and work with the Adept folks here. DM me!
9 Dec 2024
Super-excited about what's ahead. Want to move the AI research frontier, join us! amazon.science/blog/amazon-o… AGI-SFLab-Jobs@amazon.com
19
17
386
72,287
9 Dec 2024
By putting together a super talent-dense team focused on a small number of targeted research bets, we’re able to maximize the amount of compute per capita -- aka do more wild experiments faster.
3
2
37
8,984
9 Dec 2024
It’s been cool to get the space to do longer-term R&D here, and we’re interested in problems ranging from combining LLMs with RL to world models for computer agents!
1
20
3,059
27 Sep 2024
Ending Twitter hibernation to congratulateĀ @mnaficy and team onĀ @arcade_ai's launch--I've loved playing with it, turning natural language into physical objects. First product shape I've ever seen that could only be possible with diffusion models! Proud to be a (small) backer!
26 Sep 2024
1/ Announcing $17M in total fundraising to build Arcade, the world's first AI product creation platform
15
4,170
28 Jun 2024
Returning from a long hiatus to say -- this was an exceptionally fun podcast thanks to @HarryStebbings! Give it a watch for a collection of hot takes on path to AGI, limitations, hardware-model vertical integration, and the crucial role that interaction design now plays...
Replying to @HarryStebbings
3. Why Every Cloud Provider Must Have a Model Play As models become smarter, they’ll become the base computing primitive. The logic of software will be handled by LLMs in the future. Whoever controls the model layer controls all of the underlying compute.
4
1
36
4,814
28 Jun 2024
Shoutout to @HarryStebbings who unbeknownst to me logged onto this podcast filming having just had his wisdom teeth removed. Legend!
1
10
2,074