building aligned general learners. cofounder @si_pbc. follows do not imply endorsement.

Joined March 2020
45 Photos and videos
i've wanted a competent version of this for so long. amazing.
Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇
7
776
kudos for making the right decision here quickly!
NEW: Anthropic is walking back Claude Fable 5's policy to covertly degrade performance for competing AI researchers, after facing fierce backlash. “We’re changing Fable 5’s safeguards for frontier LLM development to make them visible,” Anthropic tells WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”
9
638
devansh retweeted
delighted to have helped, and so incredibly excited for what the Foundation will do for AI Resilience :)
AI is advancing quickly. Society’s ability to manage its risks must advance just as fast. Today we’re sharing our vision for AI Resilience, with more than $130M in initial grants underway across bio-resilience, cyber-resilience, AI model safety, and AI’s impact on young people: openaifoundation.org/news/re…
1
5
60
7,681
is this not literally half of the combined company's total revenue??
Anthropic is paying $1.25B a month to SpaceX for compute
21
5
1,738
365,869
chatgpt for personal finances is absolutely amazing oml
8
639
we've been waiting for someone to do this since late 2024, when we switched away from hertz-dev and our audio work! huge congrats to our friends at thinky :)
Today we're sharing our work on interaction models. A new class of model trained from scratch to handle real-time interaction natively, instead of gluing it onto a turn-based one. youtu.be/A12AVongNN4
47
3,301
devansh retweeted
Back when we were raising our seed round, Lachy was one of the only people in Silicon Valley who saw our idea, immediately got it, and wrote the check that let us train FDM-1. Incredibly grateful to have him as an early supporter.
1
1
111
11,868
devansh retweeted
(4/5) One thing we’ve built is a “kittens” virtual machine that takes over the whole GPU and allows new kinds of co-optimization. We can go past the traditional sequential kernel model – for example, fusing entire training runs into a single kernel and even weirder stuff.
28
56
676
246,331
anthropic employee asked how they're going to pay for the house they're buying in SF "half cash, half stock"
13
1,027
vinay has been incredibly helpful in the mad dash to get compute allocation before it dries up! incredibly thankful.
.@devanshpandey and Galen are exceptional. Proud to be a small angel on this journey and watch them blow up. @mcannonbrookes @scottfarkas there is are more than a few interesting partnership angles here.
1
3
1,910
devansh retweeted
Apr 30
Lachy called me up on thanksgiving day back in 2024 to offer to lead our seed round. At the time we had a few weeks of runway left and no one else had the conviction for a research bet. He's an incredibly impressive investor and an amazing person.
2
1
164
19,612
devansh retweeted
World class team achieving great results. We’re proud to have been early supporters.
We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_. ----- Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks. We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners. We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.
2
2
55
5,796
devansh retweeted
VPT (openai.com/index/vpt/) blew my mind back in 2022 so I was very excited to see SI scale up the idea with FDM1, but for knowledge work / computer use. Excited and looking forward to more!
16
21
401
50,210
devansh retweeted
Instead of predicting text tokens, @si_pbc learns to use a computer from raw screen data, predicting the next mouse movement, click, and keystroke from the pixels in front of it. This is the @Tesla FSD approach applied to knowledge work on computer screens. Excited to have not one but two @Tesla_AI goats @_milankovac_ and @karpathy join us on the cap table!
6
4
98
12,652
devansh retweeted
@G413N and @devanshpandey and the @si_pbc team have been quietly building on the frontier of a new pre-training paradigm: foundation models that learn from raw video, not language and screenshots. FDM-1, their first model, an 11M-hour computer-action dataset (the largest in the industry), a video encoder ~50x more token-efficient than the alternatives, and a 30-petabyte cluster racked in SF for under $500K. FDM-1, their first model, already extrudes CAD gears in Blender, fuzzes software, and drives a real car around San Francisco after an hour of fine-tuning. We at @sparkcapital could not be more thrilled to partner with them, alongside @sonyatweetybird and the @sequoia team.
We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_. ----- Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks. We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners. We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.
3
7
46
8,268
devansh retweeted
New from me this morning: standard intelligence has raised $75m @ $500m to develop computer use models Their hypothesis is that video pretraining gives a better action prior than text and screenshots ➡️ continual learning And their training runs are very brat
2
3
41
13,244
devansh retweeted
We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_. ----- Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks. We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners. We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.
102
60
904
320,031
devansh retweeted

12
31
348
77,339
devansh retweeted
Apr 25
one day not so long from now human use of computers will be over and we can all go to the park
310
205
3,607
257,369
within 10%? in adversarial testing? for gummies? brb buying a pallet of these
Create gummies was sued in a false advertising class action today. The plaintiff says Create advertises that each 3-gummy serving contains 4.5g of creatine, but testing shows that the products contain about 10% less than that.
5
924