Joined March 2023
11 Photos and videos
Pinned Tweet
Apr 30
I'm really grateful that we get to work with such thoughtful and mission-oriented investors. Exciting times ahead!
We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_. ----- Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks. We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners. We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.
5
4
114
12,354
Apr 30
Lachy called me up on thanksgiving day back in 2024 to offer to lead our seed round. At the time we had a few weeks of runway left and no one else had the conviction for a research bet. He's an incredibly impressive investor and an amazing person.
2
1
164
19,616
galen retweeted
VPT (openai.com/index/vpt/) blew my mind back in 2022 so I was very excited to see SI scale up the idea with FDM1, but for knowledge work / computer use. Excited and looking forward to more!
16
21
401
50,225
Apr 30
*hundreds of h100s, dozens of nodes. thousands soon!
New from me this morning: standard intelligence has raised $75m @ $500m to develop computer use models Their hypothesis is that video pretraining gives a better action prior than text and screenshots ➡️ continual learning And their training runs are very brat
3
47
5,624
galen retweeted
!!! time for more scale and more MFU optimization fun stuff (spent a week making our gpu traces go brrrr)
We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_. ----- Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks. We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners. We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.
7
2
80
5,420
galen retweeted
Delayed life update — I left @xai to join the amazing crew at @si_pbc. Loving the small team vibes and fast research cycle. Excited to show you what we’ve been cooking!
We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_. ----- Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks. We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners. We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.
24
5
307
29,163
Feb 23
general intuition is really something special, it's been amazing to watch Pim go in an entirely new direction as a founder and blow it away on execution, the culture there is incredible and they're doing great work, honored to have made a difference :)
Very excited for the SI team - fun fact, General Intuition likely would not have existed without Galen and his early mentorship as I was getting started in the field after @lachygroom introduced us. Having mostly traditional researchers in my network, and nobody who was self-taught like Galen, it was great seeing people paving their own path and being so far ahead of the curve. Follow this team!
43
5,113
Feb 23
computer use is too important to relegate to post-training. this has been many months in the making, I'm super proud of what we've achieved as a team and excited to scale!
Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.
7
11
181
21,120
1 Feb 2025
we’re assembling a 30PB storage cluster in downtown sf. got custom engraved drives for people helping. dm if you’d like to drop by
1
2
46
5,538
31 Jan 2024
built a 4x4090 space heater recently, took abt a week of debugging to get it running nicely. Thread to add public knowledge---
50
53
1,105
193,446
3 Nov 2024
At Standard Intelligence we’ve been researching scalable cross-modality learning. We’re excited to share some early results in the form of 𝗵𝗲𝗿𝘁𝘇-𝗱𝗲𝘃, an open-source, first-of-its-kind base model for full-duplex conversational audio. 1/
3
2
46
5,408
3 Nov 2024
chatvae
At Standard Intelligence we’ve been researching scalable cross-modality learning. We’re excited to share some early results in the form of 𝗵𝗲𝗿𝘁𝘇-𝗱𝗲𝘃, an open-source, first-of-its-kind base model for full-duplex conversational audio. 1/
2
17
1,610
galen retweeted
2
4
75
15,509
27 May 2024
psa in pytorch 2.3 the is_causal flag is no longer just a type hint. It's now necessary to avoid a silent kernel default to MemEff attention because Flash won't take any mask as input.
4
1,008
25 Apr 2024
deeply suspicious rn
1
6
945
25 Apr 2024
ignore the sinusoid loss before the cliff that’s just me accidentally overflowing the scheduler
1
2
677
4 Feb 2024
pro tip instead of buying a wake-up lamp you can just wire an industrial warehouse light to a smart plug
1
18
1,246
31 Jan 2024
setting the thermostat for heat in the morning.
31 Jan 2024
built a 4x4090 space heater recently, took abt a week of debugging to get it running nicely. Thread to add public knowledge---
1
1
26
2,720
31 Jan 2024
after lots of network debugging it's up with ssh/tailscale and working nicely. thanks to @kognise7 and @MiniUlisse for all the help getting this set up. Also thanks to @_pranavnt for help with proxy-purchasing to get around the 1-gpu nvidia limits.
3
26
8,502
31 Jan 2024
that's all that comes to mind for now. feel free to reach out if you have any questions about what worked and what didn't. Also consider just using cloud @sfcompute
2
18
7,970