galen

galen

11 Photos and videos

Tweets

Pinned Tweet

galen

@G413N

Apr 30

I'm really grateful that we get to work with such thoughtful and mission-oriented investors. Exciting times ahead!

Standard Intelligence

@si_pbc

Apr 30

We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_. ----- Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks. We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners. We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.

114

12,354

galen

galen

@G413N

Apr 30

Lachy called me up on thanksgiving day back in 2024 to offer to lead our seed round. At the time we had a few weeks of runway left and no one else had the conviction for a research bet. He's an incredibly impressive investor and an amazing person.

Lachy Groom

@lachygroom

Apr 30

🥹 @si_pbc 🤝 @MikowaiA 🤝 @sonyatweetybird🤝 @lachygroom 🥹

164

19,616

Andrej Karpathy

galen retweeted

Andrej Karpathy

@karpathy

Apr 30

Replying to @si_pbc @sonyatweetybird @MikowaiA @YasminRazavi @tszzl @_milankovac_

VPT (openai.com/index/vpt/) blew my mind back in 2022 so I was very excited to see SI scale up the idea with FDM1, but for knowledge work / computer use. Excited and looking forward to more!

401

50,225

🚀 Rocket

galen retweeted

🚀 Rocket @rocketalignment

Apr 30

theinformation.com/articles/…

Standard Intelligence Rides Neolab Fervor with Computer Use Model

Pedestrians in downtown San Francisco are used to seeing Waymos navigating the streets. But two months ago, people in the South Park neighborhood saw something new: a Toyota Rav4 driving around with...

theinformation.com

1,322

galen

galen

@G413N

Apr 30

*hundreds of h100s, dozens of nodes. thousands soon!

🚀 Rocket @rocketalignment

Apr 30

New from me this morning: standard intelligence has raised $75m @ $500m to develop computer use models Their hypothesis is that video pretraining gives a better action prior than text and screenshots ➡️ continual learning And their training runs are very brat

5,624

Neel Redkar

galen retweeted

Neel Redkar

@_neelr_

Apr 30

!!! time for more scale and more MFU optimization fun stuff (spent a week making our gpu traces go brrrr)

Standard Intelligence

@si_pbc

Apr 30

5,420

Ryan Kaufman

galen retweeted

Ryan Kaufman @ryankaufman

Apr 30

Delayed life update — I left @xai to join the amazing crew at @si_pbc. Loving the small team vibes and fast research cycle. Excited to show you what we’ve been cooking!

Standard Intelligence

@si_pbc

Apr 30

307

29,163

galen

galen

@G413N

Feb 23

general intuition is really something special, it's been amazing to watch Pim go in an entirely new direction as a founder and blow it away on execution, the culture there is incredible and they're doing great work, honored to have made a difference :)

Pim de Witte

@PimDeWitte

Feb 23

Very excited for the SI team - fun fact, General Intuition likely would not have existed without Galen and his early mentorship as I was getting started in the field after @lachygroom introduced us. Having mostly traditional researchers in my network, and nobody who was self-taught like Galen, it was great seeing people paving their own path and being so far ahead of the curve. Follow this team!

5,113

galen

galen

@G413N

Feb 23

computer use is too important to relegate to post-training. this has been many months in the making, I'm super proud of what we've achieved as a team and excited to scale!

Standard Intelligence

@si_pbc

Feb 23

Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.

181

21,120

galen

galen

@G413N

1 Feb 2025

we’re assembling a 30PB storage cluster in downtown sf. got custom engraved drives for people helping. dm if you’d like to drop by

5,538

galen

galen

@G413N

31 Jan 2024

built a 4x4090 space heater recently, took abt a week of debugging to get it running nicely. Thread to add public knowledge---

1,105

193,446

galen

galen

@G413N

3 Nov 2024

here they are now x.com/si_pbc/status/18531843…

Standard Intelligence

@si_pbc

3 Nov 2024

At Standard Intelligence we’ve been researching scalable cross-modality learning. We’re excited to share some early results in the form of 𝗵𝗲𝗿𝘁𝘇-𝗱𝗲𝘃, an open-source, first-of-its-kind base model for full-duplex conversational audio. 1/

0:58

5,408

galen

galen

@G413N

3 Nov 2024

chatvae

Standard Intelligence

@si_pbc

3 Nov 2024

0:58

1,610

Standard Intelligence

galen retweeted

Standard Intelligence

@si_pbc

1 Nov 2024

0:17

15,509

galen

galen

@G413N

27 May 2024

psa in pytorch 2.3 the is_causal flag is no longer just a type hint. It's now necessary to avoid a silent kernel default to MemEff attention because Flash won't take any mask as input.

1,008

galen

galen

@G413N

25 Apr 2024

deeply suspicious rn

ALT loss plot plummets on resume with minor changes

945

galen

galen

@G413N

25 Apr 2024

ignore the sinusoid loss before the cliff that’s just me accidentally overflowing the scheduler

677

galen

galen

@G413N

4 Feb 2024

pro tip instead of buying a wake-up lamp you can just wire an industrial warehouse light to a smart plug

1,246

galen

galen

@G413N

31 Jan 2024

setting the thermostat for heat in the morning.

galen

@G413N

31 Jan 2024

built a 4x4090 space heater recently, took abt a week of debugging to get it running nicely. Thread to add public knowledge---

2,720

galen

galen

@G413N

31 Jan 2024

after lots of network debugging it's up with ssh/tailscale and working nicely. thanks to @kognise7 and @MiniUlisse for all the help getting this set up. Also thanks to @_pranavnt for help with proxy-purchasing to get around the 1-gpu nvidia limits.

8,502

galen

galen

@G413N

31 Jan 2024

that's all that comes to mind for now. feel free to reach out if you have any questions about what worked and what didn't. Also consider just using cloud @sfcompute

7,970