Timaeus is an AI Safety Research Organisation working on Singular Learning Theory and Developmental Interpretability.

Joined June 2023
2 Photos and videos
Pinned Tweet
Timaeus is joining ⊢ Sequent Research!
We are starting a new, nonprofit alignment organization, ⊢ Sequent Research, bringing together researchers previously on UK AISI’s Alignment Team, Timaeus, and elsewhere to research how to align superintelligence. We are hiring! 🧵
1
3
61
4,285
Timaeus retweeted
Please reach out if you’re interested in working with us! Sequent will have a large in-person presence in Berkeley, as well as researchers remote from London, Melbourne, and elsewhere. 🇺🇸🇬🇧🇦🇺 1. Full post: sequent.org/launch 2. Express interest: sequent.org/apply
5
3
79
3,775
RT @jesse_hoogland: I’m incredibly proud of what we’ve built with Timaeus and all the research we’ve accomplished. But given how much resea…
1
Timaeus retweeted
This is why a strong human component is still necessary in alignment research at Sequent. Maybe that’s you. You should consider dropping what you’re doing and helping. A lot of other theoretical and empirical research can be left to the ASI, but alignment can’t (responsibly).
1
1
29
2,414
Timaeus retweeted
“Neural networks are grown, not programmed” We’re changing that. Mechinterp investigates how models generalize beyond their training data by studying the resulting internal structure. We introduce patterning as the dual: given desired structure, determine what data produces it.
19
157
1,084
91,731
RT @jesse_hoogland: Well, we haven't solved all the problems yet, so the offer stands... If you want to solve deep scientific problems in…
4
Timaeus retweeted
Training Data Attribution (TDA) should account for learning dynamics! The same data can influence model behavior in dramatically different ways at different time points of training. We call for a shift towards stagewise data attribution and the study of influence dynamics. 1/11
1
11
45
4,367
Timaeus retweeted
How does training data shape model behavior? Well, it’s complicated… 1/10
15
140
971
96,424
We're hiring for research engineers! Apply by *this Sunday* (Sep 14th)!
1
4
19
6,947
Using singular learning theory, we study how data shapes the structures that neural networks learn during training, and how those structures enable generalization. This gives us a unique lens on interpretability and alignment. x.com/danielmurfet/status/19…

Neural networks are grown, not programmed. What does that growth process look like? Like this! This is a small language model (3M) across training, visualised with a new interpretability technique: susceptibilities. We call this handsome critter the rainbow serpent.
1
2
414
If you're excited to support our researchers and contribute to this agenda, apply today! airtable.com/appMGBqrZQpiKXR… More details: timaeus.co/blog/updates/2025…

3
245
Timaeus retweeted
1/ AI is accelerating. But can we ensure that AIs truly share our values and follow our goals? We argue that aligning advanced AI systems requires cracking a core scientific challenge: how data shapes AI's internal structure, and how that structure determines behavior.
29
91
544
101,515
Timaeus retweeted
11 Feb 2025
Our paper has been accepted to ICLR as a spotlight paper! We introduced the refined Local Learning Coefficient, which measures “how much structure” there is in particular parts of the model associated to particular datasets or behaviors.
16 Oct 2024
1/ How do attention heads form? With our new approach, we show that attention heads have distinct developmental signatures. These signatures reveal how heads develop distinct functional roles specialized to different subsets of data. In the process, we discover a new circuit.
4
14
92
8,423
We’re hiring! We’re looking for research scientists, engineers, and a research lead to work on AI alignment through singular learning theory
1
9
36
9,406
Timaeus retweeted
1/8 How do transformers learn? In our new work, we find that transformers develop in-context learning in discrete stages that can be automatically discovered. 🧵 arxiv.org/abs/2402.02364 Joint work w/ @georgeyw_, Matthew Farrugia-Roberts, @lemmykc, Susan Wei, @danielmurfet
2
87
427
37,044