(me talking with confucius) oh maybe i could use this time loop to kickstart alignment millenia early! okay so in the future people are going to create new intelligent life -
kongzi: like children?
if any anthropic employees would like to put money where their mouth is on opus 4.5 self-reported uplift numbers (results resolving to METR-style RCT), i am keen to make bets with you. x.com/HjalmarWijk/status/199β¦
Anthropic says in their system card that *all* their AI R&D evals are close to saturation, and report a median self-reported uplift of 2X (mean over 3X!) for power users. They provide very little evidence ruling out imminent dramatic AI R&D acceleration.
x.com/eli_lifland/status/199β¦
In 2024, OP's Technical AI Safety team had 2 grantmakers and spent $40m. In 2025, we had 3 and spent $130m. If you join the team, it will enable us to spend even more next year, and weβll be directly influenced by your takes.
Come work with me!
I am proud to announce I am founding Node, an independent Nodule on a perpendicular vector in the Network of Centers of Alignment for AI Alignment Centers
I don't think @So8res and @ESYudkowsky have an extreme view. If we build superintelligence with anything remotely like our current level of understanidng, the idea that we retain control or steer the outcome is AT LEAST as wild as the idea that we'll lose control by default
OKAY PEOPLE: If I've gained any good will through all these years of tweeting, PLEASE PLEASE PLEASE like this tweet so I can get an advance copy of the book
I'M BEGGING