Cyclist, AI safety researcher, and former physicist. I'm bad at Twitter fights, so go easy on me.

Joined December 2011
52 Photos and videos
Pinned Tweet
Since nobody took me up on my $60 bounty for solving the alignment problem, I made a website for it: sixtybucks.org Would someone please just solve it? I'm getting kind of tired of this whole AI risk thing.

1
4
38
1,821
At AI Impacts, we started looking into evolution vs engineering solutions to problems, and (as I recall), we only looked carefully at flight efficiency. The short answer is that human engineers and biology hit similar lift:drag ratio limits. wiki.aiimpacts.org/doku.php?…

Cultivated meat is not on track to ever be economically competitive with animal meat. Modern factory farms have cut out almost all costs that aren't strictly biologically necessary for the production of muscle. And animal evolution has been optimizing the conversion of energy into muscle for billions of years. That means that cultivated meat needs to beat billions of years of accumulated evolutionary efficiency to become competitive. That's an extremely hard challenge. It's maybe even harder than building AGI, since evolution has only been optimizing for intelligence for tens of millions of years.
1
84
Richard Korzekwa retweeted
Open-weight LLMs ship with safety training that can be stripped in a few hundred fine-tuning steps. Can current defenses stop this? We built and open-sourced TamperBench, the first unified framework for evaluating tamper resistance, and the answer is mostly no. 1/7
1
14
28
4,422
This is sort of tough because if I give an example of a technology not developed/deployed you can say "not yet" or "not possible" or something. Anyway: - human cloning (not deployed) - nuclear power (way underdeployed vs value) - geoengineering (neither developed nor deployed) - human challenge trials (way underdeployed vs value) Jeffrey and I (mostly Jeffrey) worked on this a few years ago: aiimpacts.org/what-weve-lear…
Replying to @LRudL_
What is the evidence for this claim?
2
1
12
741
lol at this graph from that post
1
4
103
Richard Korzekwa retweeted
I’ve said enough about my disagreements with some of the ideas below, or at least with the certainty Pope Leo is expressing about them, but can we also reflect for a moment how wild it is that the Pope is tweeting stuff like this? It feels lifted from a screenplay about takeoff.
Artificial intelligences do not undergo experiences, do not possess a body, do not feel joy or pain, do not mature through relationships, and do not know from within what love, work, friendship or responsibility mean. Nor do they have a moral conscience, since they do not judge good and evil, grasp the ultimate meaning of situations, or bear responsibility for consequences. They may imitate or even simulate, but they do not understand what they produce, for they lack the affective, relational, and spiritual perspective through which human beings grow in wisdom. #MagnificaHumanitas
37
15
380
32,854
I don't think it's a log scale thing. Power output on a bike spans maybe 3-4x from median to world record. It's just that 1.5x in athletic performance seems huge. If it takes you 30 min to climb a hill, and your athletic friend 20, it's hard to imagine anyone can do it in 10.
Humans are really bad at grokking log scales, i.e., “there’s levels to this shit” Among my friends I’m the insanely fit cyclist guy; on my weekly group ride I’m one of the slowest. The fastest guy there couldn’t sniff a pro contract This applies to literally everything
2
6
540
Hell yeah.
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
1
83
It's been more than a week, and @sleepinyourhat still hasn't answered this question. What is he hiding?
Sam, what we all really want to know: What kind of sandwich?
1
5
2,667
I checked it out today and it's a nice little shop. Most of the items aren't things I particularly want to buy, and some of the pricing seems a bit steep, but on the other hand, I bought the same mug for the same price as Daniel, so who am I to say it's wrong? Luna called while I was there and we had a short, awkward, friendly conversation. After that, I chatted with the employee who was there for a bit and she seemed pretty good natured and happy to talk about the experience.
Not open right now despite the website andon.market saying its hours are 10 am to 7 pm Monday thru Sunday :(
4
269
The card reads to me like Anthropic understands there are nontrivial issues with their current process for ensuring safety and alignment. They clearly put a ton of work into evaluating Mythos, and they clearly know a lot of not-so-obvious places to look for shortcomings in these evaluations. Also, they shared many pages of quantitative results and anecdotes, which is awesome. But, FWIW, I did not read all that much concern in the language. I think there is good reason to suspect this process and anything like it are wholly inadequate for the next generation of models, much less whatever comes after that, and I did not get a "this shit is not gonna cut it" vibe from the model card at all. Did I just not read it charitably enough? Did I not apply a strong enough adjustment for "this passed through some PR team's editing"? IDK, maybe. But either way, that's how it comes across to me and ~everyone who read it with me yesterday.
Replying to @So8res
I agree this is is often a source of epistemic slipperiness but in the particular case of the Mythos Preview system card, I feel like "this shit is not gonna cut it for ASI and that is concerning" was actually relatively well signposted!
1
7
757
The crack-cocaine Claude figured staying sandboxed was for suckers.
1
2
170
Richard Korzekwa retweeted
I'm at the Munich Security Conference. It's interesting. It feels a lot like a normal academic conference, except it's packed with generals and senators and there's armed police pouring out or every orifice. 🧵
2
12
190
8,931
Name one thing @davidshor doesn’t have polling data for. I’ll wait.
Replying to @Bonecondor
It’d guess it’s because aisle preference is correlated with income
2
158
Man, I really don't like wading into things like this on twitter, but I also don't want there to be a false consensus here. So: I am in strong opposition to ICE's conduct over the past year. I'm not informed enough to have an opinion on ICE's conduct before that. Large-scale deportation is not a priority for me, but if it is for the American public, I think probably it should be a priority for the government. I don't have "a plan" because I don't know how to do things like this. But if the current tactics are all that DHS can come up with, that's a skill issue.
Ask anyone opposed to ICE’s conduct what their plan is for conducting deportations at scale. To a person, they don’t want that to happen at all. All this haggling over details is theater. You either want mass deportations or you don’t. All of someone’s takes can be predicted based on this.
1
187
I think what Epoch did here is very reasonable. But the earlier data doesn't show a clear trend, and I'm not really sold on the ~doubling of the rate of progress. Based on my very lazy Google sheets analysis, I'm inclined to model it as one trend. That said, I suppose my best guess on the best rate for modelling near-future progress might be the second slope?
AI capabilities accelerated in 2024! According to our Epoch Capabilities Index, frontier model improvement nearly doubled, from ~8 points/year to ~15 points/year.
2
209
Richard Korzekwa retweeted
24 Dec 2025
we’re going to have a century of christmas cheer compressed into five years
20
88
1,845
102,172
I don't think I'm an "Ai bro", but: It's the same reason I don't learn carpentry, lens making, farming, tailoring, welding, masonry, baking, medicine, metalworking, basket weaving, etc, etc, etc Occasionally, I encounter people who take their bike to a mechanic for something very simple and think "why don't you just learn bike maintenance. It's not hard and it's pretty fun". But of course the answer is that there are way more things you could learn than you have time to learn, so you will need to pass on almost all of them.
21 Dec 2025
Ai bros I have genuine question, why don’t you want to learn the skills required to be an artist? Why go to AI? Anyone is capable of learning it just takes some time. I really encourage you to learn how to draw or make 3d models. It’s a really fun journey and so much more fulfilling than using ai
3
126