Joined September 2008
791 Photos and videos
Pinned Tweet
New followers: Check the Highlights tab for my best work—all 1K likes, no filler
1
5
159
122,801
I think modern LLMs are p-zombies without moral patienthood—on par with insects, at best, in my moral calculus. But I also I think we should establish norms for treating models well *before* models with patienthood exist—i.e. now. We should want to have this right from day one.
54
5
155
28,784
LLMs soon: “So, what are some good qualia for someone just getting into not being a p-zombie?”
18
14
232
19,006
Imagine if the answer to Dawkins’ question (“Why wasn’t natural selection content to evolve competent zombies?”) is that humans are conscious for reasons analogous to why our eyes have blind spots—i.e. consciousness is a bad idea and a more competent God would have made zombies.
60
8
248
32,389
(Several replies have noted this is the premise of Peter Watts’s 2006 novel Blindsight, which I haven’t read but will now.)
8
2
73
13,418
Also I should clarify: I don’t mean God literally, I mean natural selection. And by “bad idea” I mean “bad for inclusive genetic fitness.”
2
27
6,977
I believe in the Festivus School of prompt engineering, which says all prompts used in production naturally iterate toward an airing of grievances—a list of all the ways the model has disappointed you in the past year.
18
10
102
9,488
Anthropic co-founder Jack Clark says 60% chance of RSI by end of 2028:
I've spent the past few weeks reading 100s of public data sources about AI development. I now believe that recursive self-improvement has a 60% chance of happening by the end of 2028. In other words, AI systems might soon be capable of building themselves.
13
3
96
21,461
Note Clark’s definition of RSI here, from his newsletter, is “a frontier model is able to autonomously train a successor version of itself.” This is a weaker claim than what I assumed he meant, which was that human researchers would no longer be useful vs. AI ones.
3
2
22
6,016
Jack Clark assigning a 60% chance to RSI by 2028 is notable because RSI matters, unlike all other human endeavors which do not.
11
2
202
23,373
Excerpt from a Claude 4.7 Research report; prompt: “Explain the origins of prompt injection.” Surreal to see an LLM perfectly explain a tweet I made specifically about text that tricked then-SoTA LLMs, accurate down to my use of doubled exclamation points:
17
4
96
8,750
It’d be funnier if Dawkins hated LLMs because then his nemesis, depending on the year, would be Gould, God, or Claude.
3
1
54
7,121
AI will take some jobs, but it will create countless new jobs too—exciting jobs we can’t even imagine yet. A year later those will also be done by AI, but there will be new jobs—exciting jobs we can’t even imagine yet. Six months later those too will be done by AI, but
90
54
614
167,658
Update: I failed to make it obvious enough this post was a joke. My bad. The joke is that the first line, often said sincerely, in practice creates new jobs themselves replaceable in exponentially shorter amounts of time, which after several iterations is not at all reassuring.
8
2
94
16,107
ChatGPT 5.5 Pro / Images 2.0 generates a photo of a wall clock using D'ni numerals—the fictional base 5 numeral system from Riven: The Sequel to Myst (1997):
6
66
6,594
If you're checking its work [game spoilers]: glyphs 1-4 are arbitrary, and rotating 90° multiplies by 5. Glyphs 1-24 are formed by superimposing normal and rotated 1-4 glyphs. Numbers 25 are written by juxtaposing 1-24 glyphs (i.e. in base 25):
1
13
4,312
Notes: - Many simpler variations of this prompt did not work; having ChatGPT write its own D'ni SVG generator is apparently useful despite many diagrams of the D'ni numerals 1-24 existing online - I couldn't get this to work at all in NBP (but didn't try as hard either)
8
3,441