jsd

jsd

521 Photos and videos

Tweets

Pinned Tweet

jsd

@datagenproc

Jan 2

Hi all, I'm interested in feedback! You can leave anonymous comments here: admonymous.co/jsd

2,962

jsd

jsd

@datagenproc

Jun 13

The moment is ripe for natural experiments on (R&D compute, labor) complementarities!

Adam Karvonen

@a_karvonen

Jun 13

Some Anthropic researchers are probably thrilled to have tons of extra research compute for the next few days with Fable turned off 😂

7,910

jsd

jsd

@datagenproc

15h

@PeterMcCrory @akorinek @tylercowen

458

jsd

jsd

@datagenproc

Jun 12

I think there's people in my broader community who sacrificed a lot / dealt with enormous stress to fix some things or keep the (good parts of the) show running in the wake of the FTX catastrophe, and haven't gotten much praise for it, and I want to extend gratitude to them.

481

Epoch AI

jsd retweeted

Epoch AI

@EpochAIResearch

Jun 11

How big a leap is Mythos in cyber capabilities? @timotheechauvin, @AlexBarry4, @js_denain, and @ansonwhho compiled the public evidence and found that while it’s unclear if Mythos was ahead of trend in discovering vulnerabilities, it represents a big jump in exploiting them. 🧵

350

33,578

jsd

jsd

@datagenproc

Jun 11

This is awesome work on an important topic.

Dewi Gould @dswg97

Jun 10

New paper! Think Fast: Estimating No-CoT Task-Completion Time Horizons of Frontier AI Models @METR_Evals showed that models' time horizons have doubled every few months. We ask: what length of tasks can models complete without any CoT?

700

jsd

jsd

@datagenproc

Jun 10

2 virtues I care about, and want to embody more: - wholesomeness - seeing, and being willing to name, inconvenient/uncomfortable truths (think, Elephant in the Brain stuff) These can often seem in tension. I'm interested in pointers for how to reconcile them.

2,611

jsd

jsd

@datagenproc

Jun 10

Some that come to mind: joecarlsmith.com/2022/12/23/… lesswrong.com/posts/ufBXmcpx…

On sincerity - Joe Carlsmith

Nearby is the country they call life.

joecarlsmith.com

163

Noah Smith 🐇🇺🇸🇺🇦🇹🇼

jsd retweeted

Noah Smith 🐇🇺🇸🇺🇦🇹🇼

@Noahpinion

Jun 2

I recently wrote about the effort to end the torture of pigs. On Thursday, @Dwarkesh_sp, @AvitalBalwit, and @NanRansohoff are hosting an in-person party for folks in the Bay Area to support the effort! If you'd like to attend, sign up here! forms.gle/8EKxF6y6bnpYMfE38

Save Our Pigs

We're hosting two urgent events to bring together folks in the Bay to stop the Save Our Bacon Act being considered by the US Senate. If it passes, it will mean federal preemption for animal welfare,...

docs.google.com

685

113,955

jsd

jsd

@datagenproc

Jun 7

I find it surprisingly hard to predict whether someone lost their faith because of "problem of evil" vs "I just don't believe in the existence of God" I'm in the latter camp, and expected the same of most people very similar to me, but that doesn't seem to be the case!

614

jsd

jsd

@datagenproc

Jun 5

reminder en.wikipedia.org/wiki/Freedo…

Freedom Riders - Wikipedia

en.wikipedia.org

202

jsd

jsd

@datagenproc

Jun 2

Enjoyed this podcast a lot. Quick clarification on Rosetta stone (≈ ECI) ① @rohinmshah says the ECI is mostly linear over time. I disagree with this, cf @AlexBarry4 and my recent analysis in "Have AI Capabilities Accelerated?" epoch.ai/publications/have-a…. (Note that this analysis is from April 2026, while the podcast was recorded in December 2025) ② I agree with something like "the ECI trend is overall very smooth compared to what you might expect from a statistical model that does not encode time information at all". ③ I think that this acceleration in ECI has come largely from increasing correlation between benchmarks and tasks that AIs are trained on directly. If we were looking at a broader set of tasks, including harder to benchmark tasks, we'd likely see less (or no) acceleration.

Have AI Capabilities Accelerated?

We investigate progress trends on four capability metrics to determine whether AI capabilities have recently accelerated. Three of four metrics show strong evidence of acceleration, driven by...

epoch.ai

Rob Wiblin

@robertwiblin

Jun 2

My best interview in some time. Rohin Shah leads AGI alignment/safety at DeepMind. And he has a lot of spicy personal takes: We probably won’t get catastrophic misalignment (00:49) Safety 'commitments' have severe limitations (10:38) The intelligence explosion probably isn't imminent (1:52:44) Why he's not working to pause AI advances (51:44) Pre-deployment evals aren't the right focus (for catastrophic risks) (37:41) Signalling concern for safety sometimes diverts resources from actually making AI safe (01:09:51) Reading AI thoughts is v useful for safety – and we'll probably be able to for years to come (54:17) Governance is somewhat more likely to be the bottleneck than alignment (43:55) Rohin's team doesn't have a veto, and that's OK (27:36) Central banks are a promising model for regulating AI (33:34) Also: Google DeepMind's actual plan for building AGI safely (1:40:29) How external researchers can positively influence big AI companies (2:21:55) The roles GDM most needs to hire for (2:37:03) On the 80,000 Hours Podcast. Links below - enjoy! (@rohinmshah)

2:48:27

2,869

jsd

jsd

@datagenproc

Jun 2

Separately, Rohin cautions against using the ECI to compare open and closed models, since open models are likely more overfit to benchmarks than closed ones. I agree that this is a major issue with this method, but I still think it's worth doing, if only to get a lower bound on the gap. We're updating the Limitation section here to mention this limitation. epoch.ai/data-insights/open-… I'm also excited about analyses that compare the gap on private vs public benchmarks, for example from @htihle here: lesswrong.com/posts/rJcCrXyE….

Open models lag state-of-the-art closed models by 4 months

Since January 2026, the most capable open-weight models have lagged frontier closed models by an average of four months, or 8 ECI points.

epoch.ai

949

Rob Wiblin

jsd retweeted

Rob Wiblin

@robertwiblin

Jun 2

2:48:27

847

153,125

jsd

jsd

@datagenproc

Jun 1

I love the album Spirit Phone by Lemon Demon!

285

jsd

jsd

@datagenproc

Jun 1

music.youtube.com/playlist?l…

Spirit Phone - Album by Lemon Demon

Spirit Phone is the seventh studio album by Lemon Demon, a musical project created by American musician Neil Cicierega. The album was released digitally through Bandcamp on February 29, 2016, marking...

music.youtube.com

334

jsd

jsd

@datagenproc

Jun 1

A lot of philosophical ethics and metaethics seems very confused to me. I like Base Camp for Mount Ethics which IMO cuts through a lot of the confusion. nickbostrom.com/papers/mount…

739

tom cunningham

jsd retweeted

tom cunningham

@testingham

May 29

I think most domains look like this at the moment: the returns to expenditure on agents diminish much more quickly than the returns to expenditure on human labor: (1/n)

708

182,541

jsd

jsd

@datagenproc

May 28

Ok, but what *does* the European Commission think about UDASSA?

2,111

Epoch AI

jsd retweeted

Epoch AI

@EpochAIResearch

May 26

Help us produce the most useful work on AI by taking our 5-minute survey: docs.google.com/forms/d/e/1F… (You can sign up at the end to join our compensated user research panel.)

Epoch AI Survey

We're trying to learn what's most important to people who use Epoch AI's work so we can focus on the right things. This takes about 5 minutes.

docs.google.com

9,951