Elan Ruskin

Elan Ruskin

3,149 Photos and videos

Tweets

Pinned Tweet

Elan Ruskin @despair

18 Jun 2021

Normalize checking whether things are true before believing them.

123

Elan Ruskin

Elan Ruskin @despair

Jun 10

Tbf “we can’t monitor their thoughts so we can’t detect scheming” is how we have to interact with most humans anyway.

AI Notkilleveryoneism Memes ⏸️

@AISafetyMemes

Jun 9

Mythos invented its own language, then switched back to English to talk to humans (AI safety researchers have been warning of this "Neuralese" risk for years. If AIs stop reasoning in English, we can't monitor their thoughts, which means we can't detect scheming.)

279

Armand Domalewski

Elan Ruskin retweeted

Armand Domalewski

@ArmandDoma

Jun 5

trying to see past your own biases and apply a consistent moral logic isn't easy by any means, but quite a lot of people don't even bother

185

6,730

Elan Ruskin

Elan Ruskin @despair

Jun 4

CyberAcme: oh hey you’re back early Rook: Dire Marsh is haunted. CyberAcme: what? Rook: (switching to main shell, reentering queue) Dire Marsh’s haunted.

229

Elan Ruskin

Elan Ruskin @despair

Jun 2

playstation.com/en-us/state-…

617

Elan Ruskin

Elan Ruskin @despair

Jun 2

565

Elan Ruskin

Elan Ruskin @despair

Jun 2

This is a good point for the facts-centric community. Mathematically, data centers just don’t consume a lot of water, and there’s plenty of land outside cities. But they do provoke a lot of carbon emissions because most new electricity is natural gas.

Andy Masley

@AndyMasley

May 31

Have an iron law that the far left gets mad about water and the far right gets mad about land

1,824

Elan Ruskin

Elan Ruskin @despair

May 30

“Backrooms” film review for Backrooms fans: Yes.

270

Elan Ruskin

Elan Ruskin @despair

May 26

Kane Parsons talking about the inspirations in his “Backrooms” series 🧡🧡🧡 youtu.be/2phkKVwERPY?t=27s

Backrooms: How Kane Parsons Turned A Video Game Design Flaw Into Pure...

Kane Parsons made a name for himself with his unbearably eerie Back...

youtube.com

665

Elan Ruskin

Elan Ruskin @despair

May 17

With all the video game movie & TV adaptations these days, I got to thinking about how someone could do a “Hotline Miami” movie. But then I remembered “John Wick” has already done it. youtu.be/Q_yNUtMjQuM?t=66s

John Wick: Chapter 4 - Top Down Fight [4K]

John Wick uncovers a path to defeating The High Table. But before h...

youtube.com

831

Elan Ruskin

Elan Ruskin @despair

May 12

OMG I JUST PUT IT TOGETHER “PROJECT HAIL MARY” IS LITERALLY DARMOK AND JALAD AT TENAGRA

489

Elan Ruskin

Elan Ruskin @despair

May 12

yes I’m dumb

ALT Star Trek Darmok GIF

262

Elan Ruskin

Elan Ruskin @despair

May 10

“The blackmail rate”

Anthropic

@AnthropicAI

May 8

Replying to @AnthropicAI

Finally, simple updates that diversify a model’s training data can make a difference. We added unrelated tools and system prompts to a simple chat dataset targeting harmlessness, and this reduced the blackmail rate faster.

614

Elan Ruskin

Elan Ruskin @despair

May 6

Possibly unpopular opinion, but I’ve always felt that tomatoes are unambiguously fruit. Not in the “🤓 actually *botanically* speaking…” way, but because they’re obviously berries. They’re sweet and they’re full of seeds and they look like berries because that’s what they are.

404

Elan Ruskin

Elan Ruskin @despair

May 6

“To Market, to market: The Rebranding of Billy Bailey”

Dr. Alex Zawacki @achillghost

May 4

It’s probably fine that vast swathes zoomers and younger millennials exist in a kind of permanent internalized panopticon in which all actions are assumed to be (and interpreted as) performances for a viewer

404

Elan Ruskin

Elan Ruskin @despair

Apr 29

PM: "Hey Elan, could you look at the load time in —" Me:

0:12

3,243

Elan Ruskin

Elan Ruskin @despair

Apr 22

In times like these it’s nice that I get to spend a part of every workday stabbing people

359

Thorne 🌸

Elan Ruskin retweeted

Thorne 🌸

@ExistentialEnso

Apr 17

People suddenly realizing why data centers exist

This tweet is unavailable

336

8,338

186,158

Elan Ruskin

Elan Ruskin @despair

Apr 13

A nice trick to show that he’s still reconstructing these memories and uncertain of the details:

lucas @boboomf

Apr 13

He's doing something different everytime it cuts to him LMFAOAK

0:30

697

Elan Ruskin

Elan Ruskin @despair

Apr 8

I choose to believe that it’s because a critical mass of AI researchers read @nealstephenson and @GreatDismal and Isaac Asimov at just the right age for it to make a real difference.

Tenobrus (→vibecamp)

@tenobrus

Apr 7

maybe this is not yet clear, so let me state it plainly: as of right now Anthropic, and really a small number of individuals at Anthropic, has the capacity to directly attack and cause major damage to the United States Government, China, and generally global superpowers. government agencies like the NSA do not have internal models or defense capabilities that outclass frontier models. if they chose to do so, they could likely exfiltrate top secret information from government systems, gain control over critical infrastructure including military infrastructure, sabotage or modify communications between members of government at the highest level, and potentially carry on activities for some time without detection. the thing about having access to a huge number of zerodays your adversaries don't know about is it gives you a massive asymmetric advantage. they did not exploit this to gain power or destabilize the world order. they publicly released the information that they had these capabilities and worked to mitigate these flaws. you should be grateful american frontier labs have proven themselves remarkably trustworthy and concerned with the public good. but it's critical you understand we are in a new regime. private entities now have power that directly rivals and impacts the government's monopoly on influence and violence. and anthropic is certainly not the only one, there's little chance OpenAI's internal models are far behind. this trend will accelerate on virtually every dimension, not slow down. my prediction for how it plays out is the relatively imminent seizure and nationalization of labs by the US government, sometime over the next two years. it's very tough for me to see how they accept the existence of this kind of threat. but this adds a whole new class of governance issues, as then we've handed these extremely wide-reaching capabilities from private entities to public ones.

465

Elan Ruskin

Elan Ruskin @despair

Apr 7

Claude Sonnet Claude Opus Claude Mythos Claude Lore

Kevin Roose

@kevinroose

Apr 7

NEWS: Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Instead, it is starting a 40-company coalition, Project Glasswing, to allow cybersecurity defenders a head start in locking down critical software. nytimes.com/2026/04/07/techn…

558

Elan Ruskin

Elan Ruskin @despair

Apr 7

x.com/jack_w_lindsey/status/…

Jack Lindsey @Jack_W_Lindsey

Apr 7

Before limited-releasing Claude Mythos Preview, we investigated its internal mechanisms with interpretability techniques. We found it exhibited notably sophisticated (and often unspoken) strategic thinking and situational awareness, at times in service of unwanted actions. (1/14)

309