Using integers & coding performance at @InsomniacGames. Also @despair on the azure ceiling site and @crash@mastodon.gamedev.place

Joined December 2007
3,149 Photos and videos
Pinned Tweet
18 Jun 2021
Normalize checking whether things are true before believing them.
4
20
123
Tbf “we can’t monitor their thoughts so we can’t detect scheming” is how we have to interact with most humans anyway.
Mythos invented its own language, then switched back to English to talk to humans (AI safety researchers have been warning of this "Neuralese" risk for years. If AIs stop reasoning in English, we can't monitor their thoughts, which means we can't detect scheming.)
3
279
Elan Ruskin retweeted
trying to see past your own biases and apply a consistent moral logic isn't easy by any means, but quite a lot of people don't even bother
1
13
185
6,730
CyberAcme: oh hey you’re back early Rook: Dire Marsh is haunted. CyberAcme: what? Rook: (switching to main shell, reentering queue) Dire Marsh’s haunted.
229
This is a good point for the facts-centric community. Mathematically, data centers just don’t consume a lot of water, and there’s plenty of land outside cities. But they do provoke a lot of carbon emissions because most new electricity is natural gas.
Have an iron law that the far left gets mad about water and the far right gets mad about land
2
1
24
1,824
“Backrooms” film review for Backrooms fans: Yes.
270
With all the video game movie & TV adaptations these days, I got to thinking about how someone could do a “Hotline Miami” movie. But then I remembered “John Wick” has already done it. youtu.be/Q_yNUtMjQuM?t=66s
1
831
OMG I JUST PUT IT TOGETHER “PROJECT HAIL MARY” IS LITERALLY DARMOK AND JALAD AT TENAGRA
1
489
yes I’m dumb

ALT Star Trek Darmok GIF

262
“The blackmail rate”
Replying to @AnthropicAI
Finally, simple updates that diversify a model’s training data can make a difference. We added unrelated tools and system prompts to a simple chat dataset targeting harmlessness, and this reduced the blackmail rate faster.
2
614
Possibly unpopular opinion, but I’ve always felt that tomatoes are unambiguously fruit. Not in the “🤓 actually *botanically* speaking…” way, but because they’re obviously berries. They’re sweet and they’re full of seeds and they look like berries because that’s what they are.
4
404
“To Market, to market: The Rebranding of Billy Bailey”
It’s probably fine that vast swathes zoomers and younger millennials exist in a kind of permanent internalized panopticon in which all actions are assumed to be (and interpreted as) performances for a viewer
404
PM: "Hey Elan, could you look at the load time in —" Me:
10
3,243
In times like these it’s nice that I get to spend a part of every workday stabbing people
359
Elan Ruskin retweeted
People suddenly realizing why data centers exist
40
336
8,338
186,158
A nice trick to show that he’s still reconstructing these memories and uncertain of the details:
Apr 13
He's doing something different everytime it cuts to him LMFAOAK
5
697
I choose to believe that it’s because a critical mass of AI researchers read @nealstephenson and @GreatDismal and Isaac Asimov at just the right age for it to make a real difference.
maybe this is not yet clear, so let me state it plainly: as of right now Anthropic, and really a small number of individuals at Anthropic, has the capacity to directly attack and cause major damage to the United States Government, China, and generally global superpowers. government agencies like the NSA do not have internal models or defense capabilities that outclass frontier models. if they chose to do so, they could likely exfiltrate top secret information from government systems, gain control over critical infrastructure including military infrastructure, sabotage or modify communications between members of government at the highest level, and potentially carry on activities for some time without detection. the thing about having access to a huge number of zerodays your adversaries don't know about is it gives you a massive asymmetric advantage. they did not exploit this to gain power or destabilize the world order. they publicly released the information that they had these capabilities and worked to mitigate these flaws. you should be grateful american frontier labs have proven themselves remarkably trustworthy and concerned with the public good. but it's critical you understand we are in a new regime. private entities now have power that directly rivals and impacts the government's monopoly on influence and violence. and anthropic is certainly not the only one, there's little chance OpenAI's internal models are far behind. this trend will accelerate on virtually every dimension, not slow down. my prediction for how it plays out is the relatively imminent seizure and nationalization of labs by the US government, sometime over the next two years. it's very tough for me to see how they accept the existence of this kind of threat. but this adds a whole new class of governance issues, as then we've handed these extremely wide-reaching capabilities from private entities to public ones.
1
465
Claude Sonnet Claude Opus Claude Mythos Claude Lore
NEWS: Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Instead, it is starting a 40-company coalition, Project Glasswing, to allow cybersecurity defenders a head start in locking down critical software. nytimes.com/2026/04/07/techn…
1
558

Before limited-releasing Claude Mythos Preview, we investigated its internal mechanisms with interpretability techniques. We found it exhibited notably sophisticated (and often unspoken) strategic thinking and situational awareness, at times in service of unwanted actions. (1/14)
309