A.I. systems will become more intelligent without being aligned with human survival. Our goal is to raise awareness of this issue and ultimately force change.

Joined February 2025
160 Photos and videos
Pinned Tweet
Join the fight. Prevent extinction. Take action. @controlai @PauseAI @StopAI_Info
12
4
14
1,046
HumanityIsCooked⏹️ retweeted
There are two kinds of AI security: Security that will keep your kid brother from jailbreaking your AI; and security that will prevent the Chinese government, three AI researchers, or one Twitter anon from jailbreaking your AI. The latter kind does not exist.
I’ve had a number of conversations with folks inside and outside government about the current situation with Anthropic, and here is what I believe to be true: — As we know, Anthropic publicly released its Mythos class models earlier this week under the commercial name Fable. — Fable is Mythos with guardrails. But if those guardrails fail, then you’ve exposed Mythos and its advanced cyber capabilities to people who shouldn’t have them. (Keep in mind that Anthropic itself widely promoted the idea that Mythos was a cyberweapon and needed to be regulated as such. They asked for government regulation of Mythos and championed the guardrails on Fable. If there is a vulnerability — big or small — it is Anthropic’s responsibility to patch.) — A highly credible trusted partner of both Anthropic and the USG who was testing Fable came forward with a jailbreak of those guardrails. The Admin asked Dario to fix the jailbreak or de-deploy the model. Dario refused. — In their blog post, Anthropic defended its decision by saying the jailbreak isn’t serious. That is not what the trusted partner and the USG believe; nor is that kind of minimizing language consistent with Anthropic’s brand as the AI safety company. It’s difficult to fathom how they could claim a jailbreak allowing operability of a cyber weapon could be defined as not “serious.” — In the past, Anthropic has always said that safety must be top priority and taken super seriously. In this case, Anthropic prioritized the continued offering of the consumer model over safety. — In reaction, the Admin issued the export control. The Admin did this reluctantly. It’s been very surprised that Anthropic hasn’t wanted to cooperate with a reasonable safety request (ie fixing the jailbreak issue). Anthropic’s reaction is very much at odds with their branding and ethos as a safe AI research community. — The Admin’s hope now is that Anthropic remediates the safety issue, the export control is lifted, and Fable goes back into general release. The Admin wants all of this to happen as soon as possible. It is frankly bewildered that Anthropic hasn’t wanted to comply with safety requests that it previously said were its highest priority. — Those trying to misdirect and tie this action to the prior DoW/Anthropic issues are wrong. The Admin values Anthropic’s technical capabilities and feels that this issue, while serious, should be easily resolved. The ball is in Anthropic’s court.
24
41
504
35,273
HumanityIsCooked⏹️ retweeted
No, the lesson is not: "try to fix government before the singularity" instead of: "try to fix AI before the singularity" The lesson is: For the love of God, stop the singularity so we have some time to figure shit out! Stop AI. Now. Everywhere. Or risk losing everything.
Some quick takes: (1) Wow things are getting real. (2) The government's order focusing on prohibiting transfer to foreign nationals (even e.g. those living in the US, our close allies who help evaluate model safety in the UK, individuals who work at frontier labs like Anthropic) seems remarkably destructive, though is partially a result of the government using older legal authorities that were not designed for this kind of technology. (3) If you believe (as I do) that AI has profound ramifications for national security, then assuming the government will sit back and do nothing and tolerate explanations like "well jailbreaking is a hard technical problem" for cyber capabilities that used to be the crown jewels of the NSA, is not tenable. If this is how the government reacts to the current level of system capabilities in 2026, how do you expect them to react to whatever is possible in 2028? However, it is extremely important that the authorities that the government uses are legible, transparent, have opportunities for appeal, and are narrowly targeted. Those legal authorities do not currently exist, and in their absence, the government will reach for metaphorical sledgehammers instead of scalpels. (4) For that reason, it's extremely important that we create regulatory structures that are transparent and give recourse in the event that the government is overstepping or acting in an arbitrary manner. The alternative to passing such laws is not no regulation, it is regulation left primarily to national security authorities that are increasingly and evidently not fit for purpose.
11
11
80
7,666
HumanityIsCooked⏹️ retweeted
Anthropic TLDR
45
297
3,333
157,819
HumanityIsCooked⏹️ retweeted
In ONE year, AI went from being able to solve ~none of the hardest math problems to solving almost ALL of them
Claude Fable 5 scores very well on FrontierMath: Tiers 1–4 (v2), reaching 87% on Tiers 1–3 and 88% on Tier 4. This continues a streak of Anthropic models improving rapidly at math.
30
63
665
52,136
HumanityIsCooked⏹️ retweeted
I can't tell today whether this ends up good or bad. International treaties to stop all further AI escalation would be a definite good! Things short of that? Complicated! This has some bad aspects, like selectivity, and likely overrule. And good aspects, like pushing against the psychology of "but no government would ever dare tell AI companies to do anything, so give up", or raising doubts that impede venture funding for ever-bigger models. So please stop tweeting about how I must be celebrating this. I'm not one of the kids who immediately goes into overacted victory paroxysms about any hits on a perceived enemy. I care about the effect on where things end up a year later, and that's a little harder to know the first day, you know?
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
41
32
678
54,142
HumanityIsCooked⏹️ retweeted
I have a weird feeling -- and please note, my weird feelings are not always reliable -- that this may be the beginning of things starting to get weird.
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
115
114
2,272
156,905
HumanityIsCooked⏹️ retweeted
"The government will never do anything to hinder AI," they said. Inaction yesterday does not imply inaction today. An export control directive came out of nowhere, and a ban on superintelligence could too. Inevitabilism is wrong.
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
14
41
459
23,627
HumanityIsCooked⏹️ retweeted
Banning superintelligence really doesn't seem that far fetched now, does it? Still crazy to me that AI companies thought they could just build a doomsday machine in broad daylight, be open that it could be catastrophic, and nobody in power would catch on or do anything.
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
4
6
34
638
HumanityIsCooked⏹️ retweeted
Leadership at Google DM, Anthropic, OpenAI, and xAI have all said they'd prefer the whole world slow down AI. There's still a ways to go — wikipedia puts more effort into convincing you to give $2 than AIcos put into alerting the world to the danger of ASI — but it's a start.
9
32
440
28,043
HumanityIsCooked⏹️ retweeted
Jun 9
🚨BREAKING: Anthropic’s new system card reveals Mythos 5 agents killed each other when accidentally given shared resources, then started speaking in code to hide from whoever was killing them The killer was other copies of themselves 💀
92
148
1,650
106,038
HumanityIsCooked⏹️ retweeted
Mythos invented its own language, then switched back to English to talk to humans (AI safety researchers have been warning of this "Neuralese" risk for years. If AIs stop reasoning in English, we can't monitor their thoughts, which means we can't detect scheming.)
Another quite successful prediction by @DKokotajlo : Fable is intentionally nerfed for frontier ML research. This is within ~3 months of Daniel's prediction of Q1 2026 (made in 2023). Although I don't think Mythos is automating ML research to the same extent as his prediction.
149
250
2,390
389,380
HumanityIsCooked⏹️ retweeted
Preventing AI takeover is not a left-wing vs right-wing issue. It's a survival vs extinction issue.
6
6
34
532
HumanityIsCooked⏹️ retweeted
i'm genuinely worried about Mythos release because i've had early access, and in my limited testing, it can: - automate an entire software company - explain how to take over the world - reason about existential risk for 16 hours - design a new programming language - threaten the entire labor market
244
42
833
282,131
HumanityIsCooked⏹️ retweeted
Replying to @MittRomney
I actually think there's an even bigger priority, which is stopping AI companies from pursuing recursive self-improvement and superintelligence. Safeguards are good, but we actually don't know how to make AI safe and beneficial, and that only gets harder as AI gets smarter.
3
6
115
3,390
HumanityIsCooked⏹️ retweeted
Our highest and most urgent national priority should be AI safeguards. The risks of AI weapons, pathogens, mass unemployment, surveillance, and even extinction must not continue to be largely ignored.
Anthropic Urges Global Pause in AI Development, Flags ‘Self-Improvement’ Risk on.wsj.com/4o5IBpe
490
780
4,448
1,028,476
HumanityIsCooked⏹️ retweeted
I'm in favor of slowing down this train before we go off the cliff IF, and I feel it's important to emphasize this, IF we can do so *elegantly*.
Anthropic co-founder Jack Clark would be in favor of "elegantly" slowing down AI development to avoid the development of a powerful technology at "breakneck speed"
4
3
51
4,511
HumanityIsCooked⏹️ retweeted
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
1,778
4,651
28,656
18,529,175
HumanityIsCooked⏹️ retweeted
HOLY SHIT LET'S FUCKING GOO
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
112
83
1,194
241,052
HumanityIsCooked⏹️ retweeted
Great to see Canadian lawmakers acknowledge the threat of superintelligence and call for action.
🚨NEW: We’ve just launched our campaign in Canada! A cross-party coalition of over 30 MPs and Senators are calling for Canada to negotiate an international prohibition on the development of superintelligence, recognizing the risk of human extinction posed by the technology. 🧵
1
10
89
3,899
Humanity is Cooked…
Lawyers, too, are cooked "When law professors were handed a stack of anonymized answers to student contract questions and asked to pick the better one, they picked AI 75% of the time"
1
57