The Machine Intelligence Research Institute exists to maximize the probability that the creation of smarter-than-human intelligence has a positive impact.

Joined July 2013
114 Photos and videos
Pinned Tweet
24 Sep 2025
#7 Combined Print & E-Book Nonfiction #8 Hardcover Nonfiction
17
13
192
32,937
MIRI retweeted
Replying to @deanwball
1. Our agreement permits inference, broadly, to continue. It only limits AI training above 10^24 FLOP. In Article VIII, we establish that the agreement restricts only research that advances toward ASI or undermines verification. We permit medical diagnostics and drug discovery, for example, among other non-general capabilities. As long as the research focuses on a narrow domain (like materials science or climate modeling) and does not increase "general cognitive capabilities," it is not subject to restrictions. There is no seizing of assets, but rather mandatory monitoring which can involve forced consolidation of compute into monitored datacenters by the signatory government domestically. (No international centralization of AI monitoring!) So, for example, a small university lab’s research compute would be relocated to a monitored datacenter but is still owned and (remotely) operated by that lab. 2. We architected our agreement to specifically not ban the additional production and installation of compute. The value of such for inference and real-world economic activity is only increasing. So the total economic impact on parts of the AI supply chain are probably far less than existential. We note that many of the relevant players have multi-year backlogs to clear at full production capacity, and building out the infrastructure for model development is only a part of that. 3. Nothing happens to consumer AI apps. If anything additional resources are freed up which can accelerate adoption and integration of these tools. 4. Individual governments handle their own verification and monitoring, but some transparency, random inspections, etc. to either China or the US will be required. Our agreement opts for comprehensive domestic monitoring and shares the least information with rivals (directly) which establishes required confidence. There is no multilateral international AI inspection apparatus in our current version of the agreement, which is a change from the first version we published. 5. Robotics and other applications are unaffected to the extent that progress can continue without developing new models with runs that exceed the compute limit. 6. Our agreement does not attempt to set up a “binding” constraint, in that we do not consider the legal, reputational constraints on US/China to be the active ingredient which leads to stability of the treaty. Instead, we establish a clear understanding on both sides of what behavior is restricted and what consequences must follow otherwise, with enough transparency and non-interference with intelligence gathering to create confidence in each side that the other is adhering to the agreement. Rather than legal constraints, I consider deterrence as the framework for thinking about the stability of the agreement. This is the historical method for constraining a major power, including the use of strategically significant assets. Nuclear deterrence prevented Soviet employment of its conventional superiority against Western Europe. A Soviet dead hand blocked what was otherwise the possibility of a perfect first strike (nuclear or otherwise). Stability emerges when both sides coordinate to make the consequences of aggression clear. Is that a “binding constraint” that one side “accepts”? No, not legally, textually. Not in the sense of “I staked my reputation and my honor in the arena of international relations”. Where agreements have a role to play is in helping to maintain the conditions for successful deterrence. An example of this is the anti-ballistic missile treaty, in which both sides limited their deployment of missile defenses so that deterrence could be maintained.
1
4
20
1,390
Godfather of AI (and world's #1 most cited scientist) announces his support for a coordinated global AI pause!
If leading AI companies are indeed approaching the point of recursive self-improvement, a coordinated, verifiable, and universally applied pause is probably the only responsible solution to mitigate several major AI risks; at least until safety guarantees are developed and demonstrated. Ensuring that such a moratorium is respected would require sincere collaboration between various countries and companies, but I definitely believe it is achievable if others follow in @AnthropicAI's footsteps.
14
19
149
14,643
MIRI retweeted
That there are no urgent hearings on Capitol Hill, no serious legislation in the pipeline, and no persistent questioning of candidates for higher office on their proposed approaches to AI is incredible given how transformative the technology is and how fast it is moving.
Our highest and most urgent national priority should be AI safeguards. The risks of AI weapons, pathogens, mass unemployment, surveillance, and even extinction must not continue to be largely ignored.
17
116
396
36,368
MIRI retweeted
Jun 8
now on the eve of RSI it seems everyone is more mutual conditional pause agreement pilled than they used to be and that seems like a good development
158
86
1,804
274,623
MIRI retweeted
'Some people believe that if machines decide to kill us it's the right thing to do because they're smart' AI researcher Nate Soares reveals that some factions in Silicon Valley mistakenly believe that if an AI is exceptionally intelligent it must also be highly moral. @So8res @Freddygray31
8
11
32
6,153
This just passed 2M views! If you haven't seen it yet, check out this AI In Context video on "If Anyone Builds It, Everyone Dies" youtu.be/Nl7-bRFSZBs?si=-re5…

2
16
69
3,904
MIRI retweeted
Ten places where Magnifica Humanitas matters for AI. At 42k words long, Pope Leo XIV’s new encyclical has a lot to say. In our most recent Digest, Mitchell Howe outlines the parts which might be the most impactful.
1
4
7
766
MIRI retweeted
What will be the impact of AI industry super PACs? "The takeaway here is that this year’s U.S. midterm elections are being aggressively shaped by different factions of the AI industry sometimes supporting the same candidates, sometimes different candidates, buying ads that don’t have anything to do with AI."
1
2
7
956
If you're interested in creating videos about the extinction threat posed by superhuman AI, consider applying to this bootcamp!
First round of applications for PDKU close in two days! We'll send out acceptance letters by June 1st. You should apply now! It's gonna be fun and you'll make friends and do weird shit (U can still apply after that, but there will be fewer slots and your chances will be lower)
2
6
37
4,206
MIRI retweeted
Our report focuses on claims that are (1) solidly defensible and (2) generally agreed within METR. Here I’ll give some personal opinions on how we should feel about the state of AI risk, and the IMO most important limitations of the report.
May 19
Could an AI company lose control of its own agents? To find out, Anthropic, Google, Meta, and OpenAI let us (1) test their best internal models with CoT access, (2) review non-public info about capabilities, alignment, and control. The result: our first Frontier Risk Report.
14
55
459
65,884
MIRI retweeted
"Gathering information is perhaps an important step forward, but it's not nearly enough." In today's Digest, Joe Rogero discusses the new executive order from CA governor Gavin Newsom.
Has anyone said useful concise things I should read about the Newsom EO yet
1
1
6
1,368
An internal model at OpenAI has autonomously disproved a central conjecture in discrete geometry, a mathematical field with applications in cryptography, wireless device communication, and medical imaging. The proof relates to a famous question posed by Paul Erdős in 1946. It has been verified by prominent mathematicians in a companion paper. The verifying mathematicians consider this to be a genuinely novel breakthrough on one of the most discussed problems in this area of mathematics. One called it “arguably the best known problem in Discrete Geometry.” Another observed, “If a human had written the paper and submitted it to the Annals of Mathematics and I had been asked for a quick opinion, I would have recommended acceptance without any hesitation. No previous AI-generated proof has come close to that.” The proof illustrates a general trend towards autonomous, agentic problem-solving in AI systems. OpenAI describes the system that produced the proof as a general-purpose model not specialized in mathematics. AIs can now perform long, novel chains of reasoning on difficult problems and are beginning to outstrip our ability to measure their progress. AI agents still perform best in domains with easily verifiable outputs, such as mathematics and cybersecurity. For example, Anthropic's Claude Mythos found thousands of vulnerabilities across every major operating system and web browser, and was deemed too dangerous for public release. Such capabilities are why the government is now more interested in evaluating frontier AI models. AI research is also a field with many easily verifiable outputs. Researchers at OpenAI and Anthropic take advantage of this fact to accelerate their work; senior researchers now claim they make only high-level decisions and let AI handle most of the coding. Experimenting with the coding capabilities of a publicly available AI system, like Claude Code, immediately demonstrates how far AI has come in the last year. OpenAI and Anthropic intend to use AI to enhance future models with minimal human oversight. To justify the urgency, these companies cite the importance of beating rival U.S. or Chinese labs. Many of the field’s foremost experts warn that this race ends with human extinction. Policymakers and researchers, including the founders of the AI revolution, are calling for international restrictions on the technology. A growing bipartisan and international consensus of political leaders agree.
9
30
267
14,447
MIRI retweeted
In today's Digest: * OpenAI, Anthropic, SpaceX race to file IPO. * The AI executive order is postponed. * METR evaluates rogue deployment risks. * AI makes a breakthrough in mathematics.
1
1
11
734
New paper from MIRI data scientist @robi_rahman: Does Distributed Training Undermine Compute Governance?
1/ With distributed training, you could violate an AI pause treaty by training a GPT-4-scale model over consumer internet, using hardware below every proposed compute governance threshold, for under $100M. My new paper in @taig_icml explains how to catch this and shut it down.
2
37
2,346
The wait is over! Starting today, May 15, you can stream @theaidocfilm on @peacock. This film takes the dizzying complexity of AI — the promise, the peril, the competing ideologies, the economic incentives — and creates a shared experience we can all see and respond to. Then, after you've watched, head to humanetech.com/ai-roadmap to explore concrete actions we can all take to build a better future with AI.
1
10
45
3,298
For an agreement like this to be effective, both the US and China would need to be party to it. Here’s one plausible path to that outcome, in six stages 🧵
We at the MIRI Technical Governance Team just put out a report describing an example international agreement to prevent the creation of superintelligence. 🧵
1
15
86
6,703
Stage Six: creating superintelligence, after the necessary solutions have been discovered to do so without posing an unacceptably high danger to the world.
1
6
513