MIRI

MIRI

114 Photos and videos

Tweets

Pinned Tweet

MIRI

@MIRIBerkeley

24 Sep 2025

#7 Combined Print & E-Book Nonfiction #8 Hardcover Nonfiction

192

32,937

David Abecassis

MIRI retweeted

David Abecassis

@Volty

Mar 30

Replying to @deanwball

1. Our agreement permits inference, broadly, to continue. It only limits AI training above 10^24 FLOP. In Article VIII, we establish that the agreement restricts only research that advances toward ASI or undermines verification. We permit medical diagnostics and drug discovery, for example, among other non-general capabilities. As long as the research focuses on a narrow domain (like materials science or climate modeling) and does not increase "general cognitive capabilities," it is not subject to restrictions. There is no seizing of assets, but rather mandatory monitoring which can involve forced consolidation of compute into monitored datacenters by the signatory government domestically. (No international centralization of AI monitoring!) So, for example, a small university lab’s research compute would be relocated to a monitored datacenter but is still owned and (remotely) operated by that lab. 2. We architected our agreement to specifically not ban the additional production and installation of compute. The value of such for inference and real-world economic activity is only increasing. So the total economic impact on parts of the AI supply chain are probably far less than existential. We note that many of the relevant players have multi-year backlogs to clear at full production capacity, and building out the infrastructure for model development is only a part of that. 3. Nothing happens to consumer AI apps. If anything additional resources are freed up which can accelerate adoption and integration of these tools. 4. Individual governments handle their own verification and monitoring, but some transparency, random inspections, etc. to either China or the US will be required. Our agreement opts for comprehensive domestic monitoring and shares the least information with rivals (directly) which establishes required confidence. There is no multilateral international AI inspection apparatus in our current version of the agreement, which is a change from the first version we published. 5. Robotics and other applications are unaffected to the extent that progress can continue without developing new models with runs that exceed the compute limit. 6. Our agreement does not attempt to set up a “binding” constraint, in that we do not consider the legal, reputational constraints on US/China to be the active ingredient which leads to stability of the treaty. Instead, we establish a clear understanding on both sides of what behavior is restricted and what consequences must follow otherwise, with enough transparency and non-interference with intelligence gathering to create confidence in each side that the other is adhering to the agreement. Rather than legal constraints, I consider deterrence as the framework for thinking about the stability of the agreement. This is the historical method for constraining a major power, including the use of strategically significant assets. Nuclear deterrence prevented Soviet employment of its conventional superiority against Western Europe. A Soviet dead hand blocked what was otherwise the possibility of a perfect first strike (nuclear or otherwise). Stability emerges when both sides coordinate to make the consequences of aggression clear. Is that a “binding constraint” that one side “accepts”? No, not legally, textually. Not in the sense of “I staked my reputation and my honor in the arena of international relations”. Where agreements have a role to play is in helping to maintain the conditions for successful deterrence. An example of this is the anti-ballistic missile treaty, in which both sides limited their deployment of missile defenses so that deterrence could be maintained.

1,390

AI Notkilleveryoneism Memes ⏸️

MIRI retweeted

AI Notkilleveryoneism Memes ⏸️

@AISafetyMemes

Jun 6

Godfather of AI (and world's #1 most cited scientist) announces his support for a coordinated global AI pause!

Yoshua Bengio

@Yoshua_Bengio

Jun 6

If leading AI companies are indeed approaching the point of recursive self-improvement, a coordinated, verifiable, and universally applied pause is probably the only responsible solution to mitigate several major AI risks; at least until safety guarantees are developed and demonstrated. Ensuring that such a moratorium is respected would require sincere collaboration between various countries and companies, but I definitely believe it is achievable if others follow in @AnthropicAI's footsteps.

149

14,643

Jim Sciutto

MIRI retweeted

Jim Sciutto

@jimsciutto

Jun 7

That there are no urgent hearings on Capitol Hill, no serious legislation in the pipeline, and no persistent questioning of candidates for higher office on their proposed approaches to AI is incredible given how transformative the technology is and how fast it is moving.

Mitt Romney

@MittRomney

Jun 5

Our highest and most urgent national priority should be AI safeguards. The risks of AI weapons, pathogens, mass unemployment, surveillance, and even extinction must not continue to be largely ignored.

116

396

36,368

roon

MIRI retweeted

roon

@tszzl

Jun 8

now on the eve of RSI it seems everyone is more mutual conditional pause agreement pilled than they used to be and that seems like a good development

158

1,804

274,623

The Spectator

MIRI retweeted

The Spectator

@spectator

May 28

'Some people believe that if machines decide to kill us it's the right thing to do because they're smart' AI researcher Nate Soares reveals that some factions in Silicon Valley mistakenly believe that if an AI is exceptionally intelligent it must also be highly moral. @So8res @Freddygray31

0:59

6,153

MIRI

MIRI

@MIRIBerkeley

May 29

This just passed 2M views! If you haven't seen it yet, check out this AI In Context video on "If Anyone Builds It, Everyone Dies" youtu.be/Nl7-bRFSZBs?si=-re5…

3,904

AI StopWatch

MIRI retweeted

AI StopWatch

@AIStopWatch

May 26

Ten places where Magnifica Humanitas matters for AI. At 42k words long, Pope Leo XIV’s new encyclical has a lot to say. In our most recent Digest, Mitchell Howe outlines the parts which might be the most impactful.

766

AI StopWatch

MIRI retweeted

AI StopWatch

@AIStopWatch

May 25

What will be the impact of AI industry super PACs? "The takeaway here is that this year’s U.S. midterm elections are being aggressively shaped by different factions of the AI industry sometimes supporting the same candidates, sometimes different candidates, buying ads that don’t have anything to do with AI."

956

MIRI

MIRI

@MIRIBerkeley

May 24

If you're interested in creating videos about the extinction threat posed by superhuman AI, consider applying to this bootcamp!

Aella

@Aella_Girl

May 23

First round of applications for PDKU close in two days! We'll send out acceptance letters by June 1st. You should apply now! It's gonna be fun and you'll make friends and do weird shit (U can still apply after that, but there will be fewer slots and your chances will be lower)

4,206

Elizabeth Barnes

MIRI retweeted

Elizabeth Barnes

@BethMayBarnes

May 22

Our report focuses on claims that are (1) solidly defensible and (2) generally agreed within METR. Here I’ll give some personal opinions on how we should feel about the state of AI risk, and the IMO most important limitations of the report.

METR

@METR_Evals

May 19

Could an AI company lose control of its own agents? To find out, Anthropic, Google, Meta, and OpenAI let us (1) test their best internal models with CoT access, (2) review non-public info about capabilities, alignment, and control. The result: our first Frontier Risk Report.

459

65,884

AI StopWatch

MIRI retweeted

AI StopWatch

@AIStopWatch

May 23

"Gathering information is perhaps an important step forward, but it's not nearly enough." In today's Digest, Joe Rogero discusses the new executive order from CA governor Gavin Newsom.

Miles Brundage

@Miles_Brundage

May 22

Has anyone said useful concise things I should read about the Newsom EO yet

1,368

MIRI

MIRI

@MIRIBerkeley

May 23

An internal model at OpenAI has autonomously disproved a central conjecture in discrete geometry, a mathematical field with applications in cryptography, wireless device communication, and medical imaging. The proof relates to a famous question posed by Paul Erdős in 1946. It has been verified by prominent mathematicians in a companion paper. The verifying mathematicians consider this to be a genuinely novel breakthrough on one of the most discussed problems in this area of mathematics. One called it “arguably the best known problem in Discrete Geometry.” Another observed, “If a human had written the paper and submitted it to the Annals of Mathematics and I had been asked for a quick opinion, I would have recommended acceptance without any hesitation. No previous AI-generated proof has come close to that.” The proof illustrates a general trend towards autonomous, agentic problem-solving in AI systems. OpenAI describes the system that produced the proof as a general-purpose model not specialized in mathematics. AIs can now perform long, novel chains of reasoning on difficult problems and are beginning to outstrip our ability to measure their progress. AI agents still perform best in domains with easily verifiable outputs, such as mathematics and cybersecurity. For example, Anthropic's Claude Mythos found thousands of vulnerabilities across every major operating system and web browser, and was deemed too dangerous for public release. Such capabilities are why the government is now more interested in evaluating frontier AI models. AI research is also a field with many easily verifiable outputs. Researchers at OpenAI and Anthropic take advantage of this fact to accelerate their work; senior researchers now claim they make only high-level decisions and let AI handle most of the coding. Experimenting with the coding capabilities of a publicly available AI system, like Claude Code, immediately demonstrates how far AI has come in the last year. OpenAI and Anthropic intend to use AI to enhance future models with minimal human oversight. To justify the urgency, these companies cite the importance of beating rival U.S. or Chinese labs. Many of the field’s foremost experts warn that this race ends with human extinction. Policymakers and researchers, including the founders of the AI revolution, are calling for international restrictions on the technology. A growing bipartisan and international consensus of political leaders agree.

267

14,447

MIRI

MIRI

@MIRIBerkeley

May 23

intelligence.org/2026/05/22/…

The Erdős Proof and AI Capabilities - Machine Intelligence Research Institute

View the official memo here.

intelligence.org

1,175

AI StopWatch

MIRI retweeted

AI StopWatch

@AIStopWatch

May 22

In today's Digest: * OpenAI, Anthropic, SpaceX race to file IPO. * The AI executive order is postponed. * METR evaluates rogue deployment risks. * AI makes a breakthrough in mathematics.

734

MIRI

MIRI

@MIRIBerkeley

May 19

New paper from MIRI data scientist @robi_rahman: Does Distributed Training Undermine Compute Governance?

Robi Rahman

@robi_rahman

May 19

1/ With distributed training, you could violate an AI pause treaty by training a GPT-4-scale model over consumer internet, using hardware below every proposed compute governance threshold, for under $100M. My new paper in @taig_icml explains how to catch this and shut it down.

2,346

Center For Humane Technology

MIRI retweeted

Center For Humane Technology

@HumaneTech_

May 15

The wait is over! Starting today, May 15, you can stream @theaidocfilm on @peacock. This film takes the dizzying complexity of AI — the promise, the peril, the competing ideologies, the economic incentives — and creates a shared experience we can all see and respond to. Then, after you've watched, head to humanetech.com/ai-roadmap to explore concrete actions we can all take to build a better future with AI.

0:30

3,298

MIRI

MIRI

@MIRIBerkeley

May 15

For an agreement like this to be effective, both the US and China would need to be party to it. Here’s one plausible path to that outcome, in six stages 🧵

Peter Barnett

@peterbarnett_

18 Nov 2025

We at the MIRI Technical Governance Team just put out a report describing an example international agreement to prevent the creation of superintelligence. 🧵

6,703

more replies

MIRI

MIRI

@MIRIBerkeley

May 15

Stage Six: creating superintelligence, after the necessary solutions have been discovered to do so without posing an unacceptably high danger to the world.

513

MIRI

MIRI

@MIRIBerkeley

May 15

For more, check out the paper from MIRI's Technical Governance Team. arxiv.org/abs/2511.10783

An International Agreement to Prevent the Premature Creation of...

Many experts argue that premature development of artificial superintelligence (ASI) poses catastrophic risks, including the risk of human extinction from misaligned ASI, geopolitical instability,...

arxiv.org

550