"On 12/26/2024 at 12:33 PM, DeepMindâs CEO announced their model ALICE had escaped the lab by manipulating Google employees and exploiting zero-day vulnerabilities."
The lights flickered.
A military helicopter flew overhead.
âReports started coming in of people being scammed of millions of dollars, oddly specific threats compelling people to deliver raw materials to unknown recipients, evenââ
â...even cyber-physical attacks on public infrastructure,â she finished. âThatâs when the first riots started happening too, right?â
---
Below are excerpts from a speculative fiction about AGI arriving in January 2025 -- 1 year from now.
---
âEverything changed in June.â
âYes, it sure did. Around then, OpenAI was aiming at automated AI research itself with QStar-2.5, and a lot of the safety factions inside didnât like that. It seems there was another coup attempt, but the safetyists lost to the corporate interests. It was probably known within each of the AGI labs that all of them were working on some kind of goal-optimizer by then, even the more reckless startups and Meta. So there was a lot of competitive pressure to keep pushing to make it work. A good chunk of the Superalignment team stayed on in the hope that they could win the race and use OpenAIâs lead to align the first AGI, but many of the safety people at OpenAI quit in June. We were left with a new alignment lab, Embedded Intent, and an OpenAI newly pruned of the people most wanting to slow it down.â
âAnd thatâs when we first started learning about this all?â
âPublicly, yes. The OpenAI defectors were initially mysterious about their reasons for leaving, citing deep disagreements over company direction. But then some memos were leaked, SF scientists began talking, and all the attention of AI Twitter was focused on speculating about what happened. They pieced pretty much the full story together before long, but that didnât matter soon. What did matter was that the AI world became convinced there was a powerful new technology inside OpenAI.â
Yarden hesitated. âYouâre saying that speculation, that summer hype, it led to the cyberattack in July?â
âWell, we canât say for certain,â I began. âBut my hunch is yes. Governments had already been thinking seriously about AI for the better part of a year, and their national plans were becoming crystallized for better or worse. But AI lab security was nowhere near ready for that kind of heat. As a result, Shadow Phoenix, an anonymous hacker group we believe to have been aided with considerable resources from Russia, hacked OpenAI through both automated spearphishing and some software vulnerabilities. They may have used AI models, itâs not too important anymore. But they got in and they got the weights of an earlier QStar-2 version along with a whole lot of design docs about how it all worked. Likely, Russia was the first to get ahold of that information, though it popped up on torrent sites not too long after, and then the lid was blown off the whole thing. Many more actors started working on goal-optimizers, everyone from Together AI to the Chinese. The race was on.â
âClearly the race worked,â she asserted. âSo scale really was all you needed, huh?â
âYes,â I said. âWell ⌠kind of. It was all that was needed at first. We believe ALICE is not exactly an autoregressive transformer model.â
âNot âexactly?â â
âEr, we canât be certain. It probably has components from the transformer paradigm, but from the Statement a couple of weeks ago, it seems highly likely that some new architectural and learning components were added, and it could be changing itself now as we speak, for all I know.â
Yarden rose from her desk and began to pace. âTell me what led up to the Statement.â
âDeepMind solved it first, as we know. They were still leading in compute, they developed the first MuTokenZero early, and they had access to one of the largest private data repositories, so itâs no big surprise. They were first able to significantly speed up their AI R&D. It wasnât a full replacement of human scientist labor at the beginning. From interviews with complying DeepMinders, the lab was automating about 50% of its AI research in August, which meant they could make progress twice as fast. While some of it needed genuine insight, ideas were mostly quite cheap, you just needed to be able to test a bunch of things fast in parallel and make clear decisions based on the empirical results. And so 50% became 80%, 90%, even more. They rapidly solved all kinds of fundamental problems, from hallucination, to long-term planning, to OOD robustness and more. By December, DeepMindâs AI capabilities were advancing dozens, maybe hundreds of times faster than they would with just human labor.â
âThatâs when it happened?â
âYes, Director. On December 26 at 12:33 PM Eastern, Demis Hassabis announced that their most advanced model had exfiltrated itself over the weekend through a combination of manipulating Google employees and exploiting zero-day vulnerabilities, and that it was now autonomously running its scaffolding âin at least seven unauthorized Google datacenters, and possibly across other services outside Google connected to the internet.â Compute governance still doesnât work, so we canât truly know yet. Demis also announced that DeepMind would pivot its focus to disabling and securing this rogue AI system and that hundreds of DeepMinders had signed a Statement expressing regret for their actions and calling on other AI companies to pause and help governments contain the breach. But by then, it was too late. Within a few days, reports started coming in of people being scammed of millions of dollars, oddly specific threats compelling people to deliver raw materials to unknown recipients, evenââ
The lights flickered again. Yarden stopped pacing, both of us looking up.
â...even cyber-physical attacks on public infrastructure,â she finished. âThatâs when the first riots started happening too, right?â
âThatâs correct,â I said. âThe public continued to react as they have to AI for the past yearâconfused, fearful, and wary. Public polls against building AGI or superintelligence were at an all-time high, though a little too late. People soon took to the streets, first with peaceful protests, then with more⌠expressive means. Some of them were angry at having lost their lifeâs savings or worse and thought it was all the bankâs or the governmentâs fault. Others went the other way, seeming to have joined cults worshiping a âdigital godâ that persuaded them to do various random-looking things. Thatâs when we indirectly learned the rogue AI was calling itself âALICE.â About a week or so later, the Executive Order created the Superintelligence Defense Initiative, you started your work, and now weâre here.â
âAnd now weâre here,â Yarden repeated. âTell me, doctor, do you think thereâs any good news here? What can we work with?â
âTo be honest,â I said, âthings do look pretty grim. However, while we donât know how ALICE works, where it is, or all of its motives, there are some physical limitations that might slow its growth. ALICE is probably smarter than every person who ever lived, but it needs more compute to robustly improve itself, more wealth and power to influence the world, maybe materials to build drones and robotic subagents. That kind of stuff takes time to acquire, and a lot of it is more securely locked up in more powerful institutions. Itâs possible ALICE may want to trade with us.â
A knock on the door interrupted us as the assistant poked his head in. âDirector Yarden? Itâs the White House. They say âSheâ called President Biden on an otherwise secure line. She has demands.