A terrifying new paper reveals the emerging Cold War. A hidden trigger planted in military AI by China or Russia gives them thousands of invisible decision-making spies.
Oh and btw, this is our last hope for stabilizing things!
Read that again: We're racing toward absolute automation where the "safety" feature is everyone secretly wiring everyone else's models to explode on command.
Everyone will hesitate before handing critical power to fragile, betrayable agents:
This is "Deterrence by betrayal".
Adversaries can poison training data, plant undetectable backdoors, or force co-option. Secret triggers embedded in military AI can cause it to attack its own side out of the blue. Tiny poisoned scrapes from the open web could do it.
Attackers hold the edge, real defence is hopeless. Trillions of tokens, impossible to audit every source, no reliable backdoor detection.
Superpowers and middle powers alike have every incentive to subvert each other's systems. Even inside labs: foreign engineers, rushed automation, self-improving AIs inheriting disloyalty.
Basically, it's so impossibly hard to avoid your AI getting hacked and the cost of compromise is so high, that automating the war machine becomes a bad bet.
So, this is our hope for a stabilizing strategy.
Some might call this game theory, others might call it COLLECTIVE INSANITY.
If betrayal is the best deterrent we've got... fingers crossed and let's hope the stars align.