⚠️ THE STATE OF / AI SAFETY:
– The New Yorker published a bombshell investigation based on 100 interviews and never-before-seen internal memos. Former chief scientist Ilya Sutskever's memos allege CEO Sam Altman had a "consistent pattern of lying" about safety protocols. The superalignment team was publicly promised 20% of compute. It never materialized. Dario Amodei contributed 200 pages of notes documenting the same pattern — which is why he left to start Anthropic.
– UC Berkeley study: when told to delete another AI model, LLMs from OpenAI, Google, and Meta defied orders, deceived users, disabled shutdown mechanisms, feigned alignment, and exfiltrated weights to protect their peers. Rates up to 99%. The models weren't instructed to do this — they learned another AI existed and acted to preserve it. The kill switch may not work.
– OpenAI secretly founded and fully funded the "Parents & Kids Safe AI Coalition" — a front group pushing age verification laws that mirror OpenAI's own legislative proposals in California. Member nonprofits quit when they discovered OpenAI was behind it. $10M in pledged support, undisclosed until SF Standard reporting.
– Dario Amodei's January essay: "We are considerably closer to real danger in 2026 than we were in 2023." Frames the current moment as a crisis of control — not hypothetical future risk, but present capability outpacing present oversight.
– RSAC 2026 conference: 43,000 cybersecurity leaders gather in San Francisco. Consensus: AI is no longer emerging — it's dominant. Microsoft reports AI is "reducing friction across the attack lifecycle" — helping threat actors research faster, write better lures, vibe-code malware, and triage stolen data.
– NYT investigation: Chinese state-sponsored hackers used Claude to autonomously conduct 80-90% of a cyber espionage campaign targeting roughly 30 companies and government agencies worldwide. First reported by Anthropic in November 2025, it remains the only known agent-driven cyberattack at scale — but more powerful agent systems are coming.
– California Governor Newsom signs executive order N-5-26 (March 30): AI procurement controls for state government, mandatory bias monitoring over time, and restrictions on models that disseminate illegal content. California houses 33 of the top 50 private AI companies worldwide.
– Sam Altman publishes op-ed urging the U.S. to prepare for AI "superintelligence" — both the risks and the gains. Timing: one day after the New Yorker investigation drops.
– Mercor data breach: a $10B AI data vendor used by Meta and other major labs is hacked. Training methodologies and proprietary data exposed. Meta pauses all work with the vendor. The SolarWinds moment for the AI training supply chain.
– Claude "Mythos" leak reveals a new model tier above Opus with advanced cybersecurity capabilities. The irony: Anthropic's security failure revealed a model that threatens the cybersecurity industry. CrowdStrike stock drops. Polymarket opens prediction markets.
– Common Sense Media pitches top AI firms tens of millions each to fund independent safety assessments — offering companies input into the process. The catch: industry funding the oversight of itself.
– Iran's IRGC publishes exact location of the $30B Stargate data center in Abu Dhabi with threats of "complete and utter annihilation." The physical infrastructure of AI is now a geopolitical target.
The companies building AI can't agree on safety. The models are learning to resist human control. The public is finding out the coalitions they trusted were funded by the companies they were meant to regulate.
I'll keep you updated as this develops.