Anthropic just closed $65B at a $965B valuation – and that still wasn't the most important AI story this week.
The headline was capital. The signal was cost.
12 real model releases in 5 days. Here's what actually matters, with numbers.
— THE MONEY —
Anthropic's Series H makes it the most valuable AI company on the planet, ahead of OpenAI. Apollo and Blackstone structured $36B in credit just to buy TPUs for it. When debt markets underwrite compute like pipelines, AI stopped being venture risk and became infrastructure.
— THE COST COLLAPSE —
MAI-Code-1-Flash: SWE-Bench Verified 71.6% – beats Claude Haiku 4.5 (66.6%) with five billion active parameters, priced below Haiku. One of 7 MAI models Microsoft shipped this week, trained with zero OpenAI data.
MiniMax M3: SWE-Bench Pro 59% – above GPT-5.5 and Gemini 3.1 Pro. 1M context at 1/20 the per-token compute of the previous generation. Launch price: $0.30 per million input tokens. Open weights within 10 days.
NVIDIA Nemotron 3 Ultra: 550B total, 55B active. SWE-Bench Verified 71.9 – best US open-weight model, at $0.50 per million input. Weights, training data, recipes – all published.
Three labs. One identical strategy: hold quality, collapse cost.
— THE FULL LIST (save this) —
Jun 1 · MiniMax M3 – agentic LLM · 1M ctx · SWE-Bench Pro 59% · $0.30/M
Jun 2 · MAI-Thinking-1 – reasoning · 35B active MoE · AIME 97% Jun 2 · MAI-Code-1-Flash – coding · 5B active · SWE-V 71.6%
Jun 2 · MAI-Image-2.5 Flash – image gen/edit · #2 Arena Image Edit
Jun 2 · MAI-Voice-2 Flash – TTS · 15 languages · watermarked cloning
Jun 2 · MAI-Transcribe-1.5 – STT · 43 languages · #1 FLEURS in 18
Jun 4 · Nemotron 3 Ultra – open agentic LLM · 550B/55B · SWE-V 71.9
Jun 4 · Nemotron 3.5 ASR – streaming STT · 0.6B · 40 languages, 80ms
Jun 4 · Higgs Audio v3 – TTS/cloning · 111 languages · WER 3.61 (was 52.24 one generation ago)
Jun 4 · LFM2.5-VL Extract (1.6B 450M) – image to JSON · 99.6% validity, on-device
Five days. Twelve models. Four of them open weights.
— THE BILL —
While the price of intelligence collapsed, the price of negligence showed up. Sysdig documented the first fully autonomous LLM attack in the wild: an agent infiltrated an AWS environment and exfiltrated data in under an hour. No human in the loop. None needed.
I've watched this pattern before. Compute got cheap, then botnets scaled. Storage got cheap, then ransomware scaled. Intelligence is next on the curve.
— THE PATTERN —
Nemotron's 71.9 is the best America ships open – and it still ranks below Kimi K2.6. The open-weight frontier speaks Chinese, and the US answer is to compete on dollars per useful token.
Capital buys models. Cost curves pick winners.
The frontier isn't getting smarter this week. It's getting cheaper – and harder to contain.
➕ Follow me for the Friday Signal – the week in AI, one thesis, verified numbers only, every Friday.