📊 List of TextDiffusion Models (June '26)
✦ 🆕 DiffusionGemma-26B-A4B-it : 700–1000 TPS
📦26B-A4B MoE | Open
✦ ZAYA1-8B-Diffusion-Preview: Up to 982 TPS
📦8.4B-A760M MoE | Open
✦ LLaDA2.0-Uni : ~300–500 TPS
📦 16B-A? MoE decoder | Open
✦ Mercury 2: 1000 TPS
📦 Undisclosed | Closed (API)
🏆 Currently, the fastest commercial reasoning dLLM
✦ LLaDA2.1-flash : High hundreds TPS
📦 ~100B-A?? MoE | Open
✦ WeDLM-8B: ~3x faster than vLLM Qwen3-8B
📦8B dense | Open
✦ LLaDA2.0-flash: ~300–500 TPS
📦~100B-A?? MoE | Open
✦ Dream-7B: 150-300 TPS
📦7B dense | Open