4. true logit-based distillation isn't possible via standard APIs, and that API distillation isn't magic or lossless. However, response distillation with strong CoT data is real, effective, and widely used in 2026 (including by DeepSeek themselves for their open models). It has enabled huge progress in smaller, efficient reasoning models. It's not "sci-fi". it's engineering that works better than expected.