🚀 Keye-VL-2.0-30B-A3B released — fully open.
The first VLM to land DSA (DeepSeek Sparse Attention) in production — nearly lossless reasoning over 256K ultra-long context.
30B params. Fully open. 🤗
🧵👇
🎉The Kwai Keye-VL-2.0 Technical Report is now live.
Keye-VL-2.0 is the first to adapt DeepSeek Sparse Attention (DSA) to GQA-based multimodal architectures, enabling lossless 256K context processing for long-video understanding.
Paper: huggingface.co/papers/2606.1…
🚀 Keye-VL-2.0-30B-A3B released — fully open.
The first VLM to land DSA (DeepSeek Sparse Attention) in production — nearly lossless reasoning over 256K ultra-long context.
30B params. Fully open. 🤗
🧵👇
▸ Tops video benchmarks at its scale
▸ Outperforms Qwen3-VL-235B on LongVideoBench (74.1)
▸ First Keye base model with built-in Agent: Code · Tool · Search