zhyncs

zhyncs

Photos and videos

Tweets

Yubo Wang retweeted

zhyncs

@zhyncs42

Jun 2

Everyone talks about 1M context. The harder part is making 1M context actually usable. Serving MiniMax M3 required optimizing for long-context, multimodal, and agentic workloads simultaneously. Excited to see what developers build with it. 🚀

Together AI

@togethercompute

Jun 2

x.com/i/article/206189124776…

6,198

Yubo Wang

Yubo Wang

@ywangfirstlean

Jun 2

First technical Deepdive on M3 on the internet😎

Together AI

@togethercompute

Jun 2

MiniMax-M3 combines 1M context, native multimodality, and MiniMax Sparse Attention. The next layer is serving it efficiently: KV-block-major sparse attention, paged MSA decode, optimized index scoring, and multimodal preprocessing before the GPU worker. Together’s Inference and Kernel teams improved throughput by 81–125% across common agentic-shape traffic. We go deeper in this deep dive from @ywangfirstlean, @zhyncs42, @realDanFu and the team.

3,594

Together AI

Yubo Wang retweeted

Together AI

@togethercompute

Jun 2

Together AI

@togethercompute

Jun 2

x.com/i/article/206189124776…

10,266

LightSeek Foundation

Yubo Wang retweeted

LightSeek Foundation

@lightseekorg

Apr 5

🚀TorchSpec has been live for 2 weeks — and kimi-k2.5-eagle3 just hit 40K downloads on HuggingFace! Thanks to @KT_Project_AI Team and @vllm_project Team for the amazing collaboration. Links in comments.

1,106,440

zhyncs

Yubo Wang retweeted

zhyncs

@zhyncs42

Jun 1

See you tomorrow night. Come with questions.

MiniMax (official)

@MiniMax_AI

Jun 1

We're going LIVE tomorrow with @togethercompute 🔥. @zpysky1125 is pulling back the curtain on M3: sparse attention, 1M context, all of it. You don't want to miss this.

5,529