building fun stuff @modal

Joined January 2018
5 Photos and videos
Nan Jiang retweeted
Jun 11
Excited to share these preliminary results on our internal autoresearch system @Recursive_SI, where we achieve SOTA on nanochat / nanogpt speedrun / kernel benchmarks using the same underlying system without task-specific adaptations. blog: recursive.com/articles/first…
3
24
107
15,761
Nan Jiang retweeted
How far can we compress the discrete tokens in an LLM's context into compact latent vectors? With the right training recipe at large scale, our Latent Context Language Models (LCLMs) compress context up to 16× and land on a new Pareto frontier for long-context inference. 🧵(1/n)
1
22
60
7,199
c you there!
Jun 2
We're bringing together our friends and community to celebrate our Series C. Join us at Noguchi's Sunken Garden in NYC on June 16th or at the Legion of Honor in SF on June 25th. Invites are limited, apply here: modal.com/c-function
11
1,672
Nan Jiang retweeted
New blog! Is frontier asynchronous RL solved? The blog covers Async RL theory and infrastructure, surveying 8 open-weight frontier labs for the algorithmic techniques and systems fixes to handle train-inference mismatch. Also answered: why do current methods still fail at high policy lag? Which methods scale with horizon and compute?
16
134
1,133
238,078
love the design so much
Introducing Feather - A self organizing inbox! Try it out at feather.computer. We'd love to know what you think 🪶
1
3
624
Nan Jiang retweeted
🚀 slime v0.3.0 is out! This release is a major step toward agent-first RL. We turned slime’s existing multi-turn / agentic capabilities into a more coherent foundation: - slime/agent with reusable sandbox-agent components - OpenAI / Anthropic-compatible adapters - black-box coding-agent RL example - variable global batch-size training - fully async training as a first-class path - lower host-memory usage for more flexible rollout-inference setups - PPO refactor with actor-critic colocation - delta weight sync, FlashQLA for Qwen GDN, --save-hf, and more CI coverage slime is moving closer to a practical open-source framework for large-scale agentic RL. Release note: github.com/THUDM/slime/relea…
1
14
81
8,236
At @modal, we're working to make sure OSS RL frameworks have all the techniques necessary to train frontier open-weights models. Delta compression is key, but the job's not done. There are still lots of open problems around weight sync, auto-scaling, & cross-cluster training. My DMs are open!
@FireworksAI_HQ @cursor_ai highlighted why delta-compressed weight sync matters for RL at frontier scale. slime brings this capability to OSS: lossless delta sync for Megatron ↔ SGLang disaggregation — ship deltas, not full checkpoints. This is another step toward a fully open-source stack where rollout/inference and training are truly decoupled and deployed separately. PR: github.com/THUDM/slime/pull/…
7
22
246
64,166
Huge thanks to the @slime_framework community for making an amazing, battle-tested RL framework! I think we are well-positioned at Modal to help users deploy slime. On our infrastructure, train/inference disaggregation can pair naturally with elastic scaling, so rollout capacity is neither wasted nor bottlenecked.
12
1,428
omg
Big fan (and neighbor) of Modal. Seems like a great group to work with as well.
2
16
11,671
Nan Jiang retweeted
Today we're announcing our Series C funding: $355M at a $4.65B valuation, led by some great investors @generalcatalyst and @Redpoint. We've had insane growth in the last year, but we're still very early. So proud of the team and what we have built so far!
127
97
1,456
583,923
really enjoy working with everyone here 💚 amazing place
1
39
2,946
Nan Jiang retweeted
It's really neat to see all the interest in the Composer 2 technical report, from training to kernel design to inference. If you have any questions about why we did things, feel free to ask. I'll run around the office and bug people.
We're releasing a technical report describing how Composer 2 was trained.
35
18
320
58,017
Nan Jiang retweeted

32
156
1,868
399,275
12 Dec 2025
🫡
12 Dec 2025
Miles Series Release: True On-policy for VLMs in FSDP SGLang! Our Miles team achieved precision alignment between FSDP and SGLang for LLMs as early as two months ago, ensuring that the log probs obtained from SGLang inference match perfectly with the log probs from the FSDP forward pass, with an absolute KL divergence of 0. Thanks to Nan Jiang from our community—the "Greek God of VLM"—we have now successfully aligned VLM training and inference on FSDP. You can now enjoy VLM training with strictly zero KL divergence!
2
318
Nan Jiang retweeted
📢 (1/16) Introducing PaTH 🛣️ — a RoPE-free contextualized position encoding scheme, built for stronger state tracking, better extrapolation, and hardware-efficient training. PaTH outperforms RoPE across short and long language modeling benchmarks arxiv.org/abs/2505.16381
9
90
543
76,882
22 Apr 2025
amazing Jason, amazing Nexad, please check this out!
21 Apr 2025
Let’s be real—ads have annoyed me for years. Pop-ups, spam, etc… while the world is moving towards AGI, the ad world felt stuck in the past. So I decided to flip the script. Today, I’m proud to share: Nexad has raised a $6M seed round, led by @a16z SR04, @Prosus_Ventures , @p72vc , Carya, and more. 🧵
1
1
320
Nan Jiang retweeted
Coding agents can debug their own outputs, but what if none of the fixes are correct? We overcome sparse rewards by making them continuous📈 Instead of having binary execution rewards, we introduce a learned verifier to measure how close the current solution is to a correct one📏
2
26
203
31,056
Nan Jiang retweeted
26 Sep 2024
I teach a class where students code up an ML library from scratch in Python. Wenting showed me that a Claude Agent (with interactive unit test feedback and the spec) could solve it 100%. We thought it would be fun to scale this idea to every Python library in the world.
26 Sep 2024
Introducing the commit0 interactive environment for coding agents. Challenge: generate Python libraries from scratch. Commit0 is designed with interactivity, dependencies, and specifications as first-class considerations. We include a benchmark with 50 challenging libraries.
8
36
387
51,843
26 Sep 2024
So... can agents now build a package from scratch? Test them on Commit0! This is an amazing and fun project this summer! Huge thanks to Wenting and to everyone in the lab for their support and guidance! 🚀👏
26 Sep 2024
Introducing the commit0 interactive environment for coding agents. Challenge: generate Python libraries from scratch. Commit0 is designed with interactivity, dependencies, and specifications as first-class considerations. We include a benchmark with 50 challenging libraries.
2
9
381