CLS

CLS

5 Photos and videos

Tweets

Nan Jiang retweeted

CLS

@ChengleiSi

Jun 11

Excited to share these preliminary results on our internal autoresearch system @Recursive_SI, where we achieve SOTA on nanochat / nanogpt speedrun / kernel benchmarks using the same underlying system without task-specific adaptations. blog: recursive.com/articles/first…

First Steps Toward Automated AI Research - Recursive

Early results from Recursive’s automated AI research system on model training and GPU kernel benchmarks

recursive.com

Recursive

@Recursive_SI

Jun 11

x.com/i/article/206456979931…

107

15,761

Leon

Nan Jiang retweeted

Leon

@iamleonli

Jun 9

How far can we compress the discrete tokens in an LLM's context into compact latent vectors? With the right training recipe at large scale, our Latent Context Language Models (LCLMs) compress context up to 16× and land on a new Pareto frontier for long-context inference. 🧵(1/n)

7,199

Nan Jiang

Nan Jiang

@nanjiangwill

Jun 2

c you there!

Modal

@modal

Jun 2

We're bringing together our friends and community to celebrate our Series C. Join us at Noguchi's Sunken Garden in NYC on June 16th or at the Legion of Honor in SF on June 25th. Invites are limited, apply here: modal.com/c-function

0:15

1,672

Luke J. Huang

Nan Jiang retweeted

Luke J. Huang

@whatthelukh

Jun 1

New blog! Is frontier asynchronous RL solved? The blog covers Async RL theory and infrastructure, surveying 8 open-weight frontier labs for the algorithmic techniques and systems fixes to handle train-inference mismatch. Also answered: why do current methods still fail at high policy lag? Which methods scale with horizon and compute?

134

1,133

238,078

Nan Jiang

Nan Jiang

@nanjiangwill

Jun 1

love the design so much

Feather @feather___co

May 23

Introducing Feather - A self organizing inbox! Try it out at feather.computer. We'd love to know what you think 🪶

1:36

624

slime

Nan Jiang retweeted

slime

@slime_framework

Jun 1

🚀 slime v0.3.0 is out! This release is a major step toward agent-first RL. We turned slime’s existing multi-turn / agentic capabilities into a more coherent foundation: - slime/agent with reusable sandbox-agent components - OpenAI / Anthropic-compatible adapters - black-box coding-agent RL example - variable global batch-size training - fully async training as a first-class path - lower host-memory usage for more flexible rollout-inference setups - PPO refactor with actor-critic colocation - delta weight sync, FlashQLA for Qwen GDN, --save-hf, and more CI coverage slime is moving closer to a practical open-source framework for large-scale agentic RL. Release note: github.com/THUDM/slime/relea…

Release v0.3.0 · THUDM/slime

We are excited to announce the release of slime v0.3.0! This release marks a major step toward agent-first reinforcement learning. While slime has supported multi-turn and agentic workloads from ea...

github.com

8,236

Nan Jiang

Nan Jiang

@nanjiangwill

May 30

At @modal, we're working to make sure OSS RL frameworks have all the techniques necessary to train frontier open-weights models. Delta compression is key, but the job's not done. There are still lots of open problems around weight sync, auto-scaling, & cross-cluster training. My DMs are open!

slime

@slime_framework

May 30

@FireworksAI_HQ @cursor_ai highlighted why delta-compressed weight sync matters for RL at frontier scale. slime brings this capability to OSS: lossless delta sync for Megatron ↔ SGLang disaggregation — ship deltas, not full checkpoints. This is another step toward a fully open-source stack where rollout/inference and training are truly decoupled and deployed separately. PR: github.com/THUDM/slime/pull/…

246

64,166

Nan Jiang

Nan Jiang

@nanjiangwill

May 30

Huge thanks to the @slime_framework community for making an amazing, battle-tested RL framework! I think we are well-positioned at Modal to help users deploy slime. On our infrastructure, train/inference disaggregation can pair naturally with elastic scaling, so rollout capacity is neither wasted nor bottlenecked.

1,428

Nan Jiang

Nan Jiang

@nanjiangwill

May 21

omg

Sasha Rush

@srush_nlp

May 21

Big fan (and neighbor) of Modal. Seems like a great group to work with as well.

11,671

Erik Bernhardsson

Nan Jiang retweeted

Erik Bernhardsson

@bernhardsson

May 21

Today we're announcing our Series C funding: $355M at a $4.65B valuation, led by some great investors @generalcatalyst and @Redpoint. We've had insane growth in the last year, but we're still very early. So proud of the team and what we have built so far!

0:46

Modal

@modal

May 21

x.com/i/article/205723780724…

127

1,456

583,923

Nan Jiang

Nan Jiang

@nanjiangwill

May 21

really enjoy working with everyone here 💚 amazing place

Modal

@modal

May 21

x.com/i/article/205723780724…

2,946

Sasha Rush

Nan Jiang retweeted

Sasha Rush

@srush_nlp

Mar 25

It's really neat to see all the interest in the Composer 2 technical report, from training to kernel design to inference. If you have any questions about why we did things, feel free to ask. I'll run around the office and bug people.

Cursor

@cursor_ai

Mar 24

We're releasing a technical report describing how Composer 2 was trained.

320

58,017

Jack Morris

Nan Jiang retweeted

Jack Morris

@jxmnop

Mar 9

x.com/i/article/203102900413…

156

1,868

399,275

Nan Jiang

Nan Jiang

@nanjiangwill

12 Dec 2025

🫡

LMSYS Org

@lmsysorg

12 Dec 2025

Miles Series Release: True On-policy for VLMs in FSDP SGLang! Our Miles team achieved precision alignment between FSDP and SGLang for LLMs as early as two months ago, ensuring that the log probs obtained from SGLang inference match perfectly with the log probs from the FSDP forward pass, with an absolute KL divergence of 0. Thanks to Nan Jiang from our community—the "Greek God of VLM"—we have now successfully aligned VLM training and inference on FSDP. You can now enjoy VLM training with strictly zero KL divergence!

318

Christopher Manning

Nan Jiang retweeted

Christopher Manning

@chrmanning

7 Oct 2025

This paper by Ivan Lee (@ivn1e) & @BergKirkpatrick was great! Best thing I’ve seen at #COLM2025 so far! Readability ≠ Learnability: Rethinking the Role of Simplicity in Training Small Language Models openreview.net/forum?id=AFMG…

Readability ≠ Learnability: Rethinking the Role of Simplicity in...

Recent studies suggest that very small language models (SLMs) can generate surprisingly coherent text when trained on simplified, child-directed corpora such as TinyStories. These findings have...

openreview.net

271

24,258

Songlin Yang

Nan Jiang retweeted

Songlin Yang

@SonglinYang4

24 May 2025

📢 (1/16) Introducing PaTH 🛣️ — a RoPE-free contextualized position encoding scheme, built for stronger state tracking, better extrapolation, and hardware-efficient training. PaTH outperforms RoPE across short and long language modeling benchmarks arxiv.org/abs/2505.16381

PaTH Attention: Position Encoding via Accumulating Householder...

The attention mechanism is a core primitive in modern large language models (LLMs) and AI more broadly. Since attention by itself is permutation-invariant, position encoding is essential for...

arxiv.org

543

76,882

Nan Jiang

Nan Jiang

@nanjiangwill

22 Apr 2025

amazing Jason, amazing Nexad, please check this out!

Jason Hu

@onjas_6

21 Apr 2025

Let’s be real—ads have annoyed me for years. Pop-ups, spam, etc… while the world is moving towards AGI, the ad world felt stuck in the past. So I decided to flip the script. Today, I’m proud to share: Nexad has raised a $6M seed round, led by @a16z SR04, @Prosus_Ventures , @p72vc , Carya, and more. 🧵

1:26

320

Wenting Zhao

Nan Jiang retweeted

Wenting Zhao

@wzhao_nlp

4 Mar 2025

Coding agents can debug their own outputs, but what if none of the fixes are correct? We overcome sparse rewards by making them continuous📈 Instead of having binary execution rewards, we introduce a learned verifier to measure how close the current solution is to a correct one📏

203

31,056

Sasha Rush

Nan Jiang retweeted

Sasha Rush

@srush_nlp

26 Sep 2024

I teach a class where students code up an ML library from scratch in Python. Wenting showed me that a Claude Agent (with interactive unit test feedback and the spec) could solve it 100%. We thought it would be fun to scale this idea to every Python library in the world.

Wenting Zhao

@wzhao_nlp

26 Sep 2024

Introducing the commit0 interactive environment for coding agents. Challenge: generate Python libraries from scratch. Commit0 is designed with interactivity, dependencies, and specifications as first-class considerations. We include a benchmark with 50 challenging libraries.

387

51,843

Nan Jiang

Nan Jiang

@nanjiangwill

26 Sep 2024

So... can agents now build a package from scratch? Test them on Commit0! This is an amazing and fun project this summer! Huge thanks to Wenting and to everyone in the lab for their support and guidance! 🚀👏

Wenting Zhao

@wzhao_nlp

26 Sep 2024

381