Grateful to share that SSAIL lab had 4 papers accepted to ICLR 2026 and 1 paper accepted to MLSys 2026, with all results released on the same day!
The accepted works span long-context extension, automated sequence parallelism, lookahead GRPO optimization, agentic multimodal tool using, and high-performance LLM inference.
Huge credit to the students and collaborators for pushing these ideas through many iterations, and thanks to the reviewers for the thoughtful feedback.
ICLR'26
1. From Collapse to Control: Understanding and Extending Context Length in Emerging Hybrid Models via Universal Position Interpolation,
openreview.net/pdf?id=MjmORK…, fantastic work by Haochen Shen and Zheng Wang, in collaboration with IBM researchers Davis Wertheimer, Naigang Wang, Mudhakar Srivatsa, Raghu Ganti
2. AutoSP: Unlocking Long-Context LLM Training Via Compiler-Based Sequence Parallelism,
openreview.net/pdf?id=0fgsHv…, led by Ahan Gupta, with Zhihao Wang, Neel Dani, and collaborators Tunji Ruwase and Masahiro Tanaka
3. Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning,
openreview.net/pdf?id=xBlHiH…, Co-led by Zheng Wang and the team.
4. VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use,
openreview.net/pdf?id=Idst6X…, led by Mingyuan Wu.
MLSys'26
5. SuperInfer: SLO-Aware Rotary Scheduling and Memory Management for LLM Inference on Superchips, led by Jiahuan Yu with Mingtao Hu and Zichao Lin.
#SSAIL #MLSys #ICLR