Manling Li

Manling Li

Photos and videos

Tweets

KnowledgeLM Workshop retweeted

Manling Li

@ManlingLi_

Mar 19

What left for humans with powerful coding agents? Right now, we evaluate agents mostly on Success Rate. But if fixing one simple issue by adding 2000 lines of spaghetti code, is that a win? I see the AI agents solve problems by endlessly adding new functions, growing into chaotic, million-line codebase that no humans can manage. But top engineers indeed care about the elegant simplicity beneath the mess (hello, Occam's Razor). What is left for humans? Might be just this. Yeah I became more and more excited about Abstraction. This paper is only about Abstracting and Reusing Skills, like macro functions. But might be a baby-step start.

Shiqi Chen @shiqi_chen17

Mar 9

📍 Can LLMs discover, abstract, and reuse higher-level tool skills across tasks? Existing tool-use benchmarks test solving tasks with fixed tools. But real workflows contain recurring structures where efficiency comes from reusable tool compositions, not isolated calls. We introduce SkillCraft: 126 tasks across 6 domains designed to test whether LLM agents can acquire compositional skills, not just call atomic tools. We also propose Skill Mode, a lightweight protocol with four MCP primitives that let agents compose, verify, cache, and reuse tool chains at test time. Our Key findings across evaluating 8 SOTA models: ⚡Skill Mode enables agents to self-discover and reuse skills, leading to higher success and efficiency than agents without it. The gains are larger for stronger models. 🧠 Stronger models (e.g., Claude) discover more generalizable skills, which transfer across tasks and even across models. 🔍 Deeper composition ≠ better — shallow, well-tested skills generalize best. 🔗 Paper: arxiv.org/abs/2603.00718 💻 Code: github.com/shiqichen17/Skill… 🏠 Page: skillcraft-website.github.io… (1/7)

1:16

15,135

Manling Li

KnowledgeLM Workshop retweeted

Manling Li

@ManlingLi_

Mar 13

Failure mode of LLM Agent RL training: reasoning shrinks, shorter and more similar. "diversity" has been a key to make LLM Agent RL training work, but I have always been wondering how to define "diversity". RAGEN used Entropy; RAGEN-v2 introduces Mutual Information (MI). The key insight comes from this decomposition: H(Z) = H(Z|X) I(X;Z) So we can systemically classify four types of reasoning evolving patterns: - diverse reasoning - compression reasoning - entropy collapse - template collapse Top-p filtering: The most fascinating thing is that we find top-p filtering using reward variance is simple, but effective! We also try to explain this failure mode from gradient updates, check more at @wzenus 's threads 👇

Zihan "Zenus" Wang

@wzenus

Mar 12

In Agent RL, models suffer from Template Collapse. They generate vast, diverse outputs (High Entropy) that lose all meaningful connection to the input prompt (Low Mutual Information). In other words, agent learn different ways to say nothing. 🚀 Introducing RAGEN-v2 -- Here's how we define and fix such silent failure modes in Agent RL. 🧵

0:58

137

37,753

Manling Li

KnowledgeLM Workshop retweeted

Manling Li

@ManlingLi_

Feb 16

1. What is a good exploration? More steps ≠ more information. Good exploration = prioritize information gain per step, so that forming a complete internal map of the world. It is about knowing what you don’t know, and choosing actions that reduce that uncertainty. We ask LLMs/VLMs the best action to take next: not to solve a task, not to maximize a task reward, but to reduce spatial uncertainty, to build an internal spatial belief of the world that can support future spatial reasoning.

0:16

2,522

KnowledgeLM Workshop

KnowledgeLM Workshop @lm_knowledge

15 Dec 2025

RT @ManlingLi_: Huge congrats to @hengjinlp on being named an ACL Fellow! I still feel incredibly lucky to have been advised by her. Sub…

Manling Li

KnowledgeLM Workshop retweeted

Manling Li

@ManlingLi_

3 Dec 2025

VAGEN poster at #NeurIPS: ⏲️11am-2pm Wed 📍Exhibit Hall C,D,E #5502 We look forward to discussing with you about: 1. MDP → POMDP 2. World modeling in agent internal belief 3. What is a good representation in agent internal belief for visual states? 4. How to use World Modeling to help reward shaping? 5. How to do turn-level critic learning? Drop by if you are interested in related topics!

0:26

Zihan "Zenus" Wang

@wzenus

3 Dec 2025

VAGEN poster 𝐭𝐨𝐦𝐨𝐫𝐫𝐨𝐰 at #NeurIPS! 🎮🧠 - 🕚 11am–2pm Wed - 📍 Exhibit Hall C,D,E #5502 We had much fun exploring: • How 𝐰𝐨𝐫𝐥𝐝 𝐦𝐨𝐝𝐞𝐥𝐢𝐧𝐠 helps VLM RL agents learn better policies • 𝐌𝐮𝐥𝐭𝐢-𝐭𝐮𝐫𝐧 𝐏𝐏𝐎 credit assignment via 𝐭𝐰𝐨-𝐥𝐞𝐯𝐞𝐥 𝐚𝐝𝐯𝐚𝐧𝐭𝐚𝐠𝐞 𝐞𝐬𝐭𝐢𝐦𝐚𝐭𝐨𝐫 (Bi-Level GAE) for turn-level and token-level critic learning Come chat about agents, RL, and world models 👀

118

15,667

Qineng Wang

KnowledgeLM Workshop retweeted

Qineng Wang

@qineng_wang

24 Nov 2025

Most VLM benchmarks watch the world; few ask how actions *change* it from a robot's eye. Embodied cognition tells us that intelligence isn't just watching – it's enacted through interaction. 👉We introduce ENACT: A benchmark that tests if VLMs can track the evolution of a home-scale environment from a robot's egocentric view. 🌐enact-embodied-cognition.git… 📄enact-embodied-cognition.git… 1/N

1:31

247

142,915

KnowledgeLM Workshop

KnowledgeLM Workshop @lm_knowledge

24 Nov 2025

Join her lab!

Manling Li

@ManlingLi_

24 Nov 2025

We are looking for PhDs and Postdocs! So proud of my students on achieving so many amazing things during their "very first year". I have been asked many times how I like being faculty, especially with funding cuts. My answer is always "it is the prefect job for me"! Still deep in the honeymoon phase. The only reason is the students are so amazing, making my transition so much easier. One year in, they already collected paper awards, orals, spotlights, etc What makes me proudest is they are vividly alive: curious, playful, confident in their own weird way, light up when talking about ideas, and never afraid to explore "the thing might fail". Everyone is just… themselves. And somehow, that version of themselves keeps shipping amazing work. In today's anxious academic world, this kind of aliveness is what I will try best to protect. Maybe the best part of being an advisor is that every student is so different and unique lol Interestingly, coming to second year, they've got their own passions, I can't just plug my ideas into their heads. So when I get excited about sth new, my first thought is: "Okay, time to find some fresh first-years who will be thrilled about this!" MLL lab is 1 year old, we started right in Oct 2024. We are growing and looking for more phds to join us! 1. Why our lab? (1/2) 2. Why @northwesterncs? (2/2) In 2025 alone: NU has 7 faculty as Sloan Fellows, plus a Nobel winner! Check more below

372

Niloofar ✈️ icml

KnowledgeLM Workshop retweeted

Niloofar ✈️ icml

@niloofar_mire

24 Jul 2025

🧵 Academic job market season is almost here! There's so much rarely discussed—nutrition, mental and physical health, uncertainty, and more. I'm sharing my statements, essential blogs, and personal lessons here, with more to come in the upcoming weeks! ⬇️ (1/N)

258

30,901

KnowledgeLM Workshop

KnowledgeLM Workshop @lm_knowledge

30 Jun 2025

What is the difference between spatial reasoning and text-based reasoning?

Manling Li

@ManlingLi_

30 Jun 2025

Can VLMs build Spatial Mental Models like humans? Reasoning from limited views? Reasoning from partial observations? Reasoning about unseen objects behind furniture / beyond current view? Check out MindCube! 🌐mll-lab-nu.github.io/mind-cu… 📰arxiv.org/pdf/2506.21458 🤗huggingface.co/datasets/MLL-… 👩‍💻github.com/mll-lab-nu/MindCu…

1:26

Manling Li

KnowledgeLM Workshop retweeted

Manling Li

@ManlingLi_

22 May 2024

[KnowledgeLM @ ACL24] @lm_knowledge 🚨 Update: We've extended the paper submission deadline to May 30 to accommodate COLM review releasing. 📢 We welcome submissions of Finding papers to present at our workshop! We have lined up wonderful speakers, and we are eager to engage with you in Thailand! Meet with our organizers: @ZoeyLi20 @hengjinlp @megamor2 @eunsolc @mjqzhang @peterbhase @mohitban47 @preslav_nakov @Meng_CS @JiaweiHan Website: knowledgeable-lm.github.io/

13,206

KnowledgeLM Workshop

KnowledgeLM Workshop @lm_knowledge

13 Apr 2024

🚀 Knowledgeable Language Model Workshop at ACL24 @aclmeeting Are you ever curious about how much LLMs know? Do you ever wish that LLMs could become smarter with more knowledge? Or maybe you are thinking about removing certain facts from its memory? knowledgeable-lm.github.io/

KnowFM Workshop @ ACL 2026 — Towards Knowledgeable Foundation Models

Exploring knowledge in foundation models: emergence, injection, updating, probing, and generation. San Diego, 2026.

knowledgeable-lm.github.io

15,123

more replies

KnowledgeLM Workshop

KnowledgeLM Workshop @lm_knowledge

13 Apr 2024

If you feel captivated by these problems, come join us at the Knowledge Language Model Workshop at ACL!

323

KnowledgeLM Workshop

KnowledgeLM Workshop @lm_knowledge

13 Apr 2024

We will have a Best Paper Award, supported by @amazon. Appreciate it!!

252