Lei Li

Lei Li

96 Photos and videos

Tweets

Lei Li @lileics

May 1

More inventive to submit to GenBio2026 @genbio_workshop. The best paper teams will win the super powerful DGX Spark! Thanks to the generous support from @nvidia !

GenBio Workshop @ ICML26 @genbio_workshop

Apr 22

The best two academic papers will be awarded one DGX Spark each -- We thank NVIDIA for their generous support! Paper Submission Instructions: genbio-workshop.github.io/20…

841

Lei Li

Lei Li @lileics

May 1

The ICML 2026 workshop on Generative and Agentic AI for Biology has extended the submission deadline to May 8, 2026 AOE! Please consider submitting your cool work at openreview.net/group?id=ICML…

ICML 2026 Workshop GenBio

Welcome to the OpenReview homepage for ICML 2026 Workshop GenBio

openreview.net

Lei Li @lileics

Apr 16

If you are working on AI for biology, chemistry, drug discovery, please consider submit your latest work to ICML 2026 workshop on Generative and Agentic AI for Biology!

1,816

Graham Neubig

Lei Li retweeted

Graham Neubig

@gneubig

Apr 30

Today I'll give a talk about "Two Futures of Programming" at Amazon Research Day! Looking forward to seeing people in Palo Alto for those who attend, I'm sharing my materials online for those who can't.

114

6,982

Lei Li

Lei Li @lileics

Apr 16

Very grateful and excited to receive the support from @LaudeInstitute Moonshot Program for our proposed project on Scientific Agents for Physical Experimentation! cmu.edu/news/stories/archive… @AkariAsai @gneubig and Newell Washburn.

CMU Teams Recognized in Moonshots AI Competition

CMU research recently earned top recognition in the Laude Institute's Moonshots competition, which concentrates on applying AI to some of society’s most pressing challenges.

cmu.edu

Akari Asai

@AkariAsai

Apr 15

Our proposal on scientific agents for physical experimentation in the lab received an Honorable Mention from the Laude Moonshot Grant! Grateful for the recognition, and excited to explore this direction with @gneubig, @lileics, and Newell Washburn 🥳 cmu.edu/news/stories/archive…

1,871

Lei Li

Lei Li @lileics

Apr 16

If you are working on AI for biology, chemistry, drug discovery, please consider submit your latest work to ICML 2026 workshop on Generative and Agentic AI for Biology!

GenBio Workshop @ ICML26 @genbio_workshop

Apr 15

Excited to introduce the 2026 workshop on Generative and Agentic AI for Biology at ICML 2026! genbio-workshop.github.io/20… 1/5

3,086

Lei Li

Lei Li @lileics

Feb 23

5/ Check out the full paper and our live leaderboard here: 🔗 Project Page: leililab.github.io/susvibes-…📄 Paper: arxiv.org/abs/2512.03262 #VibeCoding #CyberSecurity #LLM #SoftwareEngineering #AIAgent

Lei Li @lileics

Feb 23

4/ Key Leaderboard Highlights: 🏆 Security Leader: @OpenHands GLM4.7 🏆 Functionality Leader: SWE-agent Claude 4 Sonnet If we are moving toward an agent-led dev cycle, we need to talk about security now, not later.

1,420

Lei Li

Lei Li @lileics

Feb 23

Lei Li @lileics

Feb 23

3/ The "Vibe" Trap: Even when we gave agents hints about potential vulnerabilities, they struggled to mitigate the risks.

2,283

Lei Li

Lei Li @lileics

Feb 23

3/ The "Vibe" Trap: Even when we gave agents hints about potential vulnerabilities, they struggled to mitigate the risks.

Lei Li @lileics

Feb 23

2/, We tested the world’s leading coding agents, and the results are a wake-up call for the industry: Functionality ≠ Security: For example, while SWE-Agent with Claude 4 Sonnet solved 61% of tasks correctly, only 10.5% of those solutions were actually secure.

1,451

Lei Li

Lei Li @lileics

Feb 23

Lei Li @lileics

Feb 23

🚀 Is "Vibe Coding" actually safe for production? We’ve all seen the demos: give an LLM agent a prompt, watch it work its magic, and boom—you have a feature. But there’s a massive hidden risk. In our latest paper, we introduce SUSVIBES, a benchmark of 200 real-world SE tasks.

1,735

Lei Li

Lei Li @lileics

Feb 23

Guilherme Favaron

@guifav

Feb 21

Your vibe coded app works. But is it secure? New benchmark SusVibes from Songwen Zhao, Danqing Wang, Kexun Zhang, Jiaxuan Luo, Zhuo Li, and @lileics at @CarnegieMellon, @Columbia, and @JohnsHopkins tested 200 real world feature requests on coding agents. The results are sobering: SWE Agent with Claude 4 Sonnet produced functionally correct code 61% of the time, but only 10.5% of solutions were actually secure. Even adding security hints to prompts did not fix the problem. The gap between 'it works' and 'it is safe to deploy' is massive. 77 different CWE vulnerability types showed up across the benchmark. Worth thinking about next time someone says AI will replace software engineers. The harder question was never about writing code that runs. It was always about writing code that does not break under adversarial conditions. Source: arxiv.org/abs/2512.03262

3,004

Lei Li

Lei Li @lileics

10 Dec 2025

Congratulations to all students in the “Generative AI for Biomedicine”!Truly amazing and excellent posters beyond my expectation! Thanks for co-instructor @jmuiuc and superb TAs @ZhenqiaoSong @ramith__ to make this course successful!

Jian Ma

@jmuiuc

8 Dec 2025

Poster day for our “Generative AI in Biomedicine” course this semester. The students’ creativity, energy, and enthusiasm for this exciting area are truly inspiring!

1,744

Lei Li

Lei Li @lileics

2 Dec 2025

I am at #NeurIPS2025 this week and happy to meet and chat about coding/reasoning agents, LLM security, privacy/copyright of genAI, and AI for drug/protein design. Also happy to meet prospective phd applicants to CMU and applicants to CMU GenAI/LLM certificate program.

2,064

Lei Li

Lei Li @lileics

16 Oct 2025

Meet LLaMAX2: a strong multilingual LLM which excels on 17 language's translation and reasoning! (it is actually based on QWen3 but since there is a prior LLaMAX model, we just reuse the name convention). as always, feedback is welcome

FeYuan @t_feyuan

15 Oct 2025

Replying to @t_feyuan

Welcome to use our models. More Details: 🎉 Paper: LLaMAX2: Your Translation-Enhanced Model also Performs Well in Reasoning (huggingface.co/papers/2510.0…) 🎉 Code: github.com/CONE-MT/LLaMAX2.0 🎉 Model: huggingface.co/collections/L…

2,269

Lei Li

Lei Li @lileics

16 Oct 2025

Excited and Congratulations to my colleague Maarten Sap for winning the prestigious Packard Fellowship for Science and Engineering! #CMU #LTI

Maarten Sap (he/him)@MaartenSap

15 Oct 2025

I’m ✨ super excited and grateful ✨to announce that I'm part of the 2025 class of #PackardFellows (packard.org/2025fellows). The Packard Foundation and this fellowship will allow me to explore exciting research directions towards culturally responsible and safe AI 🌍🌈

2,370

Martin Jinye Zhang

Lei Li retweeted

Martin Jinye Zhang @martinjzhang

9 Oct 2025

Can AI develop methods like a seasoned statistical geneticist? 🤔 In 8 hrs, our new method TusoAI improve two popular tools in genetics: scDRS ( 40% power) & pgBoost ( 11% enrichment). Preprint: arxiv.org/abs/2509.23986 Great work by @AlistairTurcan with @KexinHuang5 @lileics

TusoAI: Agentic Optimization for Scientific Methods

Scientific discovery is often slowed by the manual development of computational tools needed to analyze complex experimental data. Building such tools is costly and time-consuming because...

arxiv.org

20,803

Lei Li

Lei Li @lileics

26 Aug 2025

Come join us on 9/12 at CMU AI for Science workshop to present and discuss about how modern generative AI and foundation models accelerate scientific discoveries. We have an outstanding lineup of speakers and various poster/panel/lab/social activities. cmu-ai-for-science-workshop.…

Welcome to CMU AI for Science Workshop, 2025! – AI for Science Workshop at CMU September 12

We are hosting AI for Science Workshop at Carnegie Mellon University, Pittsburgh, PA, USA on September 12, 2025.

cmu-ai-for-science-workshop.github.io

Jiayi Geng

@JiayiiGeng

26 Aug 2025

📢 We're thrilled to announce the CMU AI for Science Workshop on Sept 12 at CUC-MPW! Featuring an amazing lineup of speakers: - Akari Asai (AI2/CMU) - Gabe Gomes (CMU) - Chenglei Si (Stanford) - Keyon Vafa (Harvard) Join us on campus, submit your poster & register here: cmu-ai-for-science-workshop.… Questions? Feel free to email: cmu-ai-for-science-workshop@andrew.cmu.edu We look forward to see you there!🤗

1,213

Lei Li

Lei Li @lileics

25 Aug 2025

Wonderful results of benchmarking LLM on MCP use from @michaelqshieh 👍

Michael Qizhe Shieh

@michaelqshieh

25 Aug 2025

Introducing MCPMark, a collaboration with @EvalSysOrg and @lobehub! We created a challenging benchmark to stress-test MCP use in comprehensive contexts. - 127 high-quality data samples created by experts. - GPT-5 takes the current lead and achieves a Pass@1 of 46.96% while the other models fall in the range of 10-30%. - Diverse test cases on Notion, Github, Filesystem, Playwright (browser), and Postgres. 9🧵s ahead

1,669

Lei Li

Lei Li @lileics

17 Aug 2025

Congratulations to AI2 @allen_ai on getting major support from @NSF and @nvidia to advance AI for scientific discovery, which is major area modern generative AI and foundation models can accelerate the progress!

Ai2

@allen_ai

14 Aug 2025

With fresh support of $75M from @NSF and $77M from @NVIDIA, we’re set to scale our open model ecosystem, bolster the infrastructure behind it, and fast‑track reproducible AI research to unlock the next wave of scientific discovery. 💡

7,135

Lei Li

Lei Li @lileics

18 Jul 2025

The show is on. Welcome to 2025 Generative AI for Biology workshop. 7 invited talks a panel with 5 panelists 14 spotlight talks 121 poster presentations! Huge thanks to the workshop sponsors: Genesis Therapeutics, Genbio AI, and Tencent! genbio-workshop.github.io/20…

1,681

Lei Li

Lei Li @lileics

18 Jul 2025

We have an excellent lineup of distinguished speakers at the Gen AI for Bio workshop! Join us in the East Exhibition Hall A on July 18, starting at 8:45am. #GenBio2025 #ICML2025

GenBio Workshop @ ICML26 @genbio_workshop

17 Jul 2025

Hope to see you all tomorrow at the GenAI & Bio workshop!! #ICML2025 Schedule: genbio-workshop.github.io/20…

1,048