Generative AI for language and science. MT, LLM, GenAI Safety, Drug Discovery

Joined April 2010
96 Photos and videos
More inventive to submit to GenBio2026 @genbio_workshop. The best paper teams will win the super powerful DGX Spark! Thanks to the generous support from @nvidia !
The best two academic papers will be awarded one DGX Spark each -- We thank NVIDIA for their generous support! Paper Submission Instructions: genbio-workshop.github.io/20…
1
3
841
The ICML 2026 workshop on Generative and Agentic AI for Biology has extended the submission deadline to May 8, 2026 AOE! Please consider submitting your cool work at openreview.net/group?id=ICML…
If you are working on AI for biology, chemistry, drug discovery, please consider submit your latest work to ICML 2026 workshop on Generative and Agentic AI for Biology!
1
2
13
1,816
Lei Li retweeted
Today I'll give a talk about "Two Futures of Programming" at Amazon Research Day! Looking forward to seeing people in Palo Alto for those who attend, I'm sharing my materials online for those who can't.
1
16
114
6,982
Very grateful and excited to receive the support from @LaudeInstitute Moonshot Program for our proposed project on Scientific Agents for Physical Experimentation! cmu.edu/news/stories/archive… @AkariAsai @gneubig and Newell Washburn.
Our proposal on scientific agents for physical experimentation in the lab received an Honorable Mention from the Laude Moonshot Grant! Grateful for the recognition, and excited to explore this direction with @gneubig, @lileics, and Newell Washburn 🥳 cmu.edu/news/stories/archive…
1
15
1,871
If you are working on AI for biology, chemistry, drug discovery, please consider submit your latest work to ICML 2026 workshop on Generative and Agentic AI for Biology!
Excited to introduce the 2026 workshop on Generative and Agentic AI for Biology at ICML 2026! genbio-workshop.github.io/20… 1/5
9
3,086
5/ Check out the full paper and our live leaderboard here: 🔗 Project Page: leililab.github.io/susvibes-…📄 Paper: arxiv.org/abs/2512.03262 #VibeCoding #CyberSecurity #LLM #SoftwareEngineering #AIAgent

4/ Key Leaderboard Highlights: 🏆 Security Leader: @OpenHands GLM4.7 🏆 Functionality Leader: SWE-agent Claude 4 Sonnet If we are moving toward an agent-led dev cycle, we need to talk about security now, not later.
1
5
1,420
4/ Key Leaderboard Highlights: 🏆 Security Leader: @OpenHands GLM4.7 🏆 Functionality Leader: SWE-agent Claude 4 Sonnet If we are moving toward an agent-led dev cycle, we need to talk about security now, not later.
3/ The "Vibe" Trap: Even when we gave agents hints about potential vulnerabilities, they struggled to mitigate the risks.
1
3
2,283
3/ The "Vibe" Trap: Even when we gave agents hints about potential vulnerabilities, they struggled to mitigate the risks.
2/, We tested the world’s leading coding agents, and the results are a wake-up call for the industry: Functionality ≠ Security: For example, while SWE-Agent with Claude 4 Sonnet solved 61% of tasks correctly, only 10.5% of those solutions were actually secure.
1
1,451
2/, We tested the world’s leading coding agents, and the results are a wake-up call for the industry: Functionality ≠ Security: For example, while SWE-Agent with Claude 4 Sonnet solved 61% of tasks correctly, only 10.5% of those solutions were actually secure.
🚀 Is "Vibe Coding" actually safe for production? We’ve all seen the demos: give an LLM agent a prompt, watch it work its magic, and boom—you have a feature. But there’s a massive hidden risk. In our latest paper, we introduce SUSVIBES, a benchmark of 200 real-world SE tasks.
4
1,735
🚀 Is "Vibe Coding" actually safe for production? We’ve all seen the demos: give an LLM agent a prompt, watch it work its magic, and boom—you have a feature. But there’s a massive hidden risk. In our latest paper, we introduce SUSVIBES, a benchmark of 200 real-world SE tasks.
Your vibe coded app works. But is it secure? New benchmark SusVibes from Songwen Zhao, Danqing Wang, Kexun Zhang, Jiaxuan Luo, Zhuo Li, and @lileics at @CarnegieMellon, @Columbia, and @JohnsHopkins tested 200 real world feature requests on coding agents. The results are sobering: SWE Agent with Claude 4 Sonnet produced functionally correct code 61% of the time, but only 10.5% of solutions were actually secure. Even adding security hints to prompts did not fix the problem. The gap between 'it works' and 'it is safe to deploy' is massive. 77 different CWE vulnerability types showed up across the benchmark. Worth thinking about next time someone says AI will replace software engineers. The harder question was never about writing code that runs. It was always about writing code that does not break under adversarial conditions. Source: arxiv.org/abs/2512.03262
4
2
7
3,004
10 Dec 2025
Congratulations to all students in the “Generative AI for Biomedicine”!Truly amazing and excellent posters beyond my expectation! Thanks for co-instructor @jmuiuc and superb TAs @ZhenqiaoSong @ramith__ to make this course successful!
8 Dec 2025
Poster day for our “Generative AI in Biomedicine” course this semester. The students’ creativity, energy, and enthusiasm for this exciting area are truly inspiring!
9
1,744
2 Dec 2025
I am at #NeurIPS2025 this week and happy to meet and chat about coding/reasoning agents, LLM security, privacy/copyright of genAI, and AI for drug/protein design. Also happy to meet prospective phd applicants to CMU and applicants to CMU GenAI/LLM certificate program.
6
1
22
2,064
16 Oct 2025
Meet LLaMAX2: a strong multilingual LLM which excels on 17 language's translation and reasoning! (it is actually based on QWen3 but since there is a prior LLaMAX model, we just reuse the name convention). as always, feedback is welcome
15 Oct 2025
Replying to @t_feyuan
Welcome to use our models. More Details: 🎉 Paper: LLaMAX2: Your Translation-Enhanced Model also Performs Well in Reasoning (huggingface.co/papers/2510.0…) 🎉 Code: github.com/CONE-MT/LLaMAX2.0 🎉 Model: huggingface.co/collections/L…
1
1
9
2,269
16 Oct 2025
Excited and Congratulations to my colleague Maarten Sap for winning the prestigious Packard Fellowship for Science and Engineering! #CMU #LTI
I’m ✨ super excited and grateful ✨to announce that I'm part of the 2025 class of #PackardFellows (packard.org/2025fellows). The Packard Foundation and this fellowship will allow me to explore exciting research directions towards culturally responsible and safe AI 🌍🌈
1
5
2,370
Lei Li retweeted
Can AI develop methods like a seasoned statistical geneticist? 🤔 In 8 hrs, our new method TusoAI improve two popular tools in genetics: scDRS ( 40% power) & pgBoost ( 11% enrichment). Preprint: arxiv.org/abs/2509.23986 Great work by @AlistairTurcan with @KexinHuang5 @lileics
4
22
76
20,803
26 Aug 2025
Come join us on 9/12 at CMU AI for Science workshop to present and discuss about how modern generative AI and foundation models accelerate scientific discoveries. We have an outstanding lineup of speakers and various poster/panel/lab/social activities. cmu-ai-for-science-workshop.…
26 Aug 2025
📢 We're thrilled to announce the CMU AI for Science Workshop on Sept 12 at CUC-MPW! Featuring an amazing lineup of speakers: - Akari Asai (AI2/CMU) - Gabe Gomes (CMU) - Chenglei Si (Stanford) - Keyon Vafa (Harvard) Join us on campus, submit your poster & register here: cmu-ai-for-science-workshop.… Questions? Feel free to email: cmu-ai-for-science-workshop@andrew.cmu.edu We look forward to see you there!🤗
1,213
25 Aug 2025
Wonderful results of benchmarking LLM on MCP use from @michaelqshieh 👍
Introducing MCPMark, a collaboration with @EvalSysOrg and @lobehub! We created a challenging benchmark to stress-test MCP use in comprehensive contexts. - 127 high-quality data samples created by experts. - GPT-5 takes the current lead and achieves a Pass@1 of 46.96% while the other models fall in the range of 10-30%. - Diverse test cases on Notion, Github, Filesystem, Playwright (browser), and Postgres. 9🧵s ahead
1
8
1,669
17 Aug 2025
Congratulations to AI2 @allen_ai on getting major support from @NSF and @nvidia to advance AI for scientific discovery, which is major area modern generative AI and foundation models can accelerate the progress!
14 Aug 2025
With fresh support of $75M from @NSF and $77M from @NVIDIA, we’re set to scale our open model ecosystem, bolster the infrastructure behind it, and fast‑track reproducible AI research to unlock the next wave of scientific discovery. 💡
1
43
7,135
18 Jul 2025
The show is on. Welcome to 2025 Generative AI for Biology workshop. 7 invited talks a panel with 5 panelists 14 spotlight talks 121 poster presentations! Huge thanks to the workshop sponsors: Genesis Therapeutics, Genbio AI, and Tencent! genbio-workshop.github.io/20…
1
2
6
1,681
18 Jul 2025
We have an excellent lineup of distinguished speakers at the Gen AI for Bio workshop! Join us in the East Exhibition Hall A on July 18, starting at 8:45am. #GenBio2025 #ICML2025
Hope to see you all tomorrow at the GenAI & Bio workshop!! #ICML2025 Schedule: genbio-workshop.github.io/20…
5
1,048