Assistant Professor @Northwestern; CS Ph.D. @ASU DMML; Reliable and Efficient AI; Formerly @GoogleDeepMind @MSFTResearch, @Amazon Alexa AI;

Joined March 2018
3 Photos and videos
Kaize Ding retweeted
A Survey of Large Language Models for Text-Guided Molecular Discovery: from Molecule Generation to Optimization 1.This is the first focused survey on using large language models (LLMs) for molecule generation and optimization, introducing a novel taxonomy based on learning paradigms—covering both tuning-free (e.g., zero-shot, in-context learning) and tuning-based (e.g., supervised fine-tuning, preference tuning) methods. 2.The survey highlights how LLMs are uniquely positioned for molecular discovery due to their emergent capabilities—such as in-context learning, reasoning, and instruction following—which allow them to generalize across diverse chemical tasks without task-specific retraining. 3.In molecule generation, LLMs are deployed via prompting strategies (e.g., LLM4GraphGen, MolReGPT) or adapted through supervised datasets (e.g., Mol-Instructions, LlaSMol, ChatMol). Preference-tuned models like SmileyLlama and Mol-MoE show improved fidelity to molecular constraints. 4.For molecule optimization, the review examines how LLMs refine existing molecules through goal-directed editing. Strategies include zero-shot optimization (LLM-MDE), retrieval-augmented prompting (ChatDrug), and evolution-based in-context learning (MOLLM, LLM-EO). 5.The survey identifies a trend toward hybrid frameworks combining fine-tuned worker models with external reasoning agents (e.g., MultiMol, DrugAssist), often leveraging GPT-4o or domain-specific scoring functions to enhance candidate selection and validation. 6.Multi-modal modeling is a growing focus, with models like UniMoT and Molx-Enhanced LLM incorporating graph or 3D inputs into LLMs via specialized tokenizers and embedding schemes, enabling structurally-aware generation and optimization. 7.Benchmarking frameworks are categorized into structure-based (validity, uniqueness, diversity) and property-based (LogP, QED, synthetic accessibility, Pareto-optimality) metrics. The paper also provides a detailed summary of standard datasets for pretraining and evaluation. 8.The survey emphasizes the limitations of current LLMs: hallucinations, lack of transparency, and domain-incoherent outputs. Future work should prioritize trustworthy generation, interpretability, and error-aware prompting to enhance reliability. 9.Emerging directions include LLM-driven agent frameworks that integrate external tools (e.g., retrosynthesis engines, docking software) for iterative design, as well as cross-modal models that jointly encode chemical topology, text, and spatial information. 10.A continuously updated repository of LLM-centric molecular research is provided at github, making this survey a central resource for the field. 💻Code: github.com/REAL-Lab-NU/Aweso… 📜Paper: arxiv.org/abs/2505.16094 #LLM #MoleculeGeneration #MolecularOptimization #DrugDiscovery #ChemLLM #AI4Science #InContextLearning #SMILES #MolecularDesign #LargeLanguageModels
3
9
1,557
17 May 2025
📣 We are seeking exceptional postdoctoral candidates on AI4Health at Northwestern University! Please share with anyone who might be interested in this exciting opportunity! #Postdoc #AI4Health #ML #AI #LLM #MedicalAI #Northwestern
1
4
924
17 May 2025
The research fellow will be working on a collaborative project and jointly advised by me, Prof. Noelle Samia, and Prof. Bonnie Martin-Harris.
1
457
Kaize Ding retweeted
10 Nov 2024
📢 New paper: AskGNN - Making LLMs graph-aware through in-context learning! Our GNN-powered retriever learning-to-retrieve approach enables LLMs to process graph data effectively. No fine-tuning needed. 7 LLMs tested, 3 tasks, strong results. 🔗 arxiv.org/abs/2410.07074

1
1
1
2,429
Kaize Ding retweeted
🚨 Call for Workshop Papers at #KDD2024 🚨 Submit your paper to the KDD’24 Workshop on Resource-Efficient Learning for Knowledge Discovery 📆 June 30 chuxuzhang.github.io/RelKD/

2
3
3,371
Kaize Ding retweeted
***Internship Opportunity*** We're hiring interns. Come, join us in building an ML platform reinvented for real-time. You'll gain hands-on experience in building ML systems from the ground up. 💸 Stipend: ₹1L/month 🗓️ Start Date: May/June 2024 📍 Location: Virtual 🚀 Career Path: We'll roll out PPOs to top performers Open Roles: 1. ML 2. Infra 3. DevOps 4. Streaming Systems 5. UI/UX If you're interested, please fill out this form: forms.gle/SQxiXALXzztQyWbQA #internship #hiring

108
125
811
254,258
Kaize Ding retweeted
We heard that you missed the benefits of the standard registration. Don't worry, the deadline has been extended to February 18th. Register today: wsdm-conference.org/2024/reg… #WSDM2024 #WSDMCUP2024 #sigkdd #sigmod #sigir #sigweb #acm
4
5
2,596
14 Dec 2023
📢 Interested in statistical machine learning and data science? Don't forget to submit your application to our Ph.D. program before Jan 5th! Details can be found at: statistics.northwestern.edu/… If you are also attending NeurIPS, feel free to talk to me!

1
18
2,710
Kaize Ding retweeted
#WSDM2024 is open for registration. Make sure to grab your spot soon. Early registration ends December 17th! Come learn from industry and academia experts such as @Google's VP Elizabeth Hamon Reid from and Nicolas Cristin (@nc2y). Register now at wsdm-conference.org/2024/reg…
6
11
3,883
Kaize Ding retweeted
🎉 Exciting News! 📢 Our paper on "GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed Graphs" has been accepted at #EMNLP findings! 📚🔍 Thanks for the co-authors @kaize0409, Kyumin Lee. State tuned for the preprint and code.
1
1
10
2,127
Kaize Ding retweeted
Working with Prof. Liu and Prof. Wang (my former phd and visiting phd students) to organize a special issue on Data Centric AI. Mathematics is a very decent journal. We are looking forward to your discussions on such interesting topics.
📢 #Mathematics New Special Issue open for submission! ✨Title: Advanced Research in Data-Centric #AI 🔗 Details: buff.ly/44Lv3EY #data_science #graph_mining #statistical_machine_learning @MDPIOpenAccess @ComSciMath_Mdpi
1
2
2,438
Kaize Ding retweeted
We are pleased to announce the call for Workshop Proposals for the #WSDM2024, which will take place for the first time in LATAM at Mérida, México wsdm-conference.org/2024/cal… Proposals Due: October 5, 2023 Acceptance Notifications: November 2, 2023
8
13
5,063
Heading to Long Beach for KDD'23! This time I will present our recent work "Learning Strong Graph Neural Networks with Weak Information". If you are interested in data-efficient graph learning, you are welcome to join the oral and poster sessions!
3
3
30
3,874
Happy to chat about research if you are around! Also, I'm recruiting students to join my group at Northwestern University. Let me know if you are interested!
2
1,023
Oral: 4:00 pm – 4:20 pm, Tuesday, August 8, Room 201A Poster: 4:00 pm – 4:20 pm, Monday, August 7, Hall A, #386
925
Kaize Ding retweeted
6 Aug 2023
🙋 Welcome to ReIKD @kdd_news !! ⏰Aug 7, 1-5 pm (PDT), 202B. Conversations on generative AI agents🤖 are also super welcome! #KDD2023
The International Workshop on Resource-Efficient Learning for Knowledge Discovery (RelKD 2023 #KDD2023 ) is just around the corner! 🗓️ Monday, August 7, 1:00 pm – 5:00 pm (PDT) 📍 202B, Long Beach Convention & Entertainment Center Check our website: ncsu-dk-lab.github.io/worksh…
1
5
2,407
Kaize Ding retweeted
Announcing the call for proposals #WSDM2024 Cup. #WSDM2024 will take place in Mérida, México wsdm-conference.org/2024/cal… Submission: 8/15/2023 Notification: 8/31/2023 Previous prev partners #Google, #Amazon, #iQiYi, #Microsoft, #Wikimedia, #Adobe, #ByteDance, #Spotify, #Baidu.
4
9
3,540
Proud to be one of the DMMLers and to work with Dr. Liu, well deserved!@liuhuan Regents Professor is an AI explorer of 4 decades news.asu.edu/20230207-univer…

1
26
1,769
Kaize Ding retweeted
Few-shot learning on graphs via pretrained contrastive learning, ft. Zhen Tan, Song Wang, @kaize0409, @LiJundong, Huan Liu
1
3
9
28 Nov 2022
Next week I will be attending two conferences #icdm2022 #neurips2022 back-to-back: Nov 28th - Dec 1st @ICDM and Dec 1st - Dec 3rd @NeurIPS. Looking forward to seeing you around and please check out our works in #GraphML!
1
14
28 Nov 2022
1. Generalized Few-shot Node Classification (public.asu.edu/~kding9/pdf/Z…), Oral Session A2, Nov 29th, 10:30 am -12:00 pm 2. BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs (openreview.net/forum?id=YXvG…), Poster Session 6, Dec 1st, 4:30 pm to 6:00 pm
1