Chair Prof in AI, Associate Prof @iitdelhi; ACM Distinguished Speaker; Lab @lcs2lab; Previously @IIITDelhi @UofMaryland @iitkgp; #NLP #LLMs

Joined October 2014
183 Photos and videos
Pinned Tweet
๐ŸŒŸ ๐€ ๐๐ž๐ฐ T๐ž๐ฑ๐ญ๐›๐จ๐จ๐ค -- ๐ˆ๐ง๐ญ๐ซ๐จ๐๐ฎ๐œ๐ญ๐ข๐จ๐ง ๐ญ๐จ ๐‹๐š๐ซ๐ ๐ž ๐‹๐š๐ง๐ ๐ฎ๐š๐ ๐ž ๐Œ๐จ๐๐ž๐ฅ๐ฌ ๐ŸŒŸ I am excited to share the release of my new textbook, ๐˜๐˜ฏ๐˜ต๐˜ณ๐˜ฐ๐˜ฅ๐˜ถ๐˜ค๐˜ต๐˜ช๐˜ฐ๐˜ฏ ๐˜ต๐˜ฐ ๐˜“๐˜ข๐˜ณ๐˜จ๐˜ฆ ๐˜“๐˜ข๐˜ฏ๐˜จ๐˜ถ๐˜ข๐˜จ๐˜ฆ ๐˜”๐˜ฐ๐˜ฅ๐˜ฆ๐˜ญ๐˜ด (#LLMs) -- Perhaps the first textbook on LLMs. Target Audience: ๐Ÿ‘‰ Students/beginners, Looking for a structured starting point to learn LLMs ๐Ÿ‘‰ Teachers, planning to offer a course on LLMs ๐Ÿ‘‰ Industry professional, seeking to deepen their understanding of LLMs Explore the Book: ๐Ÿ”— Book Website: tanmoychak.com/llmbook/ ๐Ÿ“‘ Table of Contents: tanmoychak.com/llmbook/toc.pโ€ฆ ๐Ÿ›’ Available on Amazon: amazon.in/dp/936386474X/ Enhance Your Learning Experience: ๐Ÿ‘‰ Slides & Lecture Videos: Chapter-wise resources -- lcs2-iitd.github.io/ELL881-Aโ€ฆ ๐Ÿ‘‰ Exercises & Solutions: Practice with detailed chapter exercises (solutions available on request). ๐Ÿ‘‰ Upcoming @nptel_official Course: Starting January 2025! Preview here: onlinecourses.nptel.ac.in/noโ€ฆ Book Endorsement: ๐Ÿ“– Foreword by Prof. Tim Baldwin @eltimster ๐Ÿ‘ Endorsements from Prof. Iryna Gurevych @IGurevych and Prof. Pushpak Bhattacharyya #LLMs #Textbook @iitdelhi @WileyIndiaPL @lcs2lab
22
77
9,972
Tanmoy Chakraborty retweeted
What makes a good teacher? On-policy distillation has spent the year reinventing loss functions to fix problems that come from one source: the teacher doesn't know the student. New article on why every popular OPD loss has an unbounded advantage and why the fix isn't another loss. Read the full article at - x.com/ayans007/status/206541โ€ฆ w/ @Tanmoy_Chak @lcs2lab #LLM #KnowledgeDistillation

1
2
273
Tanmoy Chakraborty retweeted
Markovian ODE-guided scoring can assess the quality of offline reasoning traces in language models Arghodeep Nandi, Ojasva Saxena, Tanmoy Chakraborty arxiv.org/abs/2603.01580 [๐šŒ๐šœ.๐™ฒ๐™ป]
1
2
89
Tanmoy Chakraborty retweeted
A reporting checklist for large language models in behavioural science dlvr.it/TSxtMM
10
40
7,533
Check out our recent work published in Nature Human Behaviour on responsible LLM checklist: nature.com/articles/s41562-0โ€ฆ @NatureHumBehav

Improving the use of AI in behavioral science: LLMs are used widely in the behavioral sciences. But we have no good standards for how to do so. We introduce a consensus-based reporting checklist to improve transparency, reproducibility and ethical accountability of large-language-model-based research in the behavioural sciences. nature.com/articles/s41562-0โ€ฆ
3
1,043
This is huge. Our PEFT method, MonteCLoRA, has been merged with @huggingface. Do use it. Believe me. It is much much better than LoRA in terms of efficiency and stability.
Excited to share that our work, #MonteCLoRA, has officially been merged into the #HuggingFace PEFT library! ๐Ÿฅณ github.com/huggingface/peft/โ€ฆ Build #peft from source to use it right away! ๐Ÿš€ ๐Ÿ“œ Paper: arxiv.org/abs/2411.04358 ๐Ÿค— Docs: huggingface.co/docs/peft/maiโ€ฆ
20
2,399
Time to celebrate acceptance of two papers in ๐ˆ๐‚๐Œ๐‹'26, including one ๐’๐ฉ๐จ๐ญ๐ฅ๐ข๐ ๐ก๐ญ (top 2.2%) ๐ŸŽ‰ ๐Ÿ‘‰ Polaris: Coupled Orbital Polar Embeddings for Hierarchical Concept Learning ๐Ÿ“” arxiv.org/pdf/2605.00265 โœจ Introduces Polaris -- a hyperspherical embedding framework that decouples semantics from hierarchy using orbital geometry, uncertainty-aware learning, and efficient retrieval. ๐Ÿ‘‰ Linguistic Properties and Model Scale in Brain Encoding: From Small to Compressed Language Models (#๐’๐ฉ๐จ๐ญ๐ฅ๐ข๐ ๐ก๐ญ) ๐Ÿ“” arxiv.org/pdf/2602.07547 โœจ Shows that compact ~3B models can match much larger LLMs in brain alignment, with robustness even under compression. Grateful to all collaborators and students for the amazing work! ๐Ÿš€ @icmlconf @lcs2lab @iitdelhi #ICML26

3
4
84
5,611
Tanmoy Chakraborty retweeted
๐Ÿ‡ง๐Ÿ‡ท #LCS2 goes to #Rio ๐Ÿ‡ง๐Ÿ‡ท Presenting our paper where we move beyond memoryless personalization โ†’ modeling user preferences as action-conditioned geometric walks with memory for better, user-aligned summaries. See you at #Riocentro ๐Ÿš€ #Personalization #RepresentationLearning
Happy to announce that our paper has been accepted to #ICLR2026! ๐ŸŽ‰ ๐Ÿ“œ Beyond Markovian Drifts: Action-Biased Geometric Walks with Memory for Personalized Summarization ๐Ÿ‘ฅ Parthiv Chatterjee, Asish Batha, Tashvi Patel, @sourish_rygbee, @Tanmoy_Chak Congratulations to all authors!
1
3
462
Tanmoy Chakraborty retweeted
๐Ÿšจ CLEF 2026 - CheckThat! Lab We are excited to announce that we are organising a task at this yearโ€™s CheckThat! Lab, which extends the fact-checking pipeline with a new task focused on an important step in professional fact-checking: generating full fact-checking articles ๐Ÿ“ฐ
1
1
5
253
Tanmoy Chakraborty retweeted
๐Ÿšจ Submissions are now open for the Conference for AI Scientists (CAISc) 2026, co-organised by Lossfunk and @bitspilaniindia. Submit to probe what happens when AI systems drive scientific discovery. Submissions are open until May 15! Here is everything you need to know ๐Ÿงต
3
28
102
26,016
Tanmoy Chakraborty retweeted
An AI system MUST be the primary author: that's the only rule! Thrilled to be co-organizing this pioneering conference CAISc 2026! Send in your AI-driven research by May 15th.. @bitspilaniindia @ramgopal_rao @murari_ai @Tanmoy_Chak @palashiitkgp
๐Ÿšจ Submissions are now open for the Conference for AI Scientists (CAISc) 2026, co-organised by Lossfunk and @bitspilaniindia. Submit to probe what happens when AI systems drive scientific discovery. Submissions are open until May 15! Here is everything you need to know ๐Ÿงต
2
13
1,165
Six papers from our lab have been accepted for publication in #ACL2026. The papers cover topics including Interpretability, empowering small VLMs with advanced tool calling, LLM personalisation, and different benchmarking. #nlproc @aclmeeting
1
5
78
3,867
I strongly condemn and protest against rejecting a paper from ACL with such a justification. If I am not mistaken, "Findings" started with the motivation of accommodating such borderline "good" papers. I donโ€™t see any reason behind such a justification, given that ACL does not have any venue constraints (runs in hybrid mode). #ACL2026 #NLProc @aclmeeting
2
4
83
15,609
Our newly introduced ๐†๐”๐ˆ๐ƒ๐„-๐‹๐‹๐Œ -- A reporting checklist for using LLMs in behavioral & social science. Massive collaborative effort led by @stfeuerriegel.
๐Ÿš€Introducing ๐†๐”๐ˆ๐ƒ๐„-๐‹๐‹๐Œ: A reporting checklist for using LLMs in behavioral & social science โœ…GUIDE-LLM is a reporting checklist designed by 80 experts to improve transparency, reproducibility & ethical accountability of LLM-based research ๐Ÿ“„llm-checklist.com
7
1,229
Tanmoy Chakraborty retweeted
2/ The organising committee for CAISc 2026 is led by @paraschopra, @dhruvtrehan9, and @gargdhruv36. We are glad to have @Tanmoy_Chak (IIT Delhi), Palash Goyal (Google Research), Dr Mohan Kankanhalli (NUS AI Institute), Shirish Karande (TCS Research) on our steering committee, and @murari_ai and Pratik Narang as our Program Committee Chairs. Additionally, our program committee for final human review spans CS, Mathematics, electrical engineering, and not just ML.
1
2
23
2,490
Our new study on interpretability explains -- ๐ญ๐ก๐ž ๐๐ก๐ฒ๐ฌ๐ข๐œ๐ฌ ๐จ๐Ÿ ๐Š๐• ๐‚๐š๐œ๐ก๐ž ๐‚๐จ๐ฆ๐ฉ๐ซ๐ž๐ฌ๐ฌ๐ข๐จ๐ง ๐Ÿ๐จ๐ซ ๐‹๐‹๐Œ๐ฌ Pre-print: arxiv.org/abs/2603.01426 As context lengths continue to grow, the KV cache has become the primary memory bottleneck during inference. While many compression techniques report impressive memory savings with minimal drops in benchmark accuracy, we asked a more structural question: ๐Ÿ‘‰ ๐˜ž๐˜ฉ๐˜ข๐˜ต ๐˜ข๐˜ค๐˜ต๐˜ถ๐˜ข๐˜ญ๐˜ญ๐˜บ ๐˜ฉ๐˜ข๐˜ฑ๐˜ฑ๐˜ฆ๐˜ฏ๐˜ด ๐˜ต๐˜ฐ ๐˜ข๐˜ต๐˜ต๐˜ฆ๐˜ฏ๐˜ต๐˜ช๐˜ฐ๐˜ฏ ๐˜ข๐˜ฏ๐˜ฅ ๐˜ณ๐˜ฆ๐˜ข๐˜ด๐˜ฐ๐˜ฏ๐˜ช๐˜ฏ๐˜จ ๐˜ธ๐˜ฉ๐˜ฆ๐˜ฏ ๐˜ธ๐˜ฆ ๐˜ค๐˜ฐ๐˜ฎ๐˜ฑ๐˜ณ๐˜ฆ๐˜ด๐˜ด ๐˜ต๐˜ฉ๐˜ฆ ๐˜’๐˜ ๐˜ค๐˜ข๐˜ค๐˜ฉ๐˜ฆ? We frame KV compression as a ๐œ๐จ๐ง๐ญ๐ซ๐จ๐ฅ๐ฅ๐ž๐ ๐ฉ๐ž๐ซ๐ญ๐ฎ๐ซ๐›๐š๐ญ๐ข๐จ๐ง ๐จ๐Ÿ ๐ญ๐จ๐ค๐ž๐ง-๐ฅ๐ž๐ฏ๐ž๐ฅ ๐ซ๐จ๐ฎ๐ญ๐ข๐ง๐  ๐ข๐ง ๐ฌ๐ž๐ฅ๐Ÿ-๐š๐ญ๐ญ๐ž๐ง๐ญ๐ข๐จ๐ง. Rather than evaluating only final task accuracy, we design synthetic datasets to probe: (1) Multi-entity tracking, (2) Coreference resolution, and (3) Multi-hop reasoning. This setup allows us to disentangle three critical dimensions: Information Retention, Accessibility, and Utilisation. Our findings reveal an interesting pattern: ๐Ÿ‘‰ ๐Œ๐จ๐๐ž๐ซ๐š๐ญ๐ž ๐œ๐จ๐ฆ๐ฉ๐ซ๐ž๐ฌ๐ฌ๐ข๐จ๐ง often preserves surface-level accuracy despite substantial internal representational degradation โ€” suggesting significant redundancy in current models. ๐Ÿ‘‰ ๐๐ž๐š๐ซ ๐ž๐ฑ๐ญ๐ซ๐ž๐ฆ๐ž ๐œ๐จ๐ฆ๐ฉ๐ซ๐ž๐ฌ๐ฌ๐ข๐จ๐ง, we observe a sharp "safety cliff" in hallucinations, driven by global erasure of answer-critical tokens. ๐Ÿ‘‰ We also uncover a second failure mode -- ๐ซ๐ž๐ฉ๐ซ๐ž๐ฌ๐ž๐ง๐ญ๐š๐ญ๐ข๐จ๐ง๐š๐ฅ ๐ซ๐ข๐ ๐ข๐๐ข๐ญ๐ฒ -- where tokens remain present, but routing flexibility collapses. These results suggest that evaluating compression solely through downstream accuracy can mask stronger structural effects on reasoning. Understanding these internal dynamics is crucial as we move toward longer-context and more memory-efficient LLMs. Brilliant work by Ayan Sengupta and Samhruth Ananthanarayanan. #ScienceofLLMs #Interpretability #KVCache #ModelCompression
1
5
58
3,700
Tanmoy Chakraborty retweeted
๐ŸŽ‰ New Paper Alert ๐ŸŽ‰ We're excited to share that our paper "Here for You: A Co-Designed Mental Health Screening App for Indian University Students " has been accepted for publication in #JMIR #FormativeResearch! ๐ŸŒŸ @Tanmoy_Chak preprints.jmir.org/preprint/โ€ฆ

1
1
3
450
Tanmoy Chakraborty retweeted
Happy to announce that our paper has been accepted to #ICLR2026! ๐ŸŽ‰ ๐Ÿ“œ Beyond Markovian Drifts: Action-Biased Geometric Walks with Memory for Personalized Summarization ๐Ÿ‘ฅ Parthiv Chatterjee, Asish Batha, Tashvi Patel, @sourish_rygbee, @Tanmoy_Chak Congratulations to all authors!
1
4
822
Amidst the #ICLR26 chaos, at least one ray of hope finally materialised. No. I am not coming to Rio ๐Ÿ™ƒ @iclr_conf @sourish_rygbee @lcs2lab @iitdelhi
1
1
47
3,325