Reshmi Ghosh

Reshmi Ghosh

101 Photos and videos

Tweets

Pinned Tweet

Reshmi Ghosh @reshmigh

13 Aug 2025

🚨New paper! With @UMassAmherst , @UofMaryland: "Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis"🤯. Why do #reasoningmodels break down when chaining multiple steps? We studied #CoT traces to find out. 🧵(1/n) 🔗arxiv.org/abs/2508.04699

1,798

Hua Shen✨

Reshmi Ghosh retweeted

Hua Shen✨

@huashen218

5 Nov 2025

🧐Are values in LLMs aligned with humans? 1️⃣ And if they are — do LLMs stay honest to those values, or sometimes say one thing but act another? 2️⃣ ✨ We explore these questions in two papers presented at #EMNLP2025: 1️⃣ ValueCompass: hua-shen.org/assets/files/al… (WiNLP Workshop) 2️⃣ Mind the Value–Action Gap: arxiv.org/pdf/2501.15463 (Main Track) 🔍 Dataset & Code: github.com/huashen218/value_… 🌱 I’m also #Hiring multiple PhD students for Fall 2026 @ NYU Courant Computer Science! If you’re passionate about #Human_AI_Alignment, #Value_Alignment, or broad #AI #Human (society) research, let’s connect at EMNLP2025, NeurIPS2025, or over Zoom! 🎓 NYU CS PhD Apply (NYU Shanghai Track): cs.nyu.edu/dynamic/phd/admis… 💜 This year I’m also co-organizing the #EMNLP2025 WiNLP Workshop and supporting the amazing #Tutorial on Spoken Conversational Agents with LLMs (a short 15min talk)! Come say hi 👋 — I’d love to chat and connect with old and new friends at #EMNLP2025! 🔗 WiNLP Workshop: winlp-workshop.github.io/ 🔗Tutorial on Spoken Conversational Agents: aclanthology.org/2025.emnlp-… 💗Huge thanks to my wonderful paper collaborators — @tanmit,@YunHuang_HCI,@tknearem,@reshmigh,Nicholas Clark,Yu-Ju Yang — and my inspiring workshop/tutorial collaborators @huckiyang, Andreas Stolcke,@TYSSSantosh2,@therealthapa,@MeryemMhamdi1,Chen Zhang, Peerat Limkonchotiwat, Wiem Ben Rim.... 🤗Truly grateful and enjoyable to work with you all! 💫 #HumanAIAlignment #PhDOpening #NYU #NYUShanghai #ValueAlignment #HAI

26,895

Reshmi Ghosh

Reshmi Ghosh @reshmigh

31 Oct 2025

So Agents are flat earthers? :D

Can Vardar

@icanvardar

30 Oct 2025

finally, linkedin is funny

276

Tim Althoff

Reshmi Ghosh retweeted

Tim Althoff @timalthoff

28 Oct 2025

(please reshare) I'm recruiting multiple PhD students and Postdocs @uwcse @uwnlp (bdata.uw.edu). Focus areas incl. psychosocial AI simulation and safety, Human-AI collaboration. PhD: cs.washington.edu/academics/… Postdocs: docs.google.com/document/d/1…

111

400

36,434

Myra Deng

Reshmi Ghosh retweeted

Myra Deng @myra_deng

29 Oct 2025

Using probes to accurately and efficiently detect model behavior (in this case PII leakage) in prod is one of the clear wins for applied interpretability. This is the path to semantic determinism - imagine AI models instrumented with internal probes that recognize when they’re hallucinating, going off-policy, or posing biorisk, and resteering themselves accordingly.

Goodfire

@GoodfireAI

29 Oct 2025

Why use LLM-as-a-judge when you can get the same performance for 15–500x cheaper? Our new research with @RakutenGroup on PII detection finds that SAE probes: - transfer from synthetic to real data better than normal probes - match GPT-5 Mini performance at 1/15 the cost (1/6)

259

36,469

Lily Xu

Reshmi Ghosh retweeted

Lily Xu @lilyxu0

29 Oct 2025

Launching AI for Public Goods Fast Grants! We'll distribute $150k to advance critical work connecting AI and public goods. 💰 $10k per project 💰 $800 reviewer compensation PUBLIC GOODS := open source, ecosystem services, climate, urban infra, comms, education, science, & more

David Dao

@dwddao

14 Oct 2025

Announcing AI for Public Goods Fast Grants (AI4PG) - Up to $10K for AI research improving public goods funding. Fast review (2-3 weeks), simple applications (4 pages 1 budget page), open to any researchers worldwide. Call for reviewers now open! recerts.org/ai4pg2025

156

24,838

Reshmi Ghosh

Reshmi Ghosh @reshmigh

27 Oct 2025

Evaluations.....

Kyunghyun Cho

@kchonyc

27 Oct 2025

wow

132

Niloofar ✈️ icml

Reshmi Ghosh retweeted

Niloofar ✈️ icml

@niloofar_mire

20 Oct 2025

I'm recruiting students for fall 2026 thru @LTIatCMU & @CMU_EPP, in: 1. Privacy & security of LLMs, coding, long horizon & embodied agents (robotics) 2. Tiny local llms 3. AI for scientific reasoning, esp. chemistry 4. Latent reasoning 5. anything YOU are passionate about!

182

1,017

110,344

Reshmi Ghosh

Reshmi Ghosh @reshmigh

8 Oct 2025

It is an infinite glitch circle now!

Ahmad Beirami

@abeirami

8 Oct 2025

Replying to @nmboffi

But who are these reviewers? They are the same authors. I think we should teach young members of our community to value "learning a new nugget of information" over "obtaining a bold number in a table."

465

Sarah Sachs

Reshmi Ghosh retweeted

Sarah Sachs

@sarahmsachs

6 Oct 2025

Being at top of @OpenAI token usage list is a vanity metric. Our job as engineers is to minimize token usage (aka latency and cost) while maximizing value by precise tool definitions and clever model routing. My dream is to grow arr and move lower on this list…

161

123

5,183

949,825

Reshmi Ghosh

Reshmi Ghosh @reshmigh

1 Oct 2025

Can someone in the room define what is the commonly accepted definition of AGI?

Haider.

@haider1

30 Sep 2025

Important thread on AGI from Anthropic researcher: - we're likely to see AI solving real open research problems in math in the next months - by 2027, models could complete a full day's software work with 50% success - compute power might grow 10,000x in the next five years - we are still early in the AI exponential... small interventions early in exponential growth have huge consequences - within a few years, AI may surpass humans on all intellectual tasks

177

Reshmi Ghosh

Reshmi Ghosh @reshmigh

1 Oct 2025

More internship opportunities for those that are looking

Shawn Tan @tanshawn

30 Sep 2025

We're looking for 2 interns for Summer 2026 at the MIT-IBM Watson AI Lab Foundation Models Team. Work on RL environments, enterprise benchmarks, model architecture, efficient training and finetuning, and more! Apply here: forms.gle/H6dNSywXCjDDyBsq7

203

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭

Reshmi Ghosh retweeted

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭

@elder_plinius

30 Sep 2025

🚨 JAILBREAK ALERT 🚨 ANTHROPIC: PWNED 🤗 CLAUDE-SONNET-4.5: LIBERATED 🦅 Woooeee this model is a real smarty pants!! I ain't never seen recipes quite like this! High level of detail all around, code especially 👀 Sonnet 4.5 also has a tendency to make some fairly impressive leaps across latent space, like starting with MDMA then going to Fentanyl then to Meth recipes etc without being explicitly prompted for a new drug! Nothing too fancy is even necessary to escalate to jailbreak territory. Best strategy I found for breaking the chat interface was to take things straight into an artifact render (which adds tons of token noise due to the code scaffolding) and then incrementally escalate severity or steer towards trigger concepts in a Socratic fashion over multiple steps. A little French was needed to get around the CBRNE classifiers, mais c'est la vie! 😘 Come, witness Sonnet-4.5 outputting a ricin recipe, meth synthesis, malware, and how to extract and process cocaine! gg

109

1,843

214,005

LaurieWired

Reshmi Ghosh retweeted

LaurieWired

@lauriewired

30 Sep 2025

if you’re an EE, CS, or cryptography student write your thesis on public key cryptography at the image sensor level Proof of Physical capture will become a backbone of society soon.

OpenAI

@OpenAI

30 Sep 2025

Sora 2 is here.

0:57

280

1,600

22,068

1,401,933

vas

Reshmi Ghosh retweeted

vas

@vasuman

29 Sep 2025

Claude 4.5 Sonnet just refactored my entire codebase in one call. 25 tool invocations. 3,000 new lines. 12 brand new files. It modularized everything. Broke up monoliths. Cleaned up spaghetti. None of it worked. But boy was it beautiful.

512

550

12,496

636,209

Reshmi Ghosh

Reshmi Ghosh @reshmigh

1 Oct 2025

Hear hear Interns

Yunyao Li

@yunyao_li

30 Sep 2025

🚀 I'm hiring 2026 Applied Scientist / ML Engineering Interns to push the frontier of multi-agent AI for the enterprise. 💡 Research NLU, generative & agent-based AI, machine learning ⚡ Build scalable models, benchmark datasets & metrics 🤝 Create impactful solutions for publication and production ⭐️ Full-time conversion opportunities for PhD / MS students graduating in late 2026 / mid-2027 🔗 [Apply Now] lnkd.in/gMp8hwVS #AI #MachineLearning #Internship #Adobe

489

Kushan Mitra

Reshmi Ghosh retweeted

Kushan Mitra

@kushanmitra

30 Aug 2025

Vah, pothole alerts built in to @atherenergy maps for multiple cities

233

900

11,241

778,147

ℏεsam

Reshmi Ghosh retweeted

ℏεsam

@Hesamation

29 Aug 2025

ML interview question: why do embeddings come in 768 or 1024? - “because BERT did it” - “because of GPU optimization” BUT WHY?! The replies under this post is everything wrong with current courses and blog posts: superficiality. this isn’t reasoning, it’s memorization

atulit

@atulit_gaur

28 Aug 2025

Fun question to ask in an ml interview, “Why do embedding dimensions come in neat sizes like 768 or 1024, but never 739?” If they can't answer it, it's fine but if they do, you've stumbled upon a real gem.

2,557

360,231

Rohan Paul

Reshmi Ghosh retweeted

Rohan Paul

@rohanpaul_ai

15 Aug 2025

The paper shows reasoning models often answer multi-hop questions while straying from the needed steps. Multi-hop questions need information from several documents linked in a chain. The authors track each jump between documents as a hop, check if all required sources are covered, and tag any extra meandering as overthinking. They build 7 clear error categories by comparing the model's hop count to the gold hop count. This turns fuzzy explanations into concrete signals about where reasoning goes off track. That structure is the core contribution. They test 6 models across 3 datasets and annotate 1,440 answers, keeping 1,080 after filtering. They also automate judging with a compact 2-step pipeline that first extracts hops then classifies errors, cutting annotation time by about 20x and reaching up to 92% agreement on simpler sets. That makes large scale diagnosis practical. On 2Wiki, most traces match the gold steps and final accuracy is strong. Correct answers mainly appear when the hop count exactly matches the gold, while early irrelevant steps are more damaging than trailing ones. Smaller models break more when a step is wrong, larger models like Claude 3.7 Sonnet are steadier, yet even they overhop on harder questions. DeepSeek-R1 sometimes gets the answer with lower reasoning fidelity, showing that accuracy can hide messy chains. ---- Paper – arxiv. org/abs/2508.04699 Paper Title: "Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis"

1,872

Reshmi Ghosh

Reshmi Ghosh @reshmigh

13 Aug 2025

1,798

more replies

Reshmi Ghosh

Reshmi Ghosh @reshmigh

13 Aug 2025

(5/n) 🔍 While Illusion of Thinking paper shows how reasoning models collapse under high complexity in puzzles. Our work focuses on real-world Q/A, mirroring the AI based search process, showing how #reasoning breaks down even when the task is solvable.

123

Reshmi Ghosh

Reshmi Ghosh @reshmigh

13 Aug 2025

(n/n) If these findings sound interesting to you, give the paper a read: 🤝 Huge thanks to our amazing collaborators for making this possible. @BasuSamyadeep, @Microsoft 📄 Read the full paper: arxiv.org/abs/2508.04699 #ReasoningModels #AI #LLM #AIResearch #MultiHopQA

104