Joined April 2011
108 Photos and videos
Pinned Tweet
30 Jul 2024
Interested in Autonomous coding agents? Check out AppWorld, a labor of love from @harsh3vedi et al. Provides an execution environment & a challenging benchmark for interactive coding agents on complex everyday tasks over common apps. @stonybrooknlp #ACL2024 @aclmeeting #NLProc.
πŸ”₯ Autonomous AI Assistants (e.g., #googleio2024, #WWDC24) and coding agents (e.g., #Devin, #SWEAgent) have garnered a lot of attention recently. We can envision coding agents autonomously completing complex day-to-day tasks across apps using APIs on our behalf. But how can we develop & benchmark them in a rigorous & reproducible manner? πŸš€ Introducing AppWorld: 🌎a simulated world environment where agents can write code to interact with many apps via APIs on behalf of people πŸ“Ša benchmark of complex tasks defined on it, and πŸ§ͺa robust evaluation framework for assessing agent’s goal completion. πŸ“’ To appear as an #ACL2024 paper πŸŒŽπŸ’»πŸ§‘β€πŸ€β€πŸ§‘ β€œAppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents” #NLProc #ai #AIagents πŸ“œ arxiv.org/abs/2407.18901 (paper) 🌐 appworld.dev for code, blog, data (tasks, APIs, trajectories) explorer, interactive playground, leaderboard & more!
1
5
15
2,679
Niranjan retweeted
ChatGPT several times where's best to go for spring break? It recommends Barcelona almost every time. This isn't a fluke. RL training rewards one best answer, so the model learns to commit to one mode and repeat it. Meet Multi-Answer RL: a simple RL method that trains LMs to reason through and output a distribution of answers in a single generation. [1/N]
22
72
452
99,416
Niranjan retweeted
🚨New paper on AI & Copyright πŸ‘¨β€βš–οΈCourts have credited LLM companies' claims that safety alignment prevents reproduction of copyrighted expression. But what if fine-tuning on a simple writing task ruins it all? Worse : Fine-tuning on a single author's books (e.g., Murakami) unlocks verbatim recall of copyrighted books from 30 unrelated authors, sometimes as high as 90%. Joint work with @niloofar_mire (@LTIatCMU), Jane Ginsburg ( @ColumbiaLaw) and my amazing PhD student @irisiris_l (@sbucompsc ) (1/n)🧡
15
154
394
115,804
Niranjan retweeted
Slightly late but excited to share that I defended my PhD @stonybrooknlp in Dec ’25. Grateful for my time at there, thankful to people I met along the way & extremely happy about the work I did with everyone! Excited for my next adventure as a postdoc @osunlp @hhsun1 @ysu_nlp
7
8
57
10,380
Niranjan retweeted
McGill University (@mcgillu) has many open faculty and postdoctoral positions with generous funding packages, thanks to Impact grants, which are investing $2 billion to attract global talent to Canada πŸ‡¨πŸ‡¦πŸ‡¨πŸ‡¦πŸ‡¨πŸ‡¦. Associate/Full Professor: $8 million startup package Assistant Professor: $600K startup package Postdoc: $70K (starting salary) If you are interested and work in the space of AI/ML/NLP/LLMs, please reach out to me. #AI #NLProc #ML
45
296
1,345
195,277
Niranjan retweeted
Deeply happy and honored to be elected as an ACL Fellow -- and to be a part of the respected cohort of this past years' fellows (congrats everyone)! πŸ™ All the credit (and sincere gratitude) to all my amazing students, postdocs, collaborators, mentors, and family! πŸ€—πŸ’™
25
34
178
43,637
Niranjan retweeted
5 Dec 2025
Replying to @nikita_soni_
@nikita_soni_ and co are organizing a great shared task at SemEval 2026. Make sure to submit your systems!
Replying to @SemEvalWorkshop
@SemEvalWorkshop 2026 Task2 Live!! Go participate #NLProc @aclmeeting Build personalized, ecologically valid, emotionally intelligent systems! Rare opportunity to study lived emotions via self-reported & longitudinal experiences Evaluation begins Jan 10 tinyurl.com/semeval26task2
1
1
244
Niranjan retweeted
Replying to @SemEvalWorkshop
@SemEvalWorkshop 2026 Task2 Live!! Go participate #NLProc @aclmeeting Build personalized, ecologically valid, emotionally intelligent systems! Rare opportunity to study lived emotions via self-reported & longitudinal experiences Evaluation begins Jan 10 tinyurl.com/semeval26task2
1
6
8
896
Niranjan retweeted
25 Nov 2025
1/ Hiring PhD students at CMU SCS (LTI/MLD) for Fall 2026 (Deadline 12/10) πŸŽ“ I work on open, reliable LMs: augmented LMs & agents (RAG, tool use, deep research), safety (hallucinations, copyright), and AI for science, code & multilinguality & open to bold new ideas! FAQ in 🧡
19
120
643
147,992
Niranjan retweeted
Announcing 63 #AmazonResearchAwards recipients from 41 universities in 8 countries across 5 research areas. Awardees have access to 700 Amazon public datasets and can utilize AWS AI/ML services and tools through AWS Promotional Credits. amazon.science/research-area…
3
18
14,879
Niranjan retweeted
I am recruiting students at TTIC! Apply by Dec 9
24 Nov 2025
Pursue a fully-funded and personalized PhD in #computerscience with world-class faculty as mentors, a 4:1 student-to-faculty ratio, and in the vibrant city of Chicago. Learn more and apply by Dec. 9 (NO FEE): buff.ly/Qw8njWB
8
52
175
29,655
13 Nov 2025
If you are interested in SLM based multi-agent systems check this paper led by @bsbijoy2050. This @aaclmeeting paper shows how you can train efficient and effective SLM based small agents on the AppWorld benchmark. Work supported by a @SUNY-@IBM collaboration grant.
2
4
476
13 Nov 2025
It was a joy to have been part of your journey @harsh3vedi. Learnt so much from you and through your projects. Looking forward to seeing the next phase in your journey.
🚨 Late Life update πŸŽ“ I defended my thesis (AppWorld, IRCoT, MuSiQue, DiRe, TeaBReaC) & joined @allen_ai as a research scientist earlier this year πŸ™ Deeply grateful to my awesome advisor @b_niranjan mentors @tusharkhot @Ashish_S_AI, committee members @HAndySchwartz @OwenRambow @sameer_, many collaborators, @stonybrooknlp labmates, friends & family 🀝If you want to collaborate, DMs are open! I’m interested in (tool-use, coding, web) agents and environments 🌎 We've many exciting releases on the AppWorld front coming up. Stay tuned! Or DM if you can help! πŸ™‚
3
1,060
Niranjan retweeted
🚨 Late Life update πŸŽ“ I defended my thesis (AppWorld, IRCoT, MuSiQue, DiRe, TeaBReaC) & joined @allen_ai as a research scientist earlier this year πŸ™ Deeply grateful to my awesome advisor @b_niranjan mentors @tusharkhot @Ashish_S_AI, committee members @HAndySchwartz @OwenRambow @sameer_, many collaborators, @stonybrooknlp labmates, friends & family 🀝If you want to collaborate, DMs are open! I’m interested in (tool-use, coding, web) agents and environments 🌎 We've many exciting releases on the AppWorld front coming up. Stay tuned! Or DM if you can help! πŸ™‚
11
5
162
13,790
22 Sep 2025
Thanks @gregd_nlp for visiting @sbucompsc and the broader NLP and AI group here. And welcome to right coast :-).
Very excited to have @gregd_nlp chat about LLM Reasoning beyond scale at @sbucompsc #NLProc
3
13
2,419
Niranjan retweeted
Very excited to have @gregd_nlp chat about LLM Reasoning beyond scale at @sbucompsc #NLProc
2
38
4,557
Niranjan retweeted
Excited to share that QUDsim has been accepted to #COLM2025!! πŸŽ‰πŸŽ‰
Have that eerie feeling of dΓ©jΓ  vu when reading model-generated text πŸ‘€, but can’t pinpoint the specific words or phrases πŸ‘€? ✨We introduce QUDsim, to quantify discourse similarities beyond lexical, syntactic, and content overlap.
1
5
18
1,445
Niranjan retweeted
🚨 New paper alert! 🚨 πŸ“’ Announcing MuSciClaims, a multimodal claim verification benchmark for heterogeneous, information-rich scientific figures. Such verification is crucial for evaluating scientific hypotheses & ensuring trust and reproducibility in results. 1/6 #NLProc
1
5
20
2,245
Niranjan retweeted
Happy to share that our paper titled "Teaching an Old LLM Secure Coding: Localized Preference Optimization on Distilled Preferences" has been accepted to ACL 2025 Main Conference Track. πŸ₯³πŸŽ‰
1
3
10
417