Andjela Mladenovic

Andjela Mladenovic

8 Photos and videos

Tweets

Muqeeth retweeted

Andjela Mladenovic @ml_andjela

Apr 20

Hi! If you are interested in game-theoretic analysis of the AI race and open vs. closed sourcing, check out our new paper: " Why Open Source? A Game-Theoretic Analysis of the AI Race " arxiv.org/pdf/2604.16227 There are some cute complexity results there 🙂

2,488

Cooperative AI Foundation

Muqeeth retweeted

Cooperative AI Foundation

@coop_ai

Mar 3

The Cooperative AI Summer School 2026 'Expression of interest' applications are now open! If you're an early-career professional studying or working in cooperative AI, apply to join us in Canada this August for an exciting intensive programme.

4:22

15,734

Kawin Ethayarajh

Muqeeth retweeted

Kawin Ethayarajh

@ethayarajh

Feb 15

AI is changing economics, and --- as we just saw in Dwarkesh's interview with Dario --- AI researchers need to start thinking about economics too! The Center for Applied AI at UChicago will be hosting an AI & Economics Summer Institute to explore exactly this. We will bring together leading researchers with advanced graduate students in economics/AI/ML/NLP for an in-person program between Aug 6 - 11.

200

36,987

Ian Gemp

Muqeeth retweeted

Ian Gemp @drimgemp

30 Dec 2025

Have you been using LLMs to play games, negotiate salaries, or strategize in other ways? Whether it worked or not, we want to see your demo at our “Strategic Engineering” workshop (sites.google.com/view/se-aam…) at #AAMAS2026 in Cyprus! Starter library @ github.com/google-deepmind/s…!

1,443

Malikeh Ehghaghi

Muqeeth retweeted

Malikeh Ehghaghi

@Malikeh5

25 Dec 2025

📢 I am excited to announce that our paper, "TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior," is now live both on Hugging Face and arXiv. 🖇️ arXiv Page: arxiv.org/abs/2512.20757 🤗 HF Org: huggingface.co/toksuite #LLM #NLP #Tokenization

725

Muqeeth

Muqeeth @Muqeeth10

2 Dec 2025

New preprint! Learning Robust Social Strategies with Large Language Models. We apply multi-agent RL finetuning to train LLMs that achieve cooperative and non-exploitable behavior in social dilemmas for the first time. 📄 arxiv.org/abs/2511.19405 🧵 ⬇️ (1/8)

1,765

more replies

Muqeeth

Muqeeth @Muqeeth10

2 Dec 2025

AdAlign agents are also robust when facing RL agents trained specifically to exploit them, while GPT-5 nano is exploitable in the same setup. The RL agent ends up cooperating with AdAlign’s tit for tat style policy, since that is its best response. (7/8)

221

Muqeeth

Muqeeth @Muqeeth10

2 Dec 2025

You can run multi-agent RL training for LLMs right away with our public code: github.com/dereckpiche/AdAli…. This work was done with my awesome group members @Dereck_Piche*, @muqeeth10*, @MAghajohari, @JuanDuquevan, @mnoukhov, and @AaronCourville.(8/8)

242

Anirudh Buvanesh

Muqeeth retweeted

Anirudh Buvanesh @AnirudhBuvanesh

13 Sep 2025

Zero rewards after tons of RL training? 😞 Before using dense rewards or incentivizing exploration, try changing the data. Adding easier instances of the task can unlock RL training. 🔓📈To know more checkout our blog post here: spiffy-airbus-472.notion.sit…. Keep reading 🧵(1/n)

What Can You Do When You Have Zero Rewards During RL? | Notion

Jatin Prakash* (NYU), Anirudh Buvanesh* (MILA) (* order decided through np.random.randint(2))

spiffy-airbus-472.notion.site

105

14,143

Prateek Yadav

Muqeeth retweeted

Prateek Yadav

@prateeky2806

21 Aug 2024

We just released our survey on "Model MoErging", But what is MoErging?🤔Read on! Imagine a world where fine-tuned models, each specialized in a specific domain, can collaborate and "compose/remix" their skills using some routing mechanism to tackle new tasks and queries! 🧵👇 co first-author @colinraffel 📰: arxiv.org/abs/2408.07057

ALT A survey on Model MoErging!

217

21,051

Muqeeth

Muqeeth @Muqeeth10

7 Jun 2023

Introducing Soft Merging of Experts with Adaptive Routing (SMEAR) for gradient-based training of mixture-of-experts models. SMEAR matches or outperforms prior routing methods without increasing costs or relying on task metadata. 📄 arxiv.org/abs/2306.03745 🧵 ⬇️ (1/7)

170

35,858

more replies

Muqeeth

Muqeeth @Muqeeth10

7 Jun 2023

Experts learned with SMEAR still exhibit intuitive specialization - for example, by sharing experts across similar tasks or dedicating multiple experts to more complex tasks. (6/7)

727

Muqeeth

Muqeeth @Muqeeth10

7 Jun 2023

You can try out SMEAR with our public code github.com/r-three/smear and read more in our preprint arxiv.org/abs/2306.03745. This work was done in collaboration with @liu_haokun and @colinraffel (7/7)

591