Foundations of Cooperative AI Lab (FOCAL) at CMU (@FOCAL_lab)

22 Nov 2023

We are recruiting postdocs at the Foundations of Cooperative AI Lab (@FOCAL_lab) at @CarnegieMellon (cmu.edu)! Please retweet / share / send great applicants our way! For different positions please reach out. @SCSatCMU @CSDatCMU @mldcmu apply.interfolio.com/136899

744

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

13h

One of my open math problems apparently got resolved by ChatGPT 5.5 Pro (Ryan O'Donnell prompted it better than I did!), though the proof was so hard for me to read that it seemed easier to just prove it myself. More thoughts on implications for math here: aifails.substack.com/p/even-…

152

17,192

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

Jun 12

Replying to @FOCAL_lab @EmanuelTewolde

arxiv.org/abs/2604.15267

CoopEval: Benchmarking Cooperation-Sustaining Mechanisms and LLM...

It is increasingly important that LLM agents interact effectively and safely with other goal-pursuing agents, yet, recent works report the opposite trend: LLMs with stronger reasoning capabilities...

884

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

Jun 12

Replying to @FOCAL_lab

@FOCAL_lab member @EmanuelTewolde is presenting his CoopEval work at EPFL on Monday (14:30) and it will be on zoom! (paper link in comment) memento.epfl.ch/event/ai-cen…

Jiayuan Liu

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 15

(1/4) Can remembering more of the past make AI agents less cooperative? In our new paper, we study LLM agents in repeated social dilemmas. The key variable is not how many rounds they play, but how much prior interaction history they can access when making each decision.

626

Jiayuan Liu

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 15

(2/4) Surprisingly, longer recall often degrades cooperation. Across 7 LLMs and 4 repeated social dilemma games, agents with longer histories often shift away from forward-looking cooperation and toward retrospective grievance-tracking.

547

Jiayuan Liu

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 15

(3/4) The mechanism is not just “too much context.” It is what the agents remember: replacing histories with synthetic cooperative records restores cooperation, and ablating explicit CoT reasoning often reduces the collapse. We call this the memory curse.

475

Jiayuan Liu

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 15

(4/4) Paper link: arxiv.org/abs/2605.08060 Big thanks to my collaborators! @Jack_Litq, @zzzoooeee321, Xin Luo, Haoxuan Zeng, @EmanuelTewolde, Tai Sing Lee, @tonghanwang, @ckingsford, @conitzer

The Memory Curse: How Expanded Recall Erodes Cooperative Intent in...

Context window expansion is often treated as a straightforward capability upgrade for LLMs, but we find it systematically fails in multi-agent social dilemmas. Across 7 LLMs and 4 games over 500...

855

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 11

In this paper led by Jerick Shi @Jerick1380, we propose a taxonomy of LLM deception. arxiv.org/abs/2604.04788

From Hallucination to Scheming: A Unified Taxonomy and Benchmark...

Large language models (LLMs) produce systematically misleading outputs, from hallucinated citations to strategic deception of evaluators, yet these phenomena are studied by separate communities...

1,258

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 7

(3/3) Also, I don't understand how some people think AGI is just around the corner but the risks are easily manageable! Of course their positions may not be captured accurately here.

180

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 7

(2/3) I'm sure we all have thoughts on our descriptions -- I certainly worry about many other AI risks current and future in addition to scaled misinformation, and I actually think the world is too focused on LLMs-as-chatbots -- but still impressive.

195

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 7

(1/3) Wow, I (and many friends) made it onto this "Mapping AI" map! Does anyone know the methodology? Who is behind it? mapping-ai.org/map

Map—Mapping AI

Interactive stakeholder map of the U.S. AI policy landscape.

mapping-ai.org

376

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 2

(3/3) You can play with interleaving tokens yourself here! (... works better with better models...) cs.cmu.edu/~focal/alternatin…

404

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 2

(2/3) We've also been interested in interleaving tokens for philosophical reasons. This chapter based on a talk I gave at a Duke conference about tests of consciousness discusses how coherent LLM text doesn't necessarily come from any clear unit entity. cs.cmu.edu/~conitzer/LLMcons…

433

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

May 2

(1/3) In this new paper led by Jiayuan Liu, we show that interleaving tokens from multiple LLMs can be (somewhat) robust, even when a *majority* of the LLMs are corrupted (unlike if they all vote over tokens)! arxiv.org/abs/2604.17139

The Consensus Trap: Rescuing Multi-Agent LLMs from Adversarial...

Multi-agent large language model (LLM) architectures increasingly rely on response-level aggregation, such as Majority Voting (MAJ), to raise reasoning ceilings. However, in open environments,...

4,551

Jerick Shi

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

Jerick Shi

@Jerick1380

Apr 28

After about a year of work, I defended my MSCS thesis at CMU: Title: The Structure of Deception: How LLM Agents Lie, Break Promises, and Exploit Trust in Multi-Agent Settings Core claim: LLM deception in multi-agent settings isn't one phenomenon. It's a family of structurally distinct failure modes, each shaped by different features of the interaction. Some look like premeditated false commitments. Others look like strategic silence that message-level classifiers can't see at all. Aggregate lying rates hide this, and current monitoring approaches each fail against different parts of it. I would like to deeply thank to my advisors @conitzer and @ZhijingJin, @AdtRaghunathan for being part of the committee, and everyone in the @JinesisLab for all their time and effort shaping this work. Recording: youtu.be/Z3Q9AkriPxg @MPI_IS @ELLISforEurope @UofTCompSci @VectorInst @TorontoSRI @CIFAR_News @JinesisLab @EuroSafeAI @ELLISInst_Tue @CarnegieMellon @SCSatCMU #AIAgents #AISafety #MultiAgentAI

6,149

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

Apr 23

(4/4) Emin Berker is presenting (on Saturday at this time) "Designing Rules to Pick a Rule: Aggregation by Consistency" (an unusual approach to social choice where we let the choice of rule depend on the input (!)) arxiv.org/abs/2508.17177

Designing Rules to Pick a Rule: Aggregation by Consistency

Given a set of items and a set of evaluators who all individually rank them, how do we aggregate these evaluations into a single societal ranking? Work in social choice and statistics has produced...

142

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

Apr 23

(3/4) Ioannis Anagnostides (maybe with help from Emanuel Tewolde) is presenting (right now!) "Convergence of Regret Matching in Potential Games and Constrained Optimization" (studying the properties of regret matching beyond zero-sum games) arxiv.org/abs/2510.17067

Convergence of Regret Matching in Potential Games and Constrained...

Regret matching (RM) -- and its modern variants -- is a foundational online algorithm that has been at the heart of many AI breakthrough results in solving benchmark zero-sum games, such as poker....

202

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted

Apr 23

(2/4) Cyrus Cousins is presenting (right now!) "Towards Cognitively-Faithful Decision-Making Models to Improve AI Alignment" (arguing for the importance of ensuring that models learned from people's decisions align with their cognitive processes) arxiv.org/abs/2509.04445

Towards Cognitively-Faithful Decision-Making Models to Improve AI Alignment

Recent AI trends seek to align AI models to learned human-centric objectives, such as personal preferences, utility, or societal values. Using standard preference elicitation methods, researchers...

129

Vincent Conitzer

Foundations of Cooperative AI Lab (FOCAL) at CMU retweeted