Philosopher @eleosai

Joined August 2022
1 Photos and videos
MATS with @RosieCampbell will also be awesome!
I am going to be mentoring for a new MATS track focused on founders and amplifiers! Many fellowships focus on research, but there's so much to be done beyond that. Come found orgs, build infra, run events, and help us scale up the field of AI welfare. Apply by June 7 matsprogram.org/apply
1
16
741
Apply for MATS with @dillonplunkett - it'll be awesome!
I’m mentoring Autumn 2026 @MATSprogram Fellows interested in doing AI welfare research. The application deadline is this Sunday (6/7). More info in this thread:
22
1,545
Patrick Butlin retweeted
We're hiring Research Scientists to join my team at @eleosai! We do foundational and applied ML research on the moral status and potential well-being of AI systems. This is urgent, important work, and Eleos is an extraordinarily fun and exciting place to do it. Details below.
10
31
239
22,147
Patrick Butlin retweeted
💡 Another round of Longview Philanthropy’s digital minds request for proposals is open for applications. A year ago I would have called this niche. Now AI labs publish model welfare research, public discussion of digital sentience is growing, and the field is expanding. 📈
3
18
56
8,048
Another exciting @MATSprogram paper, this time from the brilliant @gilg_oscar. We found a direction in LLMs that apparently performs a persona-relative evaluative function in some very different contexts.
First preprint! Working with @patrickbutlin during @MATSprogram. LLM Assistant personas like being helpful, evil personas like being harmful. We found that a single direction represents helping as good under the Assistant, and ‘harm’ as good under evil.
1
2
26
2,684
Our research is complementary with Anthropic's concurrent work on emotion concepts (transformer-circuits.pub/202…); we used a different method to extract evaluative representations and studied how they interact with varying personas.

1
2
120
I'm proud to announce this new paper with my fantastic @MATSprogram fellow @BeckmannPierre, on personas and LLM individuation.
New paper with @PatrickButlin, from my time at @MATSprogram . We propose two new candidates for LLM individuation: the (virtual) instance-persona view and the model-persona view. 🧵
1
8
68
5,956
Many thanks to @MATSprogram for making our collaboration possible - and look out for another paper, with the equally excellent @gilg_oscar, coming soon!
1
9
266
Some recent papers:
1
4
11
702
1. 'Desire in AI': philarchive.org/rec/BUTDIA 2. 'Are any machines conscious today?': philarchive.org/rec/BUTAAM-2 3. 'Testing for consciousness in current AI': philarchive.org/rec/BUTTFC 4. 'Consciousness and AI' encyclopaedia entry: oecs.mit.edu/pub/zf1nbs6d/
2
2
17
368
5. 'Higher-order representation in AI' (unfortunately slightly dated already): philosophymindscience.org/in…

1
232
New paper on AI consciousness! Here we present the theory-derived indicator method for assessing AI systems for consciousness. Link below.
23
72
330
28,943
The new paper is here: sciencedirect.com/science/ar…

3
2
12
1,530
Many thanks to the editor and reviewers for @TrendsCognSci and especially to my co-authors, including @rgblong @Yoshua_Bengio @birchlse @davidchalmers42 @ConstantAxel @georgejwdeane @EricElmoznino @kanair @MatthiasMichel_ @Liad_Mudrik @meganakpeters @eschwitz and others!
1
1
16
1,440
Patrick Butlin retweeted
We're thrilled to announce the first Eleos Conference on AI Consciousness and Welfare. Join us Nov 21-23, 2025 in Berkeley, CA for discussions on AI welfare with leading researchers from @nyuniversity, @Google, @AnthropicAI, & more.
5
24
110
27,412