Assistant prof @MIT_CSAIL, research @OpenAI. PhD in computer science @Stanford

Joined September 2007
1 Photos and videos
Mitchell Gordon retweeted
We're back! The MIT HCI group has grown, and we couldn't be more excited. A huge welcome to our newest faculty (@mitchellgordon, @huangcza, @ZanaBucinca & @jas_x_flowers) & students joining @arvindsatya1, @karger, Stefanie Mueller, Rob Miller & Daniel Jackson. Give us a follow!
2
13
66
16,009
Mitchell Gordon retweeted
excited to share that i'll be pursuing my phd in computer science at @MIT_CSAIL starting this fall 🥳🎓 i'm so grateful to be coadvised by the literal dream team: @jacobandreas, @bakkermichiel and @mitchellgordon 🙌
21
14
357
17,883
Mitchell Gordon retweeted
Sycophancy, disempowerment, homogenization of thought: lots to be grim about for what AI is doing to us, the collapse of our subjectivity into a machine "objectivity". But a lot of AI's value seems to come precisely from scaling this objectivity. How do we make sense of this?
1
3
23
1,238
Mitchell Gordon retweeted
“Should I fear death?” Ask an LLM and you get one answer or a big bag, but little visibility into the decisions and assumptions that produced them. We built the "conceptual multiverse": a system that makes those decisions transparent and intervenable. multiverse.csail.mit.edu
1
9
39
6,530
Mitchell Gordon retweeted
recently, i’ve been thinking about ways to design ai systems to be more compatible with slow thinking 🐌. you can check out the full blogpost here 🤗: jennyhuang19.github.io/slow-…
4
21
169
11,952
Mitchell Gordon retweeted
Last day to apply to the OpenAI safety fellowship! It’s a chance to work with some of my favorite people on some of the most important, interesting, and consequential questions in AI
Apr 6
Introducing the OpenAI Safety Fellowship, a new program supporting independent research on AI safety and alignment—and the next generation of talent. openai.com/index/introducing…
2
5
37
14,859
Mitchell Gordon retweeted
MIT postdoc opportunity! We're hiring a human-AI interaction postdoc (HCI ML/RL) to train agents that deepen how people think and collaborate - rewarded by how humans actually build skill together. With @arvindsatya1 @ZanaBucinca, me & more! Apply by May 1 tinyurl.com/4jsr8ee9
3
18
121
13,186
Mitchell Gordon retweeted
Apr 6
Introducing the OpenAI Safety Fellowship, a new program supporting independent research on AI safety and alignment—and the next generation of talent. openai.com/index/introducing…
384
293
2,666
948,198
Mitchell Gordon retweeted
🚨MIT Postdoc Opportunity! We're looking for someone with an HCI ML/RL background to work with us on agents that promote metacognition and sociality—trained with ethnographic rewards! w/@mitchellgordon,@zanabucinca & colleagues in sociology anthropology tinyurl.com/4jsr8ee9
1
29
125
14,610
Mitchell Gordon retweeted
I’m looking for someone who’s excited to be on the operational end of AI safety research problems. This role sits at the intersection of research and execution: working with academic researchers, 3p evaluators, and internal partners to help shape AI safety in practice.
20
17
250
61,934
Mitchell Gordon retweeted
We’re committing $7.5M to @AISecurityInst’s Alignment Project to fund independent research on mitigations for safety and security risks from misaligned AI. openai.com/index/advancing-i…
215
80
755
124,601
Congrats to the Simile team! Some of the best people I know, working on one of the most interesting problems.
8
1,009
Mitchell Gordon retweeted
New on the OpenAI alignment blog! We prototype a method for eliciting the values that drive preferences over model responses, and release CoVal, an experimental dataset we built with it. Details in thread 👇
1
8
50
17,999
Mitchell Gordon retweeted
New paper out with @Scale_AI! Introducing MoReBench - the first-ever benchmark to evaluate procedural moral reasoning in LLMs. MoReBench focuses on how LLMs reason, not just what they decide. We reveal surprising gaps in frontier models' moral reasoning that scaling laws & existing benchmarks miss entirely, and encourage more research around CoT monitoring and robust capability building. This collaboration spanned @UW @nyuniversity @harvard @stanford @mit @cais & more 🧠⚖️
5
22
126
16,867
Mitchell Gordon retweeted
✨Tutorial Materials Now Available! We’re truly grateful for the hundreds (maybe thousands!) of wonderful attendees who joined our #NeurIPS Human–AI Alignment Tutorial 💗 -- Thank you all for your enthusiasm, thoughtful questions, and all the inspiring follow-up conversations 🤗! As many of you requested during #NeurIPS, we would love to share with you the full tutorial video and all slides below provided by our amazing speakers @mitchellgordon @adamfungi @Yoshua_Bengio: 📺 Tutorial Recording: neurips.cc/virtual/2025/loc/… 📕All Slides: hai-alignment-course.github.… We’d also love to hear more of your questions and feedback — and hope these resources spark new ideas and collaborations in Human–AI Alignment research🔥!
19 Nov 2025
🚀 Thrilled to announce our upcoming #NeurIPS2025 Tutorial on Human–AI Alignment: Foundations, Methods, Practice, and Challenges! 🗓️ Dec 2, 09:30–12:00 PST 📍 Exhibit Hall F, San Diego Convention Center 🔗 NeurIPS program: neurips.cc/virtual/2025/loc/… 👉 Tutorial Website: hai-alignment-course.github.… With an incredible lineup of speakers — @mitchellgordon, @adamfungi, @Yoshua_Bengio — we’ll dive into: * Human-in-the-loop AI & Value Alignment * Collective Alignment * Sociotechnical Evaluation and Oversight * A Safety Argument for the Scientist AI 🌟 An exceptional interdisciplinary expert panel -- featuring insights from @dawnsongtweets, @eegilbert, @monojitchou, and @hannahrosekirk! 👫 Welcome to join us for an exciting and engaging session — let’s shape the future of Human–AI Alignment together! #NeurIPS2025 #HAIAlignment #ValueAlignment #CollectiveAlignment #AISafety #ResponsibleAI
4
13
79
11,902
Mitchell Gordon retweeted
19 Nov 2025
🚀 Thrilled to announce our upcoming #NeurIPS2025 Tutorial on Human–AI Alignment: Foundations, Methods, Practice, and Challenges! 🗓️ Dec 2, 09:30–12:00 PST 📍 Exhibit Hall F, San Diego Convention Center 🔗 NeurIPS program: neurips.cc/virtual/2025/loc/… 👉 Tutorial Website: hai-alignment-course.github.… With an incredible lineup of speakers — @mitchellgordon, @adamfungi, @Yoshua_Bengio — we’ll dive into: * Human-in-the-loop AI & Value Alignment * Collective Alignment * Sociotechnical Evaluation and Oversight * A Safety Argument for the Scientist AI 🌟 An exceptional interdisciplinary expert panel -- featuring insights from @dawnsongtweets, @eegilbert, @monojitchou, and @hannahrosekirk! 👫 Welcome to join us for an exciting and engaging session — let’s shape the future of Human–AI Alignment together! #NeurIPS2025 #HAIAlignment #ValueAlignment #CollectiveAlignment #AISafety #ResponsibleAI
Thrilled to share that our paper “Towards Bidirectional Human-AI Alignment” has been accepted to #NeurIPS2025 (Position Track)! 🎉 👫<>🤖We argue for an explicit reflection on what we mean by “alignment”, and to take into account the bidirectional, dynamic interactions between humans and AI to achieve truly responsible and safe AI systems. 🧠 if you’re generally interested in “alignment”, don’t miss our #NeurIPS2025 Tutorial on “Human-AI Alignment: Foundations, Methods, Practice, and Challenges” , with amazing @mitchellgordon & @adamfungi — more details coming soon! - 💎 NeurIPS 2025 Position Paper: arxiv.org/pdf/2406.09264 - 📚 NeurIPS 2025 Tutorial: neurips.cc/virtual/2025/tuto… 💗 Huge thanks to our incredible co-authors — this was our 3rd resubmission — your persistent support and encouragement made it happen! Big thanks to everyone in our ICLR & CHI 2025 BiAlign workshops — your enthusiasm keeps us believing we’re doing something right for our community.🙏 ☕️👯‍♀️I’m attending #COLM2025 at Montreal this week, happy to chat more if you’re around! Also, we (w/ multiple co-authors) will present our #BiAlign paper in-person @SanDiego -- catch us at #NeurIPS2025, we’d love to hear your thoughts and join discussions!
2
12
107
40,213
Mitchell Gordon retweeted
No single person or institution should define ideal AI behavior for everyone.  Today, we’re sharing early results from collective alignment, a research effort where we asked the public about how models should behave by default.  Blog here: openai.com/index/collective-…
72
89
540
181,953
Mitchell Gordon retweeted
2 May 2025
We’ve spent the last few days doing a deep dive on what went wrong with last week’s GPT-4o update in ChatGPT. Expanding on what we missed with sycophancy and the changes we’re going to make in the future: openai.com/index/expanding-o…
483
523
4,710
2,200,872