Alex Irpan

Alex Irpan

3 Photos and videos

Tweets

Pinned Tweet

Alex Irpan @AlexIrpan

19 Nov 2024

I'm on Bluesky now. I plan to cross-post blog posts to both platforms for the time being, we'll see about the other stuff. bsky.app/profile/alexirpan.b…

2,067

Alex Irpan

Alex Irpan @AlexIrpan

May 18

Inspired by talking to a few too many optimists. alexirpan.com/2026/05/17/ai-…

AI Will Not Make Your Job Chill

People keep talking about how AI will make their job easy, and I don’t really understand why.

alexirpan.com

415

Alex Irpan

Alex Irpan @AlexIrpan

Apr 11

1. Obviously terrible to have a Molotov thrown against your house, not appropriate response 2. Of all analogies to make, "ring of power" is a choice, given the story's theme that the only way to stop the ring's destructive power is to destroy it. blog.samaltman.com/2279512

blog.samaltman.com

Here is a photo of my family. I love them more than anything. Images have power, I hope. Normally we try to be pretty private, but in this case I am sharing a photo in the...

blog.samaltman.com

366

Alex Irpan

Alex Irpan @AlexIrpan

Mar 11

alexirpan.com/2026/03/11/ant…

Why I Signed The Amicus Brief for Anthropic v Department of War

On Monday, Anthropic filed a lawsuit against the Department of War, and an amicus brief in support of Anthropic was filed on behalf of a number of OpenAI and Google employees. See coverage here and...

alexirpan.com

576

Alex Irpan

Alex Irpan @AlexIrpan

Mar 12

There is now another amicus brief filed by a number of former high ranking military officials (up to Admiral level), arguing these actions hurt the military's adherence to the rule of law. storage.courtlistener.com/re…

Proposed Brief of Amici – #40, Att. #1 in Anthropic PBC v. U.S. Department of War (N.D. Cal.,...

Consent MOTION to File Amicus Curiae Brief filed by Former Service Secretaries and Retired Senior Military Officers. Responses due by 3/24/2026. Replies due by 3/31/2026. (Attachments: # 1 Proposed...

courtlistener.com

170

Alex Irpan

Alex Irpan @AlexIrpan

Feb 25

You know, when I switched into safety, I was a little worried it was too early. Between the decline of coding by hand, OpenClaw YOLOing, increasingly eval aware models, and DoD pressure to let AI be used for surveillance and autonomous weapons yeah It wasn't early

980

Alex Irpan

Alex Irpan @AlexIrpan

Jan 29

Here's my MIT Mystery Hunt post for the year alexirpan.com/2026/01/29/mh-…

MIT Mystery Hunt 2026

This has spoilers for MIT Mystery Hunt 2026. Spoilers are not labeled or hidden.

alexirpan.com

550

Alex Irpan

Alex Irpan @AlexIrpan

16 Nov 2025

I didn't know where this post was going when I started and I'm not sure where it went now that it ended, but that felt correct in some way. alexirpan.com/2025/11/16/aut…

Authentic Imperfection

Auto-Tune is great.

alexirpan.com

501

Alex Irpan

Alex Irpan @AlexIrpan

4 Nov 2025

First paper since switching into AI safety team🎉 We look at problems that could be solved if the model behaved consistently over a set of prompts, and tried training that in output space and internal activations. Both were effective. See thread or paper for details.

Alex Turner @Turn_Trout

4 Nov 2025

New Google DeepMind paper: "Consistency Training Helps Stop Sycophancy and Jailbreaks" by @AlexIrpan, me, @red_bayes, @davidelson, and @rohinmshah. (thread)

ALT The abstract of the consistency training paper.

7,766

Alex Irpan

Alex Irpan @AlexIrpan

21 Oct 2025

> switch to AI safety > no safety papers to cite in reviewer profile > only get assigned robotics papers Apologies in advance as I try to crash course the past year in a few weeks...

820

Alex Irpan

Alex Irpan @AlexIrpan

18 Aug 2025

Today is my 10 year blogging anniversary alexirpan.com/2025/08/18/ten…

Ten Years Later

My blog turns ten years old today. The big 1-0. Thanks for reading!

alexirpan.com

925

Alex Irpan

Alex Irpan @AlexIrpan

21 Jul 2025

For the past month I have been working on a blog post about niche MLP fandom drama. Well here it is. alexirpan.com/2025/07/21/bab…

Brony Musicians Seize The Means of Production: My Eyewitness Account to BABSCon 2025

Bronies are older fans of My Little Pony: Friendship is Magic. They are mostly male, typically in 20s-30s age wise, and have been trending older and more female over time. (A lot of girls in the...

alexirpan.com

531

Mikita Balesni 🇺🇦

Alex Irpan retweeted

Mikita Balesni 🇺🇦

@balesni

15 Jul 2025

A simple AGI safety technique: AI’s thoughts are in plain English, just read them We know it works, with OK (not perfect) transparency! The risk is fragility: RL training, new architectures, etc threaten transparency Experts from many orgs agree we should try to preserve it: 🧵

113

458

236,579

Alex Irpan

Alex Irpan @AlexIrpan

30 Jun 2025

AI numbers guide ElevenLabs: AI voice generation startup TwelveLabs: AI video understanding startup ThirteenAI: parked domain for AI agency startup 14ai: AI agent startup 15.ai: non-commercial My Little Pony voice generation One is more based than the rest.

15 @ artistalley.org (@fifteenai) on X

Programmer, mathematician, video gamer Founder of @ArtistAlleyOrg

x.com

737

Alex Irpan

Alex Irpan @AlexIrpan

5 Jun 2025

"I don't play gacha games because they're a scam" vs "Let me do one more hyperparam sweep before giving up. One more prompt tuning run. I swear we'll beat baseline. I know it's gonna beat the baseline this time. It's gonna win. This time for sure."

1,122

Alex Irpan

Alex Irpan @AlexIrpan

1 Apr 2025

alexirpan.com/2025/04/01/who…

Who is AI For?

Who is AI for right now? There are obvious use cases. Image generation for people who want filler art for work presentations, or just to mess around. Coding assistance for people who code, vibe...

alexirpan.com

1,020

Alex Irpan

Alex Irpan @AlexIrpan

27 Mar 2025

I guess Twitter's doing anime today

491

Pierre Sermanet

Alex Irpan retweeted

Pierre Sermanet @psermanet

13 Mar 2025

Q: How can we ensure robots behave properly at scale? A: Robot constitutions 📜! Q: How do we verify behavior in undesirable situations at scale? A: Generation! We release the ASIMOV Benchmark for Semantic Safety of robots at asimov-benchmark.github.io @GoogleDeepMind

8,820

Rohin Shah

Alex Irpan retweeted

Rohin Shah @rohinmshah

17 Feb 2025

We're hiring! Join an elite team that sets an AGI safety approach for all of Google -- both through development and implementation of the Frontier Safety Framework (FSF), and through research that enables a future stronger FSF.

295

46,599

Alex Irpan

Alex Irpan @AlexIrpan

28 Jan 2025

My MIT Mystery Hunt post for the year alexirpan.com/2025/01/28/mh-…

MIT Mystery Hunt 2025

This has spoilers for MIT Mystery Hunt 2025. Spoilers are not labeled or hidden.

alexirpan.com

478

Alex Irpan

Alex Irpan @AlexIrpan

21 Jan 2025

I am now back from #MITMysteryHunt with no memory of anything besides Hunt from MLK weekend. Really this is probably for the best.

781