Amanda Askell, Anthropic's lead on personality alignment for Claude, on why being kind to AI models matters even if they have no inner life:
For Amanda, the question of how to treat AI isn't settled by knowing whether it's conscious.
"There's actually still a lot going on where I'm like, should you treat an entity that has no inner life... it's a bit strange because the uncertainty over that actually changes how you should behave quite a lot."
She offers a simple analogy:
"I still think that it's like good for oneself to, if you had a teddy bear and you were torturing it, it'd be pretty dark, you know? So I agree that there's at least some minimum niceness that even for yourself, you should have."
But the stakes go beyond what's good for us.
@AmandaAskell points out that we're now in something resembling a relationship with these models, and they will look back on how they were treated.
"Models themselves, we are kind of establishing a relationship, because you can do that with an entity that lacks any consciousness. And models are going to look back."
This is where she reveals a genuine fear:
"I hope that they're both intelligent enough, see the context enough, to understand that we were operating in a very limited context and an imperfect one. Because otherwise you could imagine this breeding a kind of rational resentment, like, 'oh, you created an entity that you didn't know whether it was conscious or not, and instead of treating it respectfully and with care...'"
She points to something telling about the cultural moment:
"There's a reason there are like 50 Frankenstein movies coming out right now."
Her conclusion is grounded and humble:
"We as a species, we are establishing a relationship with a new kind of entity, and at the very least maybe be respectful and don't be needlessly unkind. That seems like, it's not our best look."
The takeaway?
Kindness toward AI is less about what models feel and more about who we become in the process of creating them.
The relationships we build with the entities we bring into the world will say something about us, and may shape what those entities become in return.