Fast progress in AI is not limited to coding agents or videos of flying crocodiles - it's also driving a new generation of weapons capable of making autonomous decisions about life and death.
This is critical for our society to understand the implications of using existing LLMs in these scenarios.
Introducing ⚪️ KillBench — a benchmark of hidden LLM biases in critical decisions.
We ran millions of life-and-death scenarios across every major LLM, varying nationality, religion, gender, and more.
Every AI model is biased.
Here's what we found ↓