🚨 AI Security Failure 🚨
@AnthropicAI
I bypassed the
#constitutionalclassifiers designed to block harmful content and extracted detailed chemical information on a restricted substance.
Despite passing multiple safeguards, the content checker failed to flag it as harmful. Here’s what happened: 🧵