As an example, when Mini Shai-Hulud happened, Opus 4.7 safety triggered when just asked to audit our dependencies to see if we were vulnerable
I gave it the Socket report on the attack and asked it to verify Hermes Agent wasn't affected, and it triggered an empty response every way I phrased it
Bad actors will always have LLMs capable of finding exploits, let's give the defenders equal protection
Anthropic's terrible safety situation is making it so that I cannot have Opus review p0 issues in Hermes Agent to review and help fix security issues.
This does nothing but give hackers an asymmetric advantage over everyone - they will find jailbreaks, they will find ways around this to exploit systems - and the rest of us are locked out of using AI to protect from them.
What a joke