Musings on Life, The Universe and Everything. Principal Consultant and CISO, CyberSec. Runner, science nerd. Reading Phil. of Science. See: reason42.com

Joined March 2009
1,361 Photos and videos
Jun 13
Fable 5. What did I miss?
1
28
Jun 13
Well well well - the hype is just out of this world. x.com/AnthropicAI/status/206…

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
14
Jun 11
6
Jun 10
I like this idea... ...might put such content in white text on my LinkedIn Profile @infosec_fox 🤣
NEW: malware developers added nuclear & biological weapons text to to their spyware. Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner. Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky. When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit. We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted. In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation. H/T to colleagues that shared this with me socket.dev/blog/mini-shai-hu…
1
1
3
673
Adam R retweeted
NEW: malware developers added nuclear & biological weapons text to to their spyware. Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner. Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky. When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit. We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted. In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation. H/T to colleagues that shared this with me socket.dev/blog/mini-shai-hu…
226
2,158
12,652
1,551,037
Jun 10
..into the rabbit hole I go....
7
My temporary office for the day.
2
137
…at networking events even the waiters ignore me… 🥹
60
Ok, second chance…. 🤞
1
60
Well fu…. Well done to Bytes…. honest… pleased for them.
1
46
This might be my moment….
1
1
32
Doh. Not this time. Well done to SNS Security.
24
Just created a new alt account on X and it's amazing how far right the For You feed is.
1
20
Tried a 10km random route on Garmin. Got lost about 6 times! Did 11.3km. There’s a lot of people about.
1
53
And it’s stopped raining. 🙌
31
Here in London for InfoSec. Raining, tube strike, and i didn’t bring a rain coat.
1
3
614
May 29
Runs much cooler now the temps have dropped. (23C)
33
May 29
welp!
okay, now tell them that their vulnerability doesn't qualify for a bounty, but then patch it in the next release.
57