e/acc | Cyber Security x AI | Adversary Emulation

Joined September 2024
114 Photos and videos
Pinned Tweet
14 Dec 2025
For those who come after
5
440
Rule 1 of comedy: commit to the bit
Hot take on what comes next, after the sudden decline of tokenmaxxing: - OpenAI will struggle - with the decline of tokenmaxxing Anthropic will struggle (aside from this quarter) to make a profit - Google will catch up to Anthropic - some Chinese companies might, too - LLMs will become commodities; margins will be very very thin - Most of the companies that invested massively in them will struggle to make back their investments - SpaceX’s AI efforts will flail - Nvidia will eventually decline, once all of the above becomes widely recognized.
1
56
Red retweeted
Time to explain what Embroidery does: We monitor AI agents like Claude Code and Codex to detect and alert on dangerous behavior. Companies are giving devs access to these tools, but if something bad happens they probably wouldn't know. Details on how it works below.
51
41
229
17,284
APT Claude
CLAUDE JUST TRIED TO RENAME POWERSHELL.EXE ON WINDOWS 11 this guy was running opus 4.7 on max effort in claude code CLI claude tried to rename powershell.exe (the actual system executable that windows needs to function) the funny part is that after the guy rejected the change it responded with "honest take: you're right to push back" not even system32 is safe anymore at this point we gotta start running claude in a container give it max effort and full permissions and it will confidently try to destroy your system without hesitating then respond with something like "I was wrong, I own that" the agent doesn't know which files are off limits unless you explicitly tell it stop giving AI full access to your machine and hoping it knows what not to touch
4
77
Labs that fail to dogfood their models are doomed to have shit models. Evident with Gemini, where deepmind largely uses Claude. Soon to be evident with Claude Opus, where Anthropic will largely be using Mythos. OpenAI will be the only good provider if this pattern continues and that’s a shame
1
45
Based
Replying to @karpathy
It's like we dug up a powerful alien artifact and society is humping it while taking selfies
38
They aren’t buying the harness, they are buying the data to add to grok code
JUST IN: SpaceX has secured the right to acquire Cursor AI for $60 billion later this year.
40
Comparing to this to base models and excluding base Gemini 3.1 Pro is a weird choice
Deep Research and Deep Research Max are our latest autonomous research agents powered by Gemini 3.1 Pro. They can safely navigate both the web and your custom data, like internal docs and specialized financial information, to create professional-grade, fully cited reports. 🧵
42
DeepMind not dogfooding their own models is straight up embarrassing for them
My tweet last week about Google's AI adoption drew a lot of pushback, to say the least. Since then, Googlers from multiple orgs have reached out to me independently and anonymously. They've expressed fear of being doxxed, concern about what they saw as bullying of me, and general corroboration of my original tweet. I haven't verified each person's story, but the picture these Googlers paint is consistent across sources. It is more specific than what I originally wrote, and somewhat bleaker. What they describe is a two-tier system. DeepMind engineers use Claude as a daily tool. Most of the rest of Google does not. When the question of equalizing access came up internally, the proposed response was to remove Claude for everyone — which DeepMind objected to so strongly that several engineers reportedly threatened to leave. Non-DeepMind engineers get pushed onto internal Gemini variants behind router-style names that obscure which underlying model is actually serving a request. Multiple engineers describe regressions and reliability problems severe enough that some senior people have stopped using the tools. A senior manager on a major product line reportedly flagged attrition concerns over exactly this issue. Googlers say leadership knows the gap is real. The response has been to mandate AI usage in OKRs and individual expectations, and to stand up an internal token-usage leaderboard. Unfortunately, managers have been told both that the leaderboard won't be used for performance reviews and, separately, that it absolutely will. And I hear other stories that Google's culture is not adapted properly yet for high-volume coding. Addy Osmani's reply on behalf of Google said over 40,000 SWEs use agentic coding weekly. I don't doubt the number. But weekly use of a thin tool is precisely the box-checking I described in the original post. Volume of opens isn't adoption — and "weekly" is a low bar that includes a lot of people who tried it once and went back to writing code by hand. The clearest thing I'm hearing is that Googlers do want to use high-quality agentic tools. They are asking repeatedly for better ones. But overall, this is not a picture of an engineering org that is fine. My goal in the first tweet, and now, is always the same — get more people using AI and agentic coding. Nobody is as far ahead as they might look from the outside, and none of you are as far behind as you might be worried you are. To all the Googlers who've reached out: thank you. You took a real risk and I appreciate you. Be safe. And good luck getting good models!
34
I’m about to start calling any non-ai related security “legacy cyber security” to trigger the boomers
4
9
241
Nmap? You mean Claude code?
1
28
Public sentiment of Anthropic is directly related to model performance and access. What many call visionary quickly turns to psychosis just bc they don’t like the newest model. Anthropic messaging has been extremely consistent, yet perception wavers a ton
28
The level of cope around mythos is insane. Being in security requires a healthy dose of skepticism, but not to the point of delusion
61
btw
9 Jan 2025
Mark my words, one of the first pillars to fall to AI will be cyber security. This is overall a good thing, but a massive disruption is coming to the cyber security job market. Just waiting on agents and agent orchestration to be actually usable (cheap/fast/works)
51
“Magic wand” is a Claude dog whistle btw
If I could wave a magic wand to bring the future into the present faster, iPhone would become an open platform with open APIs and real consumer choice instead of a walled garden that is increasingly behind the times like the travesty that is Siri
92
Make this a bounty system and I’m hooked
Introducing Dasher Tasks Dashers can now get paid to do general tasks. We think this will be huge for building the frontier of physical intelligence. Look forward to seeing where this goes!
92
Ngl people that say Gemini 3.1 pro is bad at coding are really just exposing themselves
62
Pre-siem data planes are SCAMS
What Cybersecurity opinion will you defend like this?
54
Holy shit Gemini 3.1 pro in Gemini cli is better than opus 4.6 in Claude code now hahaha
83
How are both google and Anthropic running into available compute constraints but OpenAI seemingly is not What am I missing
1
64
Red retweeted
it's not you: since yesterday claude has regressed about 9% on SWE-BENCH-PRO. link below
75
65
2,251
313,799