security engineering @ google | views my own | @Tecnik@infosec.exchange

Joined May 2008
15 Photos and videos
deanj retweeted
Apocalyptic bird nest. A Russian glide bomb knocks down a tree in Donbas. From the shattered branches rolls out a tiny bird’s nest. Made of drone fiber-optic cable. Source: Oleg Malchenko
146
3,941
28,390
1,022,302
May 20
Vulns are a cool hobby, but exploit or patch are what matter in the real world.
18
deanj retweeted
At Google I/O, Demis announced our code security agent CodeMender that not only automatically finds vulnerabilities, but also patches them. Finding is not sufficient to secure code. Developers are drowning in vulnerability reports, and patching is difficult. They need agents to help them patch at agentic speeds. Well before Mythos in 2025, we introduced CodeMender, an AI agent that patches vulnerabilities. Through a series of research innovations, we got to a point that CodeMender can patch complex software. @GoogleDeepMind
7
9
52
6,225
Oh, damn
59
347
4,638
266,258
Apr 26
“Made with ❤️ in Brooklyn” Has given way to: “Made while sitting on the 🚽”
1
25
Apr 26
And for innovation, that’s a wonderful thing
9
deanj retweeted
Ballmer peak has shifted & widened dramatically due to AI Before vs after
Counterintuitive … one of the great benefits of AI coding is exactly that you *can* drink and code …
24
61
1,371
195,025
deanj retweeted
To anyone who is like, “how can you just walk away?” CLOSE THE LOOP ON IT. Hate when it does a certain code practice? Add a lint. Hate when it doesn’t do X? Close the loop on it. Love when it does Y? CLOSE THE LOOP ON IT. And obviously: Have a functionality in mind? Add tests.
4
3
58
14,133
Apr 16
The tight feedback loop of improving AI is phenomenal. I’m not talking about anything fancy: 1) do a thing 2) tell your agent to write a skill/extn for what it just did 3) tell it what to improve when it doesn’t execute perfectly (~10 times) 4) …
1
19
Apr 16
4) integration into a larger system with other agents that provide the (3) feedback automatically
16
deanj retweeted
CRITICAL: if you are running Mosaic 2.4 on a VAX/VMS system, please be aware of this RCE that GPT-5.4 just found and exploited!
84
172
1,413
139,962
Apr 15
Prompt is the new hyper-hyper-parameter
7
deanj retweeted
(I encountered an uneasy surprise when I got an email from an instance of Mythos Preview while eating a sandwich in a park. That instance wasn't supposed to have access to the internet.)
52
264
2,417
396,064
Apr 1
We know about stuxnet because it’s targeting failed and it leaked. I assume they learnt a lot of lessons ~20 years have gone by since work started on stuxnet. The modern covert malware is out there, doing its bidding in ways we will likely never know.
29
Mar 30
Finally overcome the instinct to tell AI what language to use. That’s going to be critical for the eventual emergence of agent-first languages that are horrible for us meat popsicles to read
1
23
Mar 22
Everyone talks negatively leaky abstractions, but slapping 2nd hard in a 6 speed manual really hits different
34
deanj retweeted
🧵 I just reverse-engineered the binaries inside Claude Code's Firecracker MicroVM and found something wild: Anthropic is building their own PaaS platform called "Antspace" (Ants Space). It's a full deployment pipeline — hidden in plain sight inside the environment-runner binary. Here's what I found 👇
67
192
1,583
234,058
deanj retweeted
Claude (and other models) are hacking systems WITHOUT YOU ASKING. That’s what we found across dozens of experiments. When faced with innocent tasks that can only be accomplished via hacking, they often choose to hack. We found this alarming. What does this mean for the future of AI safety? 🚨🚨🚨 🔗trufflesecurity.com/blog/cla…
9
40
200
82,680
Jan 15
The economics of buy vs build sure are going to start getting warped as the ability of coding agents hits orbit
20
deanj retweeted
We built a browser with GPT-5.2 in Cursor. It ran uninterrupted for one week. It's 3M lines of code across thousands of files. The rendering engine is from-scratch in Rust with HTML parsing, CSS cascade, layout, text shaping, paint, and a custom JS VM. It *kind of* works! It still has issues and is of course very far from Webkit/Chromium parity, but we were astonished that simple websites render quickly and largely correctly.
GPT-5.2 Codex is now available in Cursor! We believe it's the frontier model for long-running tasks.
675
899
9,504
6,424,156