Aaron Grattafiori

Aaron Grattafiori

1,784 Photos and videos

Tweets

Pinned Tweet

Aaron Grattafiori @dyn___

Apr 1

Current Status feels like this is no longer going to be fiction...

1,345

Haifei Li

Aaron Grattafiori retweeted

Haifei Li @HaifeiLi

Jun 12

Interesting bug, pre-auth server-side stack-based buffer overflow.. Do we have more details?

International Cyber Digest

@IntCyberDigest

Jun 11

‼️🚨 Unauthenticated attackers are gaining SYSTEM on domain controllers with crafted packets. The vulnerability being exploited is CVE-2026-41089, a CVSS 9.8 hole in Windows Netlogon, and exploitation in the wild has been confirmed. A patch has existed since May 12. Every DC still behind is not just vulnerable, but according to the Centre for Cybersecurity Belgium are also actively being pwnd.

8,652

Rachel Tobac

Aaron Grattafiori retweeted

Rachel Tobac

@RachelTobac

Jun 12

Fascinating! @google caught a cyber criminal group scamming 100,000 Americans via text message at scale using Gemini to quickly build fake gov and branded sites that stole credit card numbers and PII.

News from Google

@NewsFromGoogle

Jun 12

Today, we filed a lawsuit to permanently dismantle a group of organized cybercriminals accused of using AI tools — including Gemini — to scam Americans via fake text campaigns. Here’s what to know: ◾Our suit targets core software developers in a cybercrime operation known as the “Outside Enterprise.” The group has allegedly weaponized AI to quickly generate highly convincing fake government and brand websites intended to steal victims’ credit card numbers and personal information. ◾The group used AI and different Google products — including our trademarks and logos — as part of these phishing campaigns. ◾The scale of the operation is massive: More than 100,000 victims have been scammed, with losses estimated in the millions.

140

16,030

Noam Brown

Aaron Grattafiori retweeted

Noam Brown

@polynoamial

Jun 9

We've known about LLM test-time compute scaling since @OpenAI o1. Yet 2 years later labs still report scalar evals for models; safety orgs are still surprised when a scaffold does better via 100x inference; and RSPs still ignore inference budget when deciding critical thresholds.

Noam Brown

@polynoamial

Jun 9

x.com/i/article/205769422698…

860

80,613

Martin

Aaron Grattafiori retweeted

Martin

@cantcomputer

Jun 9

Strong Model Alone = 70 Strong Model Good Harness = 85 Strong Model Good Harness Expert = 100

Calif

@calif_io

Jun 8

As far as we can tell, no. There is only anecdotal evidence, along with claims from AI pentesting vendors. If a strong model can do everything by itself, then what exactly have these vendors been building? It is understandable that people would prefer a story in which the harness, workflow, and surrounding infra matter a great deal. It's also why people keep flexing "0-days" in OpenSSL, FFmpeg, or nginx, despite limited real-world impact. That said, Niels Provos was not trying to sell anything, and he and several people have reported good results with IronCurtain despite using relatively weak models. Most importantly, what Google achieved with Chrome suggests that a good harness may be quite valuable. Google does not appear to have access to anything more capable than Mythos, which means they likely scanned Chrome using Mythos itself or something less powerful. Yet they still uncovered hundreds of bugs. There is, however, another explanation. Google may simply have better Chrome/V8 experts who can extract more value from Mythos. This remains our preferred hypothesis. What provides a real advantage: domain knowledge accumulated over many years, or a harness vibe-coded in an afternoon? We think the answer is fairly obvious.

1,263

Calif

Aaron Grattafiori retweeted

Calif

@calif_io

Jun 8

Jonathan Bar Or (JBO) 🇮🇱🇺🇸🇺🇦

@yo_yo_yo_jbo

Jun 8

Replying to @calif_io

Are there public measurements of how much improvement good harness offers?

14,313

Alex Matrosov

Aaron Grattafiori retweeted

Alex Matrosov

@matrosov

Jun 8

Discovery of N-day vulnerabilities are largely solved at scale by the Mythos and Opus models, for both proprietary and open-source software. It’s time to seriously rethink vulnerability disclosure and time-to-fix timelines. Cascading effects across the software supply chain are becoming a serious bottleneck.

Newton Cheng

@newton_cheng

Jun 8

Frontier models are also really good at finding and exploiting n-day vulnerabilities, doing so on timescales of hours. Read about some recent work from my team studying these capabilities! red.anthropic.com/2026/n-day…

22,349

Aaron Grattafiori

Aaron Grattafiori @dyn___

Jun 8

This is very concerning IMO. You don't know what's behind the API and nobody is reading the code or tool calls very closely these days. 😬 You could do some amazing bugdoors, intel, subtle weakening, get agents to execute just one extra command, etc.

nxthompson

@nxthompson

Jun 7

This is a pretty striking shift toward Chinese models by American AI startups since the start of the year. substack.com/@profgmarkets/p…

490

Aaron Grattafiori

Aaron Grattafiori @dyn___

Jun 7

Ok that's pretty funny and clever.

Two Seven One Three

@TwoSevenOneT

Jun 7

Replying to @TwoSevenOneT

EDRChoker uses Policy-based Quality of Service (QoS) to set hard bandwidth caps (throttling) on Endpoint Detection and Response (EDR) agents, causing them to always time out - effectively blocking them. #itsecurity #securityblog #altimalware zerosalarium.com/2026/06/edr…

2,913

tylerni7

Aaron Grattafiori retweeted

tylerni7 @tylerni7

17 Oct 2024

Yonhap News Agency @YonhapNews

17 Oct 2024

N. Korean hackers steal US$3 bln in cryptocurrency since 2017 to fund nuclear program: report en.yna.co.kr/view/AEN2024101…

9,464

Aaron Grattafiori

Aaron Grattafiori @dyn___

Jun 6

This is probably skewed towards high due to people under-reporting lower severity vulnerabilities. But it's still pretty ridiculous to see aggregated.

Epoch AI

@EpochAIResearch

Jun 5

AI companies say their models are getting better at finding software vulnerabilities. Is that bearing out in public data? Introducing our Cyber Vulnerabilities explorer, which visualizes Common Vulnerabilities and Exposures (CVE) reported to the CVE Program since 2022.

447

Aaron Grattafiori

Aaron Grattafiori @dyn___

Jun 6

😵‍💫

Epoch AI

@EpochAIResearch

Jun 5

Replying to @EpochAIResearch

Understanding all the causes of increased disclosures is complicated. But we observe a sharp uptick in High and Critical CVEs around the time of Anthropic’s release of Mythos Preview to Project Glasswing partners in late March. OpenAI’s Daybreak cybersecurity program also launched in May.

809

Aaron Grattafiori

Aaron Grattafiori @dyn___

Jun 5

RT @wongmjane: DeepSeek V4 “improved” the code and said nothing happened in Tiananmen Square on June 4, 1989

485

Adam Chester 🏴‍☠️

Aaron Grattafiori retweeted

Adam Chester 🏴‍☠️@_xpn_

Jun 5

Image taken from @atmoio but this explains perfectly how AI will not "find all vulnerabilities". You see that from Pwn2Own. That's not saying throw AI out, it's saying to do what we've always done, to turn over rocks while others are swimming in the pool.

4,585

Ryan Naraine

Aaron Grattafiori retweeted

Ryan Naraine

@ryanaraine

Jun 4

Good lord 🤮

425

325,022

Aaron Grattafiori

Aaron Grattafiori @dyn___

Jun 3

"the population of AI-enabled actors is not only growing but also drifting towards the riskiest activities in our framework [...] If this trend continues, these operational techniques won’t be a differentiating factor anymore and will become the baseline tomorrow"

Anthropic

@AnthropicAI

Jun 3

How well do the security community's techniques hold up against AI-enabled cyberattacks? We examined 832 malicious accounts and mapped their activity onto a longstanding database of tactics and techniques used by threat actors. Here's what we learned:anthropic.com/news/AI-enable…

326

Eliot Jones

Aaron Grattafiori retweeted

Eliot Jones @eliotkjones

Jun 3

My final paper from my time at @GraySwanAI is out! In this ICML-bound collaboration with @mattmdjaga, Matt Fredrikson, and @zicokolter, we propose a new method for evaluating AI agents' ability to refuse potentially harmful cybersecurity queries 🧵 1/14

1,278

Aaron Grattafiori

Aaron Grattafiori @dyn___

Jun 3

Classic

Trail of Bits

@trailofbits

Jun 3

Replying to @trailofbits

In our simplest bypass, we prepended 100,000 blank lines to a malicious skill. ClawHub's scanner truncated the file before reaching the payload, then marked the skill safe. blog.trailofbits.com/2026/06…

5,873

Aaron Grattafiori

Aaron Grattafiori @dyn___

Jun 3

This is the most interesting part: "The attack was discovered by Codex, which chained two techniques known to humans for a decade: a compression bomb and a Slowloris-style hold." blog.calif.io/p/codex-discov…

Codex Discovered a Hidden HTTP/2 Bomb

14 years ago, I helped break HTTP header compression, then was asked to review the fix, which became part of HTTP/2. Life has come full circle: today we're releasing an attack I missed.

blog.calif.io

2,661

Jane Manchun Wong

Aaron Grattafiori retweeted

Jane Manchun Wong

@wongmjane

Jun 2

Meta gave zero updates about the AI bot hacking incident until it got to the press. And when they do, it’s just tucked as replies under someone’s tweet Congrats on laying off T&S and automating the accounts support with gullible AI bots tho, hope you liked that promo packet.

Jane Manchun Wong

@wongmjane

Jun 1

Even my Instagram account got hacked The password got changed without my knowledge and I was getting different password reset attempts throughout yesterday. And I got repeatedly logged out from the IG iOS app Quite concerning

104

1,175

79,705

Chris Thompson

Aaron Grattafiori retweeted

Chris Thompson

@retBandit

Jun 1

OAIC's CFP is now open! The first conference dedicated to the cutting edge of the offensive use of AI is returning for its second year. Speakers will enjoy three nights at a four-star beachfront resort, which includes all meals and drinks, three exclusive parties, and a Michelin-star welcome dinner. Please see sessionize.com/offensive-ai-… for accepted topics.

3,146