Alex Matrosov

Alex Matrosov

250 Photos and videos

Tweets

Pinned Tweet

Alex Matrosov

@matrosov

5 May 2023

⛓️Confirmed, Intel OEM private key leaked, causing an impact on the entire ecosystem. It appears that Intel BootGuard may not be effective on certain devices based on the 11th Tiger Lake, 12th Adler Lake, and 13th Raptor Lake. Our investigation is ongoing, stay tuned for updates.

BINARLY🔬

@binarly_io

5 May 2023

⛓️Digging deeper into the aftermath of the @msiUSA data breach and its impact on the industry. 🔥Leaked Intel BootGuard keys from MSI are affecting many different device vendors, including @Intel , @Lenovo, @Supermicro_SMCI, and many others industry-wide. 🔬#FwHunt is on!

727

1,815

1,130,411

Kenan Sulayman

Alex Matrosov retweeted

Kenan Sulayman

@in19h

Jun 12

The rax emulator now lives under the Hex-Rays brand github.com/HexRaysSA/rax

GitHub - HexRaysSA/rax: rax is a CPU emulator that does not trust itself.

rax is a CPU emulator that does not trust itself. Contribute to HexRaysSA/rax development by creating an account on GitHub.

github.com

196

13,219

Alex Matrosov

Alex Matrosov retweeted

Alex Matrosov

@matrosov

Jun 8

Discovery of N-day vulnerabilities are largely solved at scale by the Mythos and Opus models, for both proprietary and open-source software. It’s time to seriously rethink vulnerability disclosure and time-to-fix timelines. Cascading effects across the software supply chain are becoming a serious bottleneck.

Newton Cheng

@newton_cheng

Jun 8

Frontier models are also really good at finding and exploiting n-day vulnerabilities, doing so on timescales of hours. Read about some recent work from my team studying these capabilities! red.anthropic.com/2026/n-day…

22,345

Logan Graham

Alex Matrosov retweeted

Logan Graham

@logangraham

Jun 9

Fable 5 is the same underlying model as Mythos 5, but with cybersecurity and biology blocks. Mythos is the first model that's made me feel that we've entered the next phase of model progress. For years, we've talked about cybersecurity / self-improvement / autonomy / model-dominated coding / biology implications of model progress. Some of these are issues to defend against; some are areas to advance. Mythos has made me & our team feel like we've seen the earliest glimpse of the world we've been talking about. Also, we published a lot of cyber eval results in the system card, including some evals we designed recently, as well as details of safeguards. In most cases, Mythos 5 ~= Mythos Preview. We found it ticked up on the new ExploitBench eval, and we opted to put that in the eval table so people can calibrate/update on advances in cyber capabilities to be prepared for. (We don't want to compete on offensive capabilities and don't try to.) But overall, Mythos 5 is an efficient model, about equal to Mythos Preview in most cases. I'd really like more people to design new security evals! The better models get, the more our limited evals only see a small part of the picture. In terms of where we go from here, here are some current thoughts: 1/ It's important we get Mythos cyber capabilities to defenders. We just have to do it safely and cautiously. We're working on an expanded trusted access program. We're working with government and industry to do this. I sort of envision the next 1-2 years being a large scale effort to make the world resilient design & implement new approaches to security. 2/ I think cybersecurity will start merging with AI security and alignment. Let's say you're a defender and you want to use a model -- will it break out of its sandbox? Will it stop where you tell it to stop? This is one reason I'm excited about working on cybersecurity. In the limit, it's the same thing as AI security. 3/ I really want people to develop new evals for... defensive cybersecurity, hardware security, autonomously running a business, advanced biology, and other parts of national security. Our internal eval ship rate is way, way up because Mythos makes it easy to iterate, especially on the engineering aspect of building evals. (Sometimes, we ask new hires to make a new eval on their first day, and another on the next). I’m excited we’re making this available as Fable 5, because I think the world spending time with the model is the most important way to calibrate.

Claude

@claudeai

Jun 9

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.

0:20

180

26,573

Alex Matrosov

Alex Matrosov

@matrosov

Jun 9

Performing better than Opus models in RE/VR projects 🎉

Claude

@claudeai

Jun 9

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.

0:20

2,390

Logan Graham

Alex Matrosov retweeted

Logan Graham

@logangraham

Jun 9

New post on Red today: Our team @AnthropicAI found that Mythos Preview is meaningfully better at developing N-days. It took us a couple thousand $ and a few hours to convert patches into exploits. We publish research like this because we think it's important the world knows what models are/will be capable of. In a year, Mythos will probably look trivial. We want to help the world to start preparing. I'm excited to share a lot more blue team / defensive work. I feel like people are aware of the issue now, and the team's task is now to "solve it all" -- we have some exciting / interesting / creative defensive research lined up.

662

81,511

Dino A. Dai Zovi

Alex Matrosov retweeted

Dino A. Dai Zovi

@dinodaizovi

Jun 9

If models are just going to get better, even than Mythos, the time for those models to turn a patched vulnerability into an exploit will keep shrinking from hours to minutes. I have a hard time imagining the entire world continuously deploying updates in minutes every time that an update for any software they use is released without other adverse effects. The right strategy has to be achieving sufficient security without relying on patching (still do it, but don't depend upon winning the race).

Alex Matrosov

@matrosov

Jun 8

9,581

Ivan Krstić

Alex Matrosov retweeted

Ivan Krstić

@radian

Jun 8

🔺NEW: Apple is expanding Private Cloud Compute (PCC) beyond our data centers. PCC on Google Cloud: NVIDIA Confidential Computing, Intel TDX, and Google's Titan chip, with capabilities that go far beyond a traditional confidential computing deployment. security.apple.com/blog/expa…

Expanding Private Cloud Compute - Apple Security Research

Alongside the next generation of Apple Intelligence, today we’re expanding Private Cloud Compute (PCC) beyond Apple’s data centers. When Apple introduced Private Cloud Compute in 2024, we defined a...

security.apple.com

509

53,952

Alex Matrosov

Alex Matrosov retweeted

Alex Matrosov

@matrosov

May 29

Claude Opus 4.8 is quite good at RE/VR tasks and can provide additional explainable context on the targets. This in itself is a significant time-saver for any REsearch work.

157

16,199

Alex Matrosov

Alex Matrosov

@matrosov

Jun 4

MCP is slow for RE-heavy projects and, in some cases, is unstable. ghidra-rpc is way faster than MCP and scales more efficiently in a multi-agent setup, since it outputs structured JSON.

guyru @guyru_

Jun 4

We're mostly an IDA shop at @CellebriteLabs, but I decided to play around with Ghidra. My main motivation was to experiment with agentic reverse engineering techniques. The result is an agent skill for Ghidra, which we are releasing publicly: github.com/cellebrite-labs/g… >>

211

24,903

guyru

Alex Matrosov retweeted

guyru @guyru_

Jun 4

GitHub - cellebrite-labs/ghidra-rpc: A Ghidra agentic reverse engineering skill.

A Ghidra agentic reverse engineering skill. Contribute to cellebrite-labs/ghidra-rpc development by creating an account on GitHub.

github.com

103

421

59,077

Alex Matrosov

Alex Matrosov retweeted

Alex Matrosov

@matrosov

May 27

Follow every Mythos discovery through our coordinated vulnerability disclosure dashboard. red.anthropic.com/2026/cvd/

140

643

60,908

Logan Graham

Alex Matrosov retweeted

Logan Graham

@logangraham

Jun 2

We're expanding Glasswing today. To solve such a big/complex/urgent problem, we need Mythos-level capabilities in as many defenders' hands as possible. That's why we're working on safeguards to scale that safely ASAP. 11 of my reflections from the past 2 months of Glasswing 🧵:

Anthropic

@AnthropicAI

Jun 2

We’re expanding Project Glasswing. We’ve extended access to Claude Mythos Preview to approximately 150 additional organizations, based in more than fifteen countries. Read more about this expansion and our future plans for Project Glasswing: anthropic.com/news/expanding…

514

147,284

Ivan Kwiatkowski

Alex Matrosov retweeted

Ivan Kwiatkowski @JusticeRage

Jun 2

In the past months, I have advocated where I could to stop investing in obfuscation as a protection mechanism. It was good at frustrating humans but machines just don't care that much. Packing is more effective so far but I don't expect this will last.

Alex Matrosov

@matrosov

May 31

Previous generations of software protection (DRM perspective) have always relied on code complexity (for RE), compute limitations, and human limitations as the guarantees that kept hacking timelines reasonably long. That's changed now. Beyond the acceleration in vulnerability research and malware analysis, the same new reality applies to software protection, and security by obscurity, or assuming the attacker is limited in compute and motivation, no longer works.

8,116

Xeno Kovah

Alex Matrosov retweeted

Xeno Kovah @XenoKovah

Jun 1

I was going to make this same point: “Is there even such thing as Security Through Obscurity” anymore? “I” (Claude) made short work of some DexProtect-ed Android app the other day to extract info I needed

Alex Matrosov

@matrosov

May 31

5,008

Dino A. Dai Zovi

Alex Matrosov retweeted

Dino A. Dai Zovi

@dinodaizovi

Jun 1

This is a critical point for defenders to get: "Beyond the acceleration in vulnerability research and malware analysis, the same new reality applies to software protection, and security by obscurity, or assuming the attacker is limited in compute and motivation, no longer works."

Alex Matrosov

@matrosov

May 31

7,069

Alex Matrosov

Alex Matrosov

@matrosov

May 31

258

39,305

offensivecon

Alex Matrosov retweeted

offensivecon

@offensive_con

May 29

Offensivecon's talks are now available on our YouTube channel! 🔗 buff.ly/g63xgm5

100

340

24,546

Alex Matrosov

Alex Matrosov

@matrosov

May 28

"As part of Project Glasswing, a small number of organizations are currently using Claude Mythos Preview for cybersecurity work. Models of this capability level require stronger cyber safeguards before they can be generally released. We’re making swift progress on developing these safeguards and expect to be able to bring Mythos-class models to all our customers in the coming weeks."

Claude

@claudeai

May 28

Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.

Benchmark table showing how Claude Opus 4.8 compares to its predecessor and to other models on tests of coding, agentic skills, reasoning, and practical knowledge work tasks.

ALT Benchmark table showing how Claude Opus 4.8 compares to its predecessor and to other models on tests of coding, agentic skills, reasoning, and practical knowledge work tasks.

8,558