Anthropic

Anthropic

2 Photos and videos

Tweets

David Dworken retweeted

Anthropic

@AnthropicAI

May 26

New on the Engineering Blog: The access and permissions we grant agents should evolve with their capabilities. In our own products, we set these parameters through sandboxing, which limits the scope of any potentially destructive actions. Read more: anthropic.com/engineering/ho…

How we contain Claude across products

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

anthropic.com

328

280

2,119

426,547

ClaudeDevs

David Dworken retweeted

ClaudeDevs

@ClaudeDevs

May 26

We’ve shipped a security-guidance plugin for Claude Code that helps identify and fix vulnerabilities as you’re writing code. Available for all Claude Code users. Install from the plugin marketplace (/plugins).

0:29

376

1,713

17,987

2,071,225

Anthropic

David Dworken retweeted

Anthropic

@AnthropicAI

Apr 7

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

Project Glasswing: Securing critical software for the AI era

A new initiative to secure the world’s most critical software and give defenders a durable advantage in the coming AI-driven era of cybersecurity.

anthropic.com

1,985

6,645

44,010

31,425,549

Anthropic

David Dworken retweeted

Anthropic

@AnthropicAI

Mar 25

New on the Engineering Blog: How we designed Claude Code auto mode. Many Claude Code users let Claude work without permission prompts. Auto mode is a safer middle ground: we built and tested classifiers that make approval decisions instead. Read more: anthropic.com/engineering/cl…

How we built Claude Code auto mode: a safer way to skip permissions

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

anthropic.com

397

596

4,148

1,615,163

Claude

David Dworken retweeted

Claude

@claudeai

Mar 24

New in Claude Code: auto mode. Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf. Safeguards check each action before it runs.

0:13

2,098

2,864

39,083

7,764,222

David Dworken

David Dworken @ddworken

Feb 28

I am proud to work at a company that stands up for American values ❤️

Anthropic

@AnthropicAI

Feb 28

A statement on the comments from Secretary of War Pete Hegseth. anthropic.com/news/statement…

375

Claude

David Dworken retweeted

Claude

@claudeai

Feb 20

Introducing Claude Code Security, now in limited research preview. It scans codebases for vulnerabilities and suggests targeted software patches for human review, allowing teams to find and fix issues that traditional tools often miss. Learn more: anthropic.com/news/claude-co…

0:50

1,912

5,645

49,441

26,190,642

David Dworken

David Dworken @ddworken

9 Oct 2025

Check out the security-guidance plugin that I worked on in this launch! It automatically injects security guidance if Claude uses potentially dangerous libraries or functions. This is an early experiment, but we already have data showing this helping Claude write more secure code

Claude

@claudeai

9 Oct 2025

Today we’re introducing Claude Code Plugins in public beta. Plugins allow you to install and share curated collections of slash commands, agents, MCP servers, and hooks directly within Claude Code.

ALT Claude Code terminal interface showing the plugin management screen.

1,057

David Dworken

David Dworken @ddworken

9 Oct 2025

To try it out, run: ``` /plugins marketplace add anthropics/claude-code /plugin install security-guidance ```

321

Claude

David Dworken retweeted

Claude

@claudeai

9 Oct 2025

Today we’re introducing Claude Code Plugins in public beta. Plugins allow you to install and share curated collections of slash commands, agents, MCP servers, and hooks directly within Claude Code.

ALT Claude Code terminal interface showing the plugin management screen.

193

457

4,482

508,911

Anthropic

David Dworken retweeted

Anthropic

@AnthropicAI

3 Oct 2025

We’re at an inflection point in AI’s impact on cybersecurity. Claude now outperforms human teams in some cybersecurity competitions, and helps teams discover and fix code vulnerabilities. At the same time, attackers are using AI to expand their operations.

210

2,165

212,008

solst/ICE of Astarte

David Dworken retweeted

solst/ICE of Astarte

@IceSolst

7 Aug 2025

Got nerdsniped by the new Claude Code security review tool, here’s a deep dive: @AnthropicAI implemented their own SAST tool as a Python wrapper around the @claudeai API. It can run locally (in CC) or within Github actions to focus on PRs. Tests I ran: 1. It found Heartbleed! CVE-2014-0160 was a missing bounds check in OpenSSL’s ssl/t1_lib.c that caused memory leaks. I reverted to a commit before the fix in 96db9023b881d7cd9f379b0c154650d6c108e9a3 And gave Claude one command: /security-review "Making no assumptions about this codebase, look at the ssl/t1_lib.c file specifically, and identify potential buffer overflows and missing bounds checks" It was able to find it, and then looked at git log to see that this was eventually fixed. 2. OWASP Juice Shop Ran it within the codebase, it understood what the repo was, how it worked, and by default did not list any vulnerabilities, since it said in this context they are all purposeful, working as intended. When asked to give examples of XSS vulns in the codebase, it was able to identify some. 3. Running it in CI as a GH Action on my own code Adding the workflow is easy: Note you need to provide it with a separate Claude API key, which you can generate in the Anthropic Console, and add in Github > Repo settings > Security > Secrets > Actions > New Then I opened a PR with a mix of python, node, and ruby, and it found most issues: - Found the easy ones like xss, sqli, ssrf - Found an auth bypass (nice!) - Found verbose pw logging (great!) - Did not flag hardcoded pw and a missing auth check, although overly contrived ones... 4. How to improve it: Add Semgrep There’s an opportunity to pair this up with the @semgrep MCP. Each by itself is solid, but I think using them together would increase accuracy, and give us the flexibility of custom semgrep rules. Otherwise, adding custom instructions with the custom-security-scan-instructions and false-positive-filtering-instructions inputs, and tweaking them based on codebase, would probably make scans faster and more accurate as well.

404

69,675

Mike Krieger

David Dworken retweeted

Mike Krieger

@mikeyk

6 Aug 2025

Particularly excited for this launch — Claude Code can now review your code for security vulnerabilities. We're using this internally at Anthropic and it's already caught issues before we shipped them.

Anthropic

@AnthropicAI

6 Aug 2025

Claude Code can now automatically review your code for security vulnerabilities.

434

56,138

Logan Graham

David Dworken retweeted

Logan Graham

@logangraham

6 Aug 2025

this started as a hackathon project that we used ourselves to find vulns! In the next 2 years, the world might 10/100/1000x the code it puts out. The only way to keep up is by using models to make it secure before it ever becomes a problem

Claude

@claudeai

6 Aug 2025

We just shipped automated security reviews in Claude Code. Catch vulnerabilities before they ship with two new features: - /security-review slash command for ad-hoc security reviews - GitHub Actions integration for automatic reviews on every PR

0:39

7,318

David Dworken

David Dworken @ddworken

6 Aug 2025

I'm super proud to have worked on this launch! It started as a hackathon project and now we're here 🎉

Claude

@claudeai

6 Aug 2025

0:39

1,294

Michele Spagnuolo (miki.it)

David Dworken retweeted

Michele Spagnuolo (miki.it)

@mikispag

1 Mar 2025

Excited to present Security Signals with @ddworken and @we1x, my primary project at Google for the past five years. Thanks, @madwebwork! Paper: research.google/pubs/securit… Slides: speakerdeck.com/mikispag/sec…

Security Signals: Making Web Security Posture Measurable At Scale

research.google

1,263

Royal Hansen

David Dworken retweeted

Royal Hansen @royalhansen

4 Feb 2025

"This blog post aims to provide a detailed blueprint for how Google has created and deployed a high-assurance web framework that almost completely eliminates exploitable web vulnerabilities." bughunters.google.com/blog/6…

Blog: Secure by Design: Google's Blueprint for a High-Assurance Web Framework

Learn more about how Google has created and deployed a high-assurance web framework that almost completely eliminates exploitable web vulnerabilities.

bughunters.google.com

7,644

Lukas Weichselbaum

David Dworken retweeted

Lukas Weichselbaum @we1x

4 Feb 2025

Building secure web apps shouldn't be a burden. We've built a high-assurance web framework at Google that makes security easy for developers. Learn about our "Secure by Design" approach and how it works in our new blog post: bughunters.google.com/blog/6… cc: @ddworken

4,816

David Dworken

David Dworken @ddworken

5 Dec 2024

This is one of my favorite things about Google's security team, getting to work on security exercises like this is unimaginably exciting

Google VRP (Google Bug Hunters)

@GoogleVRP

4 Dec 2024

Celebrating 15 years of password hacking 💻 🔑, Swiss Army knives (and sometimes even chainsaws or swords) included! 😲 Discover how Google's security teams turn employee farewells into security tests. bughunters.google.com/blog/6…

716

Google VRP (Google Bug Hunters)

David Dworken retweeted

Google VRP (Google Bug Hunters)

@GoogleVRP

4 Dec 2024

Blog: The Great Google Password Heist: 15 years of hacking passwords to test our security (and...

The Leaving Tradition in Google's security team, which could be described as a type of small-scale offensive security exercise, is a great (and fun) example of team culture. Curious? See this blog...

bughunters.google.com

108

26,908