Josh Kotrous

Josh Kotrous

47 Photos and videos

Tweets

Kyle Bhiro retweeted

Josh Kotrous

@kotro___

Apr 30

Last week I demoed Apex, our open source AI pentesting agent, at AI Agents Demo Night in NYC at The Refinery at Domino. Live on stage, we hacked a financial institution in under 3 minutes. Apex doesn't just scan for textbook vulnerabilities. It digs into your infrastructure, finds what's exposed, maps out business logic flows that attackers could abuse, and exploits novel attack paths autonomously. I showed it discover an FTP server, identify write access, and deface the target site, all live in front of a packed room. No scripts. No playbooks. Just a prompt and a target. This is what attackers can do now. The question is whether you find the holes first. Demoed alongside @cognition, @clay, @justworks, @normativeai, North Cloud, and @trywindmill. Huge thanks to @TechNYC, @obviously_nyc, The Refinery at Domino, and our team at @runpensar for putting this together. Apex is open source. Link below, go break some things (legally).

2:40

1,945

Kerem Proulx ⌘

Kyle Bhiro retweeted

Kerem Proulx ⌘

@ProulxKerem

Apr 20

x.com/i/article/204624460873…

341

Kerem Proulx ⌘

Kyle Bhiro retweeted

Kerem Proulx ⌘

@ProulxKerem

Apr 15

If you are an open source maintainer and are worried about what's going on in security - we @runpensar want to sponsor continuously securing your project. Reach out to me via DM or email us at team(at)pensarai(dot)com

Amjad Masad

@amasad

Apr 15

If finding security flaws is fully automated with frontier models à la Mythos, then GitHub should have a metric, like stars, showing how much compute is spent securing/hardening an open-source package. Example: 📦 linus/linux ⭐️ 200k 🦾 $239M Only way OSS can be trusted.

743

Kerem Proulx ⌘

Kyle Bhiro retweeted

Kerem Proulx ⌘

@ProulxKerem

Mar 31

This will keep happening with increased frequency. So many hypergrowth startups relying on AI to build faster, accumulating financial crisis levels of security debt (we’ll hire someone to secure it later!) AI native startups are now a critical and least defended node in the enterprise attack surface.

Dominic Alvieri

@AlvieriD

Mar 31

Mercor AI has allegedly been breached by Lapsus 939GB of source code 4TB of data in total All data from their TailScale VPN @mercor_ai

571

Zion

Kyle Bhiro retweeted

Zion

@BlasianHokage

Mar 31

Automatic security companies going to go crazy after Axios, Mercor and now Claude code. @runpensar is one I can think of

Chaofan Shou

@Fried_rice

Mar 31

Claude code source code has been leaked via a map file in their npm registry! Code: pub-aea8527898604c1bbb12468b…

1,322

Feross

Kyle Bhiro retweeted

Feross

@feross

Mar 31

🚨 CRITICAL: Active supply chain attack on axios -- one of npm's most depended-on packages. The latest axios@1.14.1 now pulls in plain-crypto-js@4.2.1, a package that did not exist before today. This is a live compromise. This is textbook supply chain installer malware. axios has 100M weekly downloads. Every npm install pulling the latest version is potentially compromised right now. Socket AI analysis confirms this is malware. plain-crypto-js is an obfuscated dropper/loader that: • Deobfuscates embedded payloads and operational strings at runtime • Dynamically loads fs, os, and execSync to evade static analysis • Executes decoded shell commands • Stages and copies payload files into OS temp and Windows ProgramData directories • Deletes and renames artifacts post-execution to destroy forensic evidence If you use axios, pin your version immediately and audit your lockfiles. Do not upgrade.

541

4,026

16,168

12,403,713

Josh Kotrous

Kyle Bhiro retweeted

Josh Kotrous

@kotro___

Mar 23

We've been quiet the last few months. That was intentional. We've been working directly with real companies, real systems, and real constraints - making sure what we're building doesn't just work in controlled environments, but is mission-critical ready. Today, we're showing what we've been building. Introducing Pensar Apex - an AI-powered penetration testing agent that runs directly in your terminal. This isn't a wrapper or a chatbot. It's an autonomous agent that explores an application like a real tester, reasons about vulnerabilities, and chains multi-step attack paths. All from a single command. We've been dogfooding Apex on our own codebase for months, and enterprise customers have been running our cloud-hosted version against their environments. The results have sharpened the product considerably - nothing teaches you what "reliable" actually means like staking your own security on it. But the real breakthrough wasn't just building the agent - it was building a reliable validation system around it. One that forces the agent to deterministically verify its findings, continuously test its own hypotheses, and prove exploitability before reporting anything. Because agents are easy to demo, trustworthy agents are hard to build. That shift changed everything for us. Less guessing, more proving. Less noise, more signal. And via our cloud hosted offering, it can slot directly into your CI/CD pipeline - giving you continuous, validated pentesting results on every commit. Not periodic assessments that go stale the moment code changes. Continuous proof that your application holds up, running alongside your tests. This is what we think the new paradigm looks like: pentesting that lives in your development workflow, not outside of it. If you're a developer, you can run a pentest in minutes. If you're a security engineer, you can push it much further. Try it, break it, and tell us where it falls short. We've got a lot more coming.

477

Kerem Proulx ⌘

Kyle Bhiro retweeted

Kerem Proulx ⌘

@ProulxKerem

Mar 19

Our autonomous pentesting agent just outperformed the two most popular open source offensive security agents on a benchmark of 60 modern, defense-enabled web apps. Battle-tested in production against our customers' environments from startups to financial institutions, Apex consistently finds and exploits critical vulnerabilities other agents and humans miss. Today we're releasing it open source alongside our internal benchmarks.

0:45

295

1,987,110

Josh Kotrous

Kyle Bhiro retweeted

Josh Kotrous

@kotro___

Mar 17

There’s been a lot of criticism of MCP lately, and I've felt the sentiment myself. But the discussion is circling a deeper shift that APIs are becoming the UX for agents. Humans tolerate messy APIs because we read docs, infer intent, and adapt. Agents don’t. They rely almost entirely on the semantic structure you expose. So the real design question becomes "how much meaning lives in your schema?" The better the interface communicates the system, the less intelligence the agent needs to use it.

319

Kerem Proulx ⌘

Kyle Bhiro retweeted

Kerem Proulx ⌘

@ProulxKerem

Mar 11

x.com/i/article/203178897148…

449

Insecure Agents Podcast

Kyle Bhiro retweeted

Insecure Agents Podcast

@insecureagents

Mar 9

We’re talking about sandboxes and security today at @daytonaio Compute! Great to chat with @shcallaway on how his new company @sazabi is using sandboxes to build the future of AI native observability

2,259

Michelle Fang 🌁

Kyle Bhiro retweeted

Michelle Fang 🌁

@michelleefang

Feb 9

Tuesday 2/10 ‣ Galentine's Vibe Coding Night with v0 luma.com/b9vx5saz @ceciaramitaro @maddiedreese ‣ Hacking Agents February Meetup luma.com/utvcobkp @ratothebec ‣ Zero to Enterprise luma.com/h7fta8dx @runpensar @kylebhiro ‣ Women in AI: San Francisco Happy Hour hosted by Anything luma.com/mo4kyop9 @zariazinn @anything ‣ AI Journal Club for Researchers ft. Ben Coleman (Google Deepmind) luma.com/uglaf7c5 @workato

Galentine's Vibe Coding Night with v0 · Luma

Grab your laptop and your favorite galentines — we're building together! Join us for an evening of vibe coding, live demos, and networking in San Francisco.…

luma.com

2,804

adam 🇺🇸

Kyle Bhiro retweeted

adam 🇺🇸

@personofswag

Feb 8

“did opus-4.6 fast brutally frame mog gpt-5.3-codex?”

1,971

53,226

Allie Howe

Kyle Bhiro retweeted

Allie Howe

@vtahowe

Feb 6

If you want to see where AI security risk exists in OpenClaw or your agents go checkout app.vairde.ai 👀👀👀👀 WE LAUNCHED 👀👀👀👀

2:00

16,775

gaut

Kyle Bhiro retweeted

gaut

@0xgaut

Jan 15

pulling up your 15 claude code tabs in the morning

0:17

163

650

8,483

341,438

Kyle Bhiro

Kyle Bhiro

@kylebhiro

Jan 14

Micro Center, Brooklyn

156

Pensar

Kyle Bhiro retweeted

Pensar

@runpensar

Jan 12

@AirMacNair24 has joined Pensar as Head of Growth! Joe’s experience spans sales, customer success, account management, and strategy. In his previous role, he was #1 in pentest sales across early stage, mid-market and enterprise accounts. Joe understands every stage of the pentest lifecycle, from initial scoping to final reporting and compliance readiness. His end-to-end knowledge positions him perfectly to help organizations leverage Pensar’s on-demand pentests for their security needs. Welcome, Joe!

195

Kyle Ryan

Kyle Bhiro retweeted

Kyle Ryan

@kyjry

Jan 6

I gave Apex, our pentesting agent, the Playwright MCP and ran it against internal benchmarks It registered an account on its own—hit auth-protected endpoints, found the signup flow, created credentials, and continued the pentest authenticated No instruction to do this. It just figured out that's what it needed to continue the attack chain

1,725

Insecure Agents Podcast

Kyle Bhiro retweeted

Insecure Agents Podcast

@insecureagents

24 Dec 2025

Shai Hulud 2.0 was a wake-up call. Hear from @Feross, Founder & CEO of @SocketSecurity, on supply chain attacks and what's next. Full episode is out now!

0:24

2,035

Insecure Agents Podcast

Kyle Bhiro retweeted

Insecure Agents Podcast

@insecureagents

23 Dec 2025

Speed, security and statefulness for AI code. Ep. 19 @ivanburazin, Co-Founder and CEO of @daytonaio is out now!

0:12

7,200