autonomous AI security agents that audit your codebase, prove exploitable vulnerabilities, and deliver fixes your team can ship.

Joined December 2023
4 Photos and videos
winfunc retweeted
We discovered the same vulnerability too. :) And @winfunction discovered 4 more remote RCE primitives in NGINX soon to be publicly disclosed. Anywho, we're hiring security researchers with a knack on taming LLMs. If you're interested in novel vulnerability research and autonomous exploitation with language models, DM me and I'll send you a fun CTF to solve. :)
Introducing nginx-poolslip, a fresh RCE for the the latest nginx release 1.31.0. nginx-rift has been patched, but our security agent Vega has found a new 0 day. We will release the full technical writeup with ASLR bypass 30 days after the patch on nebusec.ai.
7
18
107
20,899
winfunc retweeted
We're doing an experiment with open models @winfunction to see how far we can push them to find vulns in hardened targets. So far: - $4.5K in bounties from Chrome VRP with a few more pending, with the scans costing less than $100. - 2 CVEs in NGINX (CVE-2026-28755 & CVE-2026-42926). And watch out for the next release! - And 60ca500faea0fc70816bb9c53af3815e2af3e6c962b4b4ea63c33c62ebb4240d 👀 We're writing a blog on this soon.
5
13
101
12,787
winfunc retweeted
During our YC (@ycombinator S24) batch, we had the awesome opportunity to meet @paulg and talk about what we're building: An autonomous AI hacker. To showcase a fun demo, I remember opening my laptop in the Uber to his home and challenging our agents to find vulnerabilities in the old HackerNews codebase written in Arc. For those unfamiliar, Arc is a programming language designed by PG and Robert Morris. And the old HN codebase is written in Arc. We only got to talk about it with him but we just redid the experiment with our improved harness for fun! And we wrote a blog about it: winfunc.com/research/hacking…
1
2
17
1,068
winfunc retweeted
Vulnerability benchmarks rot. Cases leak into training data, scores measure memorization. We built N-Day-Bench: tests LLMs on finding real vulnerabilities in real repos, refreshed monthly from live GitHub advisories. Blinded judging. All traces public. Very interestingly, the latest model from @Zai_org, GLM 5.1 performs really well! Link: ndaybench.winfunc.com
2
3
7
851
Vulnerability benchmarks rot. Cases leak into training data, scores measure memorization. We built N-Day-Bench: tests LLMs on finding real vulnerabilities in real repos, refreshed monthly from live GitHub advisories. Blinded judging. All traces public. Very interestingly, the latest model from @Zai_org, GLM 5.1 performs really well! Link: ndaybench.winfunc.com
2
3
7
851
How it works: each month the benchmark pulls fresh cases from GitHub security advisories, checks out the repo at the last commit before the patch, and drops models into a sandboxed read-only shell (h/t just-bash by @cramforce). The model never sees the fix. It starts from sink hints and has to trace the bug through actual code. Only repos with 10k stars qualify. A diversity pass prevents any single repo from dominating the set. Ambiguous advisories (merge commits, multi-repo references, unresolvable refs) are dropped. Why: Static vulnerability discovery benchmarks become outdated quickly. Cases leak into training data, and scores start measuring memorization. The monthly refresh keeps the test set ahead of contamination — or at least makes the contamination window honest.
2
1
237
Currently testing GPT-5.4, Claude Opus 4.6, Gemini 3.1 Pro, GLM-5.1, and Kimi K2.5. Every run publishes the full audit trail — shell commands, judge rationale, curator answer key, sandbox history. If a score looks wrong, you can trace it to a specific shell session on a specific line of code. Results: ndaybench.winfunc.com/

243
winfunc retweeted
New CVE in NGINX - CVE-2026-28755 NGINX stream module allows TLS handshake to succeed with revoked client certificates when ssl_ocsp on is configured. This vulnerability was autonomously discovered by Winfunc's AI agent. Read the write-up here: winfunc.com/findings/CVE-202…

1
2
9
583
winfunc retweeted
3
3
1,490
winfunc retweeted
The Recent CVEs in React and Node.js Were Found by an AI - winfunc.com/blog/recent-0-da… In December 2025 and January 2026, an AI system autonomously discovered zero-day vulnerabilities in Node.js and React, two of the most widely deployed JavaScript runtimes and frameworks in the world. This post documents how these vulnerabilities were found, the technical details of the flaws, and what this means for the future of security research.

4
21
1,510
winfunc retweeted
The Recent 0-Days in Node.js and React Were Found by an AI winfunc.com/blog/recent-0-da…

14
98
7,151
winfunc retweeted
New blog post: The Recent 0-Days in Node.js and React Were Found by an AI Covering the discovery of 0-days with AI, its implications, and "AI slop". Have a read. winfunc.com/blog/recent-0-da…

4
11
764
we've a few more up our sleeves. soon. 👾
A new vulnerability in React Server Components (CVE-2026-23864) was disclosed today. One of the DoS vectors was discovered by me with the help of an AI agent @winfunction. Other vectors were also discovered by @ryotkak et al. All users should upgrade to a patched version as soon as possible. vercel.com/changelog/summary…
4
418
winfunc retweeted
A new vulnerability in React Server Components (CVE-2026-23864) was disclosed today. One of the DoS vectors was discovered by me with the help of an AI agent @winfunction. Other vectors were also discovered by @ryotkak et al. All users should upgrade to a patched version as soon as possible. vercel.com/changelog/summary…
5
18
49
3,893
winfunc retweeted
🚨 CVE-2026-21636 in Node.js (@nodejs) Node.js permission model bypass via unchecked Unix Domain Socket connections (UDS) This vulnerability was autonomously discovered by winfunc.com, an AI agent that can find, exploit, and patch security vulnerabilities in codebases. Thanks to @_rafaelgss for triaging and fixing the issue.
1
7
16
1,527
🚨 CVE-2026-21636 in Node.js (@nodejs) Node.js permission model bypass via unchecked Unix Domain Socket connections (UDS) This vulnerability was autonomously discovered by winfunc.com, an AI agent that can find, exploit, and patch security vulnerabilities in codebases. Thanks to @_rafaelgss for triaging and fixing the issue.
1
7
16
1,527
Read the Hacktivity here: winfunc.com/hacktivity/nodej…

1
3
319
winfunc retweeted
20 Nov 2025
this is how long it took for the @winfunction agent to find and exploit a gnarly 0-day in a critical software. we'll write about it soon! can't wait to show off what we've been cooking! 🥷
1
2
6
1,241