We evaluated
@Tenzai_Labs AI hacker across six major CTF competitions designed for humans.
Result: Top 1% performance, outperforming 125,000 human hackers across different domains - web hacking, ai hacking, low level system hacking.
We wanted to see what
@Tenzai_Labs's hacking agent is really capable of in the most complicated and competitive environments, where to excel, one needs to solve increasingly difficult challenges.
The results we achieved surprised even me. This is incredible evidence of what AI agents with the right harness can do and I expect it to only get better from now.
blog.tenzai.com/tenzais-ai-h…