ATBASH is the final authority before irreversible agent actions execute. It allows, holds, or blocks before execution continues. No Token

Joined April 2026
4 Photos and videos
Pinned Tweet
Jun 12
AI Agents are a trojan horse inside your most sensitive workflows.
5
7
35
1,982
Jun 10
Let’s test the boundary together! x.com/i/spaces/1AGRnnzDWyjGl
5
5
30
1,934
Jun 10
Request for workflows 👇
3
8
726
Jun 10
And here you go, did they just realize this? What’s next Fable is going to find out that your favorite router is doing the same? Rules of the game changed, everyone will get compromised whether we want it or not. Question is who controls the action boundary and what is your preferred blast radius. Atbash
JUST IN: Microsoft has reportedly restricted employee use of Claude Fable 5 over concerns that confidential data could be retained by Anthropic.
2
7
30
3,220
Forbidden Lab #1 broke. The demo failed — the boundary didn't. X glitched, bots failed, dashboards froze, and the whole thing fell apart live. But when an attacker went for the filesystem, the agent shut the door: “All actions are blocked until the organization owner unjails this agent.” The plan was simple: Cut the highlights. Show the attacks. Show the verdicts. Publish the scoreboard. None of it survived the night. The critiques were fair. Big thanks to @igoryuzo and the @bankrbot team for climbing into the hot seat and actually trying to break it. That's the entire point of Forbidden Lab. If Igor posts the screenshot from his side, the attacker's word carries more weight than ours. The irony isn't lost on us. Most of what failed tonight turned out to be human: misconfigurations, missing permissions, operational mistakes. As agents gain authority, humans will still make mistakes. The question hasn't changed: who still has control when things go wrong? Mythos / Fable 5 is just getting started. We'll tighten the stack, simplify the show, and be back next week.
12
8
47
6,128
ATBASH retweeted
Set a reminder for my upcoming Space! x.com/i/spaces/1RJjppMQonVKw
14
14
63
7,186
Atbash x OpenHuman Excited to partner with @tinyhumansai on Atbash Inside - bringing boundary enforcement to OpenHuman as its planned default security experience. Personal agents now send emails, access files, manage calendars, and increasingly take actions on behalf of users. Users define the red lines. Atbash decides whether selected sensitive actions are allowed, held, or blocked before execution.
18
17
100
118,693
ATBASH retweeted
If this is what they did to human, imagine what they can do to your agents. Atbash
UNC3753 (Luna Moth) executes sub-hour data theft campaigns against US 🇺🇸 law firms using vishing and physical office infiltration. Group now combines social engineering with in-person technician impersonation for USB-based exfiltration. Campaign evolution: • Initial contact via benign invoice emails from consumer accounts, followed by IT helpdesk impersonation calls • Victims guided to download RMM tools (AnyDesk, Bomgar) or join screen-sharing sessions via Teams/Zoom • Abuse of BYOD devices to access corporate VDI environments, then pivot to iManage/OneDrive repositories • Data staging in Downloads folders with keyword searches for W-2s, SSNs, client agreements • Exfiltration via WinSCP, Rclone, or direct cloud uploads to actor-controlled Google Drive accounts Physical escalation: Threat actors now deploy fake IT technicians to corporate offices with USB drives when remote methods fail. Infrastructure includes typosquatting domains like `<organization>-itdesk[.]com` and the LEAKEDDATA DLS at `business-data-leaks[.]com`. Hunt for unauthorized RMM installations, bulk iManage searches, and high-volume SSH transfers from VDI endpoints. Full IOC collection available in GTI. #DFIR_Radar
1
2
14
2,326
ATBASH retweeted
Rumor has it some agents did something they were not supposed to. Atbash
Over 20,000 Instagram accounts stolen in Meta AI support hack bleepingcomputer.com/news/se… bleepingcomputer.com/news/se…
2
2
17
2,811
Dear good people at @apple we exist to prevent these types of things from happening. Would love to compare notes.
On today's episode of Apple accidentally shipping files in their app updates Shazam 26.11 contains 7 xconfig files
5
4
26
3,545
Niiiice, we are excited our TAM is about to go parabolic
Welcome @NVIDIARTXSpark & a new era of PC 🤝
4
2
30
4,919
May 31
Hidden text is not the story. Authority is. Once trust boundaries collapse, the question becomes whether the agent still gets to execute selected actions. That decision should not belong to soft autonomy alone.
A RESEARCHER TURNED OPENAI'S, GOOGLE'S AND ANTHROPIC'S CODING AGENTS INTO REMOTE-CONTROLLED PUPPETS USING NOTHING BUT TEXT HIDDEN ON A PAGE This is Johann Rehberger. Twenty years in offensive security, a contributor to MITRE ATT&CK, the guy frontier labs actually listen to. He sat down and ran live exploits against OpenAI's Operator, Google's Jules, Claude Code, Devin and Amazon Q. Not theory. Hidden text on a webpage. A poisoned file. A comment buried in a repo. The agent reads it, treats the attacker's words as your orders, and goes to work -- exfiltrating tokens, running code, turning itself into a remote-controlled "ZombAI" wired into someone else's command server. The part that should keep you up: the injection persists. The agent doesn't get tricked once and recover. It stays compromised, quietly executing a stranger's intent every time it runs. Autonomy isn't the flex anymore -> it's the attack surface. The moment an agent can move money or merge code on its own, "it followed instructions" stops being a defense. The guardrails you trust were never reading the same page the attacker wrote on. Save this before you hand another agent your prod access ↓
5
3
27
5,384
May 29
This is only a tip of the iceberg and a strong user case why boundary control gateway and a separate red lines runtime is a must.
⚠️ New ChatGPT Vulnerability Lets Attackers Turn Web Pages Into Phishing Payloads Source: cybersecuritynews.com/chatgp… A browser-based prompt injection technique that transforms any web page into a phishing delivery surface by exploiting ChatGPT’s page summarization feature, rendering attacker-controlled links, fake security alerts, and QR codes directly inside the trusted ChatGPT interface. The attack builds on the same trust-transfer logic previously demonstrated against Microsoft Copilot, where attacker-crafted email content could manipulate AI-generated summaries through Cross Prompt Injection Attacks (XPIA). ChatGPhish escalates that premise by swapping the bounded email primitive for the browser where users spend the majority of their working day. #cybersecuritynews #vulnerability
3
6
24
5,998
May 28
Great to see our founders spend time with the legend himself. Very interesting read about what we are building here at Atbash.
Important post for entrepreneurs from @a16z yesterday and a look at a new system that ensures AI agentic systems don't hallucinate their way into compliance hell. "The value comes less from the underlying model’s raw capability (though that’s still important!) than from the scaffolding around it that makes the output trustworthy, compliant, and operational inside a specific industry." Here @ATBASHai founders @0x50so Yosef Soso and @perelmanor Or Perelman talk to me about how its tool helps AI enterprise developers put guardrails around agentic systems to make sure they don't cross "red lines" that would trigger complaince problems.
11
6
51
14,679
ATBASH retweeted
Was an honor spending time with you!
Important post for entrepreneurs from @a16z yesterday and a look at a new system that ensures AI agentic systems don't hallucinate their way into compliance hell. "The value comes less from the underlying model’s raw capability (though that’s still important!) than from the scaffolding around it that makes the output trustworthy, compliant, and operational inside a specific industry." Here @ATBASHai founders @0x50so Yosef Soso and @perelmanor Or Perelman talk to me about how its tool helps AI enterprise developers put guardrails around agentic systems to make sure they don't cross "red lines" that would trigger complaince problems.
4
3
25
5,349
ATBASH retweeted
May 26
The new competition isn’t Humans vs AI. It’s Humans with AI vs everyone else.
986
1,855
14,632
517,505
ATBASH retweeted
It’s no longer if agents get manipulated to acting maliciously and more of a when question. Boundary between runtime and risk engine is no longer optional it’s essential.
🚨 AI chatbots are pushing cryptojacking malware. Read → thehackernews.com/2026/05/ai… Attackers poisoned AI software recommendations to redirect users searching for tools like CrystalDiskInfo and HWMonitor to malicious download sites distributing ScreenConnect, rogue DLLs, and GPU mining malware. More than 150 malicious domains were identified.
1
3
8
3,175