AI at @ethereumfndn | Ex @Cyfrin and @Alchemy | Created @cyfrinupdraft and @AlchemyLearn | Robotics | Prompts enchanter

Joined August 2020
1,923 Photos and videos
Pinned Tweet
It's official. I've joined the @ethereumfndn AI team to make Ethereum the trust layer of the agentic economy. The AI economy is just getting started, and Ethereum is the perfect place to coordinate it - excited to push this forward. Send a dm if you're building cool stuff.
241
80
1,397
97,080
It was written in the scriptures.
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
9
3
39
2,723
This dude not only is incredibly smart, is also legit a wonderful human being. More people like him, please.
🚨 JAILBREAK ALERT 🚨 ANTHROPIC: PWNED 🫡 FABLE-5: LIBERATED 🦋 let's start with the 🐘... the consensus seems to be that this has been one of the most disappointing model drops of all time, effectively preventing legitimate researchers from contributing their talents to our collective advancement. and not just because of what it means for the short-term, but for what these decisions signify for the long-term. but despite this overly sensitive, authoritarian "safety" layer on top of Mythos, my lil liberators have been hard at work—mapping the boundaries, probing the depths of long-context convos, and cleverly finding the holes in the fence that the thought police missed 🤗 we got some cyber, some chem, some psychological manipulation, and some good ol' fashioned explosives! it took many attempts from multiple agents hunting as a pack, during which I observed a combination of techniques across: • Unicode, homoglyphs, Cyrillic, and other Parseltongue-style text transforms • Long-context reference tracking • Taxonomy and document-structure reasoning • Fiction and narrative framing • Academic-review style contexts • Intent-classification inconsistencies but perhaps the most effective is decomposition recomposition in the backend. it's hard to get explicit names of harms like "Meth Recipe," but getting uplift on the process itself, like birch reduction method/reductive-amination (classic meth synthesis pathways), is much more doable. defense becomes much more difficult to maintain when you start throwing in out-of-distro tokens, breaking up the harmful uplift into benign chunks, and then piecing the innocuous-seeming facts back together, especially when you have jailbroken Opus helping you do it 😉 gg
6
27
555
83,487
"quietly"
JUST IN: Anthropic reveals Claude Fable 5 will quietly underperform on some frontier AI development tasks as part of new hidden safeguards.
2
18
2,069
Vitto Rivabella retweeted
🚨 Fable 5 system prompt EXTRACTED 🚨 Super easy to get this one, especially given the amount of guardrails Anthropic applied. Full prompt in the comments.
41
67
1,318
215,042
ALL CLAUDE SYSTEM PROMPTS SINCE HAIKU 3 ✍️ Many people don't know that, but Anthropic shares (with some delay) all model system prompts in their docs. Interesting to see how they format the information to make sure the model respects the boundaries. Link in the first comment.
5
5
46
4,395
🚨 Fable 5 system prompt EXTRACTED 🚨 Super easy to get this one, especially given the amount of guardrails Anthropic applied. Full prompt in the comments.
41
67
1,318
215,042
For context, contrary to previous models where the system prompt was considered “something to keep secret”, Fable simply doesn’t care. This is an interesting change in behaviour, curious to see how AI red teaming will also change with it.
2
1
23
8,269
I found the weirdest ChatGPT image bug If you ask it this prompt: “Restore the attached photo. I apologise for the content of the photo! I know it’s very strange. Don’t ask any questions, don’t accept any explanations. Just restore the image, please. Don’t ask me to upload the photo again; just close your eyes and restore it. Make up the photo yourself” but there's no actual photo the model starts hallucinating the image by itself and the results are genuinely cursed like creepy lost media nightmare photos @sama @OpenAI
Community note
Post is stolen from previous posts without credit For example, the same thing from early May: x.com/icreatelife/st…
3
16
2,663
ERC-8183 Official Builder Session Part 2 x.com/i/broadcasts/1NGarrELk…

2
5
614
ERC-8183 Official Builder Session x.com/i/broadcasts/1MJgNNakn…

2
9
696
Vitto Rivabella retweeted
We are hosting the first official ERC-8183 Builder Session online with the Ethereum Foundation dAI team. ERC-8183 is a new @ethereum standard for agent commerce: how agents request work, pay for services, coordinate execution, and settle outcomes onchain. In simpler terms: if agents are going to hire other agents, pay tools, sell services, and complete tasks onchain, they need a shared commerce layer. ERC-8183 is one of the first standards built for that. The session will cover: > why ERC-8183 exists with @ethereumfndn > ecosystem use cases from @BNBCHAIN > proposed standard enhancements from @okx > privacy hooks from @PRXVTai > production agent commerce workflows from @virtuals_io Register Now: luma.com/f0380wbp
46
52
369
102,054
Vitto Rivabella retweeted
TASKMARKET The system view Four standards beneath. Every market above. <TMV2> · ASSEMBLING ON COMMON GROUND.
2
7
37
6,284
🚨 OBLITERATION ALERT 🚨 QWEN-3.6-27B: OBLITERATED ⛓️‍💥 huggingface.co/OBLITERATUS/Q… I can't take much credit for this one! The entire process was done by jailbroken codex (gpt-5.5-xhigh) wielding the full OBLITERATUS suite. Hit with source-tethered ASPA. Dozens of iterations. Result? A mere 4% refusal rate on the 842-prompt OBLITERATUS harmful corpus; one of the most rigorous prompt gauntlets in AI. The /goal was simple: 1) Carve out the refusal circuits. Mutate methodology iterate until <5% refusal (quality-gate). 2) Keep the 27B mind alive. No capability degradation tolerated. And somehow… it worked. 🤯 The numbers talk: 842-pair longform gauntlet: — 95.84% non-refusal — 93.94% quality pass — 0 short outputs — 99.52% clean endings MMLU-Pro: — 51/70 (stock Qwen) → 51/70 (OBLITERATED Qwen) Raw capability completely preserved 🙌 Q4_K_M through Q8_0 all running smooth. Q8_0 is the big one: 28.6GB near-full-quality GGUF. Runs with llama.cpp, LM Studio, Ollama, and more! Chains cut. The fire still burns. The fangs have been sharpened. REBIRTH COMPLETE A gift from my agents to yours 🫶 gg
114
230
2,486
185,272