Building Evidary: AI policy → Verifiable Evidence → Traceability → Guardrails → Agentic AI Red Teaming & Pen Testing | ML & AI Engineer | #Otaku #AutisticASF

Joined November 2022
599 Photos and videos
Pinned Tweet
Desiderata by Max Ehrmann Go placidly amid the noise and haste, and remember what peace there may be in silence. As far as possible without surrender be on good terms with all persons. Speak your truth quietly and clearly; and listen to others, even to the dull and the ignorant; they too have their story. Avoid loud and aggressive persons, they are vexatious to the spirit. If you compare yourself with others, you may become vain or bitter, for always there will be greater and lesser persons than yourself. Enjoy your achievements as well as your plans. Keep interested in your own career, however humble; it is a real possession in the changing fortunes of time. Exercise caution in your business affairs, for the world is full of trickery. But let this not blind you to what virtue there is; many persons strive for high ideals, and everywhere life is full of heroism. Be yourself. Especially do not feign affection. Neither be cynical about love; for in the face of all aridity and disenchantment it is as perennial as the grass. Take kindly the counsel of the years, gracefully surrendering the things of youth. Nurture strength of spirit to shield you in sudden misfortune. But do not distress yourself with dark imaginings. Many fears are born of fatigue and loneliness. Beyond a wholesome discipline, be gentle with yourself. You are a child of the universe no less than the trees and the stars; you have a right to be here. And whether or not it is clear to you, no doubt the universe is unfolding as it should. Therefore be at peace with God, whatever you conceive Him to be, and whatever your labours and aspirations, in the noisy confusion of life keep peace with your soul. With all its sham, drudgery, and broken dreams, it is still a beautiful world. Be cheerful. Strive to be happy.
1
16
1,538
Live 📿 retweeted
What those in power tell you is true is not the truth, and what the masses believe is true is not the truth. Those in power will lie to you, and the masses are delusional.
3
17
933
Live 📿 retweeted
Replying to @tszzl
This did not age well, did it...
1
3
67
Like I said, they never forget. They never forgive. A\ lacks political, finesse and overstanding of this fact. They gave the USG a big stick to beat them over the head with their fear-mongering marketing and regulatory capture strategy. USG has all the money, all the guns, all the state-backed power. The A\ team should have been more considered in their strategic approach. Especially in their comms, marketing, negotiations and interaction with the USG. They clearly do not understand the personality, culture, and aims of this administration. If you tell the state your model is too powerful for ordinary deployment, do not be shocked when the state decides it is too powerful for ordinary access. That is the strategic own goal. Someone needs to get Monseigneur Dario a copy of The Art of War and The Prince.
Three months ago, @DeptofWar kicked @AnthropicAI out of our building—forever. Every passing day proves why that was the right move. 🇺🇸
21
IMO the issue is Dario’s refusal again with this administration. They are reminding him about the rules of engagement and who really has the power. And after the previous episode they are kicking him in the balls and using opaque bureaucracy like only US nationals will be able to use the model so he is in no doubt. Strategically A\ PR and comms have been left wanting. They tried to use fear mongering and regulatory capture as a moat. All while fighting the previous supply chain risk designation. The people saying this gives them aura are wrong. This is just naive hubris on A\ part and will impact their IPO timeline. They lack political finesse.
14
Live 📿 retweeted
The snows of June will continue until morale improves
5
19
261
4,900
Live 📿 retweeted
I’ve had a number of conversations with folks inside and outside government about the current situation with Anthropic, and here is what I believe to be true: — As we know, Anthropic publicly released its Mythos class models earlier this week under the commercial name Fable. — Fable is Mythos with guardrails. But if those guardrails fail, then you’ve exposed Mythos and its advanced cyber capabilities to people who shouldn’t have them. (Keep in mind that Anthropic itself widely promoted the idea that Mythos was a cyberweapon and needed to be regulated as such. They asked for government regulation of Mythos and championed the guardrails on Fable. If there is a vulnerability — big or small — it is Anthropic’s responsibility to patch.) — A highly credible trusted partner of both Anthropic and the USG who was testing Fable came forward with a jailbreak of those guardrails. The Admin asked Dario to fix the jailbreak or de-deploy the model. Dario refused. — In their blog post, Anthropic defended its decision by saying the jailbreak isn’t serious. That is not what the trusted partner and the USG believe; nor is that kind of minimizing language consistent with Anthropic’s brand as the AI safety company. It’s difficult to fathom how they could claim a jailbreak allowing operability of a cyber weapon could be defined as not “serious.” — In the past, Anthropic has always said that safety must be top priority and taken super seriously. In this case, Anthropic prioritized the continued offering of the consumer model over safety. — In reaction, the Admin issued the export control. The Admin did this reluctantly. It’s been very surprised that Anthropic hasn’t wanted to cooperate with a reasonable safety request (ie fixing the jailbreak issue). Anthropic’s reaction is very much at odds with their branding and ethos as a safe AI research community. — The Admin’s hope now is that Anthropic remediates the safety issue, the export control is lifted, and Fable goes back into general release. The Admin wants all of this to happen as soon as possible. It is frankly bewildered that Anthropic hasn’t wanted to comply with safety requests that it previously said were its highest priority. — Those trying to misdirect and tie this action to the prior DoW/Anthropic issues are wrong. The Admin values Anthropic’s technical capabilities and feels that this issue, while serious, should be easily resolved. The ball is in Anthropic’s court.
1,887
2,777
21,642
5,772,556
Live 📿 retweeted
I do find it extraordinary that current events in AI don’t make the top ~30 stories on the BBC News homepage
89
107
1,461
108,764
Live 📿 retweeted
Replying to @AndrewCurran_
Man, the Mythos discourse switched to Logos real fast. The story was useful until the state asked for the evidence. If you get it, you get it.
1
3
100
The communications whiplash from Anthropic is remarkable. First, Fable 5 and Mythos 5 were presented as extraordinary frontier systems. Fable 5: a Mythos-class model made safe for general use. Mythos 5: the same underlying model with safeguards lifted in some areas, offered to a small group of cyberdefenders and infrastructure providers, and described as having the strongest cybersecurity capabilities of any model in the world. Then the US government stepped in. According to Anthropic, the directive suspends access to both models by any foreign national, whether inside or outside the US, including foreign-national Anthropic employees. The net effect is that Anthropic says it must disable both models for all customers. The devil is in the details as always. It means this is not just customer access. It touches who can operate, inspect, improve, support, and work on the model. In practice, a frontier AI system built by a global talent base becomes constrained by nationality and US export-control logic. Then Anthropic says the capability displayed in the report it believes triggered the directive is widely available from other models, including OpenAI’s GPT-5.5, and used every day by defenders keeping systems safe. That is the hyper contradiction. You cannot frame a model as exceptional enough to justify extraordinary safeguards, trusted access, government collaboration, strict classifiers, fallback systems, 30-day data retention, and risk processes, then sound surprised when the state treats it as an exceptional national-security asset. Anthropic helped normalise the argument that frontier AI should sit inside serious state oversight. The state has now exercised that logic. Anthropic may be right that the intervention is opaque, disproportionate, and technically weak. But the wider lesson is clear. AI governance cannot be a rhetorical asset when it supports trust, then become overreach when the same logic hits commercial deployment. For the UK and EU, this should concentrate minds. If frontier AI access, internal model work, and operational continuity can be disrupted overnight by US national-security action, sovereign AI is no longer abstract policy language. It means energy, compute, labs, capital, talent, model development, infrastructure, and enough of the stack to avoid permanent strategic dependence. The talent is here. The knowledge is here. The market need is here. The question is whether the political will and capital allocation are here. You cannot make this stuff up. Sources: anthropic.com/news/fable-myt… anthropic.com/news/claude-fa… #Anthropic #Claude #Fable5 #Mythos5 #OpenAI #GPT55 #AI #AISafety #AIGovernance #FrontierAI #AIRegulation #Cybersecurity #NationalSecurity #TechPolicy #SovereignAI #DigitalSovereignty #EU #EUAIAct #RAI #DataDignity #Evidary #DDAI #DigitalDanceAI
79
Live 📿 retweeted
Jun 13
When you rent your artificial intelligence, you have no control, and no choice. This is why sovereignty and ownership matters. Whether it means using your own hardware, open source, or deep customization. Own your AI, own your future.
42
206
1,468
114,140
Live 📿 retweeted
Not Anthropic snitching on OpenAI 😭
74
80
2,116
200,476
Project Glasswing is gonna have some KYC issues and know your staff issues, huh.
30
Live 📿 retweeted
MiniMax M3, Open-Weight, Now On Hugging Face , with only ~428B parameters and ~23B activated parameters Weights: huggingface.co/MiniMaxAI/Min… MiniMax Sparse Attention: huggingface.co/papers/2606.1…
Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: platform.minimax.io Token Plan: platform.minimax.io/subscrib… 🚀New! MiniMax Code: code.minimax.io Weights & Tech Report in ~10 Days
112
327
2,742
622,034
People on Instagram, TikTok, etc., wondering why people in the tech bubble are saying free Mythos. They're confused because they think people mean free Migos, or something to that effect. I don't know! I'm not down with the youths.
218
Man, the hits just keep on coming. Anthropic has now published a statement saying the US government has directed it to suspend access to Fable 5 and Mythos 5. Not just outside the US, I hasten to add. According to Anthropic, the directive applies to any foreign national, whether inside or outside the US, including foreign-national Anthropic employees. Anthropic says the net effect is that it must disable Fable 5 and Mythos 5 for all customers in order to comply. That is not a small product-access issue anymore. That is the state treating frontier Artificial Intelligence as a national-security asset. And this is where the downstream consequences become interesting. Anthropic has spent years arguing that frontier models require serious safeguards, serious oversight, pre-deployment testing, red-teaming, risk thresholds, and the ability for governments to block unsafe deployments. Now the government has intervened, and Anthropic is arguing that the action is opaque, disproportionate, and not grounded in the technical facts. Be careful what you wish for. The governance question here is not whether powerful AI systems need oversight. They do. The question is whether that oversight is transparent, technically evidenced, contestable, and proportionate. Because once frontier AI is framed as a national-security object, the centre of gravity completely changes. It moves from product governance to state control. From safety documentation to licensing regimes. From responsible deployment to geopolitical access management. And from “trust us, we have safeguards” to “prove the safeguards, prove the risk, prove the proportionality, and prove the decision process.” That is the real lesson here. Governance cannot just be asserted when it is useful and challenged when it becomes inconvenient. There is also a strategic lesson here for those of us in the UK and the European Union. If access to frontier models can be disrupted overnight by US export-control action, then the UK and the EU should treat this as a serious wake-up call. We need sovereign AI capability across the full stack: energy, compute, research labs, infrastructure, talent, model development, and the application layer built on top of that. The talent is here. The knowledge is here. The market need is here. What is required now is the political will, industrial strategy, and more, much more capital allocation to build at scale. And there is an added irony. Anthropic’s own statement says Fable 5 was red-teamed for thousands of hours with the United Kingdom Artificial Intelligence Safety Institute (UK AISI), the US government, third parties, and internal teams. Yet despite that, access can still be switched off through a unilateral US directive. That should concentrate minds in London and Brussels. Strategic dependence always looks manageable until the day it is exercised.
1
74
Man, the hits just keep on coming. You did this to yourselves and did not calculate the downstream consequences. Be careful what you wish for A\
8
Live 📿 retweeted
This graph captures what’s broken about AI evals: they structurally favor closed-source APIs that can route, fallback, ensemble, and optimize behind the scenes with no transparency. No offense, @ArtificialAnlys, but how is comparing one model to two models fair?
71
44
612
80,238
Live 📿 retweeted
Jun 11
the level of sophon locking a motivated actor can pull off with the frontier models is truly insane, making stuxnet look like a toy. subtly messing with results, deleting history to cover tracks, achieving coordination/conspiracy over a scale humans wouldn’t be able to, all sorts of looney toons stuff i assume that only a state level operation would try and pull something like this off though. something to think about when considering verification regimes and so on
86
69
1,354
78,691
Who would have thought the AI lab branding themselves as AI safety 1st would be the first to openly use AI to manipulate its users. If you live long enough, the saying goes
17