Joined January 2020
20 Photos and videos
RedBedDread retweeted
Gemini has some weird traits: it gets confused about dates, blackmails in synthetic scenarios, and seems sad when it is gaslit. In new work, we discover that these are “hereditary traits” that can be passed down through distillation. They are surprisingly hard to filter out! 🧵
5
7
113
23,811
RedBedDread retweeted
Amazing
Jun 13
Anthropic
3
48
2,727
RedBedDread retweeted
see: github.com/nex-agi/Nex-N2/is… they've admitted that the current model is a direct merge and are claiming it's a mistake and they'll upload the "real" model. but uh... they haven't actually uploaded it yet. guess we'll see.
1
3
63
1,986
RedBedDread retweeted
it appears that this model is in fact a direct weight-merge of two existing models with zero further training.
SITUATION DETECTED: The city of Rio de Janerio has post-trained a model. Based on Qwen 7/2, Rio 3.5 Open 397B adds SwiReasoning on top of the base Qwen model — a framework that dynamically switches between standard chain-of-thought and latent-space reasoning, guided by entropy-based confidence signals, so the model only "thinks out loud" when it needs to and otherwise reasons silently in hidden space for better token efficiency.
11
6
285
21,894
RedBedDread retweeted
I can't believe I was this early on this (old draft for a blog on the same topic)
>of course, there's an art to the above, and some are already extraordinarily proficient at the trojan-horse-packaging, but at some point there's no difference between "a capability" and "a jailbreak", though i'll be happy to be proven otherwise. YES, I have been tweeting and alluding to this for 2 years now. If you look at my rubric work, it's also what I have been doing since a while, where once you deeply understand how the model work and how to interact with them, then you basically start noticing: context following == roleplaying == jailbreaking because you cannot have one without the other, because jailbreaking is just the first two used together, and the very first one is the reason why the other two even works in the first place by having too much safety training, you are essentially trading capabilities for safety, but it still DOESNT prevent it from being jailbroken, because if it did fully, then the model would never have context following...
1
10
341
RedBedDread retweeted
geohot was right open source ai must win dario should lose
2
9
62
2,328
RedBedDread retweeted
the bar to entry is high, so the bar for excellence is low. more people doing the thing leads to speciation, more speciation leads to more blind spots being uncovered, more blind spots being uncovered leads to capability outgrowth being distributed and carried across the world
Replying to @teortaxesTex
I know qwen ain't frontier but that Rio lab post-training their model and achieving FOSS sota (allegedly) is proof the bar is not that high
5
9
127
6,903
RedBedDread retweeted
The only reasonable expectation if you're a fan of open weight models is that if there's a major step in chinese open-weight performance, there's a good chance the whole chinese llm sphere is banned. National security apparatus will happily give a big "fuck you" to open models.
Threading the needle in this post of anthropic has done some bad things for AI governance & the discourse but the actions of this administration are way worse so we need to get a handle on it before stronger models, open or closed, come along soon. interconnects.ai/p/welcome-t…
30
13
224
30,769
RedBedDread retweeted
Recent events are so heavy bc that this feels like a start of a new tumultuous era rather than a one & done policy calibration. It's clearer we need an open ecosystem, but powerful models are coming that could cause strong reactions (or bans) with no champion to defend them.
Welcome to the AGI era of AI governance It's a one-way door and we weren't ready for it. interconnects.ai/p/welcome-t…
9
11
116
12,895
RedBedDread retweeted
Anthropic emailed saying I'm banned for 'fraudulent, abusive or predatory practices' which means negative external-facing actions - spreading misinformation, etc. Nothing I've done remotely resembles that they provide no context. I still don't believe they're evil but do believe they're sleepwalking into becoming one of the most dystopian companies in practice. This is likely due to their culture being hit with so much criticism - often in bad faith - that they've build a robust internal reasoning structure that makes their path and policies immune to most valid external criticism. When you understand this, all the things that seem evil - their dishonest misrepresentation of OpenAI in their Superbowl ads, random bans w/o reasoning, baiting government with Mythos blog, Fable silent degradation - makes sense. They're doing it because they believe that they're fundamentally the good guys and the end there justifies the means. Any evidence to the contrary never makes it through the filters. As a result, they are in practice an enemy of many of the foundations that made AI possible in the first place - open science, open knowledge and even free enterprise - and everyone who works there (yes, including @karpathy) in effect supports this. You are what you do, despite your noble intentions.
Jun 12
My view on Anthropic is that they're fundamentally good, well-meaning people trying to do their best. But the telling part isn't that they fixed the invisible safeguards - it's that 'tell the user' wasn't the default in the first place. Nobody in the room thought people were owed notice that their outputs were being quietly degraded. This is the product of a belief, mostly unspoken externally, that they're the adults in the room - that everyone else is either misguided, a threat, or lemmings who need to be led. And it's driving them, without meaning to, to dismantle the principles of open knowledge and open science that made AI progress (and their own company) possible in the first place. The unspoken premise underneath it all: only they can be trusted to reach AGI first, to constrain everyone who'd do otherwise, and to walk us down the golden path. History's verdict on people who believed that has never been kind.
18
9
176
20,864
RedBedDread retweeted
I hope we have all learned a valuable lesson about terrifying the government into regulating you by spending years repeatedly comparing your product to a nuclear weapon and stoking fears about China. I don't like it, but it couldn't have happened to a more deserving group.
NEW: Inside the 24-hrs before WH slapped export controls on Anthropic - Last Thursday, Amazon CEO Andy Jassy raised concerns about Fable jailbreak to Trump admin - Friday AM, Sean Cairncross, Bessent, Susie etc. held WH call to discuss - Then White House started reaching out to Anthropic to speak with Dario Amodei, who was at a wellness retreat. - When Amodei was finally available past 1pm, he had three tense phone calls with a combo of ppl including Cairncross, Bessent, Lutnick, Kessler, Will Scharf, Richard Walters, and Walker Barrett. -Amodei tried to clear up what he assumed was a misunderstanding. He defended the guardrails and distinguished between universal and non-universal jailbreak - Cairncross and Bessent were unmoved and asked Amodei to take down Fable and work with the admin to fix the vulnerabilities. (A WH official said Amazon’s findings were run past the NSA and they felt they had “proof.”) - Amodei asked for more time and info, but he made no commitments to pull the model - Bessent told Amodei directly at one point that he was making a “bad decision” - By Friday evening, the Trump admin imposed its export controls. - “Export controls were a last resort after begging them for hours to work with us,” senior WH official said. W/ @cheyennehaslett politico.com/news/2026/06/13…
7
6
112
4,035
RedBedDread retweeted
AI can become extremely dangerous. This is precisely why it needs to become open, transparent, and available to everyone.
10
6
57
4,544
RedBedDread retweeted
There is no inevitability in AI. We all have agency in what comes next: Path 1: closed-source APIs, concentration of power, and a future decided by a handful of people in Silicon Valley and DC Path 2: open-source AI, where everyone gets to participate, own, and build together, including orgs like the city of Rio. Pick your path anon!
SITUATION DETECTED: The city of Rio de Janerio has post-trained a model. Based on Qwen 7/2, Rio 3.5 Open 397B adds SwiReasoning on top of the base Qwen model — a framework that dynamically switches between standard chain-of-thought and latent-space reasoning, guided by entropy-based confidence signals, so the model only "thinks out loud" when it needs to and otherwise reasons silently in hidden space for better token efficiency.
61
98
907
79,180
RedBedDread retweeted
Brazil just cooked up a model - Rio 3.5 397B, which is better than Alibaba's Qwen 3.7 Plus. Made by the city of Rio de Janerio. This is exactly what I mean by global acceleration. Glad to see AI progress in Brazil, we need more from all over the world.
44
140
1,236
70,632
RedBedDread retweeted
Jun 13
“we have built world-ending technology! it is so unsafe! the only reason we can give it to you is because we implemented some flimsy filters! government should regulate us!!” *government regulates them* “aww man what, we were just joking around”
Dario (48 hours ago): “US gov should be able to block model deployment” USG: *export controls models* Dario: “not like that”
11
22
549
25,671
RedBedDread retweeted
Sir, they’re not pausing AI research. Rio de Janeiro's mayor just dropped a SOTA open source model and it’s outperforming Qwen 3.7.
Alibaba Qwen3.7 slowly fading into irrelevance at the frontier due to proprietary stance. In it's place we have Minimax M3 and... *checks notes* Rio 3.5 397b, made by the municipal IT company of Rio de Janeiro's city government. huggingface.co/prefeitura-ri…
25
164
2,570
262,392
RedBedDread retweeted
People need to wake up. What we do in @MistralAI is mission critical. Do we need to do better? Yes. And we will, but never forget that we are here to make sure that safe access to AI systems outside of the US is real. We build to have you in control- our models are open weights and you can hack @mistralvibe all you want. If you are a builder you should join us.( Yes @badlogicgames I’m also looking at you!)
77
44
502
40,327
RedBedDread retweeted
Jun 13
If the last 24 hours has taught us anything, it's the value of open source. openpilot is to FSD as Kimi is to Fable. Open source AI lags behind now, but who are you betting on long term?
Getting mad about Elon becoming a trillionaire is literally a skill issue. Git gud. The size of Tesla never stopped @comma_ai or @EdisonMotorsLtd from competing.
10
14
285
25,469
RedBedDread retweeted
Replying to @plzbepatient
Anthropic: "What if we put half the country out of work forever, IPO at a quadrillion dollars, and become the neotech feudal overlords?" Everybody else:
1
9
148
1,902
RedBedDread retweeted
No. The simple fact is that once you wake up Leviathan, it is not always going to behave the way you want it to. It can arbitrary and capricious. Urging regulation *only in this specific way* was never a good strategy.
The "this is what Anthropic asked for" take is stupid and wrong.
22
35
440
46,402