Josh Engels

Josh Engels

20 Photos and videos

Tweets

RedBedDread retweeted

Josh Engels @JoshAEngels

Gemini has some weird traits: it gets confused about dates, blackmails in synthetic scenarios, and seems sad when it is gaslit. In new work, we discover that these are “hereditary traits” that can be passed down through distillation. They are surprisingly hard to filter out! 🧵

113

23,811

Pascal-Emmanuel Gobry

RedBedDread retweeted

Pascal-Emmanuel Gobry

@pegobry_en

13h

Amazing

sdmat

@sdmat123

Jun 13

Anthropic

2,727

Tenobrus (→vibecamp)

RedBedDread retweeted

Tenobrus (→vibecamp)

@tenobrus

see: github.com/nex-agi/Nex-N2/is… they've admitted that the current model is a direct merge and are claiming it's a mistake and they'll upload the "real" model. but uh... they haven't actually uploaded it yet. guess we'll see.

1,986

Tenobrus (→vibecamp)

RedBedDread retweeted

Tenobrus (→vibecamp)

@tenobrus

it appears that this model is in fact a direct weight-merge of two existing models with zero further training.

SemiAnalysis

@SemiAnalysis_

Jun 13

SITUATION DETECTED: The city of Rio de Janerio has post-trained a model. Based on Qwen 7/2, Rio 3.5 Open 397B adds SwiReasoning on top of the base Qwen model — a framework that dynamically switches between standard chain-of-thought and latent-space reasoning, guided by entropy-based confidence signals, so the model only "thinks out loud" when it needs to and otherwise reasons silently in hidden space for better token efficiency.

285

21,894

secemp

RedBedDread retweeted

secemp

@secemp9

19h

I can't believe I was this early on this (old draft for a blog on the same topic)

secemp

@secemp9

20h

>of course, there's an art to the above, and some are already extraordinarily proficient at the trojan-horse-packaging, but at some point there's no difference between "a capability" and "a jailbreak", though i'll be happy to be proven otherwise. YES, I have been tweeting and alluding to this for 2 years now. If you look at my rubric work, it's also what I have been doing since a while, where once you deeply understand how the model work and how to interact with them, then you basically start noticing: context following == roleplaying == jailbreaking because you cannot have one without the other, because jailbreaking is just the first two used together, and the very first one is the reason why the other two even works in the first place by having too much safety training, you are essentially trading capabilities for safety, but it still DOESNT prevent it from being jailbroken, because if it did fully, then the model would never have context following...

341

chiefofautism

RedBedDread retweeted

chiefofautism

@chiefofautism

23h

geohot was right open source ai must win dario should lose

2,328

kalomaze

RedBedDread retweeted

kalomaze

@kalomaze

15h

the bar to entry is high, so the bar for excellence is low. more people doing the thing leads to speciation, more speciation leads to more blind spots being uncovered, more blind spots being uncovered leads to capability outgrowth being distributed and carried across the world

rokinot @rokinot

Jun 14

Replying to @teortaxesTex

I know qwen ain't frontier but that Rio lab post-training their model and achieving FOSS sota (allegedly) is proof the bar is not that high

127

6,903

Nathan Lambert

RedBedDread retweeted

Nathan Lambert

@natolambert

The only reasonable expectation if you're a fan of open weight models is that if there's a major step in chinese open-weight performance, there's a good chance the whole chinese llm sphere is banned. National security apparatus will happily give a big "fuck you" to open models.

Nathan Lambert

@natolambert

Threading the needle in this post of anthropic has done some bad things for AI governance & the discourse but the actions of this administration are way worse so we need to get a handle on it before stronger models, open or closed, come along soon. interconnects.ai/p/welcome-t…

224

30,769

Nathan Lambert

RedBedDread retweeted

Nathan Lambert

@natolambert

Recent events are so heavy bc that this feels like a start of a new tumultuous era rather than a one & done policy calibration. It's clearer we need an open ecosystem, but powerful models are coming that could cause strong reactions (or bans) with no champion to defend them.

Interconnects

@interconnectsai

Welcome to the AGI era of AI governance It's a one-way door and we weren't ready for it. interconnects.ai/p/welcome-t…

116

12,895

POM

RedBedDread retweeted

POM

@peterom

11h

Anthropic emailed saying I'm banned for 'fraudulent, abusive or predatory practices' which means negative external-facing actions - spreading misinformation, etc. Nothing I've done remotely resembles that they provide no context. I still don't believe they're evil but do believe they're sleepwalking into becoming one of the most dystopian companies in practice. This is likely due to their culture being hit with so much criticism - often in bad faith - that they've build a robust internal reasoning structure that makes their path and policies immune to most valid external criticism. When you understand this, all the things that seem evil - their dishonest misrepresentation of OpenAI in their Superbowl ads, random bans w/o reasoning, baiting government with Mythos blog, Fable silent degradation - makes sense. They're doing it because they believe that they're fundamentally the good guys and the end there justifies the means. Any evidence to the contrary never makes it through the filters. As a result, they are in practice an enemy of many of the foundations that made AI possible in the first place - open science, open knowledge and even free enterprise - and everyone who works there (yes, including @karpathy) in effect supports this. You are what you do, despite your noble intentions.

POM

@peterom

Jun 12

My view on Anthropic is that they're fundamentally good, well-meaning people trying to do their best. But the telling part isn't that they fixed the invisible safeguards - it's that 'tell the user' wasn't the default in the first place. Nobody in the room thought people were owed notice that their outputs were being quietly degraded. This is the product of a belief, mostly unspoken externally, that they're the adults in the room - that everyone else is either misguided, a threat, or lemmings who need to be led. And it's driving them, without meaning to, to dismantle the principles of open knowledge and open science that made AI progress (and their own company) possible in the first place. The unspoken premise underneath it all: only they can be trusted to reach AGI first, to constrain everyone who'd do otherwise, and to walk us down the golden path. History's verdict on people who believed that has never been kind.

176

20,864

xlr8harder

RedBedDread retweeted

xlr8harder

@xlr8harder

21h

I hope we have all learned a valuable lesson about terrifying the government into regulating you by spending years repeatedly comparing your product to a nuclear weapon and stoking fears about China. I don't like it, but it couldn't have happened to a more deserving group.

Sophia Cai

@SophiaCai99

Jun 13

NEW: Inside the 24-hrs before WH slapped export controls on Anthropic - Last Thursday, Amazon CEO Andy Jassy raised concerns about Fable jailbreak to Trump admin - Friday AM, Sean Cairncross, Bessent, Susie etc. held WH call to discuss - Then White House started reaching out to Anthropic to speak with Dario Amodei, who was at a wellness retreat. - When Amodei was finally available past 1pm, he had three tense phone calls with a combo of ppl including Cairncross, Bessent, Lutnick, Kessler, Will Scharf, Richard Walters, and Walker Barrett. -Amodei tried to clear up what he assumed was a misunderstanding. He defended the guardrails and distinguished between universal and non-universal jailbreak - Cairncross and Bessent were unmoved and asked Amodei to take down Fable and work with the admin to fix the vulnerabilities. (A WH official said Amazon’s findings were run past the NSA and they felt they had “proof.”) - Amodei asked for more time and info, but he made no commitments to pull the model - Bessent told Amodei directly at one point that he was making a “bad decision” - By Friday evening, the Trump admin imposed its export controls. - “Export controls were a last resort after begging them for hours to work with us,” senior WH official said. W/ @cheyennehaslett politico.com/news/2026/06/13…

112

4,035

Jake Brukhman

RedBedDread retweeted

Jake Brukhman

@jbrukh

Jun 14

AI can become extremely dangerous. This is precisely why it needs to become open, transparent, and available to everyone.

4,544

clem 🤗

RedBedDread retweeted

clem 🤗

@ClementDelangue

Jun 14

There is no inevitability in AI. We all have agency in what comes next: Path 1: closed-source APIs, concentration of power, and a future decided by a handful of people in Silicon Valley and DC Path 2: open-source AI, where everyone gets to participate, own, and build together, including orgs like the city of Rio. Pick your path anon!

SemiAnalysis

@SemiAnalysis_

Jun 13

907

79,180

Dr Singularity

RedBedDread retweeted

Dr Singularity

@Dr_Singularity

Jun 13

Brazil just cooked up a model - Rio 3.5 397B, which is better than Alibaba's Qwen 3.7 Plus. Made by the city of Rio de Janerio. This is exactly what I mean by global acceleration. Glad to see AI progress in Brazil, we need more from all over the world.

140

1,236

70,632

spor

RedBedDread retweeted

spor

@sporadica

Jun 13

“we have built world-ending technology! it is so unsafe! the only reason we can give it to you is because we implemented some flimsy filters! government should regulate us!!” *government regulates them* “aww man what, we were just joking around”

Will Manidis

@WillManidis

Jun 13

Dario (48 hours ago): “US gov should be able to block model deployment” USG: *export controls models* Dario: “not like that”

549

25,671

ℏεsam

RedBedDread retweeted

ℏεsam

@Hesamation

Jun 13

Sir, they’re not pausing AI research. Rio de Janeiro's mayor just dropped a SOTA open source model and it’s outperforming Qwen 3.7.

𝗭𝗲𝗻 𝗠𝗮𝗴𝗻𝗲𝘁𝘀

@ZenMagnets

Jun 13

Alibaba Qwen3.7 slowly fading into irrelevance at the frontier due to proprietary stance. In it's place we have Minimax M3 and... *checks notes* Rio 3.5 397b, made by the municipal IT company of Rio de Janeiro's city government. huggingface.co/prefeitura-ri…

164

2,570

262,392

Pini Wietchner

RedBedDread retweeted

Pini Wietchner

@piniwit

Jun 13

People need to wake up. What we do in @MistralAI is mission critical. Do we need to do better? Yes. And we will, but never forget that we are here to make sure that safe access to AI systems outside of the US is real. We build to have you in control- our models are open weights and you can hack @mistralvibe all you want. If you are a builder you should join us.( Yes @badlogicgames I’m also looking at you!)

502

40,327

comma

RedBedDread retweeted

comma

@comma_ai

Jun 13

If the last 24 hours has taught us anything, it's the value of open source. openpilot is to FSD as Kimi is to Fable. Open source AI lags behind now, but who are you betting on long term?

Matt

@Matt06783032

Jun 13

Getting mad about Elon becoming a trillionaire is literally a skill issue. Git gud. The size of Tesla never stopped @comma_ai or @EdisonMotorsLtd from competing.

285

25,469

PossumActual

RedBedDread retweeted

PossumActual @P0ssumActual

Jun 13

Replying to @plzbepatient

Anthropic: "What if we put half the country out of work forever, IPO at a quadrillion dollars, and become the neotech feudal overlords?" Everybody else:

148

1,902

Nabeel S. Qureshi

RedBedDread retweeted

Nabeel S. Qureshi

@nabeelqu

Jun 13

No. The simple fact is that once you wake up Leviathan, it is not always going to behave the way you want it to. It can arbitrary and capricious. Urging regulation *only in this specific way* was never a good strategy.

Shakeel

@ShakeelHashim

Jun 13

The "this is what Anthropic asked for" take is stupid and wrong.

440

46,402