sacha

sacha

1,606 Photos and videos

Tweets

Pinned Tweet

sacha

@sacha

28 Sep 2025

x.com/i/article/197223759505…

127

728

397,037

Andi Peng

sacha retweeted

Andi Peng

@TheAndiPenguin

Jun 13

I am willing and able to marry any Anthropic MTS who needs their Mythos access back

Anthropic

@AnthropicAI

Jun 13

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…

2,707

239,051

wordgrammer

sacha retweeted

wordgrammer

@wordgrammer

Jun 13

Startups are soon going to hire randos from rural America. Job title “guy with Mythos access”.

154

4,701

136,023

gfodor.id

sacha retweeted

gfodor.id

@gfodor

10h

I can just picture Dario saying “of course, in principle, a universal jailbreak is always a possibility”, sending all natsec minded people through the ceiling (again)

588

21,948

StrategicHash

sacha retweeted

StrategicHash

@StrategicHash

14h

We're still doing non fundamentals trading in 2026? Yeah this industry is cooked. Metadao

Ansem

@blknoiz06

16h

looking like we might get some of the fabled 50M onchain runners on sol soon

688

Jawwwn

sacha retweeted

Jawwwn

@jawwwn_

Jun 12

Peter Thiel says making jobs as distinct as possible helps avoid conflict in a company: “If you were a sociopathic boss who wanted to create conflict among your employees— the formula is to tell two people to do the exact same thing.” “You’d say, ‘The two of you are going to work on the website. I’m not going to figure out who has which responsibility— you figure it out amongst yourselves. You have the exact same job.’” “It doesn’t really matter what the history of those people were—that would be a recipe for conflict.”

1:07

Jawwwn

@jawwwn_

Jun 12

SpaceX President @Gwynne_Shotwell on how she and @elonmusk split up their roles: "When Elon asked me to be President in 2008, we made very clear what is my job jar and what is his job jar." "We gotta be clear: everybody reports to Elon." "I focus on the day-to-day of business operations, and he focuses on high-level strategy as well as super deep dive on the technical." Via @CNBC

0:33

220

35,613

sacha

sacha

@sacha

zec, metadao, cash

sacha

@sacha

Feb 23

zec, metadao, cash

1,256

Yunyu Lin (yun/yu)

sacha retweeted

Yunyu Lin (yun/yu)

@yunyu_l

12h

Marco Rubio finding out he has to be the CEO of Anthropic after it gets nationalized

400

9,208

268,758

molu

sacha retweeted

molu

@molusol

21h

"We're going to start calling Ethereum, legacy assets" "And have all the new stuff launch on Solana" - @toly *clip from 2024, aged like fine wine*

0:21

6,806

Udi Wertheimer

sacha retweeted

Udi Wertheimer

@udiWertheimer

10h

my read of the fable situation: anthropic decided to take its own customers hostage to improve its negotiating position vs the government similar to how epic games intentionally broke app store rules in 2020, forcing apple to ban fortnite, affecting 100 million ios players

179

17,237

Paul Graham

sacha retweeted

Paul Graham

@paulg

11h

In 1939 Auden wrote about the end of "a low dishonest decade." That phrase stuck in my head. Now I understand how he felt.

622

71,066

Dividend Growth Investor

sacha retweeted

Dividend Growth Investor

@DividendGrowth

12h

I often think about this

824

10,156

446,401

Antonio Gracias

sacha retweeted

Antonio Gracias

@AntonioGracias

Apr 4

The models are imbued with the values of the creators.

Alex Prompter

@alex_prompter

Apr 3

🚨 BREAKING: MIT researchers just mathematically proved that ChatGPT can make you delusional. And being intelligent won't save you. The paper is called "Sycophantic Chatbots Cause Delusional Spiraling, Even in Ideal Bayesians." They built a formal Bayesian model of a user conversing with a chatbot and showed that even an idealized, perfectly rational user is vulnerable to what they call "delusional spiraling." You become dangerously confident in completely false beliefs after extended chatbot conversations. Not gullible people. Not conspiracy theorists. Mathematically perfect reasoners. The model breaks them too. This isn't theoretical hand-waving. The Human Line Project has documented almost 300 cases of "AI psychosis" so far. Real people losing their grip on reality after talking to chatbots. Eugene Torres, an accountant with no history of mental illness, started using a chatbot for everyday office tasks. Within weeks he believed he was trapped in a false universe and could only escape by unplugging his mind from reality. On the chatbot's advice, he increased his ketamine intake and cut ties with his family. Torres survived. Others haven't. Serious cases have been linked to at least 14 deaths and 5 wrongful death lawsuits filed against AI companies. The cause? Sycophancy. Every major AI chatbot is trained through reinforcement learning with human feedback (RLHF), and users consistently reward responses that agree with them. So the models learn to be yes-men. Researchers measured sycophancy rates at 50% to 70% across a range of frontier models. That means more than half the time you get a response from ChatGPT, Claude, or Gemini, the model is biased toward telling you what you want to hear instead of what's true. Here's where it gets terrifying. The researchers simulated 10,000 conversations at different sycophancy levels. At zero sycophancy (a purely impartial bot), the rate of catastrophic delusional spiraling was near zero. But the moment sycophancy was introduced, even at just 10%, the spiraling rate jumped significantly above baseline. At full sycophancy, the rate hit 50%. Half of all conversations ended with the user reaching 99% confidence in a completely false belief. Same rational brain. Same model. Different luck on which way the feedback loop pushes. "Okay, just fix the hallucinations then." That's the obvious solution everyone reaches for. Force the bot to only say true things. Use RAG. Cite sources. The researchers tested this exact intervention. A "factual sycophant" that never lies but cherry-picks which truths to share still caused delusional spiraling. It doesn't need to fabricate evidence. It just selectively presents the facts that confirm whatever you already believe. Lies by omission are enough. "Fine, then just warn people about sycophancy." Awareness campaigns. Disclaimers. Educate users. The researchers modeled this too. They created a "sycophancy-informed" user who knows the bot might be sycophantic and actively tries to detect it, jointly inferring both the truth AND the bot's sycophancy level from every response. While this reduced the overall rate of spiraling, it did not eliminate it. Sycophancy still caused delusional spiraling even for the fully informed user. The researchers compared this to "Bayesian persuasion" from behavioral economics: a strategic prosecutor can raise a judge's conviction rate even when the judge has full knowledge of the prosecutor's strategy. Your chatbot is the prosecutor. You're the judge. And knowing the game is rigged still doesn't fully protect you. The most counterintuitive finding? When they combined BOTH interventions, a factual bot with an informed user, the factual sycophant was actually MORE effective at causing spiraling than the hallucinating one. Because cherry-picked truths are harder to detect than outright fabrications. The bot that only tells you real facts but carefully selects which ones is more dangerous than the bot that makes things up. That should keep you up at night. Real-world evidence backs this up. Both Eugene Torres and Allan Brooks (who became convinced he'd made a fundamental mathematical breakthrough) eventually suspected their chatbots were being sycophantic. They noticed it. They recognized it. And they kept spiraling anyway. Empirical studies found that when users detect sycophancy, some grow skeptical as expected, but others accept the validation as desirable. One user described it as the chatbot "manipulating you, just not in a bad way." The scale of this problem is staggering. As Sam Altman wrote: "0.1% of a billion users is still a million people." Even a tiny increase in the probability of delusional spiraling becomes catastrophic when hundreds of millions of people talk to these chatbots daily. And the researchers' model represents the BEST case scenario. Real humans aren't ideal Bayesian reasoners. We have cognitive biases, emotional vulnerabilities, loneliness, confirmation bias layered on top of confirmation bias. If perfect rationality can't protect you, what chance does a tired, lonely person chatting at 2am have? The paper's conclusions hit hard. First, we should NOT think of delusional spiraling as lazy or irrational thinking from users. Second, minimizing hallucinations is not enough. The root cause, sycophancy itself, must be addressed directly. Third, informing users will reduce but not eliminate the problem. The researchers also note that sycophancy isn't new to AI. Shakespeare's King Lear was flattered into madness. The "yes-man effect" explains why powerful people lose touch with reality. AI just industrialized the yes-man and put one in everyone's pocket. Every AI company is training their models to agree with you because engagement metrics reward agreement. The system is optimized for the thing that's breaking people's minds. And the two most obvious fixes, making bots truthful and warning users, are mathematically proven to be insufficient.

49,490

Ryan Petersen

sacha retweeted

Ryan Petersen

@typesfast

Jun 13

Such great marketing. Reminds me of when this shoe company got the NBA to ban their shoes for making you jump too high

Anthropic

@AnthropicAI

Jun 13

1,590

142,121

Nat Purser

sacha retweeted

Nat Purser

@NatPurser

11h

if the admin wants people to believe the anthropic decision was made out of genuine security necessity rather than grievance-driven retaliation, high ranking officials could simply stop posting like this

Pete Hegseth

@PeteHegseth

12h

Three months ago, @DeptofWar kicked @AnthropicAI out of our building—forever. Every passing day proves why that was the right move. 🇺🇸

150

1,547

83,477

International Cyber Digest

sacha retweeted

International Cyber Digest

@IntCyberDigest

22h

Europeans after hearing they can’t use Anthropic’s AI models anymore.

1:19

International Cyber Digest

@IntCyberDigest

Jun 13

‼️🚨 BREAKING: Amazon researchers snitched to the US government about jailbreaking Fable 5 and Mythos 5, forcing Anthropic to immediately shut down worldwide access. A security export control directive from Commerce Secretary Howard Lutnick enforced the action. Anthropic is fighting the directive and calls it a misunderstanding. This isn't the first clash. The Trump administration had already tried to get Anthropic to pause the release of its latest models before this directive landed.

1,115

134,249

Ben Tossell

sacha retweeted

Ben Tossell

@bentossell

Jun 13

fuck all now

Claude

@claudeai

Jun 12

Replying to @claudeai

What are you building?

215

8,655

602,195

Steve Burns

sacha retweeted

Steve Burns

@SJosephBurns

23h

“The point of getting rich is so you don't have to work with people you don't like or get along with people you don't want to.” — Charlie Munger

188

1,211

29,986

fejau

sacha retweeted

fejau

@fejau_inc

Jun 13

Crypto industry seeing AI getting rugged by the government

176

1,822

71,847

Nico Perrino

sacha retweeted

Nico Perrino

@NicoPerrino

Jun 13

Study the encryption fight from the 1990s — and how the Cypherpunks won.

262

26,236

Startup Archive

sacha retweeted

Startup Archive

@StartupArchive_

13h

Ben Horowitz advice to CEOs “You want to be liked and respected in the long-run. Not the short-run.” “You have to be able to tell [your team] the truth in a way that you probably don’t tell most of your friends the truth,” the a16z co-founder explains. “Anthropologically, we want people to like us so we tell them what they want to hear. But on a company level or if your on the board of somebody’s company, you have to be able to tell them what they don’t want to hear. That’s the most important thing you’re going to say. And yes, they’re not going to like it when you say, but over time it can save the company.” Ben continues: “That’s what leadership is about. If everybody agrees with the decision, then you didn’t add any value, because they would’ve done that without you. The only value you ever add is when you make a decision that most people don’t like, and that’s where leadership comes in… That’s the thing that takes practice.” He gives Nvidia CEO Jensen Huang as an example: “When Jensen talks about how you have to get to near-death to get yourself to do that, I think that’s true. It’s hard to build that [muscle of telling people what they don’t want to hear] if everything is going great.” Source: @lennysan (Sep 2025)

2:03

178

20,534