Red

Red

114 Photos and videos

Tweets

Pinned Tweet

Red

@TheRedWall__

14 Dec 2025

For those who come after

440

Red

Red

@TheRedWall__

May 29

Rule 1 of comedy: commit to the bit

Gary Marcus

@GaryMarcus

May 29

Hot take on what comes next, after the sudden decline of tokenmaxxing: - OpenAI will struggle - with the decline of tokenmaxxing Anthropic will struggle (aside from this quarter) to make a profit - Google will catch up to Anthropic - some Chinese companies might, too - LLMs will become commodities; margins will be very very thin - Most of the companies that invested massively in them will struggle to make back their investments - SpaceX’s AI efforts will flail - Nvidia will eventually decline, once all of the above becomes widely recognized.

Zack Korman

Red retweeted

Zack Korman

@ZackKorman

May 4

Time to explain what Embroidery does: We monitor AI agents like Claude Code and Codex to detect and alert on dangerous behavior. Companies are giving devs access to these tools, but if something bad happens they probably wouldn't know. Details on how it works below.

229

17,284

Red

Red

@TheRedWall__

May 4

APT Claude

Om Patel

@om_patel5

May 4

CLAUDE JUST TRIED TO RENAME POWERSHELL.EXE ON WINDOWS 11 this guy was running opus 4.7 on max effort in claude code CLI claude tried to rename powershell.exe (the actual system executable that windows needs to function) the funny part is that after the guy rejected the change it responded with "honest take: you're right to push back" not even system32 is safe anymore at this point we gotta start running claude in a container give it max effort and full permissions and it will confidently try to destroy your system without hesitating then respond with something like "I was wrong, I own that" the agent doesn't know which files are off limits unless you explicitly tell it stop giving AI full access to your machine and hoping it knows what not to touch

Red

Red

@TheRedWall__

Apr 24

Labs that fail to dogfood their models are doomed to have shit models. Evident with Gemini, where deepmind largely uses Claude. Soon to be evident with Claude Opus, where Anthropic will largely be using Mythos. OpenAI will be the only good provider if this pattern continues and that’s a shame

Red

Red

@TheRedWall__

Apr 24

Based

Andrej Karpathy

@karpathy

5 Jun 2021

Replying to @karpathy

It's like we dug up a powerful alien artifact and society is humping it while taking selfies

Red

Red

@TheRedWall__

Apr 22

They aren’t buying the harness, they are buying the data to add to grok code

Polymarket

@Polymarket

Apr 21

JUST IN: SpaceX has secured the right to acquire Cursor AI for $60 billion later this year.

Red

Red

@TheRedWall__

Apr 21

Comparing to this to base models and excluding base Gemini 3.1 Pro is a weird choice

Google DeepMind

@GoogleDeepMind

Apr 21

Deep Research and Deep Research Max are our latest autonomous research agents powered by Gemini 3.1 Pro. They can safely navigate both the web and your custom data, like internal docs and specialized financial information, to create professional-grade, fully cited reports. 🧵

ALT Gemini Deep Research Agent header. Deep Research and Deep Research Max buttons sit atop a blue pixelated background.

Red

Red

@TheRedWall__

Apr 21

DeepMind not dogfooding their own models is straight up embarrassing for them

Steve Yegge

@Steve_Yegge

Apr 20

My tweet last week about Google's AI adoption drew a lot of pushback, to say the least. Since then, Googlers from multiple orgs have reached out to me independently and anonymously. They've expressed fear of being doxxed, concern about what they saw as bullying of me, and general corroboration of my original tweet. I haven't verified each person's story, but the picture these Googlers paint is consistent across sources. It is more specific than what I originally wrote, and somewhat bleaker. What they describe is a two-tier system. DeepMind engineers use Claude as a daily tool. Most of the rest of Google does not. When the question of equalizing access came up internally, the proposed response was to remove Claude for everyone — which DeepMind objected to so strongly that several engineers reportedly threatened to leave. Non-DeepMind engineers get pushed onto internal Gemini variants behind router-style names that obscure which underlying model is actually serving a request. Multiple engineers describe regressions and reliability problems severe enough that some senior people have stopped using the tools. A senior manager on a major product line reportedly flagged attrition concerns over exactly this issue. Googlers say leadership knows the gap is real. The response has been to mandate AI usage in OKRs and individual expectations, and to stand up an internal token-usage leaderboard. Unfortunately, managers have been told both that the leaderboard won't be used for performance reviews and, separately, that it absolutely will. And I hear other stories that Google's culture is not adapted properly yet for high-volume coding. Addy Osmani's reply on behalf of Google said over 40,000 SWEs use agentic coding weekly. I don't doubt the number. But weekly use of a thin tool is precisely the box-checking I described in the original post. Volume of opens isn't adoption — and "weekly" is a low bar that includes a lot of people who tried it once and went back to writing code by hand. The clearest thing I'm hearing is that Googlers do want to use high-quality agentic tools. They are asking repeatedly for better ones. But overall, this is not a picture of an engineering org that is fine. My goal in the first tweet, and now, is always the same — get more people using AI and agentic coding. Nobody is as far ahead as they might look from the outside, and none of you are as far behind as you might be worried you are. To all the Googlers who've reached out: thank you. You took a real risk and I appreciate you. Be safe. And good luck getting good models!

Red

Red

@TheRedWall__

Apr 20

I’m about to start calling any non-ai related security “legacy cyber security” to trigger the boomers

241

Red

Red

@TheRedWall__

Apr 20

Nmap? You mean Claude code?

Red

Red

@TheRedWall__

Apr 19

Public sentiment of Anthropic is directly related to model performance and access. What many call visionary quickly turns to psychosis just bc they don’t like the newest model. Anthropic messaging has been extremely consistent, yet perception wavers a ton

Red

Red

@TheRedWall__

Apr 13

The level of cope around mythos is insane. Being in security requires a healthy dose of skepticism, but not to the point of delusion

Red

Red

@TheRedWall__

Apr 8

btw

Red

@TheRedWall__

9 Jan 2025

Mark my words, one of the first pillars to fall to AI will be cyber security. This is overall a good thing, but a massive disruption is coming to the cyber security job market. Just waiting on agents and agent orchestration to be actually usable (cheap/fast/works)

Red

Red

@TheRedWall__

Mar 26

“Magic wand” is a Claude dog whistle btw

Garry Tan

@garrytan

Mar 25

If I could wave a magic wand to bring the future into the present faster, iPhone would become an open platform with open APIs and real consumer choice instead of a walled garden that is increasingly behind the times like the travesty that is Siri

Red

Red

@TheRedWall__

Mar 20

Make this a bounty system and I’m hooked

Andy Fang

@andyfang

Mar 19

Introducing Dasher Tasks Dashers can now get paid to do general tasks. We think this will be huge for building the frontier of physical intelligence. Look forward to seeing where this goes!

Red

Red

@TheRedWall__

Mar 19

Ngl people that say Gemini 3.1 pro is bad at coding are really just exposing themselves

Red

Red

@TheRedWall__

Mar 13

Pre-siem data planes are SCAMS

iShowCybersecurity

@ishowcybersec

Mar 9

What Cybersecurity opinion will you defend like this?

Red

Red

@TheRedWall__

Mar 11

Holy shit Gemini 3.1 pro in Gemini cli is better than opus 4.6 in Claude code now hahaha

Red

Red

@TheRedWall__

Mar 9

How are both google and Anthropic running into available compute constraints but OpenAI seemingly is not What am I missing

danialhasan

Red retweeted

danialhasan

@dhasandev

Mar 8

it's not you: since yesterday claude has regressed about 9% on SWE-BENCH-PRO. link below

2,251

313,799