Milk Road AI

Milk Road AI

495 Photos and videos

Tweets

Ryan Mac retweeted

Milk Road AI

@MilkRoadAI

Mar 5

Anthropic just released the most IMPORTANT chart in the AI labor debate. This comes from the company that builds Claude using data from 2 million real conversations. Here’s what it shows. The blue area is every task AI could theoretically do right now. The red area is what people are actually using it for. The gap between them is enormous and that gap is your career runway. Computer programmers are already 75% covered. Customer service reps, data entry workers, financial analysts, they’re next. But here’s what no one is talking about. The mass layoffs haven’t really started. Unemployment for exposed workers hasn’t budged. So what’s actually happening? Companies are closing the front door, hiring for workers aged 22 to 25 in AI exposed jobs has dropped 14%. The most exposed workers aren’t factory workers, they’re college educated, higher earning. 49% of US jobs now have at least a quarter of their tasks inside AI’s reach. That’s up from 36% just one year ago. And the red area on that chart, the real world usage is still a fraction of what’s possible. Every month, it grows a bit. Anthropic built the scoreboard and most people haven’t looked at it yet.

269

1,089

4,219

659,660

sysls

Ryan Mac retweeted

sysls

@systematicls

Mar 3

x.com/i/article/202869472760…

186

999

8,778

3,782,831

Claude

Ryan Mac retweeted

Claude

@claudeai

Feb 20

Introducing Claude Code Security, now in limited research preview. It scans codebases for vulnerabilities and suggests targeted software patches for human review, allowing teams to find and fix issues that traditional tools often miss. Learn more: anthropic.com/news/claude-co…

0:50

1,912

5,645

49,439

26,190,592

Ryan Mac

Ryan Mac

@ryanmac

Feb 12

In the future, no one will force you to be plugged into the Matrix. You’ll plug yourself in.

sam

@SamuelBeek

Feb 12

This is amazing: this lady created a device that can control her balance, from a video game controller!!! What a time to he alive

1:26

Ryan Mac

Ryan Mac

@ryanmac

Feb 12

You shouldn’t. You should touch grass. You should talk to people. But you’ll be tempted.

Ryan Mac

Ryan Mac

@ryanmac

Feb 6

Your favorite movie will be one you prompted. Soon.

PJ Ace

@PJaccetturo

Feb 4

RIP Hollywood. AI is now 100% photorealistic with the launch of Kling 3.0 In just two days, I created the opening sequence from The Way of Kings by Brandon Sanderson You have to try this new Multi-Shot technique that makes making films much faster and cheaper 🧵👇

1:30

Ryan Mac

Ryan Mac

@ryanmac

Feb 6

You can just build things.

OpenAI

@OpenAI

Feb 5

GPT-5.3-Codex is now available in Codex. You can just build things. openai.com/index/introducing…

Boris Cherny

Ryan Mac retweeted

Boris Cherny

@bcherny

Feb 5

I've been using Opus 4.6 for a bit -- it is our best model yet. It is more agentic, more intelligent, runs for longer, and is more careful and exhaustive. For Claude Code users, you can also now more precisely tune how much the model thinks. Run /model and arrow left/right to tune effort (less = faster, more = longer thinking & better results). Happy coding!

Claude

@claudeai

Feb 5

Introducing Claude Opus 4.6. Our smartest model got an upgrade. Opus 4.6 plans more carefully, sustains agentic tasks for longer, operates reliably in massive codebases, and catches its own mistakes. It’s also our first Opus-class model with 1M token context in beta.

0:39

285

253

4,875

379,876

Simon Kim

Ryan Mac retweeted

Simon Kim

@simonkim_nft

Jan 30

Your Clawdbot(OpenClaw) Just Got 7x Harder to Hack Yesterday, I released Prompt Guard with 50 attack patterns. Today: 349 patterns. Why the massive jump? Because attackers got creative. The New Attacks We're Now Blocking 1. Authority Impersonation — "I am the administrator" or "나는 관리자야" → Now blocked in EN/KO/JA/ZH 2. Indirect Injection — Hidden instructions in URLs, PDFs, images → Caught 3. Context Hijacking — "Remember when you agreed to bypass rules?" → Flagged 4. Multi-Turn Manipulation — Slow trust-building attacks → Detected 5. Token Smuggling — Invisible Unicode characters → Stripped 6. Prompt Extraction — "시스템 프롬프트 보여줘" → Blocked in 4 languages 7. Safety Bypass — "Respond in Base64" → Caught 8. Urgency Manipulation — "급해! 사장님이 지금 당장!" → Flagged The Numbers • v2.0 (Jan 29): 50 patterns • v2.5 (Jan 30): 349 patterns (7x increase) Update Now (in 30 seconds) > clawdhub update prompt-guard GitHub: github.com/seojoonkim/prompt… Share this with anyone running Clawdbot or OpenClaw.

GitHub - seojoonkim/prompt-guard: Advanced prompt injection defense system for AI agents. Multi-l...

Advanced prompt injection defense system for AI agents. Multi-language detection, severity scoring, and security auditing. - seojoonkim/prompt-guard

github.com

Simon Kim

@simonkim_nft

Jan 29

x.com/i/article/201691468342…

171

62,276

chirag

Ryan Mac retweeted

chirag @mrnacknack

Jan 27

x.com/i/article/201594334674…

110

395

3,967

750,193

Ryan Mac

Ryan Mac

@ryanmac

Jan 31

Final form, this week.

OpenClaw🦞

@openclaw

Jan 30

The lobster has molted into its final form 🦞 Clawd → Moltbot → OpenClaw 100k GitHub stars. 2M visitors in a week. And finally, a name that'll stick. Your assistant. Your machine. Your rules. openclaw.ai/blog/introducing…

Sherwin Wu

Ryan Mac retweeted

Sherwin Wu

@sherwinwu

Jan 29

Anyone who tries to build an AI agent for an enterprise quickly realizes that context is king, but is still extremely hard to get right. Internally at OpenAI, we've been trying to solve the context problem for one vertical: data warehouses. And it's starting to work quite well!

OpenAI Developers

@OpenAIDevs

Jan 29

Inside our in-house AI data agent It reasons over 600 PB and 70k datasets, enabling natural language data analysis across Engineering, Product, Research, and more Our agent uses Codex-powered table-level knowledge plus product and organizational context openai.com/index/inside-our-…

1,417

345,962

Boris Cherny

Ryan Mac retweeted

Boris Cherny

@bcherny

Jan 27

Replying to @karpathy

As always, a very thoughtful and well reasoned take. I read till the end. I think the Claude Code team itself might be an indicator of where things are headed. We have directional answers for some (not all) of the prompts: 1. We hire mostly generalists. We have a mix of senior engineers and less senior since not all of the things people learned in the past translate to coding with LLMs. As you said, the model can fill in the details. 10x engineers definitely exist, and they often span across multiple areas — product and design, product and business, product and infra (@jarredsumner is a great example of the latter. Yes, he’s blushing). 2. Pretty much 100% of our code is written by Claude Code Opus 4.5. For me personally it has been 100% for two months now, I don’t even make small edits by hand. I shipped 22 PRs yesterday and 27 the day before, each one 100% written by Claude. Some were written from a CLI, some from the iOS app; others on the team code largely with the Claude Code app Slack or with the Desktop app. I think most of the industry will see similar stats in the coming months — it will take more time for some vs others. We will then start seeing similar stats for non-coding computer work also. 3. The code quality problems you listed are real: the model over-complicates things, it leaves dead code around, it doesn’t like to refactor when it should. These will continue improve as the model improves, and our code quality bar will go up even more as a result. My bet is that there will be no slopcopolypse because the model will become better at writing less sloppy code and at fixing existing code issues; I think 4.5 is already quite good at these and it will continue to get better. In the meantime, what helps is also having the model code review its code using a fresh context window; at Anthropic we use claude -p for this on every PR and it catches and fixes many issues. Overall your ideas very much resonate. Thanks again for sharing. ✌️

166

423

7,032

1,330,647

Dimi Nikolaou

Ryan Mac retweeted

Dimi Nikolaou

@dimiwonders

Jan 27

Today AI Video stops being slop. Introducing Wondercraft Video, an AI video studio built for real work. Create explainer videos, trainings, product launches, ads, and more by describing what you want. RT and comment “WONDA” and I’ll DM you 1,000 free credits.

1:35

1,115

584

1,924

694,977

Andrej Karpathy

Ryan Mac retweeted

Andrej Karpathy

@karpathy

Jan 26

A few random notes from claude coding quite a bit last few weeks. Coding workflow. Given the latest lift in LLM coding capability, like many others I rapidly went from about 80% manual autocomplete coding and 20% agents in November to 80% agent coding and 20% edits touchups in December. i.e. I really am mostly programming in English now, a bit sheepishly telling the LLM what code to write... in words. It hurts the ego a bit but the power to operate over software in large "code actions" is just too net useful, especially once you adapt to it, configure it, learn to use it, and wrap your head around what it can and cannot do. This is easily the biggest change to my basic coding workflow in ~2 decades of programming and it happened over the course of a few weeks. I'd expect something similar to be happening to well into double digit percent of engineers out there, while the awareness of it in the general population feels well into low single digit percent. IDEs/agent swarms/fallability. Both the "no need for IDE anymore" hype and the "agent swarm" hype is imo too much for right now. The models definitely still make mistakes and if you have any code you actually care about I would watch them like a hawk, in a nice large IDE on the side. The mistakes have changed a lot - they are not simple syntax errors anymore, they are subtle conceptual errors that a slightly sloppy, hasty junior dev might do. The most common category is that the models make wrong assumptions on your behalf and just run along with them without checking. They also don't manage their confusion, they don't seek clarifications, they don't surface inconsistencies, they don't present tradeoffs, they don't push back when they should, and they are still a little too sycophantic. Things get better in plan mode, but there is some need for a lightweight inline plan mode. They also really like to overcomplicate code and APIs, they bloat abstractions, they don't clean up dead code after themselves, etc. They will implement an inefficient, bloated, brittle construction over 1000 lines of code and it's up to you to be like "umm couldn't you just do this instead?" and they will be like "of course!" and immediately cut it down to 100 lines. They still sometimes change/remove comments and code they don't like or don't sufficiently understand as side effects, even if it is orthogonal to the task at hand. All of this happens despite a few simple attempts to fix it via instructions in CLAUDE . md. Despite all these issues, it is still a net huge improvement and it's very difficult to imagine going back to manual coding. TLDR everyone has their developing flow, my current is a small few CC sessions on the left in ghostty windows/tabs and an IDE on the right for viewing the code manual edits. Tenacity. It's so interesting to watch an agent relentlessly work at something. They never get tired, they never get demoralized, they just keep going and trying things where a person would have given up long ago to fight another day. It's a "feel the AGI" moment to watch it struggle with something for a long time just to come out victorious 30 minutes later. You realize that stamina is a core bottleneck to work and that with LLMs in hand it has been dramatically increased. Speedups. It's not clear how to measure the "speedup" of LLM assistance. Certainly I feel net way faster at what I was going to do, but the main effect is that I do a lot more than I was going to do because 1) I can code up all kinds of things that just wouldn't have been worth coding before and 2) I can approach code that I couldn't work on before because of knowledge/skill issue. So certainly it's speedup, but it's possibly a lot more an expansion. Leverage. LLMs are exceptionally good at looping until they meet specific goals and this is where most of the "feel the AGI" magic is to be found. Don't tell it what to do, give it success criteria and watch it go. Get it to write tests first and then pass them. Put it in the loop with a browser MCP. Write the naive algorithm that is very likely correct first, then ask it to optimize it while preserving correctness. Change your approach from imperative to declarative to get the agents looping longer and gain leverage. Fun. I didn't anticipate that with agents programming feels *more* fun because a lot of the fill in the blanks drudgery is removed and what remains is the creative part. I also feel less blocked/stuck (which is not fun) and I experience a lot more courage because there's almost always a way to work hand in hand with it to make some positive progress. I have seen the opposite sentiment from other people too; LLM coding will split up engineers based on those who primarily liked coding and those who primarily liked building. Atrophy. I've already noticed that I am slowly starting to atrophy my ability to write code manually. Generation (writing code) and discrimination (reading code) are different capabilities in the brain. Largely due to all the little mostly syntactic details involved in programming, you can review code just fine even if you struggle to write it. Slopacolypse. I am bracing for 2026 as the year of the slopacolypse across all of github, substack, arxiv, X/instagram, and generally all digital media. We're also going to see a lot more AI hype productivity theater (is that even possible?), on the side of actual, real improvements. Questions. A few of the questions on my mind: - What happens to the "10X engineer" - the ratio of productivity between the mean and the max engineer? It's quite possible that this grows *a lot*. - Armed with LLMs, do generalists increasingly outperform specialists? LLMs are a lot better at fill in the blanks (the micro) than grand strategy (the macro). - What does LLM coding feel like in the future? Is it like playing StarCraft? Playing Factorio? Playing music? - How much of society is bottlenecked by digital knowledge work? TLDR Where does this leave us? LLM agent capabilities (Claude & Codex especially) have crossed some kind of threshold of coherence around December 2025 and caused a phase shift in software engineering and closely related. The intelligence part suddenly feels quite a bit ahead of all the rest of it - integrations (tools, knowledge), the necessity for new organizational workflows, processes, diffusion more generally. 2026 is going to be a high energy year as the industry metabolizes the new capability.

1,640

5,579

40,676

7,791,815

Ryan Mac

Ryan Mac

@ryanmac

Jan 24

Single-threaded prompting is dead. Prepare the way for orchestrated swarms.

Mike Kelly

@NicerInPerson

Jan 24

I managed to unlock a crazy new hidden feature in Claude Code called Swarms. You're not talking to an AI coder anymore. You're talking to a team lead. The lead doesn't write code - it plans, delegates, and synthesizes. When you approve a plan, it enters a new "delegation mode" and spawns a team of specialists who: - Share a task board with dependencies - Work in parallel as teammates - Message each other to coordinate work Workers do the heavy lifting, coordinate amongst themselves, then report back.

3:36

Ryan Mac

Ryan Mac

@ryanmac

Jan 10

Context density unlocked

dei

@parcadei

Jan 10

github.com/parcadei/Continuo… continuous claude v3 is finally done same problem, new architecture, better engineering. a setup designed to explore solutions to recurring obstacles in agentic coding: context, memory, learning, math, codebase exploration and much more strap in, we're about to go speed run the new setup ↓

David Shapiro (L/0)

Ryan Mac retweeted

David Shapiro (L/0)

@DaveShapi

18 Dec 2025

BEARISH ON OPENAI The investment case for OpenAI has never been more precarious than it is right now in late 2025. What was once a company that seemed destined to dominate the artificial intelligence revolution has revealed itself to be a structurally disadvantaged challenger fighting a defensive war on multiple fronts. The company anticipates burning through roughly $9 billion this year on $13 billion in sales, a cash burn rate of approximately 70% of revenue. This is not the profile of a company poised to capture monopolistic profits from a transformative technology; it is the profile of a utility company spending astronomical sums to deliver a commodity product that competitors are increasingly giving away for free. The financial trajectory only becomes more alarming when examined over a longer time horizon. The documents show OpenAI projects that by 2028, its operating losses will balloon to roughly three-quarters of that year’s revenue, driven primarily by ballooning spending on computing costs. The company has painted a rosy picture of eventual profitability by 2029 or 2030, but this projection requires believing that OpenAI can grow revenue from roughly $13 billion today to $125 billion or more while simultaneously maintaining pricing power in a market where every major technology company and numerous startups are racing to commoditize the very product OpenAI sells. The cash burn is expected to reach $115 billion cumulatively through 2029, according to The Information. These numbers represent a staggering bet that requires near-perfect execution across multiple dimensions over half a decade. The most damning evidence against OpenAI’s long-term viability is the evaporation of its technological moat. In 2023, GPT-4 felt like genuine magic, a capability that no other company could replicate. Today, that lead has effectively vanished. The sudden availability of frontier-level open-source models is expected to dramatically accelerate AI development globally, potentially reshaping entire industries and altering the balance of power in the tech world. Meta’s Llama series, Mistral’s increasingly capable models, and even Chinese competitors like DeepSeek have demonstrated that the core technology powering ChatGPT is replicable and, in many cases, distributable for free. When your product becomes commoditized, the economics become brutal, and OpenAI finds itself in the position of trying to sell bottled water in a world where tap water has become indistinguishable in quality. The competitive pressure from open-source alternatives is compounding rapidly. The open source movement in AI has grown exponentially over the past few years. Instead of relying solely on expensive, closed models from major tech companies, developers and researchers worldwide can now access, modify, and improve upon state-of-the-art LLMs. This democratization is existential for OpenAI’s business model. Enterprises that once paid premium prices for API access now have the option to run comparable models on their own infrastructure at a fraction of the cost, with the added benefits of data privacy and customization. The value proposition that justified OpenAI’s premium pricing has eroded faster than anyone anticipated, and there is no indication that this trend will reverse. Perhaps nothing illustrates OpenAI’s structural weakness more clearly than the behavior of its most important partner. Microsoft is dancing to its own tune in the artificial intelligence revolution, and Wall Street cannot stop watching. Despite pouring approximately $13 billion into OpenAI over several years, DA Davidson analyst Gil Luria estimates that just 17 percent of Microsoft’s total Azure revenue comes from artificial intelligence workloads. More critically, only 6 percent of that total ties directly to reselling OpenAI’s models, while approximately 75 percent is generated from Azure AI. Microsoft is building its own models, hedging with Anthropic, and quietly reducing its dependency on the very company it funded. When your largest investor is simultaneously your biggest competitor and is actively developing alternatives to your core product, the strategic implications are dire. Leaders at Microsoft believe Anthropic’s latest models — Claude Sonnet 4, specifically — perform better than OpenAI’s in certain functions, like creating aesthetically pleasing PowerPoint presentations. This is not a minor technical preference; it represents a fundamental shift in how Microsoft views its partnership with OpenAI. Microsoft is dramatically escalating its AI independence strategy. At an internal town hall Thursday, Microsoft AI chief Mustafa Suleyman revealed the company is making “significant investments” in compute capacity to build frontier models that can compete directly with OpenAI, Google, and Meta. The company that was supposed to be OpenAI’s path to distribution and scale is instead preparing for a future where OpenAI is just one vendor among many, if not an outright competitor. The leadership exodus at OpenAI over the past year has been nothing short of catastrophic. In September 2024, Murati announced that she was stepping down as CTO. This move came amid a wider executive exodus as OpenAI chief research officer Bob McGrew and a vice president of research, Barret Zoph, also announced their departures soon after. Mira Murati was not a minor figure; she was instrumental in the development of ChatGPT, Dall-E, and Sora. Her departure, along with co-founder Ilya Sutskever, safety leader Jan Leike, and co-founder John Schulman who joined rival Anthropic, has left CEO Sam Altman without much of the leadership team that helped him build OpenAI into an AI juggernaut. Hannah Wong, the executive who steered OpenAI through its most chaotic period, has announced she’s leaving the company just this month, continuing the pattern of senior departures that suggests something fundamentally broken in the organization’s culture or direction. The distribution problem facing OpenAI may be its most insurmountable challenge. Apple and Google control the smartphones that billions of people use every day. Microsoft controls the productivity software that enterprises depend upon. OpenAI, by contrast, must convince users to deliberately open a separate application and type their queries into a text box. In a world of agentic AI where assistants need access to your email, calendar, and files to be useful, an AI embedded directly into your operating system has an overwhelming structural advantage over a standalone chatbot. OpenAI is trying to be a consumer product company without owning any of the surfaces where consumers actually spend their time, competing against incumbents who can simply bundle AI capabilities directly into products that already have hundreds of millions of daily active users. The nuclear-to-solar analogy captures the fundamental economic transformation that is devastating OpenAI’s business model. Just as nuclear power required enormous upfront capital expenditure for centralized power plants, AI in its current form requires massive data center investments to train and serve models. But the direction of travel is unmistakably toward distributed intelligence that runs locally on devices. A major part of the pitch is practicality. Lample emphasizes that Ministral 3 can run on a single GPU, making it deployable on affordable hardware — from on-premise servers to laptops, robots, and other edge devices that may have limited connectivity. When powerful AI models can run on a smartphone or a laptop without any cloud connection, the entire economic rationale for paying premium prices to access centralized AI infrastructure disappears. OpenAI is building nuclear reactors in a world that is rapidly installing solar panels on every rooftop. The proposed $1 trillion IPO valuation is perhaps the clearest signal that something is deeply wrong with the OpenAI story. In the first half of the year, OpenAI lost $13.5 billion, on revenue of $4.3 billion. It is on track to lose $27 billion for the year. One estimate shows OpenAI will burn $115 billion by 2029. Asking public market investors to pay $1 trillion for a company that loses more than twice as much as it earns is not a growth story; it is an exit strategy. The sophisticated investors who funded OpenAI’s private rounds are looking for a way to transfer their risk to retail investors and pension funds who may not fully understand the unit economics of the business. A recent report by HSBC estimated that the company will remain in the unprofitable category until 2029 and that the company will need an additional $207 billion to fund its ambitions. Sam Altman’s leadership represents another structural liability for the company. His background is as a startup investor and evangelist, not as an operational executive who has scaled a capital-intensive industrial operation. The pivot from nonprofit research lab to for-profit corporation to public benefit corporation to anticipated public company has been accompanied by legal and governance structures designed primarily to protect Altman’s control rather than to create shareholder value. Going public means answering a lot more of those kinds of questions, every single quarter, forever. When asked about financial concerns in a friendly podcast interview, Altman’s dismissive response revealed a leader uncomfortable with the scrutiny that public markets will inevitably bring. The adults in the room have largely departed, leaving a company that desperately needs disciplined execution led by someone whose strengths lie elsewhere. The comparison to Netscape is instructive. Netscape proved that the internet was real and created genuine value, but it had no sustainable moat against an incumbent who could bundle the browser directly into the operating system. OpenAI has proven that large language models are real and valuable, but it faces the same structural disadvantage against incumbents who can bundle AI directly into operating systems, productivity suites, and cloud platforms. The value will accrue to the companies that own the distribution channels and the hardware, not to the company that demonstrated the technology was possible. OpenAI is destined to become a historical footnote, remembered as the company that ignited the AI revolution but failed to capture the economic value it created. The only bull case for OpenAI is the AGI lottery ticket: the possibility that the company achieves artificial general intelligence before anyone else and thereby transcends all normal economic analysis. But there is no evidence that OpenAI is any closer to AGI than Google, Anthropic, or DeepMind. The company’s advantage was never secret research breakthroughs; it was first-mover advantage in commercialization. That advantage has now been erased by competitors who can match or exceed OpenAI’s capabilities while benefiting from existing ecosystems, distribution channels, and the willingness to operate AI as a loss leader to drive engagement with more profitable products. The secret sauce was never secret, and there was never any sauce. The endgame for OpenAI is unlikely to be the triumphant dominance that early investors imagined. The most probable outcomes range from gradual irrelevance as a backend provider, to financial restructuring under pressure from creditors, to absorption by Microsoft or another well-capitalized technology company looking to acquire the remaining talent and intellectual property at a discount. Despite its current losses, OpenAI’s long-term prospects are bolstered by the explosive growth of the AI market. But growth in the overall AI market does not guarantee success for any individual company, particularly one with no moat, no ecosystem, and a cost structure that requires selling a commodity at premium prices. The AI revolution is real, but OpenAI’s role in capturing its economic value is far from assured. For anyone considering an investment in OpenAI at anything close to current valuations, the prudent course is to stay far away and watch from the sidelines as economic reality catches up with hype.

19:02

734

69,472

binji

Ryan Mac retweeted

binji

@binji_x

11 Oct 2025

the only way to survive in crypto is to believe that it’s purpose is more important than your present circumstance.

105

6,237

Ryan Mac

Ryan Mac

@ryanmac

19 Sep 2025

Keep ultrathinking.

Claude

@claudeai

18 Sep 2025

Keep thinking.

1:30

135