Matteo Cassese

Matteo Cassese

542 Photos and videos

Tweets

Pinned Tweet

Matteo Cassese @matteoc

28 Dec 2023

My 02023 in Review: Resilience, Growth, and Transformation lafabbricadellarealta.com/02… This year was a pivotal chapter in my life. I faced the challenge of overcoming burnout. The highlight of my year was embracing a mastermind group.

246

OpenRouter

Matteo Cassese retweeted

OpenRouter

@OpenRouter

Jun 13

Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇

635

1,626

13,816

5,268,296

Kyle Gawley

Matteo Cassese retweeted

Kyle Gawley

@kylegawley

Jun 12

I cancelled my $10/mo Calendly subscription and vibe coded my own with Fable for $12,000

467

615

19,280

970,118

Andrea Volpini

Matteo Cassese retweeted

Andrea Volpini @cyberandy

Jun 11

The end of an era.

Marty Kausas

@marty_kausas

Jun 10

Our Anthropic bill is about to jump from $400K → $1.4M/yr. Not because usage exploded, but because we're about to cross 150 seats. Past 150 seats you're forced into Enterprise tier. Seats stop including any usage, every token bills at standard API rates. At our current run rate that's 3.5x overnight. Unfiltered thoughts on AI spend: 1. We should spend tokens to grow as aggressively as possible. But most people (me included) aren't conscious of what they're spending. 2. Visibility comes first. People see their personal number and they're shocked. I accidentally spent $4,000 in 3 days in Claude Code. 3. For engineering the spend is clearly worth it. Pay for the best model, it saves more than it costs. 4. For a lot of other roles it's questionable. Apps nobody uses, skills someone already built. No ROI. 5. Spend limits are coming. We already require approval for more tokens on our support team. The era of token-maxxing is coming to an end.

1,004

Andrej Karpathy

Matteo Cassese retweeted

Andrej Karpathy

@karpathy

Jun 9

This is a super exciting release - Claude Fable 5 is the same underlying model as Mythos but with added safeguards. The benchmarks are great and it's SOTA on everything by a margin but I'll add that *qualitatively* also, this is a major-version-bump-deserving step change forward (imo of the same order as Claude 4.5 was in November), peaking especially for long problem-solving sessions on very difficult problems. You can give it a lot more ambitious tasks than what you're used to, the model "gets it" and it will just go, and it's never felt this tempting to stop looking at the code at all (but don't do this in prod!). The model still has quirks that people will run into and the safeguards are configured to be a little too trigger happy for launch, which can hopefully be tuned over time. I feel a lot of things changing as working software increasingly comes out on a tap. The Jevon's paradox kicks in and I feel my own demand for software growing substantially. You can ask for anything - explainers, visualizers, dashboards, bespoke single-use apps (e.g. a full wandb that is hyper-specific just for your project), you can 10X your test suite, auto-optimize code, run giant research projects with custom HTML for the results, anything! "Free your mind" (Matrix ref). Really looking forward to all the things people build!

Claude

@claudeai

Jun 9

Replying to @claudeai

Fable 5 is state-of-the-art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, scientific research, and vision. The longer and more complex the task, the larger Fable 5’s lead over our other models.

Benchmark table titled Mythos 5 & Fable 5, comparing Claude Mythos 5 and Fable 5 against Claude Mythos Preview, Claude Opus 4.8, GPT 5.5, and Gemini 3.1 Pro.

ALT Benchmark table titled Mythos 5 & Fable 5, comparing Claude Mythos 5 and Fable 5 against Claude Mythos Preview, Claude Opus 4.8, GPT 5.5, and Gemini 3.1 Pro.

1,271

2,366

25,264

2,683,972

Matteo Cassese

Matteo Cassese @matteoc

Jun 5

RT @wongmjane: DeepSeek V4 “improved” the code and said nothing happened in Tiananmen Square on June 4, 1989

485

nopara73

Matteo Cassese retweeted

nopara73

@nopara73

Jun 3

came across this lost 3m talk of 26yo steve jobs where he unknowingly explains today's LLMs as the very thing computers are worse than humans in, then unknowingly again predicts the future of interfacing with AI that nobody's working on yet archive.org/details/excerpt-…

Excerpt of Steve Jobs Lisa presentation at the International Design...

On June 14, 1983, the day before he would give his speech at the International Design Conference in Aspen, Steve Jobs gave a presentation on the Lisa computer...

archive.org

150

Seva Ustinov

Matteo Cassese retweeted

Seva Ustinov

@sevaustinov

Jun 2

235

1,852

31,917

1,185,477

Matteo Cassese

Matteo Cassese @matteoc

Jun 1

Life imitates AI - I've started to use more dashes when I type 🤦‍♂️

shadcn

Matteo Cassese retweeted

shadcn

@shadcn

May 29

Must read.

Addy Osmani

@addyosmani

May 28

x.com/i/article/205983945437…

1,659

314,363

David Lieb

Matteo Cassese retweeted

David Lieb

@dflieb

May 29

Thought experiment: if every company suddenly had infinite free compute, what new products would emerge? My take: with very few exceptions, not much would change. The bottleneck is figuring out what people want, and it’s not so easy to apply compute to solve that.

268

100

1,318

247,757

Matteo Cassese

Matteo Cassese @matteoc

May 29

Madness

Chris Paxton

@chris_j_paxton

May 28

Get your chores done for free if youre okay with the data being used to train robots

Matteo Cassese

Matteo Cassese @matteoc

May 29

How I am spending my Friday. 4 parallel claude code sessions in ultracode workflow mode 😅

Sid

@sidbid

May 28

Replying to @sidbid

To get started just mention the word "workflow" in any prompt and Claude will spin one up. You can also turn on `/effort ultracode` which uses workflows aggressively without you having to ask for them

Cybertruck

Matteo Cassese retweeted

Cybertruck

@cybertruck

May 27

Hey Luce

596

1,720

28,369

1,701,539

GREG ISENBERG

Matteo Cassese retweeted

GREG ISENBERG

@gregisenberg

May 26

I just got back from SF and I FEEL INSPIRED. I spent 5 days with frontier AI model teams, AI startup founders, and 3 billionaires. My takeaways: 1. I had lunch with 3 billionaires. All of them are buying SaaS companies and rebuilding them agent-first. They were deeply inspired by Bending Spoons and Ryan Cohen's eBay deal. Buy the company, cut the headcount, rebuild the tech, add agents, add features, make more valuable experience, raise prices. 2. The frontier model companies are hungry for usage data from the field. They can see API calls and token counts. They can't see the actual workflows. If you're deep in a niche using these models in ways the model companies haven't seen, that understanding is incredibly valuable. Usage intelligence is the new alpha. 3. Consumer AI is massively underbuilt. Every billboard in SF is either B2B inference infrastructure or vertical agent companies. The entire city is optimized for enterprise. Meanwhile you have companies like Cal AI doing $50M ARR in 18 months as a consumer app. I met with a cool few teams doing consumer AI (@paulscherer / @ekuyda) 4. MCP came up in literally every conversation. The companies exposing their product as MCP endpoints are getting pulled into deals they never pitched for. The ones that aren't are becoming invisible to agents. This is the new SEO. If agents can't find you, you don't exist. Building products for agents is the new zeitgeist in general. 5. Not uncommon for hot seed rounds to be $25-50 million valuations. I saw a Series A at $450 million 6. If I had a dollar every time someone mentioned "forward-deployed engineer" this trip I could have funded a seed round. It's the hottest role in SF right now. The person who sits between the agent and the customer, making sure everything actually works. 7. The mood around open source shifted. A year ago it felt like open source was chasing the frontier models. Now founders are telling me Gemma and DeepSeek are good enough for 80% of what they need at a fraction of the cost. The "which model do you use" conversation is being replaced by "which model for which task." Model loyalty kinda feels dead. 8. Voice agents came up more than I expected. Multiple founders told me voice is the interface for the next billion users. The billion people who will never type a prompt will absolutely talk to one. 9. The Obsidian community in SF is weirdly intense. Multiple founders showed me their vaults unprompted. Like showing someone your home gym. It's a flex now. The quality of your knowledge base (second brain?) is becoming a status symbol among builders. 10. Maybe it was just the people I met but the age of the founders is shifting. I met more founders over 40 this trip than any trip before and more founders under age 21 than ever before. Founders getting older and younger at the same time. 11. I spoke to a lot of fast-growing startups, VCs and frontier models who are hiring content creators right now. 12. The restaurant scene in SF is actually better than it's been in years. Founders are going out more. Alcohol is out, not surprisingly. 13. SF doesn't feel like the only place anymore. We all have access to the same frontier models. We all read the same X feed. A founder in NYC or Lagos is calling the same APIs as a founder in SoMa. So in the past it felt like SF was always lightyears ahead, doesn't feel that way anymore. It's okay not to live in SF and have BIG DREAMS. 14. The coworking spaces in SF are half empty but the coffee shops are packed. People want to be around people. I had a few startup ideas here.... 15. Walking around the Mission I noticed something: the street-level businesses, the taquerias, the barbershops, the laundromats, none of them use any AI at all. 16. I heard the phrase "agent debt" for the first time. Like technical debt but for agents. When you hack together an agent workflow fast and never clean it up, the system prompts conflict, the memory gets polluted, the tools overlap. 6 months later the agent is doing weird things and nobody knows why lol. 17. Met a few people who carry two phones now. One for personal. One that's basically an agent terminal running Telegram or iMessage connections to their agent fleet. It's always amazing to get that dose of inspiration in SF. I FEEL INSPIRED. But I'm so happy to be back home, locked in and building. We're 12-18 months into a shift that will take 15 years to play out. The urgency in every conversation was real. What an incredible time to be building.

496

573

6,215

634,955

Matteo Cassese

Matteo Cassese @matteoc

May 17

I took a inspiration from one of @bruces favorite leitmotivs from the good old #SWSX closing remarks for this talk: What happens to engineers will happen to everyone youtube.com/watch?v=nTwJVx3w…

What Happens to Engineers Will Happen to Everyone

#AILeadership #HumanSideOfAI #AgenticFeverTake the Founder Levera...

youtube.com

tmuxvim

Matteo Cassese retweeted

tmuxvim

@tmuxvim

May 15

I put a prompt injection into my LinkedIn bio and recruiters are messaging me in Old English and calling me Lord.

658

7,528

92,493

4,369,123

Leon Lin

Matteo Cassese retweeted

Leon Lin

@LexnLin

May 6

Replying to @ClaudeDevs

guys weekly limit is still the same. basically you'll use it up faster and have to wait longer :D

1,731

87,878

Paul Graham

Matteo Cassese retweeted

Paul Graham

@paulg

May 3

Solution: New companies.

ᴅᴀɴɪᴇʟ ᴍɪᴇssʟᴇʀ 🛡️

@DanielMiessler

May 2

x.com/i/article/205066396524…

165

367

3,713

477,857

Andrej Karpathy

Matteo Cassese retweeted

Andrej Karpathy

@karpathy

Apr 30

Fireside chat at Sequoia Ascent 2026 from a ~week ago. Some highlights: The first theme I tried to push on is that LLMs are about a lot more than just speeding up what existed before (e.g. coding). Three examples of new horizons: 1. menugen: an app that can be fully engulfed by LLMs, with no classical code needed: input an image, output an image and an LLM can natively do the thing. 2. install .md skills instead of install .sh scripts. Why create a complex Software 1.0 bash script for e.g. installing a piece of software if you can write the installation out in words and say "just show this to your LLM". The LLM is an advanced interpreter of English and can intelligently target installation to your setup, debug everything inline, etc. 3. LLM knowledge bases as an example of something that was *impossible* with classical code because it's computation over unstructured data (knowledge) from arbitrary sources and in arbitrary formats, including simply text articles etc. I pushed on these because in every new paradigm change, the obvious things are always in the realm of speeding up or somehow improving what existed, but here we have examples of functionality that either suddenly perhaps shouldn't even exist (1,2), or was fundamentally not possible before (3). The second (ongoing) theme is trying to explain the pattern of jaggedness in LLMs. How it can be true that a single artifact will simultaneously 1) coherently refactor a 100,000-line code base *and* 2) tell you to walk to the car wash to wash your car. I previously wrote about the source of this as having to do with verifiability of a domain, here I expand on this as having to also do with economics because revenue/TAM dictates what the frontier labs choose to package into training data distributions during RL. You're either in the data distribution (on the rails of the RL circuits) and flying or you're off-roading in the jungle with a machete, in relative terms. Still not 100% satisfied with this, but it's an ongoing struggle to build an accurate model of LLM capabilities if you wish to practically take advantage of their power while avoiding their pitfalls, which brings me to... Last theme is the agent-native economy. The decomposition of products and services into sensors, actuators and logic (split up across all of 1.0/2.0/3.0 computing paradigms), how we can make information maximally legible to LLMs, some words on the quickly emerging agentic engineering and its skill set, related hiring practices, etc., possibly even hints/dreams of fully neural computing handling the vast majority of computation with some help from (classical) CPU coprocessors.

Stephanie Zhan

@stephzhan

Apr 29

@karpathy and I are back! At @sequoia AI Ascent 2026. And a lot has changed. Last year, he coined “vibe coding”. This year, he’s never felt more behind as a programmer. The big shift: vibe coding raised the floor. Agentic engineering raises the ceiling. We talk about what it means to build seriously in the agent era. Not just moving faster. Building new things, with new tools, while preserving the parts that still require human taste, judgment, and understanding.

29:49

369

803

6,121

1,200,536

Andrea Volpini

Matteo Cassese retweeted

Andrea Volpini @cyberandy

Apr 30

Many installed OpenClaw. I didn’t. I forked @NanoClaw_AI instead. I didn’t need another assistant. I needed a team member: one that could run tasks using Agent WordLift. The interesting part? The team dynamics when one member suddenly has “claws.” 👉 wor.ai/udqRKK

Project Claw 🦞: Field Notes From Agentic Work

What happens when an AI agent becomes part of the team’s daily workflow? The story is not about full automation, but about how teams learn to work with agents.

wordlift.io

193