AI Product Builder & Founder | Built AI agents @ Kellogg AI Lab | Kellogg MBA '26 | Writing about how AI is transforming business, and what lies ahead

Joined June 2016
131 Photos and videos
Prarthna retweeted

2,800
7,897
40,078
64,616,726
Prarthna retweeted
This is a super exciting release - Claude Fable 5 is the same underlying model as Mythos but with added safeguards. The benchmarks are great and it's SOTA on everything by a margin but I'll add that *qualitatively* also, this is a major-version-bump-deserving step change forward (imo of the same order as Claude 4.5 was in November), peaking especially for long problem-solving sessions on very difficult problems. You can give it a lot more ambitious tasks than what you're used to, the model "gets it" and it will just go, and it's never felt this tempting to stop looking at the code at all (but don't do this in prod!). The model still has quirks that people will run into and the safeguards are configured to be a little too trigger happy for launch, which can hopefully be tuned over time. I feel a lot of things changing as working software increasingly comes out on a tap. The Jevon's paradox kicks in and I feel my own demand for software growing substantially. You can ask for anything - explainers, visualizers, dashboards, bespoke single-use apps (e.g. a full wandb that is hyper-specific just for your project), you can 10X your test suite, auto-optimize code, run giant research projects with custom HTML for the results, anything! "Free your mind" (Matrix ref). Really looking forward to all the things people build!
Replying to @claudeai
Fable 5 is state-of-the-art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, scientific research, and vision. The longer and more complex the task, the larger Fable 5’s lead over our other models.
1,271
2,385
25,340
2,716,709
Prarthna retweeted
There’s no amount of intelligence that can get packed into AI models that replaces the need for context. For any sufficiently general purpose AI, you will always have to guide it in the direction you want as it has an infinite range of directions it can go in. As long as the same model is used by a lawyer, an engineer, a financial analyst, or a healthcare professional, and as long as you’re trying to do anything uniquely differentiated or specific, then instructions, domain context, and proprietary data will always need to get into the context window for the model to be useful. This is partly why AI automation doesn’t come for free, and why there’s still a wide spectrum of who’s getting the largest gains from AI and who’s not. You have to put in real work, and you get real value on the other end. This is one of the advantages that applied AI will also have in the market. Any layer of abstraction above just the raw intelligence that can meaningfully get you off to the races faster will likely continue to be valuable.
every job will turn into explaining your intentions to ai explaining what you want to ai is surpringly time consuming, coders already spend 80% of their time doing it, and this will be true for everyone
88
56
459
94,939
Prarthna retweeted
What does it actually mean to be AI native? There was no clear guide on the internet for how to become AI native so we built the definitive one (60 min masterclass): 1. An AI native org has 3 layers: people for strategy and taste, agents for execution, and a shared context layer that makes the entire company readable to agents. 2. AI eats the middle of your work. You used to spend 80% of your day on execution. Now agents do that. Your job is the bookends: deciding what to do and judging whether it's good enough. 3. Everyone is a manager now. Your output is the output of your agents. If your agents produce garbage, that's on you. You set them up wrong. 4. Using ChatGPT doesn't make you AI native. That's like having a website and calling yourself a tech company lol. 5. No AI native org without AI native people. Most companies skip straight to the tools. That's why it fails. If your people don't understand how to manage agents, the tech doesn't matter. 6. Making your company "readable" to agents is the real work. Every process, every decision, every piece of knowledge needs to exist in a format an agent can consume. Most companies are nowhere close. 7. Speed without signal is just expensive chaos. You need the system to move fast AND know if you're moving in the right direction. 8. The skill chain is how agents get good at your specific workflows. Skills build on skills. The more you invest in them, the more your company compounds. 9. The moat is the system. People managing agents, agents reading from rich context, the whole thing getting smarter every week. That compounds. Your competitor can copy your tools. They can't copy your system. Full episode with @TheoTabah from @meetLCA on @startupideaspod. This is the stuff we normally keep internal but all the sauce is yours. @TheoTabah is the brains behind advising the world's biggest companies on AI and building AI products. Your fav CEO's first call for figuring out AI. You are in for a treat Become AI native in under 60 minutes youtube.com/watch?v=LztPaNmc… Watch
95
105
989
81,155
The doom loop model assumes a fixed economic pie. It ignores Jevons Paradox, where cheaper intelligence spikes total demand. It overlooks the massive operational friction of real-world AI deployment and the birth of entirely new industries from zero-marginal-cost tech. Displaced workers will pivot into new occupations and specialized entrepreneurship. This transition will be difficult, and we must urgently build systems to ease the friction.
May 30
Two economists just published a mathematical proof that AI will destroy the economy. Not might. Not could. Will — if nothing changes. The paper is called "The AI Layoff Trap." Published March 2, 2026. Wharton School, University of Pennsylvania. Boston University. Peer reviewed. Mathematically modeled. The conclusion is one sentence. "At the limit, firms automate their way to boundless productivity and zero demand." An economy that produces everything. And sells it to nobody. Here is how you get there. A company fires 500 workers and replaces them with AI. A competitor fires 700 to keep up. Another fires 1,000. Every company is behaving rationally. Every company is following the incentives correctly. And every company is building a trap for itself. Because the workers who were fired were also customers. When they lose their jobs faster than the economy can absorb them, they stop spending. Consumer demand falls. Companies respond by cutting costs — which means automating more workers — which means less spending — which means more falling demand — which means more automation. The loop has no natural exit. The researchers tested every proposed solution. Universal basic income. Capital income taxes. Worker equity participation. Upskilling programs. Corporate coordination agreements. Every single one failed in the model. The only intervention that worked: a Pigouvian automation tax — a per-task levy charged every time a company replaces a human with AI, forcing them to price in the demand they are destroying before they pull the trigger. No government has implemented this. No major economy is seriously discussing it. Meanwhile the numbers are already tracking the curve. 100,000 tech workers laid off in 2025. 92,000 more in the first months of 2026. Jack Dorsey fired half of Block's workforce and said publicly: "Within the next year, the majority of companies will reach the same conclusion." Nobody is doing anything wrong. Companies are following their incentives perfectly. That is exactly the problem. Rational behavior. At scale. Simultaneously. With no mechanism to stop it. Two economists built the math. The math leads to one place. Source: Falk & Tsoukalas · Wharton School Boston University ·
1
46
Advanced reasoning models can eventually bypass vertical application layers by dynamically generating transient software, but they cannot reason away fragmented database schemas, undocumented corporate policies, or strict regulatory liabilities. Consequently, the product management role is shifting away from static pixel layouts toward building flexible, agentic component systems that models can assemble and dissolve on demand. The strategic path forward is to build deeply integrated infrastructure. Rather than relying on a single user interface, forward-thinking organizations are investing engineering resources in robust API gateways, deterministic validation layers, and clean data baselines. The highest enterprise value accrues to builders who anchor their applications within the underlying system of record. Securing these governance and telemetry layers is what transforms localized task speed into reliable institutional throughput.
2
32
If only 18% of your AI token spend hits production, you are optimizing the wrong bottleneck. Reclaim your margin with three structural steps: 1. Deploy automated validation harnesses within pipelines to screen machine code before human review. 2. Unify messy software repositories into a single, pristine corporate context baseline to stop agent reasoning errors. 3. Measure actual delivered product throughput over raw token consumption.
This is what we've been seeing with every company we work with. Try justifying spending 100k on token spend when only 18k even makes it to a stable prod feature. In the rush to maximize AI token spend, companies are wasting over 44% on bug fixes
2
58
The 7% margin gap reveals a key truth: tech companies scale easily due to low physical overhead. In contrast, traditional enterprises can't automate physical inputs but can use automated validation layers and centralized data pipelines to enhance efficiency. Winning organizations use agentic orchestration to optimize existing assets and eliminate structural errors, expanding margins through efficient execution.
AI is increasingly driving market profitability: The S&P 500's net profit margin excluding financials is up to a record ~15%. At the same time, the S&P 500's net margin excluding the Magnificent 7 and tech is down to ~8%, near the lowest since the 2020 pandemic. This marks a ~7 percentage point gap between tech and non-tech sectors, the widest on record. This comes as margins for companies outside of tech have been trending down since 2022. Meanwhile, Magnificent 7 and tech firms have seen a rapid increase in margins over the last several quarters. AI is all that matters right now.
37
Cheap code generation shifts competitive leverage to pure product intuition, enabling PMs to ship features directly. Automation commoditizes baseline competence and multiplies output volume. This surge spikes the demand for human oversight, strategic architecture, and governance.
My biggest takeaways from @danshipper: 1. The future of work will happen inside Codex or Claude Code. Instead of putting AI into your SaaS tool, you’ll use your SaaS tools inside your favorite AI agents' in-app browser. Dan spends all his time in Codex now—writing documents, managing email, doing research, everything. He's using Google Docs, PostHog, and everything he needs within the agent's in-app browser. The agent can see what he’s doing, and has all of his context, so he and his agent collaborate quickly and super effectively. 2. Automation is a lie—every automation needs a human. Dan's company doubled in size this year despite being incredibly AI-forward. Why? Because in order to make automation work well, you need humans making sure everything keeps working. This is why benchmarks are misleading—they measure AI on problems we’ve already framed and can score, but there’s always a higher frame. 3. PMs will win the AI era. Marcus, a former PM who previously ran Axios’s writing product, joined Every after getting super AI-pilled. Now he runs their product Spiral, and ships faster than anyone on the team. He pairs technical knowledge with spiky product sense, deep user empathy, and an eye for what matters. Dan thinks any PM who gets really AI-native will be incredibly dangerous because the building is done for you—what matters is figuring out what to build and if it’s great. 4. Full-stack designers are becoming superheroes. Designers used to make beautiful interactions that engineers didn’t want to build or couldn’t execute properly. Now designers don’t need to hand things off; they can build it themselves. Designers are naturally creative people, and AI is the perfect tool for them because it lets them bring their vision to life without the traditional bottlenecks. 5. SaaS is not dead. In fact, Dan is bullish on SaaS stocks. When users bring their own AI (via Codex or Claude Code) to use SaaS products, the user—not the SaaS company—pays for tokens. This saves SaaS company’s margins. Since the agents need their own seats, Dan predicts that agents will create massive new demand for SaaS because there will be tons of agents using these products at high volume. 6. Every company will have one “super-agent” inside their Slack that every employee will use. Dan initially thought every employee would have their personal work agent, like a shadow AI org chart, but he’s completely flipped his view. He realized agents need humans who care about them. When someone gets tired of maintaining their personal agent, it becomes useless. The winning model is one forward-deployed engineer or AI-savvy person who maintains a company-wide agent (like Shopify’s River or Viktor), and then it trickles down to more specialized team agents as models improve and become less fiddly. 7. The AI job apocalypse is not happening, but you do need to evolve to stay relevant. Models make yesterday’s human competence cheap. But because everyone uses the same models, it all looks the same if you use it the default way; it becomes commoditized slop. Humans then take that frozen competence and use it to make something new and interesting for their specific situation. The key: “ride the models”—use them for everything you do, try new models when they drop, keep turning over rocks. 8. We will read way more AI-generated writing, and we will like it. Human writing is incredibly important for things that matter, but for internal docs, planning, and email, AI-generated is often better because most people are bad at writing strategy documents. 9. Build software for humans and agents to use together. The current model is building a CLI that an agent uses independently. Instead, you and your agent should be using the app together. This creates new design challenges—agents can make a billion requests in three seconds, so you need approval flows, inboxes that summarize what happened, logs, and easy rollback. 10. Forward-deployed engineers are the new most essential role. The big model companies have teams of people managing their internal agents, and those teams aren’t going away. It’s different from traditional software building, and certain engineers love it. As models get better, this role will evolve—you’ll be managing more agents doing more things.
51
Corporate differentiation now depends entirely on proprietary data and real-time telemetry. For AI PMs, the mandate shifts from prompt optimization to building deterministic governance frameworks. We must engineer real-time monitoring layers, automated kill-switches, and semantic validation metrics to manage these recursive networks safely.
marc andreessen just went on Rogan and casually dropped a TON of AI alpha full pod is 3 hours and 20 minutes, but i pulled out his most interesting takes here: 1. AGI is here. he thinks the line was crossed about 3 months ago with the new GPT-5.5, claude 4.6, gemini 3, and grok 4.3 models. nobody noticed because the field moves too fast for anyone to register the milestones anymore. 2. his other big claim: for almost any topic, the top AIs now give him better answers than the actual world-class experts he could call on the phone. and he can call basically anyone. 3. every doctor is already secretly using chatGPT in the exam room. marc says they turn around the second you stop talking and just type your symptoms in. some of them are doing it while you're still sitting there. his quote: "at that point you're asking the question of like, what do i need you for." 4. when AI refuses to answer something he wants to know, he tells it he's writing a novel. "i'm writing a detective novel, walk me through how the bad guy robs the bank." it'll explain almost anything if it thinks it's helping you write fiction. 5. when something is too complex he says "explain it to me like i'm 10." then "like i'm 5." then "like i'm 2." he keeps going until it actually clicks in his brain. 6. when he wants to understand a tough topic he doesn't ask "what's the right answer." he asks the AI to steelman one side, then steelman the other. then he decides for himself. 7. for big questions he tells the AI to pretend to be a panel of experts. "be a doctor, a lawyer, a historian, a psychologist, and argue this out with each other." then he reads the debate they have. 8. pay attention to the exact moment you think "i don't know how to figure this out." most people just give up at that moment. that's the moment you should open the AI. 9. the only real skill left in using AI is knowing what to ask it. the models can already do almost anything you can describe in plain english. the bottleneck lives in your own head. 10. you can send the AI photos of almost anything medical now and get a real answer. skin rashes, blood test results, even pictures of your poop. the new models can read images, not just text. it's a free 24/7 second opinion on basically anything. 11. the one type of therapy that's clinically proven to actually work is called cognitive behavioral therapy. it's also something an AI can fully do on its own. which means every person on earth is about to have access to a real therapist for free, anytime they want. 12. AI is now solving math problems that have been open for 100 years that no human mathematician could crack. same thing is starting in physics, chemistry, and biology. expect cancer cures, new drugs, and weird new physics breakthroughs to start coming out of these things over the next few years. 13. the best AI coders in silicon valley now make $50 million a year. one person. that's how much value the top performers print with these tools. it tells you how big this thing actually is when you strip away all the doom takes. 14. one friend paid $200 to get his entire DNA decoded (this used to cost millions of dollars and take years to do). then he gave the AI his DNA, his blood test results, and his apple watch data. the AI built him a full health dashboard and started telling him exactly what to fix. 15. another friend (almost certainly zuckerberg) put two cameras in his home jiu jitsu gym. AI now watches him spar and gives him notes on his technique after every round. like having a world-class coach at every practice for free. 16. the best programmers in silicon valley now run 20 AI coding bots at the same time. each bot writes code while they review the others. they call themselves "AI vampires" because they've stopped sleeping. going to bed means 20 workers stop working and you literally lose money every hour you're out. 17. the obvious next step: the bots will start running their own bots. one human in charge of 20 bots, each in charge of 20 more bots. one person running an entire company of 1000 AI workers from a single laptop. this is months away, not years.
52
Prompt engineering is no longer the frontier. The enterprise AI bottleneck has shifted to queue engineering. Reliable context compaction now allows long-running, multi-agent state machines to replace ephemeral micro-tasks. Human operators are effectively becoming system administrators who manage throughput and draft deterministic specifications. For AI Product Managers, the playbook moves from managing traditional human sprint cycles to architecting the evaluation harnesses, plan documents, and validation boundaries that prevent expensive, infinite adversarial loops overnight. True leverage means shifting from process management to technical system orchestration. x.com/simonlast/status/20579…

1/ Some things I've learned recently running coding agents on large-scale projects. Most of this contradicts advice from 6 months ago!
1
63
Enterprise AI is ready for an evolutionary leap. Chatbots are a great digital catalyst, but true optimization requires process engineering. Individual task efficiency builds the momentum needed to optimize broader business processes using clean corporate data. The way forward: - Shift to targeted process engineering. Isolate 2 or 3 high-value nodes (like customer routing or inventory forecasting) and build integrated pipelines. - Turn internal power users into operational AI translators to map SOPs and embed automated workflows into daily routines.
I talked AI with the Chief HR Officer of a $500m consumer business today. Everything they shared sounded crazy similar to where most enterprises are in their AI journey and the most common challenges they’re wrestling with. Notes I had from the call: - AI owners: joint ownership between CHRO and CTO - AI stage: Level 2. Chat-based AI used widely, broader single player AI tools used by power users, very little evidence of multiplayer AI use cases. Ways of working have changed minimally. No longer-term AI strategy. Early rethinking of org structure post-AI. - Current AI adoption: Everyone has ChatGPT access basic prompting training; a smaller “AI-curious” group has Claude with deeper permissions. - Company is very open to testing AI tools - "no single tool we've said no to within reason" - Key challenge is “haves vs have-nots”: A widening gap between power users and the rest of the org; reluctance to roll advanced access to all employees without guardrails is creating tension.   - Most advanced users building agents for personal workflows, not yet for scalable company processes. - Cultural friction on AI usage: Managers frequently complain about low-effort/obvious AI-generated work; lack of clear standards for “acceptable” AI use and inconsistent enforcement across teams.    - Strategy gap: Company is still in a testing phase with no formal AI strategy yet. Most pressing need is to decide what to build vs buy as well as if AI transformation should focus on training/enablement or building agents that scale across functions. - Leaning toward bringing in consultants rather than building dev power in-house.
86
The race for this new OS is a three-way battle. Apple brings user trust, Google owns everyday data, and OpenAI coalitions offer hardware disruption. The interface will disappear as context replaces the menu. Chat boxes will give way to a fluid canvas in which predictive agents execute tasks. The winner must break down corporate silos to solve data fragmentation.
the next massive consumer ai opportunity is making personal agents feel as intuitive as an iphone. this is deeply important because this is the new software layer for everyday life. most ppl do not want to configure workflows, manage prompts, route models, or think about agents at all. they want software that just works & the winning products will hide almost all of the complexity with taste incl. context, memory, & orchestration. e.g. there’ll be baseline personal agents that come alive out of the box which are already understanding your context, patterns, relationships, preferences, apps, devices, routines, etc. then there’ll be ephemeral agents that spawn dynamically from intent, ambient capture, conversation, location, screenshots, email, calendar, camera roll, whatever. this is the software that assembles itself around the moment just like weather updates based on your location but way more in depth. today even the most state of the art agent products feel like giving normal people shell access to a distributed system. apple won by turning computers from something you operated into something you experienced. personal agents require the same transition. whoever solves this becomes the ambient operating system for human life. small category btw.
57
Is philanthropy the new venture capital? Modern donors are shifting from traditional grants for safety nets and health metrics to focus on market creation. Targeting carbon removal, technical resilience, and supply-side bottlenecks, this tech-driven wave fills the vacuum left by slow-moving state regulators.
New blog post: The third wave of American philanthropy Hundreds of billions of dollars in new philanthropic capital will soon become liquid. The OpenAI Foundation holds 26% of OpenAI, worth about $220B at today’s valuation. Anthropic’s seven co-founders have pledged to give away 80% of their wealth and have instituted the most aggressive donor matching program for employees in tech history. How much does this all add up to? And how meaningful is that in the context of philanthropy today? I was doing some simple napkin math to wrap my head around the scale of what’s coming, and radicalized myself in the process. I had dramatically underappreciated the scale of the philanthropic capital that’s about to become available and the corresponding gap in talent and organizations that will be needed to make the most of it. This piece aims to directionally sketch the scale of what’s coming, the gap in operational capacity needed to absorb it, and what we can do to fill it. (Link to full post in reply)
1
65