founder

Joined January 2015
85 Photos and videos
Ben Smith retweeted
the single most important concept in 2026
29
162
2,991
818,789
We’re seeing the first interface forms that solve this. Claude Cowork lets you capture context, turn it into reusable skills, and orchestrate agents as a conductor rather than a coder.
Context is king for AI agents. There’s going to be a massive premium in the future on having up-to-date context for all your most important best practices, decisions, roadmap items, specs, marketing materials, and other critical knowledge across your company. People get a lot of context “for free” in a company. They know where they work, they have people next to them they work with, they can remember the rough outline of the company’s most recent quarterly goals to know if something seems good or bad to work on. AI agents, on the other hand, come in overly eager and ready to work on whatever you give them. At one moment they’re a lawyer for one company and an engineer for the next. This is why context remains absolutely critical for them to execute well on what you want. The teams and companies that take this seriously will have huge leverage and be steps ahead of those that don’t.
9
1
606
I get most of my exercise when claude is down
2
330
12 Dec 2025
One trillion parameters, available for fine-tuning. The largest model in their lineup now accessible to anyone with a credit card. The ceiling for what you can customize just got a lot higher.
Tinker is now generally available. We also added support for advanced vision input models, Kimi K2 Thinking, and a simpler way to sample from models. thinkingmachines.ai/blog/tin…
1
6
898
11 Dec 2025
Still Opus
1
5
362
11 Dec 2025
A researcher gave Claude Sonnet 4.5 an economics paper and its replication data. The prompt: replicate the findings from the dataset. No detailed instructions. Claude read the paper, sorted through the archive, converted the statistical code from STATA to Python, and worked through every finding. A different AI model verified the results. Spot checks confirmed accuracy. Here's why this matters: Academic research has a credibility problem. Many landmark studies can't be reproduced when other scientists try to verify them. The fix was always obvious: to systematically check the papers. But verification takes hours of expert time per study. Nobody could afford to do it at scale. Now an AI agent does in minutes what took researchers hours. We could actually audit the research that shapes policy, medicine, and business decisions. The question isn't whether we can verify science at scale anymore. It's whether we will.
Replying to @emollick
And of course, just making more PowerPoint is not a good outcome on its own... oneusefulthing.org/p/real-ai…
3
575
11 Dec 2025
2026 is all about Service as Software
1 Nov 2024
"Service as Software" is Silicon Valley's hottest buzzword right now. Everyone's talking about SaaS becoming service providers, but no one's explaining HOW. The answer? After 6 months of research and 100s of startup conversations, we have the answer: Systems of Agents. We're looking at a $4.6T opportunity.
8
650
10 Dec 2025
People want to buy outcomes
1
3
340
9 Dec 2025
vector databases for vibes. knowledge graphs for relationships. SQL for transactions. the best architectures query all three.
1
2
229
9 Dec 2025
AI lowered the cost of building software. it didn’t lower the cost of maintaining it. that’s why SaaS isnt going anywhere in one sentence.
1
208
8 Dec 2025
Two ways to sell vertical AI: the trojan horse or the boardroom. Cursor and Clay enter through the individual contributor. They build communities where users share prompts and workflows. The bottom-up swell forces enterprise adoption. Harvey and Sierra sell strictly top-down. Harvey used manufactured scarcity and a PwC partnership. Sierra's founders left Salesforce and Google to poach their former customers. Clay hit $3B on shared recipes. Harvey hit $8B on enterprise contracts. Both work. Neither is wrong.
293
7 Dec 2025
The hottest role in vertical AI isn't prompt engineer. It's forward deployed engineer. 50% developer, 50% consultant. They write Python scripts to clean client data during the sales cycle. They fine-tune RAG pipelines for specific use cases. They build the product while selling it. Harvey, Hebbia, Palantir—all hiring aggressively for this role. Vertical AI isn't plug-and-play. It's platform plus services.
1
2
496
6 Dec 2025
In vertical AI pricing, high-liability verticals charge like consultants. low-liability verticals charge like temps. Harvey and Ambience price at $2,800-$5,000 per seat per year. the buyer is paying for trust, not features. a hallucination in legal or medical work is an existential event. Clay and 11x price at $150-$500 per month. errors are tolerable. a bad email gets deleted. the value is sheer volume of work processed. same technology, completely different pricing power.
1
4
590
4 Dec 2025
Harvey just hit $8 billion. but the valuation isn't the point. it's how they got there. the story starts with a cold email to Sam Altman. Winston Weinberg was a first-year associate in 2022. he and Gabriel Pereyra built a GPT-3 prototype to answer landlord-tenant questions from Reddit. they sent it to OpenAI's general counsel because they knew a lawyer could evaluate the output. that led to Harvey becoming OpenAI Startup Fund's first investment. then Weinberg did something counterintuitive. instead of selling to everyone, he went to Allen & Overy and said: help us build this. A&O deployed Harvey firm-wide. their lawyers became the feedback loop. their workflows became the product. the co-builder relationship gave Harvey proprietary process data no public model has. actual contract templates, firm-specific precedent, the context horizontal models lack. then they built the trust stack. a "Vault" for firm-specific documents. human-in-the-loop verification. a LexisNexis partnership grounding every citation in real case law. in legal, one hallucinated case ends your company. Harvey built the architecture that makes that impossible. the wedge was narrow: contract analysis and due diligence. by solving that, they earned the right to expand. now they're taking the same engine to PwC for tax and consulting. vertical AI doesn't win on model quality. it wins on workflow depth, trust architecture, and proprietary data.
2
630
4 Dec 2025
services TAM, not software TAM. that's the whole vertical AI unlock. when 11x sells Alice SDR Agent, they're not asking for $100/month software. they're asking for $1,500/month to replace a $60,000/year SDR. the buyer isn't the CIO comparing tools. it's the VP of Sales comparing Alice to hiring a college grad. different budget line entirely.
1
11
638
3 Dec 2025
vertical AI is having a moment. Harvey, Abridge, OpenEvidence all crossed billion dollar valuations by embedding deeply into workflows that horizontal software never cracked. The playbook: own the workflow, build proprietary data moats, price on outcomes not seats. The graveyard: wrappers and "AI-powered" features that get shipped by OpenAI/Anthropic six months later. meanwhile VCs are betting big on AI-enabled rollups—buy legacy service businesses, bolt on AI, expand margins. Thesis makes sense. Execution is brutal. Most rollups destroy value historically. Shoutout to @Trace_Cohen for tracking 150 vertical AI startups across sectors: docs.google.com/spreadsheets…
17
49
515
65,267
26 Nov 2025
every docs page should just have a 'send to claude' button
2
4
620
Ben Smith retweeted
Are hackathons just like 45 minutes now, or what
16
2
52
10,329
24 Nov 2025
Like peeking into the future of the web where sites are generated on demand similar to @websim_ai
I updated my Nano Banana browser experiment with the new Pro version. Generated websites are now really good. I can see this becoming a real way to experience information 2–3 generations from now. All you need to use it is your API key. 👇
3
3
1,039