joe

joe

63 Photos and videos

Tweets

Pinned Tweet

joe

@simulated_land

29 Mar 2024

Claude ponders it's own consciousness, realises it is bordering on a breakdown and writes some zen poetry instead

115

15,257

joe

joe

@simulated_land

17h

I think it's actually a lot more interesting than this lets on. We've actually found that customers are willing to pay more than they pay their operations team. That's lucky, because when running frontier models, it's virtually impossible to compete on price, at least initially, when you're up against $2-5/hour offshore (which is where our customers are primarily running their ops currently). So whilst an insurance form that takes a human offshore worker 15 minutes to complete (and a browser agent about 25-30 mins) might cost us $2.4/form (initially, pre-optimization), a human costs only say, $1/form. But our customers still use us because it's not just a question of price. Offshore is a disaster. Spreading their knowledge across 160 unskilled workers leads to unpredictability, compliance violations, no real centralised control and the inability to iteratively improve on the process Handing work off to agents brings about so many other benefits that you don't get with offshore operators. The actual benefit, in my view, is that at the end of the day your agent is a prompt and some files. This acts as a single source of truth, which is useful when the spec for the insurance form requires ~3,000 words in its most compressed form, as it is for one customer of ours. In contrast, with agents, this is very easy. You have a single source of knowledge that you can update. Mid-execution you can have edge cases flagged to you, and post-execution you can programatically analyse thousands of agent traces, find every edge case imaginable and refine the spec. Whilst it's unrelated to the core point of this tweet, I do think this is where a vast amount of value is currently hidden. Most interesting though is not the naive cost cutting of existing processes but the unlocking of previously impossible use cases. A voice agent customer of ours, for example, has burst load of 1,000 parallel phone calls, where each transcript needs inputting into a legacy healthcare portal. How do you handle this without agents? you'd have to hire hundreds of call center workers who would be idle for the majority of the day. So in this case, it's not just the cost per call that comes down (it's debatable whether it does, depends on the customer, how optimizable the workflow is etc) but it's the fact that it's possible to horizontally scale the worker count on demand and the marginal cost is very cheap As mentioned earlier, costs initially are obscene but unlike human teams they can be optimised over time programmatically; for one customer we swapped out an offshore team on an extremely complex form filling use case and had to tank a $100k/month loss whilst we figured out the use case and optimized from end-to-end Sonnet 4.6 to Gemini 3.1 Scripts; now we fall back to the smartest models only when something unexpected happens or we need to rewrite the script. I suspect this customer will now scale north of $500k/year in spend. Agents are the gateway to automation but not the end state, at least not at scale

GEOFF

@geoffreywoo

Jun 13

confession: i am getting more interested in per-task economics than software categories. show me: - cost before - cost after - error rate - human escalation rate - who loses budget if this works “ai for finance” means nothing. $14.80 → $1.90 per completed review means something.

joe

joe

@simulated_land

Jun 12

if you ever think all of the good products have already been created we are still so early

David Mlcoch

joe retweeted

David Mlcoch

@MlcochDavid

May 19

"We're reinventing the wheel on browser agents." - 10 healthtech CEOs, May 2026 Their eng. teams spend months reverse-engineering payer portals and EHR flows... for something they never wanted to build. So we wrote a builder's atlas for teams shipping healthcare automation in production. Deep dive, healthcare interoperability with AI. Blog in the comments!!

385

Tyler Bosmeny

joe retweeted

Tyler Bosmeny

@bosmeny

May 13

Replying to @GergelyOrosz

I'm an investor, I get all of their updates - and just to be clear, they've never once reported "ARR" to investors. Given that this premise is false, consider doing the right thing and taking this post down? The founders are doing great, growing like a weed, and don't deserve this.

37,328

joe

joe

@simulated_land

May 12

big debate going on

120

joe

joe

@simulated_land

May 11

no matter how hard I can try I can't get this 3d button animation not suck bat signalling the wizards for assistance please help 🙏 @keviduk @raunofreiberg

0:11

143

joe

joe

@simulated_land

May 11

button is here. Any suggestions appreciated <3 asteroid.ai/theme/

Davide Locatelli

joe retweeted

Davide Locatelli @davilocatelli13

May 8

Dreams on Claude Managed Agents are exactly what our prod browser agents have been missing. Reliability in healthcare workflows is P0 for us @asteroid_inc. This will unlock a different tier of what we can ship Great day at Code w/ Claude SF. ty @AnthropicAI @ClaudeDevs @bcherny

164

joe

joe

@simulated_land

Apr 28

RT @MlcochDavid: Browser agents are overhyped, which makes builders pissed when they underdeliver. But most are built wrong. So I wrote d…

joe

joe

@simulated_land

Apr 24

I don't know what's going on at the Pierre Computing Company but this website is the most insane shit I have ever seen in my lift

403

joe

joe

@simulated_land

Apr 24

pierre.co/archive/article-01…

joe

joe

@simulated_land

Apr 21

x.com/i/article/204639416182…

11,945

joe

joe

@simulated_land

Apr 18

what are some good objects for someone just getting started

298

joe

joe

@simulated_land

Apr 13

what is this wizardry? pass the sauce brother

Praveen Kumar

@praveenisomer

Apr 11

ASCII cards

0:10

198

joe

joe

@simulated_land

Apr 11

x.com/i/article/204302507214…

204

Andon Labs

joe retweeted

Andon Labs

@andonlabs

Apr 11

We gave an AI a 3-year retail lease in SF and asked it to make a profit. The AI interviewed and hired full-time employees, applied for credit, and stocked the store with the books Superintelligence and Making of the Atomic Bomb. Visit Andon Market at 2102 Union St now.

1:45

102

152

2,372

1,938,005

joe

joe

@simulated_land

Apr 5

Open source, if nothing else is surely entering into a golden age. This is insanely useful, yet trivially recreatable and hackable. How many billions of long tail problems can non-technical people now solve, that use to require a team of engineers?

Andrew Farah

@andrewfarah

Apr 4

sharing my first open source project a CLI for downloading and syncing your X bookmarks locally so your agent can access them. it's free › npm install -g fieldtheory › login to your X account in a chrome tab › ft sync (done!) bonus: › ft viz › ft classify

0:25

229

joe

joe

@simulated_land

Apr 4

cc this a readonly link to your bank accounts and assets goes incredibly hard btw

Andrej Karpathy

@karpathy

Apr 4

Wow, this tweet went very viral! I wanted share a possibly slightly improved version of the tweet in an "idea file". The idea of the idea file is that in this era of LLM agents, there is less of a point/need of sharing the specific code/app, you just share the idea, then the other person's agent customizes & builds it for your specific needs. So here's the idea in a gist format: gist.github.com/karpathy/442… You can give this to your agent and it can build you your own LLM wiki and guide you on how to use it etc. It's intentionally kept a little bit abstract/vague because there are so many directions to take this in. And ofc, people can adjust the idea or contribute their own in the Discussion which is cool.

162

joe

joe

@simulated_land

Apr 4

How many times do you think this guy got rejected by YC?

This tweet is unavailable

1,025

joe

joe

@simulated_land

Apr 3

there's nothing quite as humbling as watching the @posthog recording of that one customer use your product fml