Joined September 2009
646 Photos and videos
Pinned Tweet
I'm building my second brain, One POC at a time: damien-henry.com Chat chatwithpg.com Learn fix-typos.com Native UI sqsquare.com Agentic UI mindcache.dev Short Term Memory dh7.github.io/own-your-data/ Long Term Memory
3
1,008
Damien Henry retweeted
.@dh7net, SVP of Image Research, said it best: "The HF infra is a no-brainer." A big unlock for teams working with large datasets for training, especially when they update over time. Read how Jasper used @huggingface as the creation and storage backbone for MONET: huggingface.co/storage/testi…
2
11
24
15,409
Very true! It's not because agents can do anything that they should!
Token costs are why there will be no saas apocalypse / good dev tools are cached intelligence for agents! The popular theory goes: agents can write code, so they'll just rebuild every tool from scratch and hit raw APIs. no more dev tools, no more CLIs, no more software layers. just agents and endpoints! We just tested this and the data says the opposite. We benchmarked Claude Code and Codex on real Hugging Face Hub tasks (~1,000 graded runs), with two setups: the agent-optimized hf CLI vs the agent hand-rolling curl or SDK calls from scratch. Hand-rolling burns up to 6x more tokens on multi-step tasks and fails more often (84% vs 94% task success). And that's just dropping one abstraction layer. It would obviously be orders of magnitude more tokens and a dramatically higher failure rate if the agent tried to bypass HF altogether and rebuild model hosting, versioning, and distribution from scratch. Every time an agent re-derives a workflow from raw API calls, you pay for that reasoning in tokens. every single run. a good CLI compresses that entire chain into a few high-level commands the agent can't get wrong. In a world where everyone is complaining tokens are too expensive, abstraction is leverage: thousands of hours of design decisions your agent doesn't have to re-reason about at inference time. Good tools are cached intelligence for agents! So no, agents won't rebuild everything from scratch. they'll gravitate to the most token-efficient tools, because that's what their owners pay for. The software that survives won't just be accessible to agents, it will be accurate and cheap for them to drive. We're seeing it happen with HF, which is becoming the platform for agents to use AI: ~49M requests in just two months, and growing fast! huggingface.co/blog/hf-cli-f…
1
357
Damien Henry retweeted
Token costs are why there will be no saas apocalypse / good dev tools are cached intelligence for agents! The popular theory goes: agents can write code, so they'll just rebuild every tool from scratch and hit raw APIs. no more dev tools, no more CLIs, no more software layers. just agents and endpoints! We just tested this and the data says the opposite. We benchmarked Claude Code and Codex on real Hugging Face Hub tasks (~1,000 graded runs), with two setups: the agent-optimized hf CLI vs the agent hand-rolling curl or SDK calls from scratch. Hand-rolling burns up to 6x more tokens on multi-step tasks and fails more often (84% vs 94% task success). And that's just dropping one abstraction layer. It would obviously be orders of magnitude more tokens and a dramatically higher failure rate if the agent tried to bypass HF altogether and rebuild model hosting, versioning, and distribution from scratch. Every time an agent re-derives a workflow from raw API calls, you pay for that reasoning in tokens. every single run. a good CLI compresses that entire chain into a few high-level commands the agent can't get wrong. In a world where everyone is complaining tokens are too expensive, abstraction is leverage: thousands of hours of design decisions your agent doesn't have to re-reason about at inference time. Good tools are cached intelligence for agents! So no, agents won't rebuild everything from scratch. they'll gravitate to the most token-efficient tools, because that's what their owners pay for. The software that survives won't just be accessible to agents, it will be accurate and cheap for them to drive. We're seeing it happen with HF, which is becoming the platform for agents to use AI: ~49M requests in just two months, and growing fast! huggingface.co/blog/hf-cli-f…
95
94
544
115,079
Damien Henry retweeted
Full text
12
105
465
32,377
. @NotionHQ Thumbs up and downs for every block, please.
66
I had the chance to work with people with world-class "taste," and I can confirm their talent is supported by an enormous amount of work. The part that looks like "pure taste" is just the part that I was not able to explain. It doesn't mean there was no explanation.
the cost of shipping code went to zero taste didn't but "taste" sounds mystical and unfixable, so nobody teaches it. here's the unmystical version: taste is just an eval you haven't written down yet how you choose what to measure is what matters 1/8
180
GPT 4o is back?
Jun 3
It's time to fly.
209
This!
Feels quite magical to be able to clone a 68 TB dataset to my private HF training bucket while I only have a 4TB local disk, all of that in less than a minute thanks to HF infra optimizations & xet dedup!
270
If you have not played with an image UMAP before you should try this one!
Replying to @CChadebec
We are also releasing an interactive UMAP visualization tool, which allows users to navigate the dataset and a retrieval space to extract specific sub-distributions. 🤗 UMAP: huggingface.co/spaces/jasper… 🤗 Retrieval space: huggingface.co/spaces/jasper…
1
4
395
This is one of the coolest things you'll see today! And there are multiple interactive demos to play with as well!
📢 New @heyjasper release ! 📢 MONET 🌸 : An Apache2.0 deduped and recaptioned dataset of 105M samples unlocking reproducible text-to-image research. Nano T2I 🖌️ : A codebase to train your own T2I model 🤗 @huggingface: huggingface.co/datasets/jasp… 💻: github.com/gojasper/nano-t2i Very excited about this new release, pushing the boundaries of open and reproducible T2I research. Congrats to the team! Benjamin Aubin Gonzalo Quintana @onurxtasar @UlaLaParis @_jeev2 @dh7net @clipdropapp @heyjasperai
2
3
291
The very best feature of "agents" is that they can stay on hold. You can restart something started a long time ago, or build on it, without any loss. Ex.: * Find the right thread * "Can you process this file like you did last time?" Every thread is de facto a specialized agent.
1
95
I guess I need to teach my agent to make opengraph previews a bit less dry...
1
1
126
Everything will be free for humans and expensive for agents.
1
151
This!
The argument for having ownership over your own data is so much easier when you focus on the utility you gain rather than ideology. Reasoning over your own data should be permissionless 🦅
418
I guess everyone is converging to the same setup. That's almost the same thing I vibecoded. BTW that's clearly not the end game.
I'm late to the party, but cmux is great. github.com/manaflow-ai/cmux current split: codex mac app: knowledege work, learning, reading cmux codex cli: coding
426
The weirdness is already here – it's just not evenly distributed.
Spinning Mt Fuji Like A Beyblade 🤯
389
Home lab update: the various computers in my basement can now run several Claude & Codex sessions in parallel and be controlled from my laptops.
1
262
Ok, chatting with your Pi agent over WhatsApp is cool. But the truth is that WhatsApp is a terrible UI for interacting with an agent. Who is making a mobile app to connect to a local agents in a more elegant way?
1
2
327
Adding a "Human checked" column for most of my Notion databases now.
1
87