Damien Henry

Damien Henry

646 Photos and videos

Tweets

Pinned Tweet

Damien Henry @dh7net

Feb 4

I'm building my second brain, One POC at a time: damien-henry.com Chat chatwithpg.com Learn fix-typos.com Native UI sqsquare.com Agentic UI mindcache.dev Short Term Memory dh7.github.io/own-your-data/ Long Term Memory

1,008

Jasper

Damien Henry retweeted

Jasper

@heyjasperai

Jun 11

.@dh7net, SVP of Image Research, said it best: "The HF infra is a no-brainer." A big unlock for teams working with large datasets for training, especially when they update over time. Read how Jasper used @huggingface as the creation and storage backbone for MONET: huggingface.co/storage/testi…

15,409

Damien Henry

Damien Henry @dh7net

Jun 5

Very true! It's not because agents can do anything that they should!

clem 🤗

@ClementDelangue

Jun 5

Token costs are why there will be no saas apocalypse / good dev tools are cached intelligence for agents! The popular theory goes: agents can write code, so they'll just rebuild every tool from scratch and hit raw APIs. no more dev tools, no more CLIs, no more software layers. just agents and endpoints! We just tested this and the data says the opposite. We benchmarked Claude Code and Codex on real Hugging Face Hub tasks (~1,000 graded runs), with two setups: the agent-optimized hf CLI vs the agent hand-rolling curl or SDK calls from scratch. Hand-rolling burns up to 6x more tokens on multi-step tasks and fails more often (84% vs 94% task success). And that's just dropping one abstraction layer. It would obviously be orders of magnitude more tokens and a dramatically higher failure rate if the agent tried to bypass HF altogether and rebuild model hosting, versioning, and distribution from scratch. Every time an agent re-derives a workflow from raw API calls, you pay for that reasoning in tokens. every single run. a good CLI compresses that entire chain into a few high-level commands the agent can't get wrong. In a world where everyone is complaining tokens are too expensive, abstraction is leverage: thousands of hours of design decisions your agent doesn't have to re-reason about at inference time. Good tools are cached intelligence for agents! So no, agents won't rebuild everything from scratch. they'll gravitate to the most token-efficient tools, because that's what their owners pay for. The software that survives won't just be accessible to agents, it will be accurate and cheap for them to drive. We're seeing it happen with HF, which is becoming the platform for agents to use AI: ~49M requests in just two months, and growing fast! huggingface.co/blog/hf-cli-f…

357

clem 🤗

Damien Henry retweeted

clem 🤗

@ClementDelangue

Jun 5

544

115,079

Michał Podlewski

Damien Henry retweeted

Michał Podlewski

@trajektoriePL

Jun 4

Full text

105

465

32,377

Damien Henry

Damien Henry @dh7net

Jun 5

. @NotionHQ Thumbs up and downs for every block, please.

Damien Henry

Damien Henry @dh7net

Jun 5

I had the chance to work with people with world-class "taste," and I can confirm their talent is supported by an enormous amount of work. The part that looks like "pure taste" is just the part that I was not able to explain. It doesn't mean there was no explanation.

Diana

@sdianahu

Jun 4

the cost of shipping code went to zero taste didn't but "taste" sounds mystical and unfixable, so nobody teaches it. here's the unmystical version: taste is just an eval you haven't written down yet how you choose what to measure is what matters 1/8

180

Damien Henry

Damien Henry @dh7net

Jun 4

GPT 4o is back?

OpenAI

@OpenAI

Jun 3

It's time to fly.

1:30

209

Damien Henry

Damien Henry @dh7net

Jun 1

This!

clem 🤗

@ClementDelangue

Jun 1

Feels quite magical to be able to clone a 68 TB dataset to my private HF training bucket while I only have a 4TB local disk, all of that in less than a minute thanks to HF infra optimizations & xet dedup!

270

Damien Henry

Damien Henry @dh7net

May 28

If you have not played with an image UMAP before you should try this one!

Clément Chadebec

@CChadebec

May 28

Replying to @CChadebec

We are also releasing an interactive UMAP visualization tool, which allows users to navigate the dataset and a retrieval space to extract specific sub-distributions. 🤗 UMAP: huggingface.co/spaces/jasper… 🤗 Retrieval space: huggingface.co/spaces/jasper…

395

Damien Henry

Damien Henry @dh7net

May 28

This is one of the coolest things you'll see today! And there are multiple interactive demos to play with as well!

Clément Chadebec

@CChadebec

May 28

📢 New @heyjasper release ! 📢 MONET 🌸 : An Apache2.0 deduped and recaptioned dataset of 105M samples unlocking reproducible text-to-image research. Nano T2I 🖌️ : A codebase to train your own T2I model 🤗 @huggingface: huggingface.co/datasets/jasp… 💻: github.com/gojasper/nano-t2i Very excited about this new release, pushing the boundaries of open and reproducible T2I research. Congrats to the team! Benjamin Aubin Gonzalo Quintana @onurxtasar @UlaLaParis @_jeev2 @dh7net @clipdropapp @heyjasperai

0:27

291

Damien Henry

Damien Henry @dh7net

May 27

The very best feature of "agents" is that they can stay on hold. You can restart something started a long time ago, or build on it, without any loss. Ex.: * Find the right thread * "Can you process this file like you did last time?" Every thread is de facto a specialized agent.

Damien Henry

Damien Henry @dh7net

May 24

I didn't find a way to compare generative models side by side easily, so I made this: imagebench.ai/gallery LMKWYT

The only generative image benchmark that shows the images

See the actual generated images side by side. Independent generative image benchmark — every output published, not just scores.

imagebench.ai

327

Damien Henry

Damien Henry @dh7net

May 25

I guess I need to teach my agent to make opengraph previews a bit less dry...

126

Damien Henry

Damien Henry @dh7net

May 25

Everything will be free for humans and expensive for agents.

151

Damien Henry

Damien Henry @dh7net

May 23

This!

Edward Miller

@TweetEdMiller

May 22

The argument for having ownership over your own data is so much easier when you focus on the utility you gain rather than ideology. Reasoning over your own data should be permissionless 🦅

418

Damien Henry

Damien Henry @dh7net

May 23

I guess everyone is converging to the same setup. That's almost the same thing I vibecoded. BTW that's clearly not the end game.

Peter Steinberger 🦞

@steipete

May 23

I'm late to the party, but cmux is great. github.com/manaflow-ai/cmux current split: codex mac app: knowledege work, learning, reading cmux codex cli: coding

426

Damien Henry

Damien Henry @dh7net

May 23

The weirdness is already here – it's just not evenly distributed.

Zack D. Films

@zackdfilms1

May 22

Spinning Mt Fuji Like A Beyblade 🤯

0:51

389

Damien Henry

Damien Henry @dh7net

May 12

Home lab update: the various computers in my basement can now run several Claude & Codex sessions in parallel and be controlled from my laptops.

262

Damien Henry

Damien Henry @dh7net

May 12

Ok, chatting with your Pi agent over WhatsApp is cool. But the truth is that WhatsApp is a terrible UI for interacting with an agent. Who is making a mobile app to connect to a local agents in a more elegant way?

327

Damien Henry

Damien Henry @dh7net

May 12

Adding a "Human checked" column for most of my Notion databases now.