Jeremy Howard

Jeremy Howard

767 Photos and videos

Tweets

Daniel van Strien retweeted

Jeremy Howard

@jeremyphoward

Jun 13

I disagree with this decision and I don't like it. But also... HOW DID ANTHROPIC NOT SEE THIS COMING‽ It is *the* obvious response to "this is too dangerous for anyone except us to use", since that relies on a premise ("we are uniquely good") that almost no-one agrees with.

Anthropic

@AnthropicAI

Jun 13

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…

232

206

3,121

207,254

MiniMax (official)

Daniel van Strien retweeted

MiniMax (official)

@MiniMax_AI

Jun 13

M3 would never 🙂‍↔️ As a matter of fact, the weights are now open, too. huggingface.co/MiniMaxAI/Min…

MiniMaxAI/MiniMax-M3 · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

Anthropic

@AnthropicAI

Jun 13

241

472

6,478

500,225

Quentin Gallouédec

Daniel van Strien retweeted

Quentin Gallouédec @QGallouedec

Jun 12

🏃‍♀️💨 hf-sandbox is faster and more reliable than ever! Use it for your agents, RL training, to serve your model, or whatever else you can think of!

0:26

2,561

clem 🤗

Daniel van Strien retweeted

clem 🤗

@ClementDelangue

Jun 9

Super excited to announce that @arcee_ai is the first major American AI lab to replace AWS S3 with Hugging Face for ALL their models and datasets, public AND private 🔥🔥🔥 Multi-million $ partnership to support American open-source AI, let’s go!

512

60,780

Kimi.ai

Daniel van Strien retweeted

Kimi.ai

@Kimi_Moonshot

Jun 12

🔗 Weights & code: huggingface.co/moonshotai/Ki…

moonshotai/Kimi-K2.7-Code · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

802

106,323

Adina Yakup

Daniel van Strien retweeted

Adina Yakup

@AdinaYakup

Jun 11

PP-OCRv6 just released by Baidu @PaddlePaddle ✨ tiny 1.5M / small 7.7M / medium 34.5M ✨ 48 languages ✨ Supports handwritten/printed/industrial/screen and card text ✨ Edge friendly deployment

461

22,418

PaddlePaddle

Daniel van Strien retweeted

PaddlePaddle

@PaddlePaddle

Jun 12

🚀PP-OCRv6 is officially released！ 🔥PaddleOCR’s new OCR model series scales from 1.5M to 34.5M parameters, bringing stronger accuracy, faster inference, and broader deployment options — from browsers and edge devices to servers. 📊What’s new: 🔸Tiny / Small / Medium models: 1.5M, 7.7M, 34.5M params 🔸 4.9% detection accuracy and 5.1% recognition accuracy over PP-OCRv5 🔸Up to 5.2× faster CPU inference with OpenVINO 🔸50 languages in one unified model 🔸New scenarios: PCB, CAD drawings, digital tubes, dot-matrix text 🔸Apache 2.0 open source ✨Lightweight OCR, built for the AI data era. 🔗Try it: 🌐 paddleocr.com 💻 github.com/PaddlePaddle/Padd… 🤗huggingface.co/collections/P… #PaddlePaddle #PaddleOCR #OCR #AI #ComputerVision #OpenSource #EdgeAI

315

23,669

Daniel van Strien

Daniel van Strien

@vanstriendaniel

Jun 12

Yesterday: diffusion LM beats AR on OCR correction. Today, after a suggestion from the @GoogleDeepMind team: bench it against its parameter-matched twin instead (same 26B MoE, same 4B active). The twin wins slightly on quality. The diffusion model is still ~10× faster. Updated scoreboard CER on 19th-c newspaper OCR (lower = better): • OCR input: 0.066 • Gemma-4-E4B: 0.042 (15.3s/passage) • DiffusionGemma: 0.035 (1.7s 🚀) • Gemma-4-26B MoE: 0.027 (16.3s) Equal capacity: diffusion trades accuracy for an order of magnitude of speed. Also tried the suggested sampler fixes (thx @joao_gante!) for the "seeded canvas just copies its input" bug. They free the model from copying but it re-derives the input instead of correcting it. Might still be some tweaking here? Full agent-written lab notebook, scripts runnable straight from the bucket: huggingface.co/buckets/davan…

davanstrien/diffusiongemma-ocr-bench - Storage Bucket

Storage bucket davanstrien/diffusiongemma-ocr-bench on Hugging Face

huggingface.co

2,484

Wauplin

Daniel van Strien retweeted

Wauplin @Wauplin

Jun 11

huggingface_hub v1.19.0 is out 🚀 Three big ones: 🔐 Keyless CI/CD auth (Trusted Publishers) 🖥️ hf:// URIs in the CLI 🌐 Expose ports on Jobs Details below 🧵

10,818

steven

Daniel van Strien retweeted

steven

@Tu7uruu

Jun 11

Happy to announce the launch of the Far-Field ASR Leaderboard! 🎉 While many ASR benchmarks focus on clean speech, real-world applications need to handle noise, reverberation, and distant microphones. This leaderboard makes it easier to evaluate speech recognition models under realistic acoustic conditions and compare their robustness across challenging environments.

125

9,236

Google AI Developers

Daniel van Strien retweeted

Google AI Developers

@googleaidevs

Jun 10

DiffusionGemma, our experimental open model released under an Apache 2.0 license, explores text diffusion, an exceptionally fast approach to text generation. Here’s how DiffusionGemma accelerates development: Faster token output: By shifting the bottleneck from memory bandwidth to raw compute, the model generates up to 4x faster token output on dedicated GPUs Accessible hardware footprint: Activates just 3.8B parameters during inference, fitting comfortably within 24GB-VRAM high-end consumer GPUs when quantized Novel workflows: Parallel token generation enables self-correction, making it ideal for code infilling, in-line editing, and non-linear structures DiffusionGemma prioritizes speed over raw quality and accelerates best on compute-bound hardware (like @NVIDIAAI GPUs). Standard @GoogleGemma 4 remains recommended for production quality and memory-bound devices.

444

116,255

Daniel van Strien

Daniel van Strien

@vanstriendaniel

Jun 11

Can @googlegemma DiffusionGemma help fix broken OCR? In theory, denoising tokens in parallel could work better for OCR correction since context is seen upfront? Pointed it at 19th-century newspaper OCR. It corrected better than the autoregressive baseline — at ~8x the speed.

0:36

410

35,980

more replies

Daniel van Strien

Daniel van Strien

@vanstriendaniel

Jun 11

For me, this was a negative result: 2–5 steps, but it barely edits with 61/75 outputs identical to the noisy input. Real text is off-distribution as noise, so the sampler just accepts it maybe? (You can try this in the demo)

614

Daniel van Strien

Daniel van Strien

@vanstriendaniel

Jun 11

Try it: live demo on @huggingface ZeroGPU with the step-by-step denoising replay, side-by-side diffs vs the human transcription, and the full benchmark table. huggingface.co/spaces/davans…

DiffusionGemma vs Gemma-4 — Post-OCR Correction - a Hugging Face Space by davanstrien

Diffusion vs autoregressive LLM on historical OCR cleanup

huggingface.co

625

Daniel van Strien

Daniel van Strien

@vanstriendaniel

Jun 10

Used an agent for a naughty no no use case while I still can: fine-tuning a domain-specific small VLM to be good at information extraction in the library/archive/historical research domain. Pointed an agent (Opus 4.8, Fabel refused....) at 🇫🇷 🇬🇧 handwritten death records and manuscript-catalogue cards. Resulted in a 4B model (from @numind_ai's NuExtract-3) that even follows schemas it's never seen. Agent found SFT beat GRPO but maybe @AnthropicAI sabotaged my GRPO experiments.... The agent labelled its own training data. All training /evals done via @huggingface Jobs. 🤗 huggingface.co/small-models-… Blog post soon!

small-models-for-glam/index-card-extractor-4b-v0.1 · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

947