Caleb

Caleb

823 Photos and videos

Tweets

Caleb

@calebfahlgren

13h

great listen!

Matan Grinberg

@matanSF

19h

Had a fun conversation with @HarryStebbings on all things Model Independence, Software Factories, pricing wars… And sprinkled in some hot takes that I may or may not regret sharing 😳

0:35

273

Caleb

Caleb

@calebfahlgren

19h

Pro Tip: you can upload your Fable traces to both PRIVATE datasets and buckets on @huggingface

2,173

Caleb

Caleb

@calebfahlgren

19h

ask your agent to upload the jsonl files huggingface.co/datasets?form…

Agent Traces – Hugging Face

Explore datasets powering machine learning.

huggingface.co

302

Elliot Arledge

Caleb retweeted

Elliot Arledge

@elliotarledge

22h

you can distill from hf fable traces btw

736

57,892

MiniMax (official)

Caleb retweeted

MiniMax (official)

@MiniMax_AI

Jun 12

MiniMax M3, Open-Weight, Now On Hugging Face , with only ~428B parameters and ~23B activated parameters Weights: huggingface.co/MiniMaxAI/Min… MiniMax Sparse Attention: huggingface.co/papers/2606.1…

MiniMaxAI/MiniMax-M3 · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

MiniMax (official)

@MiniMax_AI

Jun 1

Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: platform.minimax.io Token Plan: platform.minimax.io/subscrib… 🚀New! MiniMax Code: code.minimax.io Weights & Tech Report in ~10 Days

114

328

2,750

627,665

Jasper

Caleb retweeted

Jasper

@heyjasperai

Jun 11

.@dh7net, SVP of Image Research, said it best: "The HF infra is a no-brainer." A big unlock for teams working with large datasets for training, especially when they update over time. Read how Jasper used @huggingface as the creation and storage backbone for MONET: huggingface.co/storage/testi…

15,423

json

Caleb retweeted

json

@JsonBasedman

Jun 11

Asked a question so dangerous they sent my ass to haiku

394

8,987

209,344

Leandro von Werra

Caleb retweeted

Leandro von Werra

@lvwerra

Jun 10

Fable is the new leader on CADGenBench! Still long way to go:

0:25

568

36,148

Andi Marafioti

Caleb retweeted

Andi Marafioti

@andimarafioti

Jun 9

ok who put Hugging Face in Rick and Morty

0:14

363

47,604

Caleb

Caleb

@calebfahlgren

Jun 9

Claude 5 Fabel is out with System Card!! LFG!! www-cdn.anthropic.com/d00db5…

788

Caleb

Caleb

@calebfahlgren

Jun 9

Cohere drops North Mini Code - an open weights model optimized for code generation > 30B-A3B > Apache 2.0 Licensed > Available on @huggingface

1,911

Caleb

Caleb

@calebfahlgren

Jun 9

huggingface.co/CohereLabs/No…

CohereLabs/North-Mini-Code-1.0 · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

180

Arcee.ai

Caleb retweeted

Arcee.ai

@arcee_ai

Jun 9

Today we're announcing a multi-million-dollar strategic partnership with @huggingface. The Hub is now the exclusive home for everything we build: every open model we release, every private run, every proprietary dataset, and every agent trace. All of it lives there.

343

32,744

clem 🤗

Caleb retweeted

clem 🤗

@ClementDelangue

Jun 9

Super excited to announce that @arcee_ai is the first major American AI lab to replace AWS S3 with Hugging Face for ALL their models and datasets, public AND private 🔥🔥🔥 Multi-million $ partnership to support American open-source AI, let’s go!

512

60,785

Cognition

Caleb retweeted

Cognition

@cognition

Jun 8

Introducing FrontierCode: a coding eval that raises the bar for difficulty & quality. Each task took 40 hrs of work by leading open-source maintainers. Models write sloppy code that works but isn’t maintainable. Our eval is first to measure: would you actually merge this code?

234

313

4,284

2,507,526

Caleb

Caleb

@calebfahlgren

Jun 8

This is super cool! Benchmark measuring how well agents can use CAD to build a valid, geometrically correct 3D part! We need more benchmarks that model real world tasks like this!

Michael Rabinovich

@MikushRab

Jun 8

Introducing CADGenBench: measure how well AI systems produce engineering-grade 3D parts! While current models can generate 3D parts, they are far from precise enough to build functional parts. We built a benchmark to systematically measure their capabilities on two tasks: 1. Generation from an engineering drawing of a part 2. Editing: given an existing STEP file and a requested change The benchmark is tool-agnostic. It makes no assumptions about how you build the model. You can vary the LLM, and you can vary the environment. Use build123d, Onshape, Autodesk, or a model without an LLM entirely. We open sourced the scoring engine and a reference baseline on top of build123d. A collaboration between Hugging Face and @mecadoinc! Submission space: huggingface.co/spaces/Huggin… Code repository: github.com/huggingface/cadge…

0:43

515

Caleb

Caleb

@calebfahlgren

Jun 8

huggingface.co/spaces/Huggin…

CADGenBench Leaderboard - a Hugging Face Space by HuggingAI4Engineering

Leaderboard for AI-driven CAD generation

huggingface.co

108

Google Gemma

Caleb retweeted

Google Gemma

@googlegemma

Jun 5

We just dropped Gemma 4 Quantization-Aware Training (QAT) checkpoints on Hugging Face! All Gemma 4 model sizes and their drafters are now optimized with QAT to cut memory requirements and maximize on-device performance!

283

2,856

500,324

Quentin Lhoest 🤗

Caleb retweeted

Quentin Lhoest 🤗@lhoestq

Jun 4

Agent traces are the new fuel. Looking fw to announce `trl` official support for agent traces for training💥 (w/ `datasets` v5, coming out tmr?) Pick your local, synthetic, or community traces and train your own specialized Agent 🔜trl sft --dataset-name julien-c/synthtraces

Julien Chaumond

@julien_c

Jun 4

Today I'm launching a new project called SynthTraces 🔥 It is a minimal codebase to generate synthetic coding agent session traces using Pi (from @badlogicgames) I wanted a large number of coding-agent traces, so I built a tiny harness where two models talk to each other: - an open model (served via HF Inference Providers) plays the coding agent. It gets read bash access to a real open source codebase (the huggingface OSS projects) - a small local model (llama.cpp) plays the human user, asking simple questions like "how do I run this?" or "how is CI set up?" The result is more than 2,000 Pi session traces which can be used to train or fine-tune LLMs, and optimize them for Pi 🤯 And ofc everything is published on @huggingface ✅

36,274

Caleb

Caleb

@calebfahlgren

Jun 4

imagine being able to resume one of @karpathy auto-research sessions to learn and try different things!

Caleb

@calebfahlgren

Jun 4

One cool hack of of agent sessions being a json file is you can just resume someone else's trace! Great for extracting valuable context or long compacted sessions! Try the prompt with your agent below!

945