Caleb Eom

Caleb Eom

112 Photos and videos

Tweets

Caleb Eom

@calebfoundry

Jun 12

huge fan of @bycloudai ever since I started my AI YT channel. i feel so honoured to finally meet him in person.. thanks for inspiring me!

124

Caleb Eom

Caleb Eom

@calebfoundry

Jun 11

Nemotron 3 Full Breakdown With the help of Joey Conway from @NVIDIAAI getting into the specifics around why Nemotron 3 is kind of a big deal Biggest headline with Nemotron is: Hybrid Mamba Transformer, Latent MoE, and MTP Hybrid Mamba Transformer essentially attacks right at the Attention mechanism to make the overhead sub-quadratic, but unlike quantizing KV Cache or swapping out attention head, NVIDIA chose Mamba-2 Latent MoE helps further optimize on sparsity by down projecting the dimensions so you're doing less math and less memory movement between HBM and SRAM, you're saving a ton, and NVIDIA made a conscious choice to add more experts given the surplus Finally, MTP or multi token prediction where the model can see future tokens to be more expressive in training and also option to use for speculative decoding during inference Oh, also the model adopts the new OpenMDW 1.1 License

12:12

153

47,360

Caleb Eom

Caleb Eom

@calebfoundry

Jun 8

MiniMax M3 ditched full attention and adopts sparse attention This is yet another trend as more labs are focusing on token efficiency and inference throughput which M3 model demonstrates which cleverly in the M3 architecture in how KV is processed I'm personally impressed by the I/O between HBM and SRAM and how tokens are read in tiles contiguously - not wasting operations. Great work @MiniMax_AI

8:19

598

Caleb Eom

Caleb Eom

@calebfoundry

Jun 5

in case the first message wasn't clear

183

Caleb Eom

Caleb Eom

@calebfoundry

Jun 3

Pi framework that built OpenClaw So many coding agents these days all look the same and feel the same. Pi goes against the current by shedding weights rather than gaining more And harness is changing with ebbs and flow which means being a minimalist adds durability

5:34

302

Caleb Eom

Caleb Eom

@calebfoundry

Jun 1

Thank you for hosting me. And huge thanks to greg and tilde for the interview 🐐

Google Cloud Tech

@GoogleCloudTech

Jun 1

AI is a five-layer cake with an application, model, infrastructure, chip, and energy layer. While most focus on agents, the biggest bottleneck down the line might actually be the energy layer. Hear how @calebfoundry breaks down the full AI stack → goo.gle/3PWdfVn

0:59

346

Caleb Eom

Caleb Eom

@calebfoundry

May 28

Typical day: Gemini, ChatGPT, Claude for research

178

Caleb Eom

Caleb Eom

@calebfoundry

May 27

California weather is not friendly to my hair but here's a quick interview about me and my journey as a content creator covering AI. Shout out to Greg and Tilde for the interview! youtu.be/TjPr_-X0Mko?si=w0Y1…

Mastering AI stacks for software engineers

In this interview from Google I/O, Greg Bagues sits down with Caleb...

youtube.com

251

Caleb Eom

Caleb Eom

@calebfoundry

May 22

Brief history of harness engineering In case the buzzwords around what harness even is and why we're even talking about harness in general, here's a video explaining why the industry evolved from prompt, to context, and now to harness engineering. Enjoy.

7:06

395

Caleb Eom

Caleb Eom

@calebfoundry

May 20

Great chat with @Sam_Witteveen last night. Sam is the 🐐 with insane insights into the AI industry.

629

Caleb Eom

Caleb Eom

@calebfoundry

May 20

Assuming YC team of 2-3 people, you're trading around 3 years of running Codex (~70TPS at $14/M tokens running 24/7) for equity. I wouldn't take the deal.

Tyler Bosmeny

@bosmeny

May 20

A mic drop moment @ycombinator tonight @sama just offered $2M in OpenAI tokens to EVERY YC startup in the current batch in exchange for equity Just like Yuri Milner offering to invest in every startup back when Sam was a YC partner I can't wait to see what's unlocked when you let the most driven, creative and formidable founders tokenmaxx

1,751

Caleb Eom

Caleb Eom

@calebfoundry

May 19

Ran into @twominutepapers at Google IO and Dr. Károly said he's seen my videos 🤯

4,047

Caleb Eom

Caleb Eom

@calebfoundry

May 19

Gemini's new AI glasses encapsulates my cultural identity ( Nano Banana)

284

Caleb Eom

Caleb Eom

@calebfoundry

May 19

Gemini 3.5 Pro not being released in Google IO is somewhat suspect..

132

17,195

Caleb Eom

Caleb Eom

@calebfoundry

May 19

Gemini 3.5 Flash in benchmark

281

Caleb Eom

Caleb Eom

@calebfoundry

May 19

and the plot thickens...

Andrej Karpathy

@karpathy

May 19

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

400

Caleb Eom

Caleb Eom

@calebfoundry

May 19

Google IO about to start 😇

839

Caleb Eom

Caleb Eom

@calebfoundry

May 19

Got here early at Google IO..

620

Captain Insight

Caleb Eom retweeted

Captain Insight

@CaptainInsightX

May 15

OpenAI spent billions on training infrastructure. Two Aussie brothers made AI training 30x faster ~ with $500K total. 🤯 Meet Daniel & Michael Han 🇦🇺 > Brothers from Sydney, Australia > Daniel was an engineer at NVIDIA > Sped up the t-SNE algorithm 2000x. Cut SVD memory in half. > Found and fixed 20 bugs in Meta’s Llama, Google’s Gemma, Mistral, and Phi > Big AI labs missed bugs in their own models. He caught them. > Started Unsloth in December 2023 with his brother Michael > Built tools that make LLM fine-tuning 2-30x faster, with 70-90% less memory Released it 100% open source. Free for everyone. 🚀 > 64,000 GitHub stars > 10 million model downloads every month > NASA and Canva use their code > Raised only $500K total in seed funding > Got into Y Combinator S24 > Led by two brothers with a small team of 8 shipping code While big labs burn billions, they made AI accessible to everyone. Absolute Legends 🐐

133

1,503

63,241

Caleb Eom

Caleb Eom

@calebfoundry

May 16

touching some grass with my homeboy Logan

197