CalebWritesCode YT // Google Developer Expert

Joined March 2025
112 Photos and videos
huge fan of @bycloudai ever since I started my AI YT channel. i feel so honoured to finally meet him in person.. thanks for inspiring me!
1
5
124
Nemotron 3 Full Breakdown With the help of Joey Conway from @NVIDIAAI getting into the specifics around why Nemotron 3 is kind of a big deal Biggest headline with Nemotron is: Hybrid Mamba Transformer, Latent MoE, and MTP Hybrid Mamba Transformer essentially attacks right at the Attention mechanism to make the overhead sub-quadratic, but unlike quantizing KV Cache or swapping out attention head, NVIDIA chose Mamba-2 Latent MoE helps further optimize on sparsity by down projecting the dimensions so you're doing less math and less memory movement between HBM and SRAM, you're saving a ton, and NVIDIA made a conscious choice to add more experts given the surplus Finally, MTP or multi token prediction where the model can see future tokens to be more expressive in training and also option to use for speculative decoding during inference Oh, also the model adopts the new OpenMDW 1.1 License
5
19
153
47,360
MiniMax M3 ditched full attention and adopts sparse attention This is yet another trend as more labs are focusing on token efficiency and inference throughput which M3 model demonstrates which cleverly in the M3 architecture in how KV is processed I'm personally impressed by the I/O between HBM and SRAM and how tokens are read in tiles contiguously - not wasting operations. Great work @MiniMax_AI
1
9
598
in case the first message wasn't clear
5
183
Pi framework that built OpenClaw So many coding agents these days all look the same and feel the same. Pi goes against the current by shedding weights rather than gaining more And harness is changing with ebbs and flow which means being a minimalist adds durability
7
302
Thank you for hosting me. And huge thanks to greg and tilde for the interview 🐐
AI is a five-layer cake with an application, model, infrastructure, chip, and energy layer. While most focus on agents, the biggest bottleneck down the line might actually be the energy layer. Hear how @calebfoundry breaks down the full AI stack → goo.gle/3PWdfVn
10
346
Typical day: Gemini, ChatGPT, Claude for research
1
2
178
California weather is not friendly to my hair but here's a quick interview about me and my journey as a content creator covering AI. Shout out to Greg and Tilde for the interview! youtu.be/TjPr_-X0Mko?si=w0Y1…
1
5
251
Brief history of harness engineering In case the buzzwords around what harness even is and why we're even talking about harness in general, here's a video explaining why the industry evolved from prompt, to context, and now to harness engineering. Enjoy.
1
16
395
Great chat with @Sam_Witteveen last night. Sam is the 🐐 with insane insights into the AI industry.
2
11
629
Assuming YC team of 2-3 people, you're trading around 3 years of running Codex (~70TPS at $14/M tokens running 24/7) for equity. I wouldn't take the deal.
A mic drop moment @ycombinator tonight @sama just offered $2M in OpenAI tokens to EVERY YC startup in the current batch in exchange for equity Just like Yuri Milner offering to invest in every startup back when Sam was a YC partner I can't wait to see what's unlocked when you let the most driven, creative and formidable founders tokenmaxx
1
5
1,751
Ran into @twominutepapers at Google IO and Dr. Károly said he's seen my videos 🤯
1
1
17
4,047
Gemini's new AI glasses encapsulates my cultural identity ( Nano Banana)
3
284
Gemini 3.5 Pro not being released in Google IO is somewhat suspect..
8
132
17,195
Gemini 3.5 Flash in benchmark
2
2
281
and the plot thickens...
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
1
6
400
Google IO about to start 😇
2
25
839
Got here early at Google IO..
1
14
620
Caleb Eom retweeted
OpenAI spent billions on training infrastructure. Two Aussie brothers made AI training 30x faster ~ with $500K total. 🤯 Meet Daniel & Michael Han 🇦🇺 > Brothers from Sydney, Australia > Daniel was an engineer at NVIDIA > Sped up the t-SNE algorithm 2000x. Cut SVD memory in half. > Found and fixed 20 bugs in Meta’s Llama, Google’s Gemma, Mistral, and Phi > Big AI labs missed bugs in their own models. He caught them. > Started Unsloth in December 2023 with his brother Michael > Built tools that make LLM fine-tuning 2-30x faster, with 70-90% less memory Released it 100% open source. Free for everyone. 🚀 > 64,000 GitHub stars > 10 million model downloads every month > NASA and Canva use their code > Raised only $500K total in seed funding > Got into Y Combinator S24 > Led by two brothers with a small team of 8 shipping code While big labs burn billions, they made AI accessible to everyone. Absolute Legends 🐐
65
133
1,503
63,241
touching some grass with my homeboy Logan
1
4
197