Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

2,776 Photos and videos

Tweets

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

Jun 13

Last week NVIDIA and Microsoft announced the release of the NVIDIA RTX Spark, and it's really exciting to see that unified memory is finally coming outside of the M-Mac chips, and it seems that NVIDIA is indeed going all in. RTX Spark Announcement: youtube.com/watch?v=11Y3B33o…

Announcing NVIDIA RTX Spark | GTC Taipei 2026 Keynote by CEO Jensen...

NVIDIA CEO Jensen Huang announces the NVIDIA RTX Spark, a new super...

youtube.com

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

Jun 12

LLM inference has a routing problem: once every request can depend on cache locality, GPU specialization, and multi-step execution, the router directly affects latency cost. Part 1: modular.com/blog/why-llm-inf… Part 2: modular.com/blog/why-llm-inf… Part 3: modular.com/blog/why-llm-inf…

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

Jun 11

Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering. Paper: arxiv.org/abs/2601.14470

Tokenomics: Quantifying Where Tokens Are Used in Agentic Software...

LLM-based Multi-Agent (LLM-MA) systems are increasingly applied to automate complex software engineering tasks such as requirements engineering, code generation, and testing. However, their...

arxiv.org

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

Jun 10

Stanford has published their CS336 Undergraduate Module on Language Modeling from Scratch. Quality SotA learning material from Stanfrod? For free? What a time to be alive! Course: cs336.stanford.edu/ Playlist: youtube.com/watch?v=JuoVZkPB… Repo: github.com/stanford-cs336/as…

103

Awesome Machine Learning Repositories

Alejandro Saucedo | KubeCon 2025 AI Day Keynote retweeted

Awesome Machine Learning Repositories @MLRepositories

11 Feb 2023

kompute: General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and op ... Lang: C ⭐️ 1073 #MachineLearning github.com/KomputeProject/ko…

452

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

Jun 10

RT @syoyo: llama.cpp でもサポートされたし, Kompute 良さそうかのー🥺 > General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor…

GitHub - KomputeProject/kompute: General purpose GPU compute framework built on Vulkan to support...

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optim...

github.com

Jasmine

Alejandro Saucedo | KubeCon 2025 AI Day Keynote retweeted

Jasmine @chainr3d

25 May 2025

Run GPU computations on all sorts of graphics cards with Kompute. It's cross-platform, super fast, and helps with advanced data tasks. Backed by the Linux Foundation! github.com/KomputeProject/ko…

GitHub - KomputeProject/kompute: General purpose GPU compute framework built on Vulkan to support...

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optim...

github.com

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

Jun 5

An interesting release of a new massive open text-to-image dataset: The MONET dataset. Paper: arxiv.org/abs/2605.21272 Dataset: huggingface.co/datasets/jasp…

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

Jun 4

I remember when Papers With Code originally came out in 2018 it was a major breakthrough; after the meta acquisition the project slowed and then stopped, but it seems there is an attempt to revive it! paperswithcode.co/

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

Jun 3

Open AI & Anthropic have found Market Fit: simonwillison.net/2026/May/2…

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

Jun 2

Netflix is sharing their playbook for massive scale LLM fine-tuning infrastructure: netflixtechblog.com/scaling-…

101

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

Jun 1

Let’s take a trip back in time to build like it's 2010! This engineering talk from old-school YouTube-Engineering is truly a masterclass of scaling and learning, and surprisingly the lessons are as valuable today as they were back then. Talk: youtube.com/watch?v=w5WVu624…

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

May 29

NVIDIA just dropped a high efficiency open-source stack for high-resolution image, video, and world-model generation! Repo: github.com/NVlabs/Sana

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

May 28

Skforecast is making time-series foundation models much easier to test in real production forecasting workflows: it's integrating Chronos, TimesFM, Moirai, and TabICL through their new release! skforecast.org/latest/user_g…

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

May 27

Google DeepMind has just released Gemini 3.5 Flash! This is quite interesting to see as a faster agentic execution model across coding, tool use, multimodal understanding and long-horizon workflows. blog.google/innovation-and-a…

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

May 26

Benedict Evans has dropped the 2026 "AI Eats the World" deck, and here's the main highlights: GenAI so far = Huge capex first, unclear value capture, lots of hype, and only later the boring-but-transformational deployment layer. Slides: ben-evans.com/presentations

106

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

May 25

A classic! "Deep Learning Go Brrrr From First Principles" which still brings super relevant advice to AI teams today: DL go Brrr: horace.io/brrr_intro.html

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

May 22

Mozilla shared a great behind-the-scenes look at how they used Claude Mythos Preview and other models to harden Firefox, and what is really interesting is that this was not just "LLM finds bugs" but a proper end-to-end security pipeline: Mozilla Blog: hacks.mozilla.org/2026/05/be…

150

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

May 21

Google DeepMind showcases how they are bringing AlphaEvolve alive as an optimization engine for the expensive parts of ML, science and infrastructure: Announcement: deepmind.google/blog/alphaev…

0:27

Alejandro Saucedo | KubeCon 2025 AI Day Keynote

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

May 20

Stanford has just published a really interesting new dataset of real coding-agent sessions using public GitHub repos with ~6k sessions, 63K user prompts, 355K tool calls. Paper: arxiv.org/abs/2604.20779 Dataset: huggingface.co/datasets/SALT…

135