Pete Cheslock

Pete Cheslock

2,573 Photos and videos

Tweets

Pete Cheslock @petecheslock

May 19

Hey Boston Friends. Join us next week during Boston Tech Week to learn more about llm-d, open source distributed inferencing on kubernetes. Special thanks to @RedHat and @Google for helping to plan and sponsoring this free event! luma.com/eqbc1gxq

Open Source Distributed AI Inference (llm-d/vLLM) Meetup · Luma

Open Source Distributed AI Inference (llm-d/vLLM) Meetup Boston/Cambridge Hosted by Google Cloud, Red Hat AI, and the llm-d Community Date: Thursday, May 28th…

luma.com

105

Pete Cheslock

Pete Cheslock @petecheslock

May 14

Boston Friends! Come and join us for what's shaping up to be a great event! May 28th at 5pm - Google Cambridge office.

llm-d @_llm_d_

May 12

Boston AI Devs! 🏙️ Join the llm-d meetup on May 28 during Boston Tech Week. Hear the latest in LLMs from: 🎙️ Tyler Michael Smith (@RedHatAI) 🎙️ Sean Horgan (@Google) 🎙️ Peter Tanski (@CapitalOne) Huge thanks to @Google for the support! 🎟️ Register: luma.com/eqbc1gxq

144

Pete Cheslock

Pete Cheslock @petecheslock

Apr 29

We’re introducing an agentic OS prototype! Check out the demo and start building. sprou.tt/12sUilhIwa6

Building a hardened, image-based foundation for AI agents

Red Hat's Emerging Technologies team developed a community operating system image for running AI agents: an agentic OS prototype. It is built using fedora-bootc, a community project that allows for...

redhat.com

138

Yuan (Terry) Tang

Pete Cheslock retweeted

Yuan (Terry) Tang

@TerryTangYuan

Apr 14

📢 𝗧𝗵𝗲 𝗦𝘁𝗮𝘁𝗲 𝗼𝗳 𝗠𝗼𝗱𝗲𝗹 𝗦𝗲𝗿𝘃𝗶𝗻𝗴 𝗖𝗼𝗺𝗺𝘂𝗻𝗶𝘁𝗶𝗲𝘀: 𝗔𝗽𝗿𝗶𝗹 𝗘𝗱𝗶𝘁𝗶𝗼𝗻 𝗶𝘀 𝗼𝘂𝘁! Our goal with this newsletter is to give a clear, community-driven view of what’s happening across the model serving ecosystem, including updates from projects like @vllm_project, KServe, @_llm_d_, @kubernetesio, Llama Stack, and more. 👉 Check out the April newsletter here: inferenceops.substack.com/p/… 👉 Subscribe to get future issues in your inbox: inferenceops.substack.com/ 🚀 Thanks to everyone who subscribed so far! Kudos to all contributors to this edition! Francisco Arceo, Pete Cheslock, Jooho Lee, Pierangelo Di Pilato, Nir Rozenbaum, Yuan Tang, Wentao Ye, Sasa Zelenovic

1,203

Red Hat AI

Pete Cheslock retweeted

Red Hat AI

@RedHat_AI

Mar 19

vLLM meetup is coming to Boston on March 31! Workshop evening sessions covering: - @vllm_project update - Model compression and speculative decoding - Agentic AI with vLLM - Distributed inference at scale with @_llm_d_ and Kubernetes Pre-event workshop at 3:30 PM: Deploy Llama 3.1 8B and benchmark llm-d's cache-aware routing live. Shoutout to our sponsors: @RedHat, @IBM, @NVIDIAAI, The Open Accelerator, and @MITIBMLab! Register here 👇 luma.com/4rmkrrb7

vLLM Inference Meetup · Boston · Luma

Deep technical sessions. Live demos. Real conversations. If you're deploying, or scaling LLM inference, this is the room to be in. Join Red Hat AI, IBM,…

luma.com

12,114

Red Hat

Pete Cheslock retweeted

Red Hat

@RedHat

Mar 26

Red Hat is working with industry leaders to develop llm-d, an open-source project that optimizes how models are served to your users. By routing requests to the most efficient GPU and separating prefill from decode, you get faster results for less spend. Check out Pete Cheslock's quick overview of how llm-d is changing the game for Kubernetes-based AI: red.ht/3PbTkkP #KubeCon #CloudNativeCon

0:39

2,143

Pete Cheslock

Pete Cheslock @petecheslock

Mar 26

ICYMI: llm-d is officially a @CNCF Sandbox project! 🚀 We’re evolving #Kubernetes into SOTA AI infrastructure through a powerhouse coalition including @RedHat , @googlecloud , @IBMResearch, @NVIDIA, @MistralAI, @huggingface , and many more. cncf.io/blog/2026/03/24/welc…

Welcome llm-d to the CNCF: Evolving Kubernetes into SOTA AI infrastructure

We are thrilled to announce that llm-d has officially been accepted as a Cloud Native Computing Foundation (CNCF) Sandbox project! As generative AI transitions from research labs to production…

cncf.io

100

Pete Cheslock

Pete Cheslock @petecheslock

Mar 24

Wondering what llm-d is? It's the open source project simplifying LLM deployment! Run any model on any accelerator, on any cloud. #llm-d #OpenSource #AI #Kubernetes #KubeCon

0:39

110

llm-d

Pete Cheslock retweeted

llm-d @_llm_d_

Mar 24

It’s official: llm-d has joined the @CNCF! 🚀 Our mission to evolve Kubernetes into SOTA AI infrastructure just got a massive boost. This milestone belongs to our amazing community. Thank you for building this with us. 💜 We’re just getting started! 🔗 cncf.io/blog/2026/03/24/welc…

Welcome llm-d to the CNCF: Evolving Kubernetes into SOTA AI infrastructure

We are thrilled to announce that llm-d has officially been accepted as a Cloud Native Computing Foundation (CNCF) Sandbox project! As generative AI transitions from research labs to production…

cncf.io

143

9,992

Pete Cheslock

Pete Cheslock @petecheslock

Mar 24

.@Redhat is contributing llm-d (@_llm_d_) to @cloudnativefdn as a Sandbox project. This isn't just a hand-off of code. It’s a commitment to making high-performance #AI serving a core, portable capability of the cloud-native stack. #KubeCon #CloudNativeCon sprou.tt/1tpPWTpSa85

Why we’re contributing llm-d to the CNCF: Standardizing the future of AI

Red Hat is contributing llm-d to the Cloud Native Computing Foundation (CNCF) as a Sandbox project to standardize high-performance, distributed AI inference serving within the cloud-native stack....

redhat.com

Pete Cheslock

Pete Cheslock @petecheslock

Mar 19

For all my local Boston friends. If you are interested in vLLM/llm-d and inference at scale you should join us!

Red Hat AI

@RedHat_AI

Mar 19

135

Yuan (Terry) Tang

Pete Cheslock retweeted

Yuan (Terry) Tang

@TerryTangYuan

Mar 9

📢 𝗧𝗵𝗲 𝗦𝘁𝗮𝘁𝗲 𝗼𝗳 𝗠𝗼𝗱𝗲𝗹 𝗦𝗲𝗿𝘃𝗶𝗻𝗴 𝗖𝗼𝗺𝗺𝘂𝗻𝗶𝘁𝗶𝗲𝘀: 𝗠𝗮𝗿𝗰𝗵 𝗘𝗱𝗶𝘁𝗶𝗼𝗻 𝗶𝘀 𝗼𝘂𝘁! We launched our newsletter publicly last year to share our contributions to upstream communities from our @RedHat_AI teams. We’ve gained over 𝟭𝟯𝟬𝟬 𝘀𝘂𝗯𝘀𝗰𝗿𝗶𝗯𝗲𝗿𝘀! Our goal with this newsletter is to give a clear, community-driven view of what’s happening across the model serving ecosystem, including updates from @vllm_project, KServe, @_llm_d_, @kubernetesio, and Llama Stack. 👉 Check out the March newsletter here: inferenceops.substack.com/p/… 👉 Subscribe to get future issues in your inbox: inferenceops.substack.com/ 🚀 Thanks to everyone who subscribed so far! Kudos to all contributors to this edition! @franciscojarceo, Pete Cheslock, Sean Condon, Jooho Lee, Pierangelo Di Pilato, Ran Pollak, Nir Rozenbaum, @TerryTangYuan, Wentao Ye

833

Red Hat AI

Pete Cheslock retweeted

Red Hat AI

@RedHat_AI

Mar 8

We’ll cover all of this and more during our distributed inference meetup in New York City on March 11, 2026: luma.com/0crwqwg4

Distributed Inference Meetup NYC · Luma

llm-d Distributed Inference Meetup NYC Hosted by Red Hat AI, IBM Research, and AMD, this event takes place on March 11, 2026 in New York City. What to…

luma.com

826

Pete Cheslock

Pete Cheslock @petecheslock

Mar 6

LFG!!!!

Boston Celtics

@celtics

Mar 6

Injury Report Update: Jayson Tatum - AVAILABLE

117

Pete Cheslock

Pete Cheslock @petecheslock

Mar 5

DON'T TOY WITH MY EMOTIONS

Boston Celtics

@celtics

Mar 5

Injury Report for tomorrow vs. DAL: Jayson Tatum - Right Achilles Repair - QUESTIONABLE

Pete Cheslock

Pete Cheslock @petecheslock

Mar 4

If you are in NYC next Wednesday, come and join us to learn how to scale AI Inference on Kubernetes with the llm-d project.

llm-d @_llm_d_

Mar 4

What’s on the agenda for next Wednesday's NYC meetup? 🛠️ Intro to llm-d 0.5 ⚡️ Distributed LLM serving on AMD 🧠 Lessons scaling Wide-EP and MoE 💾 KV-cache offloading & prefix scheduling Join us building the future of open-source inference. Details: luma.com/0crwqwg4

108

llm-d

Pete Cheslock retweeted

llm-d @_llm_d_

Mar 4

Distributed Inference Meetup NYC · Luma

llm-d Distributed Inference Meetup NYC Hosted by Red Hat AI, IBM Research, and AMD, this event takes place on March 11, 2026 in New York City. What to…

luma.com

668

llm-d

Pete Cheslock retweeted

llm-d @_llm_d_

Mar 2

Join us next week in NYC with the llm-d community for a deep dive into distributed inference. We’re talking llm-d 0.5, scaling MoE models, and KV-cache offloading. If you're building LLM infra, don't miss this. 📅 March 11th 📍1 Madison Ave Register: luma.com/0crwqwg4

Distributed Inference Meetup NYC · Luma

llm-d Distributed Inference Meetup NYC Hosted by Red Hat AI, IBM Research, and AMD, this event takes place on March 11, 2026 in New York City. What to…

luma.com

1,063

Pete Cheslock

Pete Cheslock @petecheslock

Feb 13

RT @TerryTangYuan: We'd like to announce that @kubernetesio WG Serving has succeeded and will be disbanded! Thank you everyone who have pa…

Ernesto Rivera

Pete Cheslock retweeted

Ernesto Rivera

@ernestobrivera

Jan 23

Great talk last night by @julianeagu (@QuotientAI), @thejackobrien (Subconscious), and @petecheslock (Red Hat)! LLMs as we know it today must change to meet the capacity we expect of them. Specialized agents, changing their hardware architecture, or funneling proper context!!

606