๐ ๐๐๐๐๐๐ก๐ ๐ฏ๐.๐.๐ ๐ข๐ฌ ๐ก๐๐ซ๐!
Release highlights:
๐ง Wider model support โ LMCache now supports hybrid models including DeepSeek V4, Gemma, and Qwen 3.5/3.6 (Mamba/GDN), via per-group block sizes that handle their KV formats.
๐ป Run LMCache on CPU-only machines! โ A new shared-memory (SHM) transfer path for MP mode lets LMCache run across GPUs, CPUs, and other accelerators, including CPU-only deployments.
๐ The new MP coordinator provides a global control plane: server registration, CLI, L2 quota/eviction, and a fleet-wide CacheBlend fingerprint directory.
โ New backends: NIXL DOCA_MEMOS, Google Cloud Bigtable, Moore Threads MUSA.
Huge thanks to all contributors who made this release possible!
๐๐๐๐ ๐ฆ๐จ๐ซ๐ ๐ข๐ง ๐ญ๐ก๐ ๐ซ๐๐ฅ๐๐๐ฌ๐ ๐ง๐จ๐ญ๐๐ฌ: github.com/LMCache/LMCache/rโฆ#LLM#AIInfrastructure#KVCache#LMCache
๐๏ธTensormesh: From Research to $20M Round
Our CEO & Co-Founder @JunchenJiang sat down with TechBeats pod to talk KV cache, the "Big Data of AI," and how Tensormesh became the first caching-accelerated inference platform for enterprises across the GPU ecosystem.
Watch Full interview๐
youtu.be/kNoVF1p5xTA#LLMInference#KVCache#AIInfrastructure
Running open-source models is easy.
Running them well in production isnโt.
In todayโs spotlight, Nick Barcet (@nijaba), Head of GTM @tensormesh, shares what Tensormesh is built to solve โ and the ๐ฝ๐ฒ๐ผ๐ฝ๐น๐ฒ and culture behind the execution.
If youโre running OSS models and want dramatically more efficient inference ๐ฏTry @tensormesh with $100 in free GPU credits: lnkd.in/g-gXYtaV#AI#LLM#Inference#kvcache
๐ฐWant CHEAP GPU cloud?
๐กWanna store ALL your usersโ history&docs as KV cache to save costโbut canโt get open source to run?
Try TensorMesh SaaS:
โก๏ธ$3.09/hr H100
๐No vendor lock-in
๐ง Any open-source model
๐ชOpenAI-API compatible
Join the waitlist๐: tensormesh.ai/beta-waitlist
Interviewing 100 Bay Area Startups has always been my dream โ and today, Iโm starting the journey. ๐
Big thanks to @lmcache for inviting me to their lunch meeting and letting me do my first interview. Great team with a great vision โ they deserve more exposure.
If youโre building something exciting in the Bay Area and want to share your story, letโs chat!
@JunchenJiang@nijaba
Who hasn't dreamt of creating a software that'd run in space? In <24h, @SpaceX will transport the @Endurance_Cube into orbit. It runs my team's #MicroShift (@openshift for #edge devices) and enables school children to upload code for accessing the sat's sensor data @redhatopen
ALT Website of the ENDURANCE CubeSat mission (https://endurancein.space/) showing a count-down at 00 days, 23 hours, 59 minutes, and 59 seconds.
Finally the word is out. So much work, but also a ton of fun creating @microshift_io with my team. I'm very grateful to work with so many talented engineers across Red Hat, from our partners, and from the community who contributed. #opensource dev model FTW! @RedHat@redhatopen
Meet Red Hat Device #Edge: an enterprise-ready and supported distribution of @microshift_io. Learn more about the benefits of this solution here: red.ht/3SjKRb4#KubeCon