Joined March 2022
151 Photos and videos
Pinned Tweet
Apr 30
Runpod Flash is GA ⚡️ Flash is a Python SDK that lets you define infrastructure and deploy AI workloads directly from your terminal. Check it out: github.com/runpod/flash Blog: runpod.io/blog/flash-is-ga
2
2
10
4,879
Jun 12
The most costly inference failures usually aren't in your model. They're at the boundary between two systems that were never designed to talk to each other. Remove the seams and the failures go with them. That's how Scatter Lab runs 1,000 RPS in production on Runpod. runpod.io/case-studies/how-s…
1
151
Jun 11
Getting GPUs right now is already hard. Once you finally have them, you're staring down node networking, cross-machine scaling, and failure recovery. The work nobody warned you about. Compute and software should be designed together. That's what we're building.
2
220
Jun 11
Runpod is now in the @thoughtworks Technology Radar (Vol. 34). Worth exploring for teams that need flexible, cost-effective GPU infrastructure for AI workloads without hyperscaler lock-in. Read the full write-up here: thoughtworks.com/en-us/radar…
1
1
4
124
Jun 8
Creative studios usually scale their ambition to fit their hardware. TOOL flipped it. A render that used to take 27 hours now runs across 27 instances at once on Runpod, then stops when the work is done. Read the full story → runpod.io/case-studies/how-t…
1
232
Jun 5
"We scaled our workloads 10x without worrying about GPU shortages or excessive costs." That's coming from Segmind, a GenAI platform powering visual content generation. Want to know how they did it? Read their full story here: runpod.io/case-studies/how-s…
3
1,354
Jun 4
AI for image generation is shifting faster than most teams realize, and we have the receipts. Our new State of AI report runs on production data from 750,000 developers. Discover all the trends that we found here: runpod.io/the-state-of-ai-pd…
2
278
Jun 3
The bottleneck isn't GPUs anymore. It's the three to five clouds most AI teams stitch together to get a model to production. So as of today, we'll start calling Runpod the AI Developer Cloud. Read Zhen's take on this through the link below. runpod.io/blog/the-chips-got…
1
1
4
485
Jun 1
👏👏Congrats to the @MeckaAI team on their Series A. Great to see what they're building on Runpod.
Today @MeckaAI is announcing $60M in funding to become the data and deployment layer for physical AI This raise will allow us to scale our data infrastructure, invest into new verticals, and deploy robots into the real world
6
812
Jun 1
"The AI market looks nothing like the narrative." Our CTO, Brennen Smith, went on TFiR to talk about what we're actually seeing in production. Some of it will confirm what you've heard. A lot of it won't. Curious what the data actually shows? Find the full interview here: tfir.io/runpod-state-of-ai-b…
1
4
463
May 29
We just launched Multi-Instance GPU (MIG) on Runpod Serverless. It partitions the RTX 6000 Pro into isolated 24 GB instances, each with dedicated memory and compute. So if your workload fits in 24 GB, now you can pay for 24 GB. Read the blog post to get started: runpod.io/blog/multi-instanc…
2
1
10
659
May 28
Most teams running vLLM are using default settings. That's a 2-3x cost penalty with no performance upside. We benchmarked the configurations that actually matter, vLLM and SGLang, and turned them into a playbook. Settings, benchmarks, and copy-paste templates, all that stuff. Get the full playbook here: runpod.io/articles/guides/ll…
1
335
Runpod retweeted
May 18
BREX DATA: Spring 2026’s top 25 fastest-growing software vendors.
14
29
208
130,610
Runpod retweeted
May 6
after extensive research, @runpod labs is happy to present: Poddy, your new pet in codex! npx petdex install poddy
2
2
14
2,075
Runpod retweeted
Apr 24
introducing wandler.ai - inference server based on @huggingface transformers.js - OpenAI-compatible - runs on mac, linux & win via cuda, coreml, dml, webgpu, wasm, cpu - tested llms from @liquidai, @Alibaba_Qwen & @GoogleDeepMind - embeddings & speech-to-text - works with @NousResearch Hermes - built in ts - open source as MIT - the first ever project from @runpod labs github.com/runpod-labs/wandl…
3
3
23
3,842
Runpod retweeted
It is crazy that now using the @googlegemma family of models, autoresearch, Codex or Claude Code, @elves_skill, @UnslothAI, and $20 to spend on @runpod, you can now get an industrial quality model that can be deployed on CPU done over a weekend with minimal observation even needed I actually trained two such models this weekend, one using Gemma 4 and the other using T5 Gemma 2 Not that long ago, these models would’ve been $10k-$50k and would’ve taken several people months The fact that the price and time have dropped so dramatically opens up all sorts of new use-cases that previously wouldn’t have made sense economically Wild times
5
5
41
4,894
Apr 10
Runpod just shipped Cost Centers. Now you get native spend tracking by team, project, or department, right in the console. Label any resource (Pods, Serverless, Network Volumes, Instant Clusters) and get per-label spend breakdowns on your monthly invoice. Unlabeled resources auto-group by creator. If you're running GPU workloads across multiple teams or projects and tired of spreadsheet reconciliation, this is a big one. Now in public beta for Runpod Teams.
2
13
34
2,379
Mar 26
We put a chat interface in the Runpod console. 23 tools across the full REST API. If you can do it in the dashboard, you can ask for it in chat. Find it in the console ;)
1
7
2,058
Mar 26
Runpod is now natively supported in @transformerlab! Add your API key and get experiment tracking, automatic checkpointing with failure recovery, persistent artifact storage, and interactive sessions (Jupyter, VSCode, vLLM) on your Runpod GPUs. Get started 👇
🚀 Support for Runpod is live on Transformer Lab for Teams. Add your Runpod API key and start running workloads on Transformer Lab for Teams using Runpod instances. What you can do: ⚡ Queue workloads to run automatically or reserve an on-demand instance with Jupyter, VSCode and vLLM on dedicated Runpod GPUs 🧪 Submit training and eval jobs with built-in experiment tracking 🔄 Automate checkpointing and failure recovery. If an instance drops, your job restarts from the last saved checkpoint 💾 Store artifacts persistently, so model weights and eval results are accessible after the Runpod instance terminates 🔗 Supports SLURM and SkyPilot so teams that use Runpod alongside on-prem clusters can manage everything from a unified interface Get started here: lab.cloud/blog/runpod-no-gpu…
3
11
2,086