Infrastructure for the interactive era.

Joined March 2026
6 Photos and videos
Jun 11
Denver, thank you πŸ™ Last week's happy hour at CVPR 2026 was everything we hoped for. A room full of sharp people and real conversation about where realtime AI media goes next. Already looking forward to the next one. πŸ‘€
1
17
Jun 4
tonight. 6PM. denver. come say hi and tell us your favorite part of the CVPR so far! if you're on the list, you've got the address. if you're not, shoot us a message and we'll sort it. luma.com/hll4zk65
1
66
uRun retweeted
"Real-time" in marketing materials usually means "we batched it and it returns in 8 seconds." Real-time means under 300ms. That's the threshold UI designers use to decide when to show a loading spinner. There's a big difference. We build for the second one at @urunml.
1
1
53
uRun retweeted
Most providers route every request to a different accelerator. Mostly fine. Until you need a stateful loop. Drift across long workflows is real. At @urunml you're pinned to the same GPU on the same machine for the whole session. Stateful by design.
1
4
207
May 15
heading to @MLSysConf 2026 next week in Bellevue if you're working on inference, real-time systems, or ML infra, come say hi. always up to trade notes on what "production" actually looks like for stateful, interactive AI workloads.
2
54
uRun retweeted
Every modality in AI follows the same arc: single-shot expensive generations to multi-turn cheap interactive loops. Text and image already went through it. Video is next.
1
1
4
83
May 13
Thank you to the early ones. πŸ™ More to come. #WhatCanuRun
1
1
4
74
uRun retweeted
Two years ago, the open problem was getting an AI video model to produce a coherent 5-second clip. Recent techniques like Long Live and self-forcing solved that piece. The new bottleneck is serving it interactively. Labs are chasing the next model. The infra layer underneath is wide open.
1
1
7
123
uRun retweeted
Real-time interactive video is the hardest workload there is. Every frame has to land inside the 300ms human-perception bar. That's why we're starting there with @urunml. The rest is downhill.
1
2
7
569
May 6
Who we most want building on uRun: creative tooling companies and the studios behind tomorrow's video games. They'll go places we can't imagine β†’ urun.sh #AIvideo #GameDev #VFX
13
17
281
uRun retweeted
Replying to @OpenAI
@OpenAI and @Anthropic both charge ~2.5x for "fast mode." The most underrated pricing signal in AI right now.
1
1
3
88
May 4
some snapshots of our launch party @ Joey the Cat in SF last week. skee-ball, open bar, and real-time AI video on every screen. thank you to everyone who came out and pushed the demos somewhere great and weird. #WhatCanuRun β†’ urun.sh
4
7
340
uRun retweeted
Apr 28
Introducing the founding team with three unique angles on the same problem. Keegan ran inference at Luma during the Dream Machine launch. Sean wrote the O'Reilly book on Docker and has our GPU orchestration dialed in. Matt was running low-latency edge inference at AWS in 2017 (back when "real-time AI" meant the cameras at Amazon Go). We built uRun for the infrastructure bottleneck no one else is solving. urun.sh #AIvideo #FounderStory #realtimeAI #VideoInfra #GenerativeAI
13
1
17
308
Apr 27
urun.sh launch party - Wednesday, April 29 Β· 6PM: πŸ•ΉοΈ Arcade games 🍹 Open bar πŸ’» Live demos πŸ₯½ Meta Quest Giveaway Spots are limited - click the link to grab your invite. πŸ‘‰ luma.com/3vemq53b
3
6
185
uRun retweeted
The model moat is shrinking fast. Kimi K2.6 just beat GPT-5.4 and Claude Opus 4.6 on SWE-Bench Pro. But the story isn't the benchmarks - it's the execution layer: β†’ 300 parallel agents β†’ 13 hours autonomous coding β†’ 4,000 tool calls in one run It's no longer intelligence per token. It's tokens per second. Source: kimi.com/blog/kimi-k2-6 #claude #moonshot #OpenSource
1
4
237
uRun retweeted
Reminder 🚨 We're going live on Twitch TODAY at 2pm PT. Come hang and bring your questions πŸ‘‡ twitch.tv/urunml #Inference #Infrastructure #twitch
1
3
242
Apr 16
Imagine exploring this in real time as it generates. That is the infrastructure problem we have been solving. Check us out -> Urun.sh
Today, we released Lyra 2.0, a framework for generating persistent, explorable 3D worlds at scale, from NVIDIA Research. Generating large-scale, complex environments is difficult for AI models. Current models often β€œforget” what spaces look like and lose track of movement over time, causing objects to shift, blur, or appear inconsistent. This prevents them from creating the reliable 3D environments required for downstream simulations. Lyra 2.0 solves these issues by: βœ… Maintaining per-frame 3D geometry to retrieve past frames and establish spatial correspondences βœ… Using self-augmented training to correct its own temporal drifting. Lyra 2.0 turns an image into a 3D world you can walk through, look back, and drop a robot into for real-time rendering, simulation, and immersive applications. ➑️ Learn more: research.nvidia.com/labs/sil… πŸ“„ Read the paper: arxiv.org/abs/2604.13036
2
112