uRun

uRun

6 Photos and videos

Tweets

uRun

@urunml

Jun 11

Denver, thank you 🙏 Last week's happy hour at CVPR 2026 was everything we hoped for. A room full of sharp people and real conversation about where realtime AI media goes next. Already looking forward to the next one. 👀

0:27

uRun

uRun

@urunml

Jun 4

tonight. 6PM. denver. come say hi and tell us your favorite part of the CVPR so far! if you're on the list, you've got the address. if you're not, shoot us a message and we'll sort it. luma.com/hll4zk65

uRun Happy hour @ CVPR 2026 · Luma

uRun Happy Hour @ CVPR 2026 Two days down. You've sat through the talks, debated the posters, and had three conversations that probably should've been…

luma.com

Keegan McCallum

uRun retweeted

Keegan McCallum

@keeganmccallum3

Jun 2

"Real-time" in marketing materials usually means "we batched it and it returns in 8 seconds." Real-time means under 300ms. That's the threshold UI designers use to decide when to show a loading spinner. There's a big difference. We build for the second one at @urunml.

uRun

uRun

@urunml

Jun 1

A few seats left for our happy hour @ CVPR on thursday. come hang out and say hi. Jun 4 @ 6:00 PM in denver. RSVP → luma.com/hll4zk65

uRun Happy hour @ CVPR 2026 · Luma

uRun Happy Hour @ CVPR 2026 Two days down. You've sat through the talks, debated the posters, and had three conversations that probably should've been…

luma.com

Keegan McCallum

uRun retweeted

Keegan McCallum

@keeganmccallum3

May 26

Most providers route every request to a different accelerator. Mostly fine. Until you need a stateful loop. Drift across long workflows is real. At @urunml you're pinned to the same GPU on the same machine for the whole session. Stateful by design.

207

uRun

uRun

@urunml

May 15

heading to @MLSysConf 2026 next week in Bellevue if you're working on inference, real-time systems, or ML infra, come say hi. always up to trade notes on what "production" actually looks like for stateful, interactive AI workloads.

Keegan McCallum

uRun retweeted

Keegan McCallum

@keeganmccallum3

May 14

Every modality in AI follows the same arc: single-shot expensive generations to multi-turn cheap interactive loops. Text and image already went through it. Video is next.

uRun

uRun

@urunml

May 13

Thank you to the early ones. 🙏 More to come. #WhatCanuRun

0:55

Keegan McCallum

uRun retweeted

Keegan McCallum

@keeganmccallum3

May 12

Two years ago, the open problem was getting an AI video model to produce a coherent 5-second clip. Recent techniques like Long Live and self-forcing solved that piece. The new bottleneck is serving it interactively. Labs are chasing the next model. The infra layer underneath is wide open.

123

Keegan McCallum

uRun retweeted

Keegan McCallum

@keeganmccallum3

May 7

Real-time interactive video is the hardest workload there is. Every frame has to land inside the 300ms human-perception bar. That's why we're starting there with @urunml. The rest is downhill.

569

uRun

uRun

@urunml

May 6

Who we most want building on uRun: creative tooling companies and the studios behind tomorrow's video games. They'll go places we can't imagine → urun.sh #AIvideo #GameDev #VFX

0:35

281

Keegan McCallum

uRun retweeted

Keegan McCallum

@keeganmccallum3

May 5

Replying to @OpenAI

@OpenAI and @Anthropic both charge ~2.5x for "fast mode." The most underrated pricing signal in AI right now.

uRun

uRun

@urunml

May 4

some snapshots of our launch party @ Joey the Cat in SF last week. skee-ball, open bar, and real-time AI video on every screen. thank you to everyone who came out and pushed the demos somewhere great and weird. #WhatCanuRun → urun.sh

340

uRun

uRun retweeted

uRun

@urunml

Apr 28

Introducing the founding team with three unique angles on the same problem. Keegan ran inference at Luma during the Dream Machine launch. Sean wrote the O'Reilly book on Docker and has our GPU orchestration dialed in. Matt was running low-latency edge inference at AWS in 2017 (back when "real-time AI" meant the cameras at Amazon Go). We built uRun for the infrastructure bottleneck no one else is solving. urun.sh #AIvideo #FounderStory #realtimeAI #VideoInfra #GenerativeAI

1:27

308

uRun

uRun

@urunml

Apr 27

urun.sh launch party - Wednesday, April 29 · 6PM: 🕹️ Arcade games 🍹 Open bar 💻 Live demos 🥽 Meta Quest Giveaway Spots are limited - click the link to grab your invite. 👉 luma.com/3vemq53b

185

Keegan McCallum

uRun retweeted

Keegan McCallum

@keeganmccallum3

Apr 21

The model moat is shrinking fast. Kimi K2.6 just beat GPT-5.4 and Claude Opus 4.6 on SWE-Bench Pro. But the story isn't the benchmarks - it's the execution layer: → 300 parallel agents → 13 hours autonomous coding → 4,000 tool calls in one run It's no longer intelligence per token. It's tokens per second. Source: kimi.com/blog/kimi-k2-6 #claude #moonshot #OpenSource

237

Keegan McCallum

uRun retweeted

Keegan McCallum

@keeganmccallum3

Apr 17

Reminder 🚨 We're going live on Twitch TODAY at 2pm PT. Come hang and bring your questions 👇 twitch.tv/urunml #Inference #Infrastructure #twitch

urunml - Twitch

Real-time video AI. Interact. Steer. Play.

twitch.tv

242

uRun

uRun

@urunml

Apr 16

Imagine exploring this in real time as it generates. That is the infrastructure problem we have been solving. Check us out -> Urun.sh

uRun

uRun is infrastructure for real-time AI video, built for interactive creation where teams can generate, steer, and evolve scenes with no delays or compromise.

urun.sh

NVIDIA AI Developer

@NVIDIAAIDev

Apr 15

Today, we released Lyra 2.0, a framework for generating persistent, explorable 3D worlds at scale, from NVIDIA Research. Generating large-scale, complex environments is difficult for AI models. Current models often “forget” what spaces look like and lose track of movement over time, causing objects to shift, blur, or appear inconsistent. This prevents them from creating the reliable 3D environments required for downstream simulations. Lyra 2.0 solves these issues by: ✅ Maintaining per-frame 3D geometry to retrieve past frames and establish spatial correspondences ✅ Using self-augmented training to correct its own temporal drifting. Lyra 2.0 turns an image into a 3D world you can walk through, look back, and drop a robot into for real-time rendering, simulation, and immersive applications. ➡️ Learn more: research.nvidia.com/labs/sil… 📄 Read the paper: arxiv.org/abs/2604.13036

0:15

112