RuiningLi

RuiningLi

1 Photos and videos

Tweets

Runjia Li retweeted

RuiningLi

@RayLi234

18h

🚀 Introducing Instruct-Particulate, our new model for inferring articulated structures from static 3D meshes, with significantly improved generalization to novel object categories and support for kinematic prompting. To achieve this, we scaled our training data 40× and redesigned the model to follow kinematic prompts. The result: diverse, realistic, simulator-compatible articulated 3D assets can now be generated directly from real-world images! 🔗 Project page: instruct-particulate.github.… 🤗 Demo: huggingface.co/spaces/rayli/…

0:54

7,414

Kam Woh Ng

Runjia Li retweeted

Kam Woh Ng

@kam_woh

Jun 7

🌙 Open-sourcing Yume -- a programmable, explicit world model on Godot. You build a game by describing the world as JSON -- the things in it the rules for how they behave -- and one fixed engine runs it. github.com/kamwoh/yume

GitHub - kamwoh/yume: A programmable, explicit world model on Godot — worlds are pure JSON run by a...

A programmable, explicit world model on Godot — worlds are pure JSON run by a fixed primitives interpreter engine. Built by Claude, for Claude. - kamwoh/yume

github.com

259,313

AK

Runjia Li retweeted

@_akhaliq

May 28

Gamma-World Generative Multi-Agent World Modeling Beyond Two Players

1:28

105

31,482

Runjia Li

Runjia Li @RunjiaLi

May 19

Impressive!!

Jianyuan

@jianyuan_wang

May 19

Introducing VGGT-Ω: scaling feed-forward reconstruction across static and dynamic scenes, and studying whether the learned geometric representations transfer beyond reconstruction.

0:48

RuiningLi

Runjia Li retweeted

RuiningLi

@RayLi234

May 15

🚀 Introducing Articraft, a coding agent for articulated 3D asset creation. Articraft writes code, executes it, receives validation feedback, and refines the result into simulation-ready 3D assets with parts, joints, and motion. We’re also releasing Articraft-10K: 10,000 articulated objects across 250 categories, unlocking large-scale interactive scenes for robotics simulation and physical AI. 🔗 Project page: articraft3d.github.io/ 💻 Code: github.com/mattzh72/articraf…

0:27

108

746

185,951

Stan Szymanowicz

Runjia Li retweeted

Stan Szymanowicz

@StanSzymanowicz

May 15

We made an interactive client-server viewer for LagerNVS with @JonathonLuiten! You can now interactively explore scenes from just a photo capture - no optimization, no 3D Gaussians, just load your images, run the model on a cloud GPU and stream the renders to your local browser. Check out the video below for some spaces I recently captured in Oxford, London and beyond!

1:39

175

16,746

Google DeepMind

Runjia Li retweeted

Google DeepMind

@GoogleDeepMind

May 12

We’re reimagining a 50-year-old interface - the mouse pointer - with AI. 🖱️ These experimental demos show how people can intuitively direct Gemini on their screens using motion, speech, and natural shorthand to get things done 🧵

0:57

461

1,066

8,608

1,659,151

Xuanchi Ren

Runjia Li retweeted

Xuanchi Ren

@xuanchi13

Apr 15

We scaled up Lyra to generate explorable 3D worlds! 🚀 Introducing Lyra 2.0 — turning a single image into a 3D world you can walk through, look back, and even drop a robot into 🤖 Code and Model available today! 🌐 Website: research.nvidia.com/labs/sil… (1/N)

0:58

122

874

1,145,345

Alexander Pondaven

Runjia Li retweeted

Alexander Pondaven @alexpondaven

Apr 3

Introducing ActionParty: the first video world model that controls up to 7 players simultaneously on the same screen across 46 game environments. We tackle the action binding problem in video diffusion, ensuring each player's action is applied to the right subject. 🧵

0:10

9,503

Wei Yu

Runjia Li retweeted

Wei Yu @GnosisYu

Apr 1

Dropping an exciting new demo of MosaicMem! 👀🔥 A friend brought up a great question: why not combine long-horizon navigation video generation, promptable world events, and scene concatenation? Fair point — so we gave it a shot. 🎬✨ For more technical details, check this thread 🧵👇 x.com/GnosisYu/status/203502… #WorldModel #GenerativeAI #VideoGeneration #InteractiveAI #Genie3 #EmbodiedAI #GameAI

0:31

Wei Yu @GnosisYu

Mar 20

World models have made impressive progress in video generation, yet they still struggle with a fundamental challenge: memory. In long rollouts, the camera trajectory gradually drifts from the user-specified motion and revisited scenes no longer align with earlier observations. These errors accumulate over time, causing the generated world to steadily lose coherence. 🚀Excited to share our solution MosaicMem 🌍🧠 — our new hybrid spatial memory for video world models. Project Page: mosaicmem.github.io/mosaicme… Paper: huggingface.co/papers/2603.1…

1:36

107

8,604

Runjia Li

Runjia Li @RunjiaLi

Mar 16

🎉EgoEdit @Snapchat has been accepted to CVPR 2026! 🏆👻 We are bringing high-quality, real-time editing to egocentric videos. Our massive 100k video dataset and benchmark are ALREADY PUBLIC! 🔓🚀 🏠 Project Page: snap-research.github.io/EgoE… 🤗 Dataset: huggingface.co/datasets/ligu…

0:44

@_akhaliq

9 Dec 2025

EgoEdit Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing

0:40

105

21,815

Runjia Li

Runjia Li @RunjiaLi

Mar 16

The work was done in a joint collaboration with @WilliMenapace during my internship @Snap. Many thanks to @moayedhajiali, @ashmrz10, Chaoyang Wang Arpit Sahni, @isskoro, Aliaksandr Siarohin, @JakabTomas, @han_junlin, @SergeyTulyakov, @philiptorr

368

Runjia Li

Runjia Li @RunjiaLi

Mar 16

Replying to @Snapchat

Many thanks to coauthors! And thank @_akhaliq for posting our paper!

221

AK

Runjia Li retweeted

@_akhaliq

Mar 2

Mode Seeking meets Mean Seeking for Fast Long Video Generation paper: huggingface.co/papers/2602.2…

0:30

121

20,368

AK

Runjia Li retweeted

@_akhaliq

23 Dec 2025

WorldWarp Propagating 3D Geometry with Asynchronous Video Diffusion huggingface.co/papers/2512.1…

Paper page - WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Join the discussion on this paper page

huggingface.co

13,940

Junlin Han

Runjia Li retweeted

Junlin Han @han_junlin

1 Oct 2025

Excited to share our new work: “Learning to See Before Seeing”! 🧠➡️👀 We investigate an interesting phenomeno: how do LLMs, trained only on text, learn about the visual world? Project page: junlinhan.github.io/projects…

158

25,851

Runjia Li

Runjia Li @RunjiaLi

26 Jun 2025

🎉 VMem is officially accepted to ICCV 2025! Excited to chat with everyone in Hawaii about making video generation consistent and interactive with our Surfel-Indexed View Memory 🏝️🎥 Also, huge thanks to my insanely helpful coauthors!

@_akhaliq

24 Jun 2025

VMem Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

1:00

15,063

Tomas Jakab

Runjia Li retweeted

Tomas Jakab @JakabTomas

24 Jun 2025

Excited to share VMem: a novel memory mechanism for consistent video scene generation 🎞️✨ VMem evolves its understanding of scene geometry to retrieve the most relevant past frames, enabling long-term consistency 🌐 v-mem.github.io 🤗 huggingface.co/spaces/liguan… 1/ 🧵

VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

VMem introduces a novel surfel-indexed memory module for consistent autoregressive video scene generation. To appear in ICCV 2025.

v-mem.github.io

@_akhaliq

24 Jun 2025

VMem Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

1:00

15,792

Chuanxia Zheng

Runjia Li retweeted

Chuanxia Zheng @ChuanxiaZ

24 Jun 2025

After two amazing years with @Oxford_VGG, I will be joining @NTUsg as a Nanyang Assistant Professor in Fall 2025! I’ll be leading the Physical Vision Group (physicalvision.github.io) — and we're hiring for next year!🚀 If you're passionate about vision or AI, get in touch!

0:48

240

43,442

AK

Runjia Li retweeted

@_akhaliq

24 Jun 2025

VMem Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

1:00

374

83,564