Seung Wook Kim

Seung Wook Kim

14 Photos and videos

Tweets

Amlan Kar retweeted

Seung Wook Kim @seungkim0123

Jun 13

I am looking for highly motivated students to join me in tackling challenges in spatial generative models and simulators. Please check my website! seung-kim.github.io/seungkim…

KAIST AI @KAIST_AI

Jun 13

📢 Three incoming faculty members at KAIST AI, starting in August 2026✨ Dr. Sehoon Kim from xAI (@sehoonkim418), Dr. Hyunwoo Kim (@hyunw_kim), and Dr. Seung Wook Kim (@seungkim0123), both from NVIDIA, will be joining KAIST AI as Assistant Professors Check their websites below🧵

142

33,645

Jon Barron

Amlan Kar retweeted

Jon Barron

@jon_barron

Jun 12

There's an interesting procedural generation -> generative AI -> procedural generation ouroboros closing in on itself right now. GenAI is basically data driven procedural generation, and now the LLMs are leveraging perlin noise height maps and fractal tree models.

Remi Sebastian Kits

@SebastianKits

Jun 12

Fable 5. No external assets. Three.js. dc5fzrbo8ssfx.cloudfront.net…

1:18

10,536

Ruilong Li

Amlan Kar retweeted

Ruilong Li

@ruilong_li

Jun 3

World models are moving beyond offline generation towards interactive, real-time experiences. Introducing ⚡FlashDreams⚡: an open-source high-performance inference and serving library built for autoregressive world models: 🔥 Up to 3.10× faster LingBot-World inference 🔥 Up to 2.12× faster Self-Forcing inference 🔥 Up to 1.40× faster Wan2.1 inference 🔥 8 integrated models 🔥 Multi-GPU, streaming, low-latency serving 🔥 Agentic skills that teach you how to use it FlashDreams is designed for a new generation of AI systems that continuously evolve over time while responding to user interactions. It powers applications across robotics, autonomous vehicle simulation, gaming, and virtual worlds. Github: github.com/NVIDIA/flashdream… Docs: nvidia.github.io/flashdreams Research page: research.nvidia.com/labs/sil… Join the #flashdreams Discord channel at discord.gg/yTdHDqFP FlashDreams is also the runtime backbone behind NVIDIA OmniDreams (github.com/nv-tlabs/omni-dre…) 1/n #AI #WorldModels #FastInference #PhysicalAI #OpenSource #NVIDIA

0:47

368

87,625

Zian Wang

Amlan Kar retweeted

Zian Wang

@zianwang97

Jun 3

Following recent World-Action Model results in robotics, the same ~2B OmniDreams single-view backbone can be fine-tuned into a driving policy. In preliminary closed-loop results, it reduces collision from 6.9% to 4.2% when compared with Alpamayo 1.5, while having roughly 5x fewer parameters.

0:18

2,588

Sanja Fidler

Amlan Kar retweeted

Sanja Fidler @FidlerSanja

Jun 3

Real time world model NVIDIA OmniDreams now open sourced! If you are at CVPR, we invite you to also check out a live demo you can try out at the NVIDIA booth.

Zian Wang

@zianwang97

Jun 3

🚀 What if physical AI policies could interact with generated worlds in real time? Introducing OmniDreams, a generative world model for closed-loop autonomous vehicle simulation. Tech report, code, models, and data samples are available now. Project: research.nvidia.com/labs/sil… Code: github.com/nv-tlabs/omni-dre… Model: huggingface.co/collections/n… Join the #omnidreams discord channel: discord.gg/bsjzh4uZ

0:16

248

36,645

Zian Wang

Amlan Kar retweeted

Zian Wang

@zianwang97

Jun 3

0:16

261

80,745

Despoina Paschalidou

Amlan Kar retweeted

Despoina Paschalidou

@paschalidoud_1

Jun 3

It’s been a while since I posted here, but I’m very excited to share what our team at @nvidia has been building over the past year! After a year of active development, we’re getting ready to release SIL-Wheel to the world: a one-stop shop platform for data-centric workflows in large-scale video model training. Built by researchers, for researchers, SIL-Wheel brings together search, curation, annotation, evaluation, and analysis for large video datasets in one centralized framework. Want a sneak peek before the official release? Come by the NeXD26 Workshop @CVPR tomorrow at 10:30!🚀

10,703

Xuanchi Ren

Amlan Kar retweeted

Xuanchi Ren

@xuanchi13

May 26

The latent-vs-pixel debate misses the point. GPT Image 2 shows what users notice: pixel-level fidelity. Latent models show what scales: compact semantic structure. We connect them by replacing VAE/RAE decoders with a Pixel Diffusion Decoder. Code and Model available: research.nvidia.com/labs/sil… 🧵(1/N)

0:49

411

668,498

Noam Brown

Amlan Kar retweeted

Noam Brown

@polynoamial

Apr 23

A hill that I will die on: with today's AI models, intelligence is a function of inference compute. Comparing models by a single number hasn't made sense since 2024. What matters is intelligence per token or per $. This is especially true when using it in a product like Codex.

Lisan al Gaib

@scaling01

Apr 23

The GPT-5.5 model family completely dominates the cost-performance frontier on the Artificial Analysis Index

1,340

127,977

Lianghui Zhu

Amlan Kar retweeted

Lianghui Zhu @lianghui_zhu

Apr 19

For a decade, we've made models wider and deeper—but we've barely changed how layers *talk* to each other. Since ResNet's `x F(x)` in 2015, the depth residual has been the only highway for inter-layer communication. It's time to upgrade the staircase. 🧵

238

1,938

188,471

Michał Tyszkiewicz

Amlan Kar retweeted

Michał Tyszkiewicz @jatentaki

Apr 17

Feed-forward 3D reconstruction should not be limited to predicting one Gaussian per pixel. We introduce TokenGS, which uses learnable tokens to decouple the 3D Gaussian prediction from the image resolution and the number of input views. #CVPR2026Highlight [1/6]

0:13

249

45,195

Sandeep Routray

Amlan Kar retweeted

Sandeep Routray @SandeepRoutra11

Apr 17

🚀 Excited to share ViPRA: Video Prediction for Robot Actions 📍 Accepted to #ICLR2026 @iclr_conf 🏆 Best Paper — #NeurIPS2025 Embodied World Models Workshop Robot learning today still needs millions of action labeled videos. Yet videos are abundant — from humans and the web — but lack action labels. Meanwhile, pretrained video models already learn rich dynamics. ViPRA is a recipe for turning pretrained video models into robot policies while enabling robot learning to scale with actionless videos. 🧵 Thread ↓

0:26

268

25,537

Ruilong Li

Amlan Kar retweeted

Ruilong Li

@ruilong_li

Mar 17

Special moment to see something I’ve worked on so closely come to life! Today we announce Alpadreams — a world model that lets you explore ♾endlessly♾️in ⚡real time⚡. Video: me (left) and Alpamayo policy (right) driving in Alpadreams at #GTC26. research.nvidia.com/labs/sil…

0:50

10,156

Zan Gojcic

Amlan Kar retweeted

Zan Gojcic @ZGojcic

Mar 16

A new generation in AV simulation is here! We are announcing AlpaDreams, a real time interactive generative world model for AV simualtion! Just a year ago it took minutes to generate a few seconds of video, today it is real time and interactive! research.nvidia.com/labs/sil…

OmniDreams (formerly AlpaDreams) — NVIDIA SIL

This page has moved. AlpaDreams is now OmniDreams.

research.nvidia.com

106

18,677

Andrej Karpathy

Amlan Kar retweeted

Andrej Karpathy

@karpathy

Mar 7

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autorese… Part code, part sci-fi, and a pinch of psychosis :)

1,054

3,627

28,327

11,076,853

Jon Barron

Amlan Kar retweeted

Jon Barron

@jon_barron

Mar 3

Replying to @gkopanas

I love that review. I do genuinely think a great way to evaluate research contributions would be to add the new paper to an agent's context window and see what delta the agent can get on some OSS codebase's performance.

1,921

Sven Elflein

Amlan Kar retweeted

Sven Elflein @s_elflein

Feb 27

🚀 Exciting news! We’re introducing VGG-T³: a scalable model for offline feed-forward 3D reconstruction that finally tackles the "quadratic bottleneck." Ever wanted to have VGGT reconstruct a 1,000-image scene in seconds instead of 10 minutes and use it for visual localization?

551

84,124

Xindi Wu

Amlan Kar retweeted

Xindi Wu @cindy_x_wu

Jan 20

New #NVIDIA Paper We introduce Motive, a motion-centric, gradient-based data attribution method that traces which training videos help or hurt video generation. By isolating temporal dynamics from static appearance, Motive identifies which training videos shape motion in video generation. 🔗 research.nvidia.com/labs/sil… 1/10

1:15

120

584

110,663

Or Litany

Amlan Kar retweeted

Or Litany @orlitany

22 Dec 2025

🚗📡Radar is the unsung hero of AV perception: widespread in cars, yet overlooked in simulation. Introducing RadarGen: Realistic radar synthesis from cameras using diffusion. Massive kudos to my fantastic team at @TechnionLive and @NVIDIAAI radargen.github.io/

RadarGen: Automotive Radar Point Cloud Generation from Cameras

We present RadarGen, a diffusion model for synthesizing realistic automotive radar point clouds from multi-view camera imagery. RadarGen adapts efficient image-latent diffusion to the radar domain by...

radargen.github.io

Tomer Borreda @TomerBorreda

22 Dec 2025

📢 RadarGen: Automotive Radar Point Cloud Generation from Cameras Can we generate realistic radar point clouds solely from camera images? 🚗📡 We introduce RadarGen, a diffusion-based framework that synthesizes radar returns aligned with visual scenes. radargen.github.io

5,335

Jack Zhang

Amlan Kar retweeted

Jack Zhang @jackzzhang

15 Dec 2025

Can we apply gradient descent to discrete changes? In our new #SIGGRAPHAsia paper, we show that gradient descent can work on shape grammars, as in CAD and procedural modeling, but only if the grammars are designed correctly!

0:05

262

64,736