Synthetc Data | World Models | Radiance Fields | Computer Vision | GenAI | Chief Evangelist at @LightwheelAI

Joined February 2009
2,214 Photos and videos
Jonathan Stephens retweeted
Got real-time object detection depth sensing working for my iPhone app! You can see how LiDAR object detection gives you how far each detected object is in meters. You can also see the distance heatmap in the top right. Details: - Hardware: iPhone 15 Pro’s LiDAR sensor provides the depth map. - Model: YOLOX-S running on the iPhone’s ANE (Apple Neural Engine) This is a preliminary step before I connect this to my deadlift tracker iPhone app 🙂 @Apple #ai #machinelearning #computervision #iphone
4
2
19
2,275
I’m headed to @AutomateShow 2026 with the @LightwheelAI team! I’m excited to see the state of industrial robotics and automation in the US. If you’re going to be here, reach out and let’s connect in person! #automate2026
2
320
This is a cool project! An omni-model robotics world model. It was conditioned on @NVIDIAAI's Cosmos Predict 2.5. Now it's opensource, you can start using it today! #robotics #PhysicalAI
1/5 🚀 Thrilled to open-source OSCAR 🤖 — an action-conditioned world model for robotics, led by the visiting student in my group @wuzy2115! It generalizes across different robot embodiments with precise action controllability. All trained on a single GH200 GPU, and outperforms existing open-sourced baselines, which have larger model capacity and need more compute. Everything is public, including training data. 📄 Paper: arxiv.org/abs/2606.04463 🌐 Project: wuzy2115.github.io/oscar-pro… 💻 Code: github.com/wuzy2115/oscar-pu… 🤗 Robot data: huggingface.co/datasets/zywu… 🤗 Human data: huggingface.co/datasets/zywu… 🤗 Weights: huggingface.co/zywu2115/OSCA… #Robotics #WorldModels #AI #OpenSource
1
18
4,338
Jonathan Stephens retweeted
this is what 195° field of view looks like. your depth model was trained on 60° WideDepth from ICRA 2026: millimeter-accurate depth ground truth for fisheye cameras across 101 indoor scenes. rendered from high-res lidar, not estimated grouped fisheye, panoramic, and cropped views in fiftyone with depth heatmaps and 3D point clouds backprojected from the ground truth huggingface.co/datasets/Voxe… #ICRA2026 #CVPR2026
9
149
8,139
The dude literally emailed Jensen directly and managed to capture an image set for a NeRF. Epic!!!
In the summer of 2023, I cold emailed Jensen Huang and asked to capture a NeRF of him at SIGGRAPH. He responded in about an hour and said yes. A radiance field is, in the simplest terms, akin to a 3D photograph. A moment in time, so completely reconstructed that you can move through it and see it from angles the original cameras never occupied. NeRFs were the original method. Gaussian splatting, which debuted at that same SIGGRAPH, has since become the dominant form of radiance field. I called my late friend James, who told me we needed to begin practicing immediately. We ran capture after capture for weeks until we consistently got the capture time down to ~30 seconds with one camera. Later, in a hallway at the LA Convention Center during SIGGRAPH, I captured the portrait you're seeing now, a full 360° gaussian splat of Jensen, rendered here as a 2D flythrough. Afterward, I continued the conversation with him and members of his team to make the case for radiance fields as a foundational representation for imaging. To my surprise, they listened. Three years later, NVIDIA has several works, including NuRec, fVDB, 3DGRUT, and gsplat all utilizing radiance fields. The landscape has evolved enough that the reasoning is obvious. Gaussian splatting has begun to ship across some of the world’s largest industries, including autonomous vehicles, AEC, geospatial, media and entertainment, robotics, e-commerce, hospitality. It’s become clear that lifelike 3D is here to stay. And yet I think we will look back and be disappointed by how late we started taking 3D portraits of the people around us, just like how we have sparse 2D photos of our grandparents and great grandparents. We have billions of photographs of the people we know and love, but almost no radiance fields of them. I'll be returning to SIGGRAPH in LA where this was initially captured three years ago, with the landscape looking significantly different. Radiance fields are more under deployed than ever relative to what they can do. I'm excited for the future of imaging, and for 2D to transition into 3D. I have a few things up my sleeve that I think will make that case plainly.
1
1
16
1,793
🤦‍♂️ these posts make me feel old. Books. School. Tears.
how did people even learn to code when there was no docs, no YouTube... nothing?
1
2
416
Looks fine on a social media video, but when he zoomed in at the end the results looked like trash. Details are generalized, everything is soft and synthetic… just learn to take good photos!
Reframing a photo after it’s been taken with Spatial Reframing ✨ The new iOS 27 Apple Intelligence feature demo (Beta) #WWDC #iOS27
2
4
1,140
I loved my coffee meetups at #cvpr2026! Each day had a different mix of visitors. Not everyone even made in the photos as they came and go. Day 3 was all Dev Advocates / Evangelists! We shared notes on what it takes to be successful and best help our audiences. I plan to do this again at SIGGRAPH!
1
12
1,187
I really wished I saw this paper in person!!
Made me laugh way to hard #CVPR2026
3
1,465
Serious question: do I stack them horizontally or vertically? The logo sideways always bugs me. @NVIDIAAIDev just sent me instructions on how to set them up to work as one. Excited to tackle some fun projects!
8
1
24
10,910
I went to a lot of parties at #cvpr2026 but only one had people up and dancing! Huge kudos to my team at @LightwheelAI for throwing one heck of a party! The computer vision community showed up!!
23
2,930
I got to try out the @RealityLabs Aria 2 glasses while at #CVPR2026. I was blown away by the quality of the data streams and that it all processes on board. I’m going to work with our team to pitch a research project for these. #computervision
2
5
29
3,072
This was a cool demo! I’m going to try out the new Echo. Looks very promising.
Having a few hours this afternoon after @CVPR? Consider visiting Red Rocks 20 minutes outside of Denver! For some inspiration, here's a world processed by @SpAItial_AI Echo-2 HQ model from a single phone image! Link to the world in🧵
1
9
3,087
Come find me at Vibe Coffee and Wine this morning! I’ll be here until 9:30 am! #cvpr2026
4
527
It’s great running into old friends and making new friends in the computer vision industry! #cvpr2026 has built a great community of researchers and industry pushing the technology forward.
10
839
We need so much more work in benchmarking! So excited to see this. #robotics
Jun 6
Excited to release v0 of SO-101 Bench, a benchmark with 4 tasks designed to measure various core capabilities of robot foundation models. It includes precise spatial and geometric constraints, novel/unusual objects, temporarily occluded objects, and often requires visuospatial planning. It is presented here as a real-world case study, then a scalable real2sim evaluation suite, all executed on the cheap and open-sourced SO-101 arm!
11
2,578
I had a great time teaching an engaged crowd about how building photorealistic Isaac Sim environments with Gaussian Splatting. So many amazing open source tools @NVIDIARobotics is supporting for building simulation environments. #CVPR2026
1
9
757
Jonathan Stephens retweeted
#CVPR2026 GR3D 🧊 — A single VLM that grounds in 2D, grounds in 3D, and reasons with visual chain-of-thought — all at once. Excited to share our paper, Grounded 3D-Aware Spatial Vision-Language Modeling!
2
38
328
33,852
At Vibe Coffee and Wine, come find me! #cvpr2026
10
484