Joined May 2011
11 Photos and videos
Pinned Tweet
Excited to share our latest work on dynamic 3D reconstruction!
Introducing V-DPM, for 4D reconstruction of in-the-wild videos. We build on top of VGGT, using Dynamic Point Maps for jointly representing 3D and motion. Joint work with: @EldarIsTyping , @LaiZihang , and Andrea Vedaldi. @Oxford_VGG. Check out the demo and code 👇
9
443
This afternoon we will be presenting CoWTracker by amazing Zihang Lai who unfortunately could not be at @CVPR in person. Stop by at poster 594 (15:30 - 17:30) to learn about new SOTA in dense point tracking. Project page: cowtracker.github.io/ @SucarEdgar @Oxford_VGG
9
32
4,356
Eldar Insafutdinov retweeted
28 Nov 2025
My brother is a senior designer at Figma. He is insanely cracked. I sent him this image and asked him what it would take to build it today. I will never forget his answer… "We can't, we don't know how to do it."
This UI still looks better than half the apps today.
592
4,288
54,511
4,264,059
Eldar Insafutdinov retweeted
14 Nov 2025
papers are kind of like movies: the first one is usually the best, and the sequels tend to get more complicated but not really more exciting. But that totally doesn’t apply to the DepthAnything series. @bingyikang's team somehow keeps making things simpler and more scalable each time. in this new version, they basically show that a strong representation encoder plus a depth-ray prediction objective is enough (you see the RAE vibes too, right?) to get solid, general spatial perception across a bunch of tasks. people often say they hate computer vision because it’s messy--too many tasks, too many data types, too many moving parts. but that’s exactly why I love it. I think the biggest AI breakthroughs are going to come quietly from vision and then suddenly leapfrog everything else, changing how AI interacts with the real world and with us. pretty soon we’ll realize vision is not a big list of tasks--it’s a perspective. a perspective about modeling continuous sensory data, building layered representations of the world, and inching toward human-like intelligence. and tbh we’re watching this happen every day, behind all the hype, as all these different '"tasks" slowly start to merge.
14 Nov 2025
After a year of team work, we're thrilled to introduce Depth Anything 3 (DA3)! 🚀 Aiming for human-like spatial perception, DA3 extends monocular depth estimation to any-view scenarios, including single images, multi-view images, and video. In pursuit of minimal modeling, DA3 reveals two key insights: 💎 A plain transformer (e.g., vanilla DINO) is enough. No specialized architecture. ✨ A single depth-ray representation is enough. No complex 3D tasks. Three series of models have been released: the main DA3 series, a monocular metric estimation series, and a monocular depth estimation series. The core team members, aside from me: @HaotongLin, Sili Chen, Jun Hao Liew, @donydchen. 👇(1/n) #DepthAnything3
5
40
514
76,223
Eldar Insafutdinov retweeted
3 Nov 2025
Europe Builds. Others Profit. 3D Gaussian Splatting (3DGS) is the perfect case study. It reflects both Europe’s brilliance and its chronic inability to turn that brilliance into business. Almost everything that made 3DGS possible was born in Europe. From the early breakthroughs in point-based rasterization in Switzerland to the cumulative research from Austria, Greece, and Germany executed in France, Europe built the foundation. No other continent can match that level of scientific collaboration and intellectual strength. The LichtFeld Studio bounty later confirmed it: the biggest performance leaps came straight out of European labs. The science was here. The innovation was here. The talent was here. But the business was not. When 3DGS exploded, my inbox filled with messages from US-based companies, not from Europe. In the United States, Luma AI and Polycam turned the paper into products within weeks. They did not wait for funding programs or EU consortia. They simply built. Then came China, which not only caught up in research but quickly outpaced everyone in commercialization. XGRID, DJI, and many others built thriving businesses around what Europe invented. Today, most 3DGS papers come from Chinese institutions rather than European ones. Meanwhile, the usual giants such as Meta, NVIDIA, Google, Netflix, and Tesla continue to iterate, integrate, and push forward. A thriving ecosystem of startups like World Labs leverages this technology to create new products and markets. The innovation cycle in the United States and China is fast, relentless, and market-driven. Europe, in contrast, remains bureaucratic and slow. We fund excellence and celebrate publications, but we rarely ship, even though some small startups are trying to change the status quo. Our researchers create the breakthroughs; others create the successful products. Until Europe finds a way to bridge the gap between laboratories and markets, it will remain the world’s research and development department: brilliant, underpaid, and underleveraged. Research is Europe’s comfort zone. Execution must become its strength. Video: One of my dynamic 3D Gaussian implementations based on the paper "Representing Long Volumetric Video with Temporal Gaussian Hierarchy."
58
159
1,257
159,204
Eldar Insafutdinov retweeted
We are seeking a full-time Postdoctoral Research Assistant in Computer Vision to join the Visual Geometry Group (University of Oxford) to work on 3D and Spatial AI with Professor Andrea Vedaldi. The post is funded by ERC and is fixed-term for two years with a possible extension.
3
13
53
14,913
Eldar Insafutdinov retweeted
Breaking news: A large-scale, publicly accessible dataset of multi-view OLAT full-body human captures now exists. Accepted at #ICCV2025! A collaboration between MPI-INF and NVIDIA led by Timo Teufel. For @jankautz and me, this is our second joint work. vcai.mpi-inf.mpg.de/projects…
1
24
110
8,127
Eldar Insafutdinov retweeted
Viser completely changed the way we do research. Before viser, it was hard to visualize 3D/4D data, let alone share it. Now it’s all just in a browser! It’s amazingly powerful and looks awesome. It’s how we render our results and videos. We love it and hope you will too!
31 Jul 2025
July has been a big month for Viser! - Released v1.0.0😊 - We did some writing Some demos👇
5
35
344
23,603
Eldar Insafutdinov retweeted
Andrea Vedaldi @ ICVSS 2025
1
18
908
Eldar Insafutdinov retweeted
Very excited about this! Thank you Runjia for your hard work! If haven’t had the chance, try our demo showcasing our geometry-based memory module for interactive video generators 🕹️🎞️ huggingface.co/spaces/liguan…
26 Jun 2025
🎉 VMem is officially accepted to ICCV 2025! Excited to chat with everyone in Hawaii about making video generation consistent and interactive with our Surfel-Indexed View Memory 🏝️🎥 Also, huge thanks to my insanely helpful coauthors!
1
6
488
Eldar Insafutdinov retweeted
Call for papers: The 26th International Conference on Digital Image Computing: Techniques and Applications (@dicta2025) Dates: 3-5 December 2025 Location: Adelaide Convention Centre, Adelaide, Australia Paper Submission: 15 July, 2025 AoE Website: dicta2025.dictaconference.or…

2
4
514
I'm excited for Chuanxia's next step! It was a pleasure to work with him and I couldn't think of a better collaborator and mentor. If you looking for a PhD, don't miss out and apply!
After two amazing years with @Oxford_VGG, I will be joining @NTUsg as a Nanyang Assistant Professor in Fall 2025! I’ll be leading the Physical Vision Group (physicalvision.github.io) — and we're hiring for next year!🚀 If you're passionate about vision or AI, get in touch!
1
1
4
1,428
Eldar Insafutdinov retweeted
Excited to share VMem: a novel memory mechanism for consistent video scene generation 🎞️✨ VMem evolves its understanding of scene geometry to retrieve the most relevant past frames, enabling long-term consistency 🌐 v-mem.github.io 🤗 huggingface.co/spaces/liguan… 1/ 🧵
24 Jun 2025
VMem Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory
4
11
58
15,792
Eldar Insafutdinov retweeted
We are presenting Dual Point Maps as a #CVPR highlight tomorrow! Learn about our novel, data-efficient representation for 3D/4D deformable objects—an alternative to classical template shape models. 📍🕑 ExHall D, Poster #100, afternoon session 🌍dualpm.github.io
1
7
34
2,577
Eldar Insafutdinov retweeted
Many Congratulations to @jianyuan_wang, @MinghaoChen23, @n_karaev, Andrea Vedaldi, Christian Rupprecht and @davnov134 for winning the Best Paper Award @CVPR for "VGGT: Visual Geometry Grounded Transformer" 🥇🎉 🙌🙌 #CVPR2025!!!!!!
17
68
489
46,248
Eldar Insafutdinov retweeted
Hi there, 🎉 We are thrilled to introduce Stable Virtual Camera, a generalist diffusion model designed to address the exciting challenge of Novel View Synthesis (NVS). With just one or a few images, it allows you to create a smooth trajectory video from any viewpoint you desire. We’re naming this model in tribute to the Virtual Camera cinematography technology. @StabilityAI 🏠 Project Page: stable-virtual-camera.github… 📄 Paper: stable-virtual-camera.github… 📃 Blog: stability.ai/news/introducin… 💻 Code: github.com/Stability-AI/stab… 🤗 Model Card: huggingface.co/stabilityai/s… 🚀 Gradio Demo: huggingface.co/spaces/stabil… 🎬 Video: youtube.com/channel/UCLLlVDc…
1
27
165
26,781
Check out our work on feed-forward (in a flash) 3D scene reconstruction! Our method transforms a monocular depth estimation network into a generalisable single-view 3D reconstructor. Thanks to @StanSzymanowicz, @ChuanxiaZ, @dylanjcampbell_, J. Henriques, @chrirupp & A. Vedaldi!
Feed-forward 3D Gaussians from @Oxford_VGG strike again! Flash3D has now been accepted to 3DV 2025: it is a method for feed-forward single-view 3D scene reconstruction. Project page: robots.ox.ac.uk/~vgg/researc… Code: github.com/eldar/flash3d Arxiv: arxiv.org/pdf/2406.04343 A 🧵👇
9
453
Eldar Insafutdinov retweeted
Joao Henriques (joao.science) and I are hiring a fully funded PhD student (UK/international) for the FAIR-Oxford program. The student will spend 50% of their time @UniofOxford and 50% @AIatMeta (FAIR), while completing a DPhil (Oxford PhD). Deadline: 2nd of Dec AOE!!

3
42
231
51,560
Eldar Insafutdinov retweeted
Hi! If you found the gsplat library (docs.gsplat.studio) useful, we wrote a whitepaper with benchmarking, conventions, derivations, and new features (great effort led by @_maturk & @ruilong_li 🙌). arxiv.org/abs/2409.06765

2
29
275
30,922