Eldar Insafutdinov

Eldar Insafutdinov

11 Photos and videos

Tweets

Pinned Tweet

Eldar Insafutdinov @EldarIsTyping

Jan 15

Excited to share our latest work on dynamic 3D reconstruction!

Edgar Sucar @SucarEdgar

Jan 15

Introducing V-DPM, for 4D reconstruction of in-the-wild videos. We build on top of VGGT, using Dynamic Point Maps for jointly representing 3D and motion. Joint work with: @EldarIsTyping , @LaiZihang , and Andrea Vedaldi. @Oxford_VGG. Check out the demo and code 👇

3:26

443

Eldar Insafutdinov

Eldar Insafutdinov @EldarIsTyping

Jun 7

This afternoon we will be presenting CoWTracker by amazing Zihang Lai who unfortunately could not be at @CVPR in person. Stop by at poster 594 (15:30 - 17:30) to learn about new SOTA in dense point tracking. Project page: cowtracker.github.io/ @SucarEdgar @Oxford_VGG

0:56

4,356

Jose

Eldar Insafutdinov retweeted

Jose

@josesaezmerino

28 Nov 2025

My brother is a senior designer at Figma. He is insanely cracked. I sent him this image and asked him what it would take to build it today. I will never forget his answer… "We can't, we don't know how to do it."

rushilshah

@rushilshah_x

27 Nov 2025

This UI still looks better than half the apps today.

592

4,288

54,511

4,264,059

Saining Xie

Eldar Insafutdinov retweeted

Saining Xie

@sainingxie

14 Nov 2025

papers are kind of like movies: the first one is usually the best, and the sequels tend to get more complicated but not really more exciting. But that totally doesn’t apply to the DepthAnything series. @bingyikang's team somehow keeps making things simpler and more scalable each time. in this new version, they basically show that a strong representation encoder plus a depth-ray prediction objective is enough (you see the RAE vibes too, right?) to get solid, general spatial perception across a bunch of tasks. people often say they hate computer vision because it’s messy--too many tasks, too many data types, too many moving parts. but that’s exactly why I love it. I think the biggest AI breakthroughs are going to come quietly from vision and then suddenly leapfrog everything else, changing how AI interacts with the real world and with us. pretty soon we’ll realize vision is not a big list of tasks--it’s a perspective. a perspective about modeling continuous sensory data, building layered representations of the world, and inching toward human-like intelligence. and tbh we’re watching this happen every day, behind all the hype, as all these different '"tasks" slowly start to merge.

Bingyi Kang

@bingyikang

14 Nov 2025

After a year of team work, we're thrilled to introduce Depth Anything 3 (DA3)! 🚀 Aiming for human-like spatial perception, DA3 extends monocular depth estimation to any-view scenarios, including single images, multi-view images, and video. In pursuit of minimal modeling, DA3 reveals two key insights: 💎 A plain transformer (e.g., vanilla DINO) is enough. No specialized architecture. ✨ A single depth-ray representation is enough. No complex 3D tasks. Three series of models have been released: the main DA3 series, a monocular metric estimation series, and a monocular depth estimation series. The core team members, aside from me: @HaotongLin, Sili Chen, Jun Hao Liew, @donydchen. 👇(1/n) #DepthAnything3

0:33

514

76,223

MrNeRF

Eldar Insafutdinov retweeted

MrNeRF

@janusch_patas

3 Nov 2025

Europe Builds. Others Profit. 3D Gaussian Splatting (3DGS) is the perfect case study. It reflects both Europe’s brilliance and its chronic inability to turn that brilliance into business. Almost everything that made 3DGS possible was born in Europe. From the early breakthroughs in point-based rasterization in Switzerland to the cumulative research from Austria, Greece, and Germany executed in France, Europe built the foundation. No other continent can match that level of scientific collaboration and intellectual strength. The LichtFeld Studio bounty later confirmed it: the biggest performance leaps came straight out of European labs. The science was here. The innovation was here. The talent was here. But the business was not. When 3DGS exploded, my inbox filled with messages from US-based companies, not from Europe. In the United States, Luma AI and Polycam turned the paper into products within weeks. They did not wait for funding programs or EU consortia. They simply built. Then came China, which not only caught up in research but quickly outpaced everyone in commercialization. XGRID, DJI, and many others built thriving businesses around what Europe invented. Today, most 3DGS papers come from Chinese institutions rather than European ones. Meanwhile, the usual giants such as Meta, NVIDIA, Google, Netflix, and Tesla continue to iterate, integrate, and push forward. A thriving ecosystem of startups like World Labs leverages this technology to create new products and markets. The innovation cycle in the United States and China is fast, relentless, and market-driven. Europe, in contrast, remains bureaucratic and slow. We fund excellence and celebrate publications, but we rarely ship, even though some small startups are trying to change the status quo. Our researchers create the breakthroughs; others create the successful products. Until Europe finds a way to bridge the gap between laboratories and markets, it will remain the world’s research and development department: brilliant, underpaid, and underleveraged. Research is Europe’s comfort zone. Execution must become its strength. Video: One of my dynamic 3D Gaussian implementations based on the paper "Representing Long Volumetric Video with Temporal Gaussian Hierarchy."

1:19

159

1,257

159,204

Visual Geometry Group (VGG)

Eldar Insafutdinov retweeted

Visual Geometry Group (VGG)@Oxford_VGG

30 Oct 2025

We are seeking a full-time Postdoctoral Research Assistant in Computer Vision to join the Visual Geometry Group (University of Oxford) to work on 3D and Spatial AI with Professor Andrea Vedaldi. The post is funded by ERC and is fixed-term for two years with a possible extension.

14,913

Eldar Insafutdinov

Eldar Insafutdinov @EldarIsTyping

30 Aug 2025

RT @Oxford_VGG: Many congratulations to Prof. Andrea Vedaldi for winning one of the first Royal Society Faraday Discovery Fellowships from…

Exceptional Oxford researchers awarded first Royal Society Faraday Discovery Fellowships

Three pioneering Oxford researchers are among the first recipients of the Royal Society Faraday Discovery Fellowships, prestigious long-term awards to support exceptional mid-career research leaders...

ox.ac.uk

Vlad Golyanik

Eldar Insafutdinov retweeted

Vlad Golyanik @VGolyanik

22 Aug 2025

Breaking news: A large-scale, publicly accessible dataset of multi-view OLAT full-body human captures now exists. Accepted at #ICCV2025! A collaboration between MPI-INF and NVIDIA led by Timo Teufel. For @jankautz and me, this is our second joint work. vcai.mpi-inf.mpg.de/projects…

0:15

110

8,127

Angjoo Kanazawa

Eldar Insafutdinov retweeted

Angjoo Kanazawa @akanazawa

12 Aug 2025

Viser completely changed the way we do research. Before viser, it was hard to visualize 3D/4D data, let alone share it. Now it’s all just in a browser! It’s amazingly powerful and looks awesome. It’s how we render our results and videos. We love it and hope you will too!

Brent Yi @brenthyi

31 Jul 2025

July has been a big month for Viser! - Released v1.0.0😊 - We did some writing Some demos👇

0:52

344

23,603

Giovanni M Farinella

Eldar Insafutdinov retweeted

Giovanni M Farinella @GMFarinella

8 Jul 2025

Andrea Vedaldi @ ICVSS 2025

908

Tomas Jakab

Eldar Insafutdinov retweeted

Tomas Jakab @JakabTomas

26 Jun 2025

Very excited about this! Thank you Runjia for your hard work! If haven’t had the chance, try our demo showcasing our geometry-based memory module for interactive video generators 🕹️🎞️ huggingface.co/spaces/liguan…

V-MEM - a Hugging Face Space by liguang0115

Upload or select an image to navigate through its 3D scene. Control camera movements to move forward, backward, and turn, generating new views in real-time.

huggingface.co

Runjia Li @RunjiaLi

26 Jun 2025

🎉 VMem is officially accepted to ICCV 2025! Excited to chat with everyone in Hawaii about making video generation consistent and interactive with our Surfel-Indexed View Memory 🏝️🎥 Also, huge thanks to my insanely helpful coauthors!

488

Dylan Campbell

Eldar Insafutdinov retweeted

Dylan Campbell @dylanjcampbell_

26 Jun 2025

Call for papers: The 26th International Conference on Digital Image Computing: Techniques and Applications (@dicta2025) Dates: 3-5 December 2025 Location: Adelaide Convention Centre, Adelaide, Australia Paper Submission: 15 July, 2025 AoE Website: dicta2025.dictaconference.or…

514

Eldar Insafutdinov

Eldar Insafutdinov @EldarIsTyping

24 Jun 2025

I'm excited for Chuanxia's next step! It was a pleasure to work with him and I couldn't think of a better collaborator and mentor. If you looking for a PhD, don't miss out and apply!

Chuanxia Zheng @ChuanxiaZ

24 Jun 2025

After two amazing years with @Oxford_VGG, I will be joining @NTUsg as a Nanyang Assistant Professor in Fall 2025! I’ll be leading the Physical Vision Group (physicalvision.github.io) — and we're hiring for next year!🚀 If you're passionate about vision or AI, get in touch!

0:48

1,428

Tomas Jakab

Eldar Insafutdinov retweeted

Tomas Jakab @JakabTomas

24 Jun 2025

Excited to share VMem: a novel memory mechanism for consistent video scene generation 🎞️✨ VMem evolves its understanding of scene geometry to retrieve the most relevant past frames, enabling long-term consistency 🌐 v-mem.github.io 🤗 huggingface.co/spaces/liguan… 1/ 🧵

VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

VMem introduces a novel surfel-indexed memory module for consistent autoregressive video scene generation. To appear in ICCV 2025.

v-mem.github.io

@_akhaliq

24 Jun 2025

VMem Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

1:00

15,792

Tomas Jakab

Eldar Insafutdinov retweeted

Tomas Jakab @JakabTomas

12 Jun 2025

We are presenting Dual Point Maps as a #CVPR highlight tomorrow! Learn about our novel, data-efficient representation for 3D/4D deformable objects—an alternative to classical template shape models. 📍🕑 ExHall D, Poster #100, afternoon session 🌍dualpm.github.io

0:30

2,577

Visual Geometry Group (VGG)

Eldar Insafutdinov retweeted

Visual Geometry Group (VGG)@Oxford_VGG

13 Jun 2025

Many Congratulations to @jianyuan_wang, @MinghaoChen23, @n_karaev, Andrea Vedaldi, Christian Rupprecht and @davnov134 for winning the Best Paper Award @CVPR for "VGGT: Visual Geometry Grounded Transformer" 🥇🎉 🙌🙌 #CVPR2025!!!!!!

489

46,248

Jensen Zhou

Eldar Insafutdinov retweeted

Jensen Zhou @jensenzhoujh

18 Mar 2025

Hi there, 🎉 We are thrilled to introduce Stable Virtual Camera, a generalist diffusion model designed to address the exciting challenge of Novel View Synthesis (NVS). With just one or a few images, it allows you to create a smooth trajectory video from any viewpoint you desire. We’re naming this model in tribute to the Virtual Camera cinematography technology. @StabilityAI 🏠 Project Page: stable-virtual-camera.github… 📄 Paper: stable-virtual-camera.github… 📃 Blog: stability.ai/news/introducin… 💻 Code: github.com/Stability-AI/stab… 🤗 Model Card: huggingface.co/stabilityai/s… 🚀 Gradio Demo: huggingface.co/spaces/stabil… 🎬 Video: youtube.com/channel/UCLLlVDc…

1:00

165

26,781

Eldar Insafutdinov

Eldar Insafutdinov @EldarIsTyping

25 Nov 2024

Check out our work on feed-forward (in a flash) 3D scene reconstruction! Our method transforms a monocular depth estimation network into a generalisable single-view 3D reconstructor. Thanks to @StanSzymanowicz, @ChuanxiaZ, @dylanjcampbell_, J. Henriques, @chrirupp & A. Vedaldi!

Stan Szymanowicz

@StanSzymanowicz

22 Nov 2024

Feed-forward 3D Gaussians from @Oxford_VGG strike again! Flash3D has now been accepted to 3DV 2025: it is a method for feed-forward single-view 3D scene reconstruction. Project page: robots.ox.ac.uk/~vgg/researc… Code: github.com/eldar/flash3d Arxiv: arxiv.org/pdf/2406.04343 A 🧵👇

0:28

453

Jakob Foerster

Eldar Insafutdinov retweeted

Jakob Foerster

@j_foerst

22 Nov 2024

Joao Henriques (joao.science) and I are hiring a fully funded PhD student (UK/international) for the FAIR-Oxford program. The student will spend 50% of their time @UniofOxford and 50% @AIatMeta (FAIR), while completing a DPhil (Oxford PhD). Deadline: 2nd of Dec AOE!!

231

51,560

Angjoo Kanazawa

Eldar Insafutdinov retweeted

Angjoo Kanazawa @akanazawa

6 Nov 2024

Hi! If you found the gsplat library (docs.gsplat.studio) useful, we wrote a whitepaper with benchmarking, conventions, derivations, and new features (great effort led by @_maturk & @ruilong_li 🙌). arxiv.org/abs/2409.06765

275

30,922