Introducing: Long-Tail Internet Photo Reconstruction (CVPRā26)
We go beyond densely captured imagery to train more general 3D foundation models for the long tail of noisy, sparse, incomplete Internet photo collections of 3D scenes. Yet, we face a data bottleneck: models need ground truth for these long-tail scenes, which classical SfM fails to provide. How do we bypass it?
We break this bottleneck with two key contributions. First, we introduceĀ MegaDepth-X, a large new dataset of scenes with high-quality 3D supervision. Second, we propose a new way to simulate difficult image sets for training.
Project page:
megadepth-x.github.io/