Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Jul 08, 2025

Aleksandar Jevtić, Christoph Reich, Felix Wimbauer, Oliver Hahn, Christian Rupprecht, Stefan Roth, Daniel Cremers

Figure 1 for Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Figure 2 for Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Figure 3 for Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Figure 4 for Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Share this with someone who'll enjoy it:

Abstract:Semantic scene completion (SSC) aims to infer both the 3D geometry and semantics of a scene from single images. In contrast to prior work on SSC that heavily relies on expensive ground-truth annotations, we approach SSC in an unsupervised setting. Our novel method, SceneDINO, adapts techniques from self-supervised representation learning and 2D unsupervised scene understanding to SSC. Our training exclusively utilizes multi-view consistency self-supervision without any form of semantic or geometric ground truth. Given a single input image, SceneDINO infers the 3D geometry and expressive 3D DINO features in a feed-forward manner. Through a novel 3D feature distillation approach, we obtain unsupervised 3D semantics. In both 3D and 2D unsupervised scene understanding, SceneDINO reaches state-of-the-art segmentation accuracy. Linear probing our 3D features matches the segmentation accuracy of a current supervised SSC approach. Additionally, we showcase the domain generalization and multi-view consistency of SceneDINO, taking the first steps towards a strong foundation for single image 3D scene understanding.

* To appear at ICCV 2025. Christoph Reich and Aleksandar Jevti\'c - both authors contributed equally. Code: https://github.com/tum-vision/scenedino Project page: https://visinf.github.io/scenedino

View paper on

Share this with someone who'll enjoy it:

Title:Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Paper and Code