Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:FroDO: From Detections to 3D Objects

May 11, 2020

Kejie Li, Martin Rünz, Meng Tang, Lingni Ma, Chen Kong, Tanner Schmidt, Ian Reid, Lourdes Agapito, Julian Straub, Steven Lovegrove(+1 more)

Figure 1 for FroDO: From Detections to 3D Objects

Figure 2 for FroDO: From Detections to 3D Objects

Figure 3 for FroDO: From Detections to 3D Objects

Figure 4 for FroDO: From Detections to 3D Objects

Share this with someone who'll enjoy it:

Abstract:Object-oriented maps are important for scene understanding since they jointly capture geometry and semantics, allow individual instantiation and meaningful reasoning about objects. We introduce FroDO, a method for accurate 3D reconstruction of object instances from RGB video that infers object location, pose and shape in a coarse-to-fine manner. Key to FroDO is to embed object shapes in a novel learnt space that allows seamless switching between sparse point cloud and dense DeepSDF decoding. Given an input sequence of localized RGB frames, FroDO first aggregates 2D detections to instantiate a category-aware 3D bounding box per object. A shape code is regressed using an encoder network before optimizing shape and pose further under the learnt shape priors using sparse and dense shape representations. The optimization uses multi-view geometric, photometric and silhouette losses. We evaluate on real-world datasets, including Pix3D, Redwood-OS, and ScanNet, for single-view, multi-view, and multi-object reconstruction.

* To be published in CVPR 2020. The first two authors contributed equally

View paper on

Share this with someone who'll enjoy it:

Title:FroDO: From Detections to 3D Objects

Paper and Code