Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sofien Bouaziz

RePose: Learning Deep Kinematic Priors for Fast Human Pose Estimation

Feb 10, 2020

Hossam Isack, Christian Haene, Cem Keskin, Sofien Bouaziz, Yuri Boykov, Shahram Izadi, Sameh Khamis

Figure 1 for RePose: Learning Deep Kinematic Priors for Fast Human Pose Estimation

Figure 2 for RePose: Learning Deep Kinematic Priors for Fast Human Pose Estimation

Figure 3 for RePose: Learning Deep Kinematic Priors for Fast Human Pose Estimation

Figure 4 for RePose: Learning Deep Kinematic Priors for Fast Human Pose Estimation

Abstract:We propose a novel efficient and lightweight model for human pose estimation from a single image. Our model is designed to achieve competitive results at a fraction of the number of parameters and computational cost of various state-of-the-art methods. To this end, we explicitly incorporate part-based structural and geometric priors in a hierarchical prediction framework. At the coarsest resolution, and in a manner similar to classical part-based approaches, we leverage the kinematic structure of the human body to propagate convolutional feature updates between the keypoints or body parts. Unlike classical approaches, we adopt end-to-end training to learn this geometric prior through feature updates from data. We then propagate the feature representation at the coarsest resolution up the hierarchy to refine the predicted pose in a coarse-to-fine fashion. The final network effectively models the geometric prior and intuition within a lightweight deep neural network, yielding state-of-the-art results for a model of this size on two standard datasets, Leeds Sports Pose and MPII Human Pose.

Via

Access Paper or Ask Questions

CvxNets: Learnable Convex Decomposition

Sep 12, 2019

Boyang Deng, Kyle Genova, Soroosh Yazdani, Sofien Bouaziz, Geoffrey Hinton, Andrea Tagliasacchi

Figure 1 for CvxNets: Learnable Convex Decomposition

Figure 2 for CvxNets: Learnable Convex Decomposition

Figure 3 for CvxNets: Learnable Convex Decomposition

Figure 4 for CvxNets: Learnable Convex Decomposition

Abstract:Any solid object can be decomposed into a collection of convex polytopes (in short, convexes). When a small number of convexes are used, such a decomposition can be thought of as a piece-wise approximation of the geometry. This decomposition is fundamental to real-time physics simulation in computer graphics, where it creates a unifying representation of dynamic geometry for collision detection. A convex object also has the property of being simultaneously an explicit and implicit representation: one can interpret it explicitly as a mesh derived by computing the vertices of a convex hull, or implicitly as the collection of half-space constraints or support functions. Their implicit representation makes them particularly well suited for neural network training, as they abstract away from the topology of the geometry they need to represent. We introduce a network architecture to represent a low dimensional family of convexes. This family is automatically derived via an autoencoding process. We investigate the applications of the network including automatic convex decomposition, image to 3D reconstruction, and part-based shape retrieval.

Via

Access Paper or Ask Questions

Multiview Aggregation for Learning Category-Specific Shape Reconstruction

Jul 01, 2019

Srinath Sridhar, Davis Rempe, Julien Valentin, Sofien Bouaziz, Leonidas J. Guibas

Figure 1 for Multiview Aggregation for Learning Category-Specific Shape Reconstruction

Figure 2 for Multiview Aggregation for Learning Category-Specific Shape Reconstruction

Figure 3 for Multiview Aggregation for Learning Category-Specific Shape Reconstruction

Figure 4 for Multiview Aggregation for Learning Category-Specific Shape Reconstruction

Abstract:We investigate the problem of learning category-specific 3D surface shape reconstruction from a variable number of RGB views of previously unobserved object instances. Most approaches for multiview shape reconstruction operate on sparse shape representations, or assume a fixed number of views. We present a method that can estimate dense 3D shape, and aggregate shape across multiple and varying number of input views. Given a single input view of an object instance, we propose a representation that encodes the dense shape of the visible object surface parts as well as the surface behind line of sight and occluded by the visible surface. When multiple input views are available, the shape representation is designed to be aggregated into a single 3D shape using an inexpesive union operation. We train a 2D CNN to learn to predict this representation from a variable number of views (1 or more). We further aggregate multiview information by using permutation equivariant layers that promote order-agnostic view information exchange at the feature level. Experiments show that our approach is able to produce dense reconstructions of objects, and is able to produce better results as more views are added.

* Under review

Via

Access Paper or Ask Questions