Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Guangzhao He

Category-Agnostic Neural Object Rigging

May 26, 2025

Guangzhao He, Chen Geng, Shangzhe Wu, Jiajun Wu

Abstract:The motion of deformable 4D objects lies in a low-dimensional manifold. To better capture the low dimensionality and enable better controllability, traditional methods have devised several heuristic-based methods, i.e., rigging, for manipulating dynamic objects in an intuitive fashion. However, such representations are not scalable due to the need for expert knowledge of specific categories. Instead, we study the automatic exploration of such low-dimensional structures in a purely data-driven manner. Specifically, we design a novel representation that encodes deformable 4D objects into a sparse set of spatially grounded blobs and an instance-aware feature volume to disentangle the pose and instance information of the 3D shape. With such a representation, we can manipulate the pose of 3D objects intuitively by modifying the parameters of the blobs, while preserving rich instance-specific information. We evaluate the proposed method on a variety of object categories and demonstrate the effectiveness of the proposed framework. Project page: https://guangzhaohe.com/canor

* Accepted to CVPR 2025. Project Page: https://guangzhaohe.com/canor

Via

Access Paper or Ask Questions

EasyVolcap: Accelerating Neural Volumetric Video Research

Dec 11, 2023

Zhen Xu, Tao Xie, Sida Peng, Haotong Lin, Qing Shuai, Zhiyuan Yu, Guangzhao He, Jiaming Sun, Hujun Bao, Xiaowei Zhou

Figure 1 for EasyVolcap: Accelerating Neural Volumetric Video Research

Figure 2 for EasyVolcap: Accelerating Neural Volumetric Video Research

Figure 3 for EasyVolcap: Accelerating Neural Volumetric Video Research

Figure 4 for EasyVolcap: Accelerating Neural Volumetric Video Research

Abstract:Volumetric video is a technology that digitally records dynamic events such as artistic performances, sporting events, and remote conversations. When acquired, such volumography can be viewed from any viewpoint and timestamp on flat screens, 3D displays, or VR headsets, enabling immersive viewing experiences and more flexible content creation in a variety of applications such as sports broadcasting, video conferencing, gaming, and movie productions. With the recent advances and fast-growing interest in neural scene representations for volumetric video, there is an urgent need for a unified open-source library to streamline the process of volumetric video capturing, reconstruction, and rendering for both researchers and non-professional users to develop various algorithms and applications of this emerging technology. In this paper, we present EasyVolcap, a Python & Pytorch library for accelerating neural volumetric video research with the goal of unifying the process of multi-view data processing, 4D scene reconstruction, and efficient dynamic volumetric video rendering. Our source code is available at https://github.com/zju3dv/EasyVolcap.

* SIGGRAPH Asia 2023 Technical Communications. Source code: https://github.com/zju3dv/EasyVolcap

Via

Access Paper or Ask Questions

4K4D: Real-Time 4D View Synthesis at 4K Resolution

Oct 28, 2023

Zhen Xu, Sida Peng, Haotong Lin, Guangzhao He, Jiaming Sun, Yujun Shen, Hujun Bao, Xiaowei Zhou

Figure 1 for 4K4D: Real-Time 4D View Synthesis at 4K Resolution

Figure 2 for 4K4D: Real-Time 4D View Synthesis at 4K Resolution

Figure 3 for 4K4D: Real-Time 4D View Synthesis at 4K Resolution

Figure 4 for 4K4D: Real-Time 4D View Synthesis at 4K Resolution

Abstract:This paper targets high-fidelity and real-time view synthesis of dynamic 3D scenes at 4K resolution. Recently, some methods on dynamic view synthesis have shown impressive rendering quality. However, their speed is still limited when rendering high-resolution images. To overcome this problem, we propose 4K4D, a 4D point cloud representation that supports hardware rasterization and enables unprecedented rendering speed. Our representation is built on a 4D feature grid so that the points are naturally regularized and can be robustly optimized. In addition, we design a novel hybrid appearance model that significantly boosts the rendering quality while preserving efficiency. Moreover, we develop a differentiable depth peeling algorithm to effectively learn the proposed model from RGB videos. Experiments show that our representation can be rendered at over 400 FPS on the DNA-Rendering dataset at 1080p resolution and 80 FPS on the ENeRF-Outdoor dataset at 4K resolution using an RTX 4090 GPU, which is 30x faster than previous methods and achieves the state-of-the-art rendering quality. Our project page is available at https://zju3dv.github.io/4k4d/.

* Project Page: https://zju3dv.github.io/4k4d

Via

Access Paper or Ask Questions