Picture for Song-Hai Zhang

Song-Hai Zhang

TopoMesh: High-Fidelity Mesh Autoencoding via Topological Unification

Add code
Mar 25, 2026
Viaarxiv icon

UCM: Unifying Camera Control and Memory with Time-aware Positional Encoding Warping for World Models

Add code
Feb 26, 2026
Viaarxiv icon

DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation

Add code
Jul 02, 2025
Figure 1 for DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
Figure 2 for DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
Figure 3 for DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
Figure 4 for DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
Viaarxiv icon

GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

Add code
Apr 01, 2025
Figure 1 for GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Figure 2 for GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Figure 3 for GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Figure 4 for GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Viaarxiv icon

Splatter-360: Generalizable 360$^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images

Add code
Dec 09, 2024
Viaarxiv icon

DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion

Add code
Oct 31, 2024
Figure 1 for DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Figure 2 for DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Figure 3 for DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Figure 4 for DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Viaarxiv icon

3D Gaussian Editing with A Single Image

Add code
Aug 14, 2024
Figure 1 for 3D Gaussian Editing with A Single Image
Figure 2 for 3D Gaussian Editing with A Single Image
Figure 3 for 3D Gaussian Editing with A Single Image
Figure 4 for 3D Gaussian Editing with A Single Image
Viaarxiv icon

MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing

Add code
Apr 29, 2024
Viaarxiv icon

Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing

Add code
Mar 15, 2024
Figure 1 for Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing
Figure 2 for Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing
Figure 3 for Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing
Figure 4 for Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing
Viaarxiv icon

MAL: Motion-Aware Loss with Temporal and Distillation Hints for Self-Supervised Depth Estimation

Add code
Feb 18, 2024
Figure 1 for MAL: Motion-Aware Loss with Temporal and Distillation Hints for Self-Supervised Depth Estimation
Figure 2 for MAL: Motion-Aware Loss with Temporal and Distillation Hints for Self-Supervised Depth Estimation
Figure 3 for MAL: Motion-Aware Loss with Temporal and Distillation Hints for Self-Supervised Depth Estimation
Figure 4 for MAL: Motion-Aware Loss with Temporal and Distillation Hints for Self-Supervised Depth Estimation
Viaarxiv icon