Picture for Guangcong Wang

Guangcong Wang

Style4D-Bench: A Benchmark Suite for 4D Stylization

Add code
Aug 26, 2025
Viaarxiv icon

ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models

Add code
Aug 25, 2025
Viaarxiv icon

Distilled-3DGS:Distilled 3D Gaussian Splatting

Add code
Aug 19, 2025
Viaarxiv icon

Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency

Add code
Mar 26, 2025
Viaarxiv icon

DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding

Add code
Mar 24, 2025
Figure 1 for DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding
Figure 2 for DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding
Figure 3 for DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding
Figure 4 for DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding
Viaarxiv icon

HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding

Add code
Jan 03, 2025
Figure 1 for HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding
Figure 2 for HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding
Figure 3 for HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding
Figure 4 for HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding
Viaarxiv icon

GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting

Add code
Dec 02, 2024
Figure 1 for GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting
Figure 2 for GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting
Figure 3 for GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting
Figure 4 for GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting
Viaarxiv icon

From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding

Add code
Sep 27, 2024
Figure 1 for From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding
Figure 2 for From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding
Figure 3 for From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding
Figure 4 for From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding
Viaarxiv icon

Text-based Talking Video Editing with Cascaded Conditional Diffusion

Add code
Jul 20, 2024
Figure 1 for Text-based Talking Video Editing with Cascaded Conditional Diffusion
Figure 2 for Text-based Talking Video Editing with Cascaded Conditional Diffusion
Figure 3 for Text-based Talking Video Editing with Cascaded Conditional Diffusion
Figure 4 for Text-based Talking Video Editing with Cascaded Conditional Diffusion
Viaarxiv icon

WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation

Add code
Jul 02, 2024
Figure 1 for WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation
Figure 2 for WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation
Figure 3 for WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation
Figure 4 for WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation
Viaarxiv icon