Picture for Hsin-Ying Lee

Hsin-Ying Lee

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control

Add code
Jul 17, 2024
Figure 1 for VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Figure 2 for VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Figure 3 for VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Figure 4 for VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Viaarxiv icon

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models

Add code
Jun 11, 2024
Figure 1 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Figure 2 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Figure 3 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Figure 4 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Viaarxiv icon

GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement

Add code
Jun 09, 2024
Viaarxiv icon

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

Add code
May 28, 2024
Figure 1 for 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
Figure 2 for 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
Figure 3 for 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
Figure 4 for 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
Viaarxiv icon

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Add code
Feb 29, 2024
Figure 1 for Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Figure 2 for Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Figure 3 for Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Figure 4 for Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Viaarxiv icon

Visual Concept-driven Image Generation with Text-to-Image Diffusion Model

Add code
Feb 18, 2024
Viaarxiv icon

AToM: Amortized Text-to-Mesh using 2D Diffusion

Add code
Feb 01, 2024
Figure 1 for AToM: Amortized Text-to-Mesh using 2D Diffusion
Figure 2 for AToM: Amortized Text-to-Mesh using 2D Diffusion
Figure 3 for AToM: Amortized Text-to-Mesh using 2D Diffusion
Figure 4 for AToM: Amortized Text-to-Mesh using 2D Diffusion
Viaarxiv icon

Diffusion Priors for Dynamic View Synthesis from Monocular Videos

Add code
Jan 10, 2024
Viaarxiv icon

UpFusion: Novel View Diffusion from Unposed Sparse View Observations

Add code
Jan 04, 2024
Viaarxiv icon

Virtual Pets: Animatable Animal Generation in 3D Scenes

Add code
Dec 21, 2023
Viaarxiv icon