Picture for Sergey Tulyakov

Sergey Tulyakov

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control

Add code
Jul 17, 2024
Figure 1 for VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Figure 2 for VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Figure 3 for VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Figure 4 for VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Viaarxiv icon

Efficient Training with Denoised Neural Weights

Add code
Jul 16, 2024
Viaarxiv icon

VIMI: Grounding Video Generation through Multi-modal Instruction

Add code
Jul 08, 2024
Viaarxiv icon

Taming Data and Transformers for Audio Generation

Add code
Jun 27, 2024
Viaarxiv icon

Lightweight Predictive 3D Gaussian Splats

Add code
Jun 27, 2024
Viaarxiv icon

VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing

Add code
Jun 18, 2024
Figure 1 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Figure 2 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Figure 3 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Figure 4 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Viaarxiv icon

Hierarchical Patch Diffusion Models for High-Resolution Video Generation

Add code
Jun 12, 2024
Figure 1 for Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Figure 2 for Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Figure 3 for Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Figure 4 for Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Viaarxiv icon

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models

Add code
Jun 11, 2024
Figure 1 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Figure 2 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Figure 3 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Figure 4 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Viaarxiv icon

GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement

Add code
Jun 09, 2024
Viaarxiv icon

SF-V: Single Forward Video Generation Model

Add code
Jun 06, 2024
Viaarxiv icon