Picture for Sergey Tulyakov

Sergey Tulyakov

Nested Attention: Semantic-aware Attention Values for Concept Personalization

Add code
Jan 02, 2025
Viaarxiv icon

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

Add code
Dec 19, 2024
Figure 1 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Figure 2 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Figure 3 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Figure 4 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Viaarxiv icon

Wonderland: Navigating 3D Scenes from a Single Image

Add code
Dec 16, 2024
Figure 1 for Wonderland: Navigating 3D Scenes from a Single Image
Figure 2 for Wonderland: Navigating 3D Scenes from a Single Image
Figure 3 for Wonderland: Navigating 3D Scenes from a Single Image
Figure 4 for Wonderland: Navigating 3D Scenes from a Single Image
Viaarxiv icon

SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device

Add code
Dec 13, 2024
Figure 1 for SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Figure 2 for SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Figure 3 for SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Figure 4 for SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Viaarxiv icon

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Add code
Dec 12, 2024
Viaarxiv icon

Omni-ID: Holistic Identity Representation Designed for Generative Tasks

Add code
Dec 12, 2024
Viaarxiv icon

Video Motion Transfer with Diffusion Transformers

Add code
Dec 10, 2024
Figure 1 for Video Motion Transfer with Diffusion Transformers
Figure 2 for Video Motion Transfer with Diffusion Transformers
Figure 3 for Video Motion Transfer with Diffusion Transformers
Figure 4 for Video Motion Transfer with Diffusion Transformers
Viaarxiv icon

Mind the Time: Temporally-Controlled Multi-Event Video Generation

Add code
Dec 06, 2024
Figure 1 for Mind the Time: Temporally-Controlled Multi-Event Video Generation
Figure 2 for Mind the Time: Temporally-Controlled Multi-Event Video Generation
Figure 3 for Mind the Time: Temporally-Controlled Multi-Event Video Generation
Figure 4 for Mind the Time: Temporally-Controlled Multi-Event Video Generation
Viaarxiv icon

4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion

Add code
Dec 05, 2024
Figure 1 for 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
Figure 2 for 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
Figure 3 for 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
Figure 4 for 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
Viaarxiv icon

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Add code
Dec 02, 2024
Figure 1 for AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
Figure 2 for AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
Figure 3 for AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
Figure 4 for AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
Viaarxiv icon