Picture for Zhe Lin

Zhe Lin

Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models

Add code
Jun 08, 2025
Viaarxiv icon

LaViDa: A Large Diffusion Language Model for Multimodal Understanding

Add code
May 22, 2025
Viaarxiv icon

ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration

Add code
Apr 11, 2025
Viaarxiv icon

TurboFill: Adapting Few-step Text-to-image Model for Fast Image Inpainting

Add code
Apr 01, 2025
Viaarxiv icon

Robust Latent Matters: Boosting Image Generation with Sampling Error

Add code
Mar 11, 2025
Viaarxiv icon

ObjectMover: Generative Object Movement with Video Prior

Add code
Mar 11, 2025
Viaarxiv icon

Multitwine: Multi-Object Compositing with Text and Layout Control

Add code
Feb 07, 2025
Viaarxiv icon

TransPixar: Advancing Text-to-Video Generation with Transparency

Add code
Jan 06, 2025
Figure 1 for TransPixar: Advancing Text-to-Video Generation with Transparency
Figure 2 for TransPixar: Advancing Text-to-Video Generation with Transparency
Figure 3 for TransPixar: Advancing Text-to-Video Generation with Transparency
Figure 4 for TransPixar: Advancing Text-to-Video Generation with Transparency
Viaarxiv icon

Generative Video Propagation

Add code
Dec 27, 2024
Viaarxiv icon

Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers

Add code
Dec 22, 2024
Viaarxiv icon