Picture for Yanyu Li

Yanyu Li

S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation

Add code
Jan 19, 2026
Viaarxiv icon

SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices

Add code
Jan 13, 2026
Viaarxiv icon

Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning

Add code
Jan 07, 2026
Viaarxiv icon

Taming Diffusion Transformer for Real-Time Mobile Video Generation

Add code
Jul 17, 2025
Figure 1 for Taming Diffusion Transformer for Real-Time Mobile Video Generation
Figure 2 for Taming Diffusion Transformer for Real-Time Mobile Video Generation
Figure 3 for Taming Diffusion Transformer for Real-Time Mobile Video Generation
Figure 4 for Taming Diffusion Transformer for Real-Time Mobile Video Generation
Viaarxiv icon

CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion

Add code
May 07, 2025
Viaarxiv icon

H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models

Add code
Apr 14, 2025
Viaarxiv icon

Improving the Diffusability of Autoencoders

Add code
Feb 20, 2025
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon

SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device

Add code
Dec 13, 2024
Figure 1 for SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Figure 2 for SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Figure 3 for SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Figure 4 for SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Viaarxiv icon

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Add code
Dec 12, 2024
Viaarxiv icon