Picture for Rundong Su

Rundong Su

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

Add code
Mar 19, 2026
Viaarxiv icon

DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers

Add code
Mar 28, 2025
Figure 1 for DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
Figure 2 for DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
Figure 3 for DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
Figure 4 for DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
Viaarxiv icon