Picture for Daehyun Ahn

Daehyun Ahn

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference

Add code
Feb 15, 2024
Viaarxiv icon

Squeezing Large-Scale Diffusion Models for Mobile

Add code
Jul 03, 2023
Figure 1 for Squeezing Large-Scale Diffusion Models for Mobile
Figure 2 for Squeezing Large-Scale Diffusion Models for Mobile
Figure 3 for Squeezing Large-Scale Diffusion Models for Mobile
Figure 4 for Squeezing Large-Scale Diffusion Models for Mobile
Viaarxiv icon

Temporal Dynamic Quantization for Diffusion Models

Add code
Jun 04, 2023
Figure 1 for Temporal Dynamic Quantization for Diffusion Models
Figure 2 for Temporal Dynamic Quantization for Diffusion Models
Figure 3 for Temporal Dynamic Quantization for Diffusion Models
Figure 4 for Temporal Dynamic Quantization for Diffusion Models
Viaarxiv icon