Picture for Daehyun Ahn

Daehyun Ahn

GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Add code
May 26, 2025
Viaarxiv icon

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference

Add code
Feb 15, 2024
Viaarxiv icon

Squeezing Large-Scale Diffusion Models for Mobile

Add code
Jul 03, 2023
Viaarxiv icon

Temporal Dynamic Quantization for Diffusion Models

Add code
Jun 04, 2023
Viaarxiv icon