Picture for Hoseung Kim

Hoseung Kim

PrefillShare: A Shared Prefill Module for KV Reuse in Multi-LLM Disaggregated Serving

Add code
Feb 12, 2026
Viaarxiv icon

CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs

Add code
Dec 19, 2025
Viaarxiv icon