Picture for Byeongcheol Kim

Byeongcheol Kim

ELMoE-3D: Leveraging Intrinsic Elasticity of MoE for Hybrid-Bonding-Enabled Self-Speculative Decoding in On-Premises Serving

Add code
Apr 16, 2026
Viaarxiv icon

SeVeDo: A Heterogeneous Transformer Accelerator for Low-Bit Inference via Hierarchical Group Quantization and SVD-Guided Mixed Precision

Add code
Dec 15, 2025
Viaarxiv icon