MoE-nD: Per-Layer Mixture-of-Experts Routing for Multi-Axis KV Cache Compression

Add code
Apr 20, 2026

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: