Picture for Jan Eitzinger

Jan Eitzinger

Move the Query, Not the Cache: Characterizing Cross-Instance Latent Attention Redistribution Across GPU Fabrics

Add code
May 31, 2026
Viaarxiv icon

Leyline: KV Cache Directives for Agentic Inference

Add code
May 31, 2026
Viaarxiv icon

Diagnosing Overhead in Dispatch Operations: Cross-architecture Observatory

Add code
May 20, 2026
Viaarxiv icon

The Illusion of Power Capping in LLM Decode: A Phase-Aware Energy Characterisation Across Attention Architectures

Add code
May 12, 2026
Viaarxiv icon

Irminsul: MLA-Native Position-Independent Caching for Agentic LLM Serving

Add code
May 07, 2026
Viaarxiv icon