Picture for John P. Shen

John P. Shen

TaxBreak: Unmasking the Hidden Costs of LLM Inference Through Overhead Decomposition

Add code
Mar 12, 2026
Viaarxiv icon