Picture for Pierre Abillama

Pierre Abillama

Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs

Add code
Dec 24, 2025
Viaarxiv icon

MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention

Add code
May 24, 2025
Viaarxiv icon