Picture for Jinxin Yu

Jinxin Yu

From Buffers to Registers: Unlocking Fine-Grained FlashAttention with Hybrid-Bonded 3D NPU Co-Design

Add code
Feb 11, 2026
Viaarxiv icon