Picture for Janna Lu

Janna Lu

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale

Add code
May 07, 2025
Viaarxiv icon

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Add code
Mar 18, 2025
Viaarxiv icon