Alert button

Gated Linear Attention Transformers with Hardware-Efficient Training

Dec 24, 2023
Songlin Yang, Bailin Wang, Yikang Shen, Rameswar Panda, Yoon Kim

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: