Picture for Abdoul Majid O. Thiombiano

Abdoul Majid O. Thiombiano

MoxE: Mixture of xLSTM Experts with Entropy-Aware Routing for Efficient Language Modeling

Add code
May 01, 2025
Viaarxiv icon

Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures

Add code
Mar 24, 2025
Viaarxiv icon