Alert button

ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs

Feb 06, 2024
Zhengyan Zhang, Yixin Song, Guanghui Yu, Xu Han, Yankai Lin, Chaojun Xiao, Chenyang Song, Zhiyuan Liu, Zeyu Mi, Maosong Sun

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: