Alert button
Picture for Eric Alcaide

Eric Alcaide

Alert button

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Add code
Bookmark button
Alert button
Apr 10, 2024
Bo Peng, Daniel Goldstein, Quentin Anthony, Alon Albalak, Eric Alcaide, Stella Biderman, Eugene Cheah, Xingjian Du, Teddy Ferdinan, Haowen Hou, Przemysław Kazienko, Kranthi Kiran GV, Jan Kocoń, Bartłomiej Koptyra, Satyapriya Krishna, Ronald McClelland Jr., Niklas Muennighoff, Fares Obeid, Atsushi Saito, Guangyu Song, Haoqin Tu, Stanisław Woźniak, Ruichong Zhang, Bingchen Zhao, Qihang Zhao, Peng Zhou, Jian Zhu, Rui-Jie Zhu

Viaarxiv icon

RWKV: Reinventing RNNs for the Transformer Era

Add code
Bookmark button
Alert button
May 22, 2023
Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran GV, Xuzheng He, Haowen Hou, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Xiangru Tang, Bolun Wang, Johan S. Wind, Stansilaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Jian Zhu, Rui-Jie Zhu

Figure 1 for RWKV: Reinventing RNNs for the Transformer Era
Figure 2 for RWKV: Reinventing RNNs for the Transformer Era
Figure 3 for RWKV: Reinventing RNNs for the Transformer Era
Figure 4 for RWKV: Reinventing RNNs for the Transformer Era
Viaarxiv icon

Improving Graph Property Prediction with Generalized Readout Functions

Add code
Bookmark button
Alert button
Sep 21, 2020
Eric Alcaide

Figure 1 for Improving Graph Property Prediction with Generalized Readout Functions
Figure 2 for Improving Graph Property Prediction with Generalized Readout Functions
Figure 3 for Improving Graph Property Prediction with Generalized Readout Functions
Viaarxiv icon

E-swish: Adjusting Activations to Different Network Depths

Add code
Bookmark button
Alert button
Jan 22, 2018
Eric Alcaide

Figure 1 for E-swish: Adjusting Activations to Different Network Depths
Figure 2 for E-swish: Adjusting Activations to Different Network Depths
Figure 3 for E-swish: Adjusting Activations to Different Network Depths
Figure 4 for E-swish: Adjusting Activations to Different Network Depths
Viaarxiv icon