Alert button
Picture for Alon Albalak

Alon Albalak

Alert button

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Add code
Bookmark button
Alert button
Apr 10, 2024
Bo Peng, Daniel Goldstein, Quentin Anthony, Alon Albalak, Eric Alcaide, Stella Biderman, Eugene Cheah, Xingjian Du, Teddy Ferdinan, Haowen Hou, Przemysław Kazienko, Kranthi Kiran GV, Jan Kocoń, Bartłomiej Koptyra, Satyapriya Krishna, Ronald McClelland Jr., Niklas Muennighoff, Fares Obeid, Atsushi Saito, Guangyu Song, Haoqin Tu, Stanisław Woźniak, Ruichong Zhang, Bingchen Zhao, Qihang Zhao, Peng Zhou, Jian Zhu, Rui-Jie Zhu

Viaarxiv icon

A Survey on Data Selection for Language Models

Add code
Bookmark button
Alert button
Mar 08, 2024
Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre, Nathan Lambert, Xinyi Wang, Niklas Muennighoff, Bairu Hou, Liangming Pan, Haewon Jeong, Colin Raffel, Shiyu Chang, Tatsunori Hashimoto, William Yang Wang

Viaarxiv icon

Efficient Online Data Mixing For Language Model Pre-Training

Add code
Bookmark button
Alert button
Dec 05, 2023
Alon Albalak, Liangming Pan, Colin Raffel, William Yang Wang

Viaarxiv icon

RWKV: Reinventing RNNs for the Transformer Era

Add code
Bookmark button
Alert button
May 22, 2023
Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran GV, Xuzheng He, Haowen Hou, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Xiangru Tang, Bolun Wang, Johan S. Wind, Stansilaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Jian Zhu, Rui-Jie Zhu

Figure 1 for RWKV: Reinventing RNNs for the Transformer Era
Figure 2 for RWKV: Reinventing RNNs for the Transformer Era
Figure 3 for RWKV: Reinventing RNNs for the Transformer Era
Figure 4 for RWKV: Reinventing RNNs for the Transformer Era
Viaarxiv icon

Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning

Add code
Bookmark button
Alert button
May 20, 2023
Liangming Pan, Alon Albalak, Xinyi Wang, William Yang Wang

Figure 1 for Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning
Figure 2 for Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning
Figure 3 for Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning
Figure 4 for Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning
Viaarxiv icon

Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data

Add code
Bookmark button
Alert button
Feb 06, 2023
Alon Albalak, Colin Raffel, William Yang Wang

Figure 1 for Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
Figure 2 for Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
Figure 3 for Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
Figure 4 for Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
Viaarxiv icon

CausalDialogue: Modeling Utterance-level Causality in Conversations

Add code
Bookmark button
Alert button
Dec 20, 2022
Yi-Lin Tuan, Alon Albalak, Wenda Xu, Michael Saxon, Connor Pryor, Lise Getoor, William Yang Wang

Figure 1 for CausalDialogue: Modeling Utterance-level Causality in Conversations
Figure 2 for CausalDialogue: Modeling Utterance-level Causality in Conversations
Figure 3 for CausalDialogue: Modeling Utterance-level Causality in Conversations
Figure 4 for CausalDialogue: Modeling Utterance-level Causality in Conversations
Viaarxiv icon

Data-Efficiency with a Single GPU: An Exploration of Transfer Methods for Small Language Models

Add code
Bookmark button
Alert button
Oct 08, 2022
Alon Albalak, Akshat Shrivastava, Chinnadhurai Sankar, Adithya Sagar, Mike Ross

Figure 1 for Data-Efficiency with a Single GPU: An Exploration of Transfer Methods for Small Language Models
Figure 2 for Data-Efficiency with a Single GPU: An Exploration of Transfer Methods for Small Language Models
Figure 3 for Data-Efficiency with a Single GPU: An Exploration of Transfer Methods for Small Language Models
Figure 4 for Data-Efficiency with a Single GPU: An Exploration of Transfer Methods for Small Language Models
Viaarxiv icon