Alert button
Picture for Colin Raffel

Colin Raffel

Alert button

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models

Add code
Bookmark button
Alert button
Apr 08, 2024
Bowen Pan, Yikang Shen, Haokun Liu, Mayank Mishra, Gaoyuan Zhang, Aude Oliva, Colin Raffel, Rameswar Panda

Viaarxiv icon

A Survey on Data Selection for Language Models

Add code
Bookmark button
Alert button
Mar 08, 2024
Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre, Nathan Lambert, Xinyi Wang, Niklas Muennighoff, Bairu Hou, Liangming Pan, Haewon Jeong, Colin Raffel, Shiyu Chang, Tatsunori Hashimoto, William Yang Wang

Viaarxiv icon

DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows

Add code
Bookmark button
Alert button
Feb 16, 2024
Ajay Patel, Colin Raffel, Chris Callison-Burch

Viaarxiv icon

Learning to Route Among Specialized Experts for Zero-Shot Generalization

Add code
Bookmark button
Alert button
Feb 08, 2024
Mohammed Muqeeth, Haokun Liu, Yufan Liu, Colin Raffel

Viaarxiv icon

Distributed Inference and Fine-tuning of Large Language Models Over The Internet

Add code
Bookmark button
Alert button
Dec 13, 2023
Alexander Borzunov, Max Ryabinin, Artem Chumachenko, Dmitry Baranchuk, Tim Dettmers, Younes Belkada, Pavel Samygin, Colin Raffel

Viaarxiv icon

Merging by Matching Models in Task Subspaces

Add code
Bookmark button
Alert button
Dec 07, 2023
Derek Tam, Mohit Bansal, Colin Raffel

Figure 1 for Merging by Matching Models in Task Subspaces
Figure 2 for Merging by Matching Models in Task Subspaces
Figure 3 for Merging by Matching Models in Task Subspaces
Figure 4 for Merging by Matching Models in Task Subspaces
Viaarxiv icon

Efficient Online Data Mixing For Language Model Pre-Training

Add code
Bookmark button
Alert button
Dec 05, 2023
Alon Albalak, Liangming Pan, Colin Raffel, William Yang Wang

Viaarxiv icon

ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization

Add code
Bookmark button
Alert button
Nov 22, 2023
Prateek Yadav, Leshem Choshen, Colin Raffel, Mohit Bansal

Viaarxiv icon

Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model

Add code
Bookmark button
Alert button
Oct 26, 2023
Haikang Deng, Colin Raffel

Viaarxiv icon