Alert button
Picture for Christopher Re

Christopher Re

Alert button

Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions

Oct 28, 2023
Stefano Massaroli, Michael Poli, Daniel Y. Fu, Hermann Kumbong, Rom N. Parnichkun, Aman Timalsina, David W. Romero, Quinn McIntyre, Beidi Chen, Atri Rudra, Ce Zhang, Christopher Re, Stefano Ermon, Yoshua Bengio

Figure 1 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 2 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 3 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 4 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Viaarxiv icon

Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time

Oct 26, 2023
Zichang Liu, Jue Wang, Tri Dao, Tianyi Zhou, Binhang Yuan, Zhao Song, Anshumali Shrivastava, Ce Zhang, Yuandong Tian, Christopher Re, Beidi Chen

Figure 1 for Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Figure 2 for Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Figure 3 for Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Figure 4 for Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Viaarxiv icon

Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees

Jun 02, 2022
Jue Wang, Binhang Yuan, Luka Rimanic, Yongjun He, Tri Dao, Beidi Chen, Christopher Re, Ce Zhang

Figure 1 for Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees
Figure 2 for Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees
Figure 3 for Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees
Figure 4 for Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees
Viaarxiv icon

Decentralized Training of Foundation Models in Heterogeneous Environments

Jun 02, 2022
Binhang Yuan, Yongjun He, Jared Quincy Davis, Tianyi Zhang, Tri Dao, Beidi Chen, Percy Liang, Christopher Re, Ce Zhang

Figure 1 for Decentralized Training of Foundation Models in Heterogeneous Environments
Figure 2 for Decentralized Training of Foundation Models in Heterogeneous Environments
Figure 3 for Decentralized Training of Foundation Models in Heterogeneous Environments
Figure 4 for Decentralized Training of Foundation Models in Heterogeneous Environments
Viaarxiv icon

Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models

Nov 30, 2021
Beidi Chen, Tri Dao, Kaizhao Liang, Jiaming Yang, Zhao Song, Atri Rudra, Christopher Re

Figure 1 for Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models
Figure 2 for Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models
Figure 3 for Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models
Figure 4 for Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models
Viaarxiv icon

Metadata Shaping: Natural Language Annotations for the Tail

Oct 16, 2021
Simran Arora, Sen Wu, Enci Liu, Christopher Re

Figure 1 for Metadata Shaping: Natural Language Annotations for the Tail
Figure 2 for Metadata Shaping: Natural Language Annotations for the Tail
Figure 3 for Metadata Shaping: Natural Language Annotations for the Tail
Figure 4 for Metadata Shaping: Natural Language Annotations for the Tail
Viaarxiv icon

Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation

Oct 23, 2020
Laurel Orr, Megan Leszczynski, Simran Arora, Sen Wu, Neel Guha, Xiao Ling, Christopher Re

Figure 1 for Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation
Figure 2 for Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation
Figure 3 for Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation
Figure 4 for Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation
Viaarxiv icon

Leveraging Organizational Resources to Adapt Models to New Data Modalities

Aug 23, 2020
Sahaana Suri, Raghuveer Chanda, Neslihan Bulut, Pradyumna Narayana, Yemao Zeng, Peter Bailis, Sugato Basu, Girija Narlikar, Christopher Re, Abishek Sethi

Figure 1 for Leveraging Organizational Resources to Adapt Models to New Data Modalities
Figure 2 for Leveraging Organizational Resources to Adapt Models to New Data Modalities
Figure 3 for Leveraging Organizational Resources to Adapt Models to New Data Modalities
Figure 4 for Leveraging Organizational Resources to Adapt Models to New Data Modalities
Viaarxiv icon