Picture for Ruisi Cai

Ruisi Cai

Flextron: Many-in-One Flexible Large Language Model

Add code
Jun 11, 2024
Viaarxiv icon

LoCoCo: Dropping In Convolutions for Long Context Compression

Add code
Jun 08, 2024
Viaarxiv icon

Robust Mixture-of-Expert Training for Convolutional Neural Networks

Add code
Aug 19, 2023
Figure 1 for Robust Mixture-of-Expert Training for Convolutional Neural Networks
Figure 2 for Robust Mixture-of-Expert Training for Convolutional Neural Networks
Figure 3 for Robust Mixture-of-Expert Training for Convolutional Neural Networks
Figure 4 for Robust Mixture-of-Expert Training for Convolutional Neural Networks
Viaarxiv icon

H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Add code
Jul 19, 2023
Figure 1 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Figure 2 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Figure 3 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Figure 4 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Viaarxiv icon

Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?

Add code
Feb 24, 2023
Figure 1 for Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Figure 2 for Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Figure 3 for Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Figure 4 for Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Viaarxiv icon