Alert button

Parallel Attention and Feed-Forward Net Design for Pre-training and Inference on Transformers

May 22, 2023
Shashank Sonkar, Richard G. Baraniuk

Figure 1 for Parallel Attention and Feed-Forward Net Design for Pre-training and Inference on Transformers
Figure 2 for Parallel Attention and Feed-Forward Net Design for Pre-training and Inference on Transformers

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: