Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Add code
Mar 28, 2022
Figure 1 for Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space
Figure 2 for Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space
Figure 3 for Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space
Figure 4 for Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: