Picture for Kevin Ro Wang

Kevin Ro Wang

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Add code
Mar 28, 2022
Figure 1 for Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space
Figure 2 for Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space
Figure 3 for Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space
Figure 4 for Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space
Viaarxiv icon