Picture for Shahar Katz

Shahar Katz

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space

Add code
Feb 20, 2024
Figure 1 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Figure 2 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Figure 3 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Figure 4 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Viaarxiv icon

Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT

Add code
May 22, 2023
Figure 1 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 2 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 3 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 4 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Viaarxiv icon