Alert button

Geometric Dynamics of Signal Propagation Predict Trainability of Transformers

Mar 05, 2024
Aditya Cowsik, Tamra Nebabu, Xiao-Liang Qi, Surya Ganguli

Figure 1 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Figure 2 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Figure 3 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Figure 4 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: