Alert button
Picture for Tamra Nebabu

Tamra Nebabu

Alert button

Geometric Dynamics of Signal Propagation Predict Trainability of Transformers

Add code
Bookmark button
Alert button
Mar 05, 2024
Aditya Cowsik, Tamra Nebabu, Xiao-Liang Qi, Surya Ganguli

Figure 1 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Figure 2 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Figure 3 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Figure 4 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Viaarxiv icon