Alert button
Picture for Aitor Lewkowycz

Aitor Lewkowycz

Alert button

Show Your Work: Scratchpads for Intermediate Computation with Language Models

Add code
Bookmark button
Alert button
Nov 30, 2021
Maxwell Nye, Anders Johan Andreassen, Guy Gur-Ari, Henryk Michalewski, Jacob Austin, David Bieber, David Dohan, Aitor Lewkowycz, Maarten Bosma, David Luan, Charles Sutton, Augustus Odena

Figure 1 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Figure 2 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Figure 3 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Figure 4 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Viaarxiv icon

How to decay your learning rate

Add code
Bookmark button
Alert button
Mar 23, 2021
Aitor Lewkowycz

Figure 1 for How to decay your learning rate
Figure 2 for How to decay your learning rate
Figure 3 for How to decay your learning rate
Figure 4 for How to decay your learning rate
Viaarxiv icon

On the training dynamics of deep networks with $L_2$ regularization

Add code
Bookmark button
Alert button
Jun 15, 2020
Aitor Lewkowycz, Guy Gur-Ari

Figure 1 for On the training dynamics of deep networks with $L_2$ regularization
Figure 2 for On the training dynamics of deep networks with $L_2$ regularization
Figure 3 for On the training dynamics of deep networks with $L_2$ regularization
Figure 4 for On the training dynamics of deep networks with $L_2$ regularization
Viaarxiv icon

The large learning rate phase of deep learning: the catapult mechanism

Add code
Bookmark button
Alert button
Mar 04, 2020
Aitor Lewkowycz, Yasaman Bahri, Ethan Dyer, Jascha Sohl-Dickstein, Guy Gur-Ari

Figure 1 for The large learning rate phase of deep learning: the catapult mechanism
Figure 2 for The large learning rate phase of deep learning: the catapult mechanism
Figure 3 for The large learning rate phase of deep learning: the catapult mechanism
Figure 4 for The large learning rate phase of deep learning: the catapult mechanism
Viaarxiv icon