Alert button
Picture for Hattie Zhou

Hattie Zhou

Alert button

Vanishing Gradients in Reinforcement Finetuning of Language Models

Add code
Bookmark button
Alert button
Oct 31, 2023
Noam Razin, Hattie Zhou, Omid Saremi, Vimal Thilak, Arwen Bradley, Preetum Nakkiran, Joshua Susskind, Etai Littwin

Viaarxiv icon

What Algorithms can Transformers Learn? A Study in Length Generalization

Add code
Bookmark button
Alert button
Oct 24, 2023
Hattie Zhou, Arwen Bradley, Etai Littwin, Noam Razin, Omid Saremi, Josh Susskind, Samy Bengio, Preetum Nakkiran

Viaarxiv icon

Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok

Add code
Bookmark button
Alert button
Jun 23, 2023
Pascal Jr. Tikeng Notsawo, Hattie Zhou, Mohammad Pezeshki, Irina Rish, Guillaume Dumas

Figure 1 for Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok
Figure 2 for Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok
Figure 3 for Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok
Figure 4 for Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok
Viaarxiv icon

Teaching Algorithmic Reasoning via In-context Learning

Add code
Bookmark button
Alert button
Nov 15, 2022
Hattie Zhou, Azade Nova, Hugo Larochelle, Aaron Courville, Behnam Neyshabur, Hanie Sedghi

Figure 1 for Teaching Algorithmic Reasoning via In-context Learning
Figure 2 for Teaching Algorithmic Reasoning via In-context Learning
Figure 3 for Teaching Algorithmic Reasoning via In-context Learning
Figure 4 for Teaching Algorithmic Reasoning via In-context Learning
Viaarxiv icon

Fortuitous Forgetting in Connectionist Networks

Add code
Bookmark button
Alert button
Feb 01, 2022
Hattie Zhou, Ankit Vani, Hugo Larochelle, Aaron Courville

Figure 1 for Fortuitous Forgetting in Connectionist Networks
Figure 2 for Fortuitous Forgetting in Connectionist Networks
Figure 3 for Fortuitous Forgetting in Connectionist Networks
Figure 4 for Fortuitous Forgetting in Connectionist Networks
Viaarxiv icon

LCA: Loss Change Allocation for Neural Network Training

Add code
Bookmark button
Alert button
Sep 03, 2019
Janice Lan, Rosanne Liu, Hattie Zhou, Jason Yosinski

Figure 1 for LCA: Loss Change Allocation for Neural Network Training
Figure 2 for LCA: Loss Change Allocation for Neural Network Training
Figure 3 for LCA: Loss Change Allocation for Neural Network Training
Figure 4 for LCA: Loss Change Allocation for Neural Network Training
Viaarxiv icon

Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask

Add code
Bookmark button
Alert button
May 03, 2019
Hattie Zhou, Janice Lan, Rosanne Liu, Jason Yosinski

Figure 1 for Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask
Figure 2 for Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask
Figure 3 for Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask
Figure 4 for Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask
Viaarxiv icon