Alert button
Picture for Joshua Susskind

Joshua Susskind

Alert button

Vanishing Gradients in Reinforcement Finetuning of Language Models

Add code
Bookmark button
Alert button
Oct 31, 2023
Noam Razin, Hattie Zhou, Omid Saremi, Vimal Thilak, Arwen Bradley, Preetum Nakkiran, Joshua Susskind, Etai Littwin

Viaarxiv icon

When can transformers reason with abstract symbols?

Add code
Bookmark button
Alert button
Oct 15, 2023
Enric Boix-Adsera, Omid Saremi, Emmanuel Abbe, Samy Bengio, Etai Littwin, Joshua Susskind

Figure 1 for When can transformers reason with abstract symbols?
Figure 2 for When can transformers reason with abstract symbols?
Figure 3 for When can transformers reason with abstract symbols?
Figure 4 for When can transformers reason with abstract symbols?
Viaarxiv icon

Transformers learn through gradual rank increase

Add code
Bookmark button
Alert button
Jun 12, 2023
Enric Boix-Adsera, Etai Littwin, Emmanuel Abbe, Samy Bengio, Joshua Susskind

Figure 1 for Transformers learn through gradual rank increase
Figure 2 for Transformers learn through gradual rank increase
Figure 3 for Transformers learn through gradual rank increase
Figure 4 for Transformers learn through gradual rank increase
Viaarxiv icon

Position Prediction as an Effective Pretraining Strategy

Add code
Bookmark button
Alert button
Jul 15, 2022
Shuangfei Zhai, Navdeep Jaitly, Jason Ramapuram, Dan Busbridge, Tatiana Likhomanenko, Joseph Yitan Cheng, Walter Talbott, Chen Huang, Hanlin Goh, Joshua Susskind

Figure 1 for Position Prediction as an Effective Pretraining Strategy
Figure 2 for Position Prediction as an Effective Pretraining Strategy
Figure 3 for Position Prediction as an Effective Pretraining Strategy
Figure 4 for Position Prediction as an Effective Pretraining Strategy
Viaarxiv icon

The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon

Add code
Bookmark button
Alert button
Jun 13, 2022
Vimal Thilak, Etai Littwin, Shuangfei Zhai, Omid Saremi, Roni Paiss, Joshua Susskind

Figure 1 for The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon
Figure 2 for The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon
Figure 3 for The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon
Figure 4 for The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon
Viaarxiv icon

Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation

Add code
Bookmark button
Alert button
Jan 28, 2022
Martin Bertran, Walter Talbott, Nitish Srivastava, Joshua Susskind

Viaarxiv icon

Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
May 17, 2021
Yue Wu, Shuangfei Zhai, Nitish Srivastava, Joshua Susskind, Jian Zhang, Ruslan Salakhutdinov, Hanlin Goh

Figure 1 for Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Figure 2 for Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Figure 3 for Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Figure 4 for Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Viaarxiv icon

Collegial Ensembles

Add code
Bookmark button
Alert button
Jun 17, 2020
Etai Littwin, Ben Myara, Sima Sabah, Joshua Susskind, Shuangfei Zhai, Oren Golan

Figure 1 for Collegial Ensembles
Figure 2 for Collegial Ensembles
Figure 3 for Collegial Ensembles
Figure 4 for Collegial Ensembles
Viaarxiv icon