Alert button
Picture for James Martens

James Martens

Alert button

Dj

Disentangling the Causes of Plasticity Loss in Neural Networks

Add code
Bookmark button
Alert button
Feb 29, 2024
Clare Lyle, Zeyu Zheng, Khimya Khetarpal, Hado van Hasselt, Razvan Pascanu, James Martens, Will Dabney

Viaarxiv icon

Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation

Add code
Bookmark button
Alert button
Feb 20, 2023
Bobby He, James Martens, Guodong Zhang, Aleksandar Botev, Andrew Brock, Samuel L Smith, Yee Whye Teh

Figure 1 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 2 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 3 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 4 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Viaarxiv icon

Pre-training via Denoising for Molecular Property Prediction

Add code
Bookmark button
Alert button
May 31, 2022
Sheheryar Zaidi, Michael Schaarschmidt, James Martens, Hyunjik Kim, Yee Whye Teh, Alvaro Sanchez-Gonzalez, Peter Battaglia, Razvan Pascanu, Jonathan Godwin

Figure 1 for Pre-training via Denoising for Molecular Property Prediction
Figure 2 for Pre-training via Denoising for Molecular Property Prediction
Figure 3 for Pre-training via Denoising for Molecular Property Prediction
Figure 4 for Pre-training via Denoising for Molecular Property Prediction
Viaarxiv icon

Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers

Add code
Bookmark button
Alert button
Mar 15, 2022
Guodong Zhang, Aleksandar Botev, James Martens

Figure 1 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Figure 2 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Figure 3 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Figure 4 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Viaarxiv icon

Rapid training of deep neural networks without skip connections or normalization layers using Deep Kernel Shaping

Add code
Bookmark button
Alert button
Oct 05, 2021
James Martens, Andy Ballard, Guillaume Desjardins, Grzegorz Swirszcz, Valentin Dalibard, Jascha Sohl-Dickstein, Samuel S. Schoenholz

Viaarxiv icon

On the validity of kernel approximations for orthogonally-initialized neural networks

Add code
Bookmark button
Alert button
Apr 13, 2021
James Martens

Viaarxiv icon

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model

Add code
Bookmark button
Alert button
Jul 09, 2019
Guodong Zhang, Lala Li, Zachary Nado, James Martens, Sushant Sachdeva, George E. Dahl, Christopher J. Shallue, Roger Grosse

Figure 1 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 2 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 3 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 4 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Viaarxiv icon

Adversarial Robustness through Local Linearization

Add code
Bookmark button
Alert button
Jul 04, 2019
Chongli Qin, James Martens, Sven Gowal, Dilip Krishnan, Krishnamurthy, Dvijotham, Alhussein Fawzi, Soham De, Robert Stanforth, Pushmeet Kohli

Figure 1 for Adversarial Robustness through Local Linearization
Figure 2 for Adversarial Robustness through Local Linearization
Figure 3 for Adversarial Robustness through Local Linearization
Figure 4 for Adversarial Robustness through Local Linearization
Viaarxiv icon

Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks

Add code
Bookmark button
Alert button
May 27, 2019
Guodong Zhang, James Martens, Roger Grosse

Figure 1 for Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks
Viaarxiv icon

Differentiable Game Mechanics

Add code
Bookmark button
Alert button
May 13, 2019
Alistair Letcher, David Balduzzi, Sebastien Racaniere, James Martens, Jakob Foerster, Karl Tuyls, Thore Graepel

Figure 1 for Differentiable Game Mechanics
Figure 2 for Differentiable Game Mechanics
Figure 3 for Differentiable Game Mechanics
Figure 4 for Differentiable Game Mechanics
Viaarxiv icon