Alert button
Picture for Siddhant M. Jayakumar

Siddhant M. Jayakumar

Alert button

Powerpropagation: A sparsity inducing weight reparameterisation

Add code
Bookmark button
Alert button
Oct 06, 2021
Jonathan Schwarz, Siddhant M. Jayakumar, Razvan Pascanu, Peter E. Latham, Yee Whye Teh

Figure 1 for Powerpropagation: A sparsity inducing weight reparameterisation
Figure 2 for Powerpropagation: A sparsity inducing weight reparameterisation
Figure 3 for Powerpropagation: A sparsity inducing weight reparameterisation
Figure 4 for Powerpropagation: A sparsity inducing weight reparameterisation
Viaarxiv icon

Top-KAST: Top-K Always Sparse Training

Add code
Bookmark button
Alert button
Jun 07, 2021
Siddhant M. Jayakumar, Razvan Pascanu, Jack W. Rae, Simon Osindero, Erich Elsen

Figure 1 for Top-KAST: Top-K Always Sparse Training
Figure 2 for Top-KAST: Top-K Always Sparse Training
Figure 3 for Top-KAST: Top-K Always Sparse Training
Figure 4 for Top-KAST: Top-K Always Sparse Training
Viaarxiv icon

Perception-Prediction-Reaction Agents for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 26, 2020
Adam Stooke, Valentin Dalibard, Siddhant M. Jayakumar, Wojciech M. Czarnecki, Max Jaderberg

Figure 1 for Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Figure 2 for Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Figure 3 for Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Figure 4 for Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Viaarxiv icon

Compressive Transformers for Long-Range Sequence Modelling

Add code
Bookmark button
Alert button
Nov 13, 2019
Jack W. Rae, Anna Potapenko, Siddhant M. Jayakumar, Timothy P. Lillicrap

Figure 1 for Compressive Transformers for Long-Range Sequence Modelling
Figure 2 for Compressive Transformers for Long-Range Sequence Modelling
Figure 3 for Compressive Transformers for Long-Range Sequence Modelling
Figure 4 for Compressive Transformers for Long-Range Sequence Modelling
Viaarxiv icon

Stabilizing Transformers for Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 13, 2019
Emilio Parisotto, H. Francis Song, Jack W. Rae, Razvan Pascanu, Caglar Gulcehre, Siddhant M. Jayakumar, Max Jaderberg, Raphael Lopez Kaufman, Aidan Clark, Seb Noury, Matthew M. Botvinick, Nicolas Heess, Raia Hadsell

Figure 1 for Stabilizing Transformers for Reinforcement Learning
Figure 2 for Stabilizing Transformers for Reinforcement Learning
Figure 3 for Stabilizing Transformers for Reinforcement Learning
Figure 4 for Stabilizing Transformers for Reinforcement Learning
Viaarxiv icon

Meta-learning of Sequential Strategies

Add code
Bookmark button
Alert button
May 08, 2019
Pedro A. Ortega, Jane X. Wang, Mark Rowland, Tim Genewein, Zeb Kurth-Nelson, Razvan Pascanu, Nicolas Heess, Joel Veness, Alex Pritzel, Pablo Sprechmann, Siddhant M. Jayakumar, Tom McGrath, Kevin Miller, Mohammad Azar, Ian Osband, Neil Rabinowitz, András György, Silvia Chiappa, Simon Osindero, Yee Whye Teh, Hado van Hasselt, Nando de Freitas, Matthew Botvinick, Shane Legg

Figure 1 for Meta-learning of Sequential Strategies
Figure 2 for Meta-learning of Sequential Strategies
Figure 3 for Meta-learning of Sequential Strategies
Figure 4 for Meta-learning of Sequential Strategies
Viaarxiv icon

Information asymmetry in KL-regularized RL

Add code
Bookmark button
Alert button
May 03, 2019
Alexandre Galashov, Siddhant M. Jayakumar, Leonard Hasenclever, Dhruva Tirumala, Jonathan Schwarz, Guillaume Desjardins, Wojciech M. Czarnecki, Yee Whye Teh, Razvan Pascanu, Nicolas Heess

Figure 1 for Information asymmetry in KL-regularized RL
Figure 2 for Information asymmetry in KL-regularized RL
Figure 3 for Information asymmetry in KL-regularized RL
Figure 4 for Information asymmetry in KL-regularized RL
Viaarxiv icon

Distilling Policy Distillation

Add code
Bookmark button
Alert button
Feb 06, 2019
Wojciech Marian Czarnecki, Razvan Pascanu, Simon Osindero, Siddhant M. Jayakumar, Grzegorz Swirszcz, Max Jaderberg

Figure 1 for Distilling Policy Distillation
Figure 2 for Distilling Policy Distillation
Figure 3 for Distilling Policy Distillation
Figure 4 for Distilling Policy Distillation
Viaarxiv icon

Adapting Auxiliary Losses Using Gradient Similarity

Add code
Bookmark button
Alert button
Dec 05, 2018
Yunshu Du, Wojciech M. Czarnecki, Siddhant M. Jayakumar, Razvan Pascanu, Balaji Lakshminarayanan

Figure 1 for Adapting Auxiliary Losses Using Gradient Similarity
Figure 2 for Adapting Auxiliary Losses Using Gradient Similarity
Figure 3 for Adapting Auxiliary Losses Using Gradient Similarity
Figure 4 for Adapting Auxiliary Losses Using Gradient Similarity
Viaarxiv icon