Alert button
Picture for Iurii Kemaev

Iurii Kemaev

Alert button

Return-based Scaling: Yet Another Normalisation Trick for Deep RL

May 11, 2021
Tom Schaul, Georg Ostrovski, Iurii Kemaev, Diana Borsa

Figure 1 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 2 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 3 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 4 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Viaarxiv icon

Podracer architectures for scalable Reinforcement Learning

Apr 13, 2021
Matteo Hessel, Manuel Kroiss, Aidan Clark, Iurii Kemaev, John Quan, Thomas Keck, Fabio Viola, Hado van Hasselt

Figure 1 for Podracer architectures for scalable Reinforcement Learning
Figure 2 for Podracer architectures for scalable Reinforcement Learning
Figure 3 for Podracer architectures for scalable Reinforcement Learning
Figure 4 for Podracer architectures for scalable Reinforcement Learning
Viaarxiv icon

Discovery of Options via Meta-Learned Subgoals

Feb 12, 2021
Vivek Veeriah, Tom Zahavy, Matteo Hessel, Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for Discovery of Options via Meta-Learned Subgoals
Figure 2 for Discovery of Options via Meta-Learned Subgoals
Figure 3 for Discovery of Options via Meta-Learned Subgoals
Figure 4 for Discovery of Options via Meta-Learned Subgoals
Viaarxiv icon

Discovering a set of policies for the worst case reward

Feb 08, 2021
Tom Zahavy, Andre Barreto, Daniel J Mankowitz, Shaobo Hou, Brendan O'Donoghue, Iurii Kemaev, Satinder Baveja Singh

Figure 1 for Discovering a set of policies for the worst case reward
Figure 2 for Discovering a set of policies for the worst case reward
Figure 3 for Discovering a set of policies for the worst case reward
Figure 4 for Discovering a set of policies for the worst case reward
Viaarxiv icon

ReSet: Learning Recurrent Dynamic Routing in ResNet-like Neural Networks

Nov 11, 2018
Iurii Kemaev, Daniil Polykovskiy, Dmitry Vetrov

Figure 1 for ReSet: Learning Recurrent Dynamic Routing in ResNet-like Neural Networks
Figure 2 for ReSet: Learning Recurrent Dynamic Routing in ResNet-like Neural Networks
Figure 3 for ReSet: Learning Recurrent Dynamic Routing in ResNet-like Neural Networks
Figure 4 for ReSet: Learning Recurrent Dynamic Routing in ResNet-like Neural Networks
Viaarxiv icon