Alert button
Picture for Oleg Klimov

Oleg Klimov

Alert button

Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft

Jun 28, 2021
Ingmar Kanitscheider, Joost Huizinga, David Farhi, William Hebgen Guss, Brandon Houghton, Raul Sampedro, Peter Zhokhov, Bowen Baker, Adrien Ecoffet, Jie Tang, Oleg Klimov, Jeff Clune

Figure 1 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 2 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 3 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 4 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Viaarxiv icon

Phasic Policy Gradient

Sep 09, 2020
Karl Cobbe, Jacob Hilton, Oleg Klimov, John Schulman

Figure 1 for Phasic Policy Gradient
Figure 2 for Phasic Policy Gradient
Figure 3 for Phasic Policy Gradient
Figure 4 for Phasic Policy Gradient
Viaarxiv icon

Quantifying Generalization in Reinforcement Learning

Dec 20, 2018
Karl Cobbe, Oleg Klimov, Chris Hesse, Taehoon Kim, John Schulman

Figure 1 for Quantifying Generalization in Reinforcement Learning
Figure 2 for Quantifying Generalization in Reinforcement Learning
Figure 3 for Quantifying Generalization in Reinforcement Learning
Figure 4 for Quantifying Generalization in Reinforcement Learning
Viaarxiv icon

Exploration by Random Network Distillation

Oct 30, 2018
Yuri Burda, Harrison Edwards, Amos Storkey, Oleg Klimov

Figure 1 for Exploration by Random Network Distillation
Figure 2 for Exploration by Random Network Distillation
Figure 3 for Exploration by Random Network Distillation
Figure 4 for Exploration by Random Network Distillation
Viaarxiv icon

Gotta Learn Fast: A New Benchmark for Generalization in RL

Apr 23, 2018
Alex Nichol, Vicki Pfau, Christopher Hesse, Oleg Klimov, John Schulman

Figure 1 for Gotta Learn Fast: A New Benchmark for Generalization in RL
Figure 2 for Gotta Learn Fast: A New Benchmark for Generalization in RL
Figure 3 for Gotta Learn Fast: A New Benchmark for Generalization in RL
Figure 4 for Gotta Learn Fast: A New Benchmark for Generalization in RL
Viaarxiv icon

Proximal Policy Optimization Algorithms

Aug 28, 2017
John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, Oleg Klimov

Figure 1 for Proximal Policy Optimization Algorithms
Figure 2 for Proximal Policy Optimization Algorithms
Figure 3 for Proximal Policy Optimization Algorithms
Figure 4 for Proximal Policy Optimization Algorithms
Viaarxiv icon