Alert button
Picture for Szymon Sidor

Szymon Sidor

Alert button

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

Add code
Bookmark button
Alert button
Mar 28, 2022
Greg Yang, Edward J. Hu, Igor Babuschkin, Szymon Sidor, Xiaodong Liu, David Farhi, Nick Ryder, Jakub Pachocki, Weizhu Chen, Jianfeng Gao

Figure 1 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 2 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 3 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 4 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Viaarxiv icon

Dota 2 with Large Scale Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 13, 2019
OpenAI, :, Christopher Berner, Greg Brockman, Brooke Chan, Vicki Cheung, Przemysław Dębiak, Christy Dennison, David Farhi, Quirin Fischer, Shariq Hashme, Chris Hesse, Rafal Józefowicz, Scott Gray, Catherine Olsson, Jakub Pachocki, Michael Petrov, Henrique Pondé de Oliveira Pinto, Jonathan Raiman, Tim Salimans, Jeremy Schlatter, Jonas Schneider, Szymon Sidor, Ilya Sutskever, Jie Tang, Filip Wolski, Susan Zhang

Figure 1 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 2 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 3 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 4 for Dota 2 with Large Scale Deep Reinforcement Learning
Viaarxiv icon

Learning Dexterous In-Hand Manipulation

Add code
Bookmark button
Alert button
Jan 18, 2019
OpenAI, Marcin Andrychowicz, Bowen Baker, Maciek Chociej, Rafal Jozefowicz, Bob McGrew, Jakub Pachocki, Arthur Petron, Matthias Plappert, Glenn Powell, Alex Ray, Jonas Schneider, Szymon Sidor, Josh Tobin, Peter Welinder, Lilian Weng, Wojciech Zaremba

Figure 1 for Learning Dexterous In-Hand Manipulation
Figure 2 for Learning Dexterous In-Hand Manipulation
Figure 3 for Learning Dexterous In-Hand Manipulation
Figure 4 for Learning Dexterous In-Hand Manipulation
Viaarxiv icon

Emergent Complexity via Multi-Agent Competition

Add code
Bookmark button
Alert button
Mar 14, 2018
Trapit Bansal, Jakub Pachocki, Szymon Sidor, Ilya Sutskever, Igor Mordatch

Figure 1 for Emergent Complexity via Multi-Agent Competition
Figure 2 for Emergent Complexity via Multi-Agent Competition
Figure 3 for Emergent Complexity via Multi-Agent Competition
Figure 4 for Emergent Complexity via Multi-Agent Competition
Viaarxiv icon

Parameter Space Noise for Exploration

Add code
Bookmark button
Alert button
Jan 31, 2018
Matthias Plappert, Rein Houthooft, Prafulla Dhariwal, Szymon Sidor, Richard Y. Chen, Xi Chen, Tamim Asfour, Pieter Abbeel, Marcin Andrychowicz

Figure 1 for Parameter Space Noise for Exploration
Figure 2 for Parameter Space Noise for Exploration
Figure 3 for Parameter Space Noise for Exploration
Figure 4 for Parameter Space Noise for Exploration
Viaarxiv icon

UCB Exploration via Q-Ensembles

Add code
Bookmark button
Alert button
Nov 07, 2017
Richard Y. Chen, Szymon Sidor, Pieter Abbeel, John Schulman

Figure 1 for UCB Exploration via Q-Ensembles
Figure 2 for UCB Exploration via Q-Ensembles
Figure 3 for UCB Exploration via Q-Ensembles
Figure 4 for UCB Exploration via Q-Ensembles
Viaarxiv icon

Evolution Strategies as a Scalable Alternative to Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 07, 2017
Tim Salimans, Jonathan Ho, Xi Chen, Szymon Sidor, Ilya Sutskever

Figure 1 for Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Figure 2 for Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Figure 3 for Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Figure 4 for Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Viaarxiv icon

Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics

Add code
Bookmark button
Alert button
Aug 17, 2017
Ken Kansky, Tom Silver, David A. Mély, Mohamed Eldawy, Miguel Lázaro-Gredilla, Xinghua Lou, Nimrod Dorfman, Szymon Sidor, Scott Phoenix, Dileep George

Figure 1 for Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics
Figure 2 for Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics
Figure 3 for Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics
Figure 4 for Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics
Viaarxiv icon