Alert button
Picture for Wojciech Jaśkowski

Wojciech Jaśkowski

Alert button

How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization

Add code
Bookmark button
Alert button
Apr 29, 2020
Pierluca D'Oro, Wojciech Jaśkowski

Figure 1 for How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
Figure 2 for How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
Figure 3 for How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
Figure 4 for How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
Viaarxiv icon

Training Agents using Upside-Down Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 05, 2019
Rupesh Kumar Srivastava, Pranav Shyam, Filipe Mutz, Wojciech Jaśkowski, Jürgen Schmidhuber

Figure 1 for Training Agents using Upside-Down Reinforcement Learning
Figure 2 for Training Agents using Upside-Down Reinforcement Learning
Figure 3 for Training Agents using Upside-Down Reinforcement Learning
Figure 4 for Training Agents using Upside-Down Reinforcement Learning
Viaarxiv icon

Artificial Intelligence for Prosthetics - challenge solutions

Add code
Bookmark button
Alert button
Feb 07, 2019
Łukasz Kidziński, Carmichael Ong, Sharada Prasanna Mohanty, Jennifer Hicks, Sean F. Carroll, Bo Zhou, Hongsheng Zeng, Fan Wang, Rongzhong Lian, Hao Tian, Wojciech Jaśkowski, Garrett Andersen, Odd Rune Lykkebø, Nihat Engin Toklu, Pranav Shyam, Rupesh Kumar Srivastava, Sergey Kolesnikov, Oleksii Hrinchuk, Anton Pechenko, Mattias Ljungström, Zhen Wang, Xu Hu, Zehong Hu, Minghui Qiu, Jun Huang, Aleksei Shpilman, Ivan Sosin, Oleg Svidchenko, Aleksandra Malysheva, Daniel Kudenko, Lance Rane, Aditya Bhatt, Zhengfei Wang, Penghui Qi, Zeyang Yu, Peng Peng, Quan Yuan, Wenxin Li, Yunsheng Tian, Ruihan Yang, Pingchuan Ma, Shauharda Khadka, Somdeb Majumdar, Zach Dwiel, Yinyin Liu, Evren Tumer, Jeremy Watson, Marcel Salathé, Sergey Levine, Scott Delp

Figure 1 for Artificial Intelligence for Prosthetics - challenge solutions
Figure 2 for Artificial Intelligence for Prosthetics - challenge solutions
Figure 3 for Artificial Intelligence for Prosthetics - challenge solutions
Figure 4 for Artificial Intelligence for Prosthetics - challenge solutions
Viaarxiv icon

Model-Based Active Exploration

Add code
Bookmark button
Alert button
Oct 29, 2018
Pranav Shyam, Wojciech Jaśkowski, Faustino Gomez

Figure 1 for Model-Based Active Exploration
Figure 2 for Model-Based Active Exploration
Figure 3 for Model-Based Active Exploration
Figure 4 for Model-Based Active Exploration
Viaarxiv icon

ViZDoom Competitions: Playing Doom from Pixels

Add code
Bookmark button
Alert button
Sep 10, 2018
Marek Wydmuch, Michał Kempka, Wojciech Jaśkowski

Figure 1 for ViZDoom Competitions: Playing Doom from Pixels
Figure 2 for ViZDoom Competitions: Playing Doom from Pixels
Figure 3 for ViZDoom Competitions: Playing Doom from Pixels
Figure 4 for ViZDoom Competitions: Playing Doom from Pixels
Viaarxiv icon

Learning to Play Othello with Deep Neural Networks

Add code
Bookmark button
Alert button
Nov 17, 2017
Paweł Liskowski, Wojciech Jaśkowski, Krzysztof Krawiec

Figure 1 for Learning to Play Othello with Deep Neural Networks
Figure 2 for Learning to Play Othello with Deep Neural Networks
Figure 3 for Learning to Play Othello with Deep Neural Networks
Figure 4 for Learning to Play Othello with Deep Neural Networks
Viaarxiv icon

Mastering 2048 with Delayed Temporal Coherence Learning, Multi-Stage Weight Promotion, Redundant Encoding and Carousel Shaping

Add code
Bookmark button
Alert button
Dec 12, 2016
Wojciech Jaśkowski

Figure 1 for Mastering 2048 with Delayed Temporal Coherence Learning, Multi-Stage Weight Promotion, Redundant Encoding and Carousel Shaping
Figure 2 for Mastering 2048 with Delayed Temporal Coherence Learning, Multi-Stage Weight Promotion, Redundant Encoding and Carousel Shaping
Figure 3 for Mastering 2048 with Delayed Temporal Coherence Learning, Multi-Stage Weight Promotion, Redundant Encoding and Carousel Shaping
Figure 4 for Mastering 2048 with Delayed Temporal Coherence Learning, Multi-Stage Weight Promotion, Redundant Encoding and Carousel Shaping
Viaarxiv icon

ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 20, 2016
Michał Kempka, Marek Wydmuch, Grzegorz Runc, Jakub Toczek, Wojciech Jaśkowski

Figure 1 for ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning
Figure 2 for ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning
Figure 3 for ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning
Figure 4 for ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning
Viaarxiv icon

Systematic N-tuple Networks for Position Evaluation: Exceeding 90% in the Othello League

Add code
Bookmark button
Alert button
Jun 25, 2014
Wojciech Jaśkowski

Figure 1 for Systematic N-tuple Networks for Position Evaluation: Exceeding 90% in the Othello League
Figure 2 for Systematic N-tuple Networks for Position Evaluation: Exceeding 90% in the Othello League
Figure 3 for Systematic N-tuple Networks for Position Evaluation: Exceeding 90% in the Othello League
Figure 4 for Systematic N-tuple Networks for Position Evaluation: Exceeding 90% in the Othello League
Viaarxiv icon