Picture for Vivek Veeriah

Vivek Veeriah

Diversifying AI: Towards Creative Chess with AlphaZero

Add code
Aug 29, 2023
Figure 1 for Diversifying AI: Towards Creative Chess with AlphaZero
Figure 2 for Diversifying AI: Towards Creative Chess with AlphaZero
Figure 3 for Diversifying AI: Towards Creative Chess with AlphaZero
Figure 4 for Diversifying AI: Towards Creative Chess with AlphaZero
Viaarxiv icon

ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs

Add code
Feb 02, 2023
Figure 1 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 2 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 3 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 4 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Viaarxiv icon

GrASP: Gradient-Based Affordance Selection for Planning

Add code
Feb 08, 2022
Figure 1 for GrASP: Gradient-Based Affordance Selection for Planning
Figure 2 for GrASP: Gradient-Based Affordance Selection for Planning
Figure 3 for GrASP: Gradient-Based Affordance Selection for Planning
Figure 4 for GrASP: Gradient-Based Affordance Selection for Planning
Viaarxiv icon

Discovery of Options via Meta-Learned Subgoals

Add code
Feb 12, 2021
Figure 1 for Discovery of Options via Meta-Learned Subgoals
Figure 2 for Discovery of Options via Meta-Learned Subgoals
Figure 3 for Discovery of Options via Meta-Learned Subgoals
Figure 4 for Discovery of Options via Meta-Learned Subgoals
Viaarxiv icon

Learning State Representations from Random Deep Action-conditional Predictions

Add code
Feb 09, 2021
Figure 1 for Learning State Representations from Random Deep Action-conditional Predictions
Figure 2 for Learning State Representations from Random Deep Action-conditional Predictions
Figure 3 for Learning State Representations from Random Deep Action-conditional Predictions
Figure 4 for Learning State Representations from Random Deep Action-conditional Predictions
Viaarxiv icon

Learning Retrospective Knowledge with Reverse Reinforcement Learning

Add code
Jul 09, 2020
Figure 1 for Learning Retrospective Knowledge with Reverse Reinforcement Learning
Figure 2 for Learning Retrospective Knowledge with Reverse Reinforcement Learning
Figure 3 for Learning Retrospective Knowledge with Reverse Reinforcement Learning
Viaarxiv icon

Self-Tuning Deep Reinforcement Learning

Add code
Mar 02, 2020
Figure 1 for Self-Tuning Deep Reinforcement Learning
Figure 2 for Self-Tuning Deep Reinforcement Learning
Figure 3 for Self-Tuning Deep Reinforcement Learning
Figure 4 for Self-Tuning Deep Reinforcement Learning
Viaarxiv icon

How Should an Agent Practice?

Add code
Dec 15, 2019
Figure 1 for How Should an Agent Practice?
Figure 2 for How Should an Agent Practice?
Figure 3 for How Should an Agent Practice?
Figure 4 for How Should an Agent Practice?
Viaarxiv icon

Discovery of Useful Questions as Auxiliary Tasks

Add code
Sep 10, 2019
Figure 1 for Discovery of Useful Questions as Auxiliary Tasks
Figure 2 for Discovery of Useful Questions as Auxiliary Tasks
Figure 3 for Discovery of Useful Questions as Auxiliary Tasks
Figure 4 for Discovery of Useful Questions as Auxiliary Tasks
Viaarxiv icon

Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning

Add code
Mar 08, 2019
Figure 1 for Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning
Figure 2 for Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning
Figure 3 for Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning
Figure 4 for Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning
Viaarxiv icon