Alert button
Picture for Vivek Veeriah

Vivek Veeriah

Alert button

Diversifying AI: Towards Creative Chess with AlphaZero

Add code
Bookmark button
Alert button
Aug 29, 2023
Tom Zahavy, Vivek Veeriah, Shaobo Hou, Kevin Waugh, Matthew Lai, Edouard Leurent, Nenad Tomasev, Lisa Schut, Demis Hassabis, Satinder Singh

Figure 1 for Diversifying AI: Towards Creative Chess with AlphaZero
Figure 2 for Diversifying AI: Towards Creative Chess with AlphaZero
Figure 3 for Diversifying AI: Towards Creative Chess with AlphaZero
Figure 4 for Diversifying AI: Towards Creative Chess with AlphaZero
Viaarxiv icon

ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs

Add code
Bookmark button
Alert button
Feb 02, 2023
Ted Moskovitz, Brendan O'Donoghue, Vivek Veeriah, Sebastian Flennerhag, Satinder Singh, Tom Zahavy

Figure 1 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 2 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 3 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 4 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Viaarxiv icon

GrASP: Gradient-Based Affordance Selection for Planning

Add code
Bookmark button
Alert button
Feb 08, 2022
Vivek Veeriah, Zeyu Zheng, Richard Lewis, Satinder Singh

Figure 1 for GrASP: Gradient-Based Affordance Selection for Planning
Figure 2 for GrASP: Gradient-Based Affordance Selection for Planning
Figure 3 for GrASP: Gradient-Based Affordance Selection for Planning
Figure 4 for GrASP: Gradient-Based Affordance Selection for Planning
Viaarxiv icon

Discovery of Options via Meta-Learned Subgoals

Add code
Bookmark button
Alert button
Feb 12, 2021
Vivek Veeriah, Tom Zahavy, Matteo Hessel, Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for Discovery of Options via Meta-Learned Subgoals
Figure 2 for Discovery of Options via Meta-Learned Subgoals
Figure 3 for Discovery of Options via Meta-Learned Subgoals
Figure 4 for Discovery of Options via Meta-Learned Subgoals
Viaarxiv icon

Learning State Representations from Random Deep Action-conditional Predictions

Add code
Bookmark button
Alert button
Feb 09, 2021
Zeyu Zheng, Vivek Veeriah, Risto Vuorio, Richard Lewis, Satinder Singh

Figure 1 for Learning State Representations from Random Deep Action-conditional Predictions
Figure 2 for Learning State Representations from Random Deep Action-conditional Predictions
Figure 3 for Learning State Representations from Random Deep Action-conditional Predictions
Figure 4 for Learning State Representations from Random Deep Action-conditional Predictions
Viaarxiv icon

Learning Retrospective Knowledge with Reverse Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 09, 2020
Shangtong Zhang, Vivek Veeriah, Shimon Whiteson

Figure 1 for Learning Retrospective Knowledge with Reverse Reinforcement Learning
Figure 2 for Learning Retrospective Knowledge with Reverse Reinforcement Learning
Figure 3 for Learning Retrospective Knowledge with Reverse Reinforcement Learning
Viaarxiv icon

Self-Tuning Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 02, 2020
Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for Self-Tuning Deep Reinforcement Learning
Figure 2 for Self-Tuning Deep Reinforcement Learning
Figure 3 for Self-Tuning Deep Reinforcement Learning
Figure 4 for Self-Tuning Deep Reinforcement Learning
Viaarxiv icon

How Should an Agent Practice?

Add code
Bookmark button
Alert button
Dec 15, 2019
Janarthanan Rajendran, Richard Lewis, Vivek Veeriah, Honglak Lee, Satinder Singh

Figure 1 for How Should an Agent Practice?
Figure 2 for How Should an Agent Practice?
Figure 3 for How Should an Agent Practice?
Figure 4 for How Should an Agent Practice?
Viaarxiv icon