Picture for Matthew E. Taylor

Matthew E. Taylor

University of Alberta, Alberta Machine Intelligence Institute

Multi-Agent Advisor Q-Learning

Add code
Nov 08, 2021
Figure 1 for Multi-Agent Advisor Q-Learning
Figure 2 for Multi-Agent Advisor Q-Learning
Figure 3 for Multi-Agent Advisor Q-Learning
Figure 4 for Multi-Agent Advisor Q-Learning
Viaarxiv icon

The Atari Data Scraper

Add code
Apr 11, 2021
Figure 1 for The Atari Data Scraper
Figure 2 for The Atari Data Scraper
Figure 3 for The Atari Data Scraper
Figure 4 for The Atari Data Scraper
Viaarxiv icon

The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning

Add code
Mar 07, 2021
Figure 1 for The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning
Figure 2 for The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning
Viaarxiv icon

Model-Invariant State Abstractions for Model-Based Reinforcement Learning

Add code
Feb 19, 2021
Figure 1 for Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Figure 2 for Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Figure 3 for Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Figure 4 for Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Viaarxiv icon

Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems

Add code
Feb 16, 2021
Viaarxiv icon

Improving Reinforcement Learning with Human Assistance: An Argument for Human Subject Studies with HIPPO Gym

Add code
Feb 02, 2021
Figure 1 for Improving Reinforcement Learning with Human Assistance: An Argument for Human Subject Studies with HIPPO Gym
Figure 2 for Improving Reinforcement Learning with Human Assistance: An Argument for Human Subject Studies with HIPPO Gym
Figure 3 for Improving Reinforcement Learning with Human Assistance: An Argument for Human Subject Studies with HIPPO Gym
Figure 4 for Improving Reinforcement Learning with Human Assistance: An Argument for Human Subject Studies with HIPPO Gym
Viaarxiv icon

HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging

Add code
Jan 18, 2021
Figure 1 for HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging
Figure 2 for HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging
Figure 3 for HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging
Figure 4 for HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging
Viaarxiv icon

Useful Policy Invariant Shaping from Arbitrary Advice

Add code
Nov 02, 2020
Figure 1 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 2 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 3 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 4 for Useful Policy Invariant Shaping from Arbitrary Advice
Viaarxiv icon

Maximum Reward Formulation In Reinforcement Learning

Add code
Oct 08, 2020
Figure 1 for Maximum Reward Formulation In Reinforcement Learning
Figure 2 for Maximum Reward Formulation In Reinforcement Learning
Figure 3 for Maximum Reward Formulation In Reinforcement Learning
Figure 4 for Maximum Reward Formulation In Reinforcement Learning
Viaarxiv icon

Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy

Add code
Sep 29, 2020
Figure 1 for Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Figure 2 for Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Figure 3 for Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Figure 4 for Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Viaarxiv icon