Picture for Taher Jafferjee

Taher Jafferjee

Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training

Add code
Sep 02, 2022
Figure 1 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Figure 2 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Figure 3 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Figure 4 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Viaarxiv icon

SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation

Add code
Feb 16, 2022
Figure 1 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Figure 2 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Figure 3 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Figure 4 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Viaarxiv icon

Reinforcement Learning in Presence of Discrete Markovian Context Evolution

Add code
Feb 14, 2022
Figure 1 for Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Figure 2 for Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Figure 3 for Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Figure 4 for Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Viaarxiv icon

DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention

Add code
Oct 27, 2021
Figure 1 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 2 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 3 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 4 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Viaarxiv icon

Learning to Shape Rewards using a Game of Switching Controls

Add code
Mar 16, 2021
Figure 1 for Learning to Shape Rewards using a Game of Switching Controls
Figure 2 for Learning to Shape Rewards using a Game of Switching Controls
Figure 3 for Learning to Shape Rewards using a Game of Switching Controls
Figure 4 for Learning to Shape Rewards using a Game of Switching Controls
Viaarxiv icon

Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models

Add code
Jun 08, 2020
Figure 1 for Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models
Figure 2 for Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models
Figure 3 for Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models
Figure 4 for Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models
Viaarxiv icon