Alert button
Picture for David Mguni

David Mguni

Alert button

SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation

Add code
Bookmark button
Alert button
Feb 16, 2022
Aivar Sootla, Alexander I. Cowen-Rivers, Taher Jafferjee, Ziyan Wang, David Mguni, Jun Wang, Haitham Bou-Ammar

Figure 1 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Figure 2 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Figure 3 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Figure 4 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Viaarxiv icon

DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention

Add code
Bookmark button
Alert button
Oct 27, 2021
David Mguni, Joel Jennings, Taher Jafferjee, Aivar Sootla, Yaodong Yang, Changmin Yu, Usman Islam, Ziyan Wang, Jun Wang

Figure 1 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 2 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 3 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 4 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Viaarxiv icon

Learning to Shape Rewards using a Game of Switching Controls

Add code
Bookmark button
Alert button
Mar 16, 2021
David Mguni, Jianhong Wang, Taher Jafferjee, Nicolas Perez-Nieves, Wenbin Song, Yaodong Yang, Feifei Tong, Hui Chen, Jiangcheng Zhu, Yali Du, Jun Wang

Figure 1 for Learning to Shape Rewards using a Game of Switching Controls
Figure 2 for Learning to Shape Rewards using a Game of Switching Controls
Figure 3 for Learning to Shape Rewards using a Game of Switching Controls
Figure 4 for Learning to Shape Rewards using a Game of Switching Controls
Viaarxiv icon

Multi-Agent Determinantal Q-Learning

Add code
Bookmark button
Alert button
Jun 09, 2020
Yaodong Yang, Ying Wen, Liheng Chen, Jun Wang, Kun Shao, David Mguni, Weinan Zhang

Figure 1 for Multi-Agent Determinantal Q-Learning
Figure 2 for Multi-Agent Determinantal Q-Learning
Figure 3 for Multi-Agent Determinantal Q-Learning
Figure 4 for Multi-Agent Determinantal Q-Learning
Viaarxiv icon