Alert button
Picture for David Mguni

David Mguni

Alert button

All Language Models Large and Small

Add code
Bookmark button
Alert button
Feb 19, 2024
Zhixun Chen, Yali Du, David Mguni

Viaarxiv icon

ChessGPT: Bridging Policy Learning and Language Modeling

Add code
Bookmark button
Alert button
Jun 15, 2023
Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang

Viaarxiv icon

Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 02, 2023
Lukas Schäfer, Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V. Albrecht, David Mguni

Figure 1 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Figure 2 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Figure 3 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Figure 4 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Viaarxiv icon

Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training

Add code
Bookmark button
Alert button
Sep 02, 2022
Taher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew Taylor, Kun Shao, Jun Wang, David Mguni

Figure 1 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Figure 2 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Figure 3 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Figure 4 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Viaarxiv icon

Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints

Add code
Bookmark button
Alert button
Jun 06, 2022
David Mguni, Aivar Sootla, Juliusz Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun Wang

Figure 1 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Figure 2 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Figure 3 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Figure 4 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Viaarxiv icon

SEREN: Knowing When to Explore and When to Exploit

Add code
Bookmark button
Alert button
May 30, 2022
Changmin Yu, David Mguni, Dong Li, Aivar Sootla, Jun Wang, Neil Burgess

Figure 1 for SEREN: Knowing When to Explore and When to Exploit
Figure 2 for SEREN: Knowing When to Explore and When to Exploit
Figure 3 for SEREN: Knowing When to Explore and When to Exploit
Figure 4 for SEREN: Knowing When to Explore and When to Exploit
Viaarxiv icon

On the Convergence of Fictitious Play: A Decomposition Approach

Add code
Bookmark button
Alert button
May 03, 2022
Yurong Chen, Xiaotie Deng, Chenchen Li, David Mguni, Jun Wang, Xiang Yan, Yaodong Yang

Figure 1 for On the Convergence of Fictitious Play: A Decomposition Approach
Figure 2 for On the Convergence of Fictitious Play: A Decomposition Approach
Figure 3 for On the Convergence of Fictitious Play: A Decomposition Approach
Figure 4 for On the Convergence of Fictitious Play: A Decomposition Approach
Viaarxiv icon