Picture for Jiangcheng Zhu

Jiangcheng Zhu

Yi: Open Foundation Models by 01.AI

Add code
Mar 07, 2024
Figure 1 for Yi: Open Foundation Models by 01.AI
Figure 2 for Yi: Open Foundation Models by 01.AI
Figure 3 for Yi: Open Foundation Models by 01.AI
Figure 4 for Yi: Open Foundation Models by 01.AI
Viaarxiv icon

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Add code
Aug 09, 2023
Figure 1 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 2 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 3 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 4 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Viaarxiv icon

An Empirical Study on Google Research Football Multi-agent Scenarios

Add code
May 16, 2023
Figure 1 for An Empirical Study on Google Research Football Multi-agent Scenarios
Figure 2 for An Empirical Study on Google Research Football Multi-agent Scenarios
Figure 3 for An Empirical Study on Google Research Football Multi-agent Scenarios
Figure 4 for An Empirical Study on Google Research Football Multi-agent Scenarios
Viaarxiv icon

Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection

Add code
May 09, 2023
Figure 1 for Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Figure 2 for Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Figure 3 for Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Figure 4 for Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Viaarxiv icon

CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning

Add code
Mar 16, 2022
Figure 1 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Figure 2 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Figure 3 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Figure 4 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Viaarxiv icon

Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization

Add code
Feb 16, 2022
Figure 1 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 2 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 3 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 4 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Viaarxiv icon

Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention

Add code
Nov 16, 2021
Figure 1 for Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Figure 2 for Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Figure 3 for Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Figure 4 for Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Viaarxiv icon

Learning to Shape Rewards using a Game of Switching Controls

Add code
Mar 16, 2021
Figure 1 for Learning to Shape Rewards using a Game of Switching Controls
Figure 2 for Learning to Shape Rewards using a Game of Switching Controls
Figure 3 for Learning to Shape Rewards using a Game of Switching Controls
Figure 4 for Learning to Shape Rewards using a Game of Switching Controls
Viaarxiv icon