Alert button
Picture for Jiangcheng Zhu

Jiangcheng Zhu

Alert button

Yi: Open Foundation Models by 01.AI

Add code
Bookmark button
Alert button
Mar 07, 2024
01. AI, :, Alex Young, Bei Chen, Chao Li, Chengen Huang, Ge Zhang, Guanwei Zhang, Heng Li, Jiangcheng Zhu, Jianqun Chen, Jing Chang, Kaidong Yu, Peng Liu, Qiang Liu, Shawn Yue, Senbin Yang, Shiming Yang, Tao Yu, Wen Xie, Wenhao Huang, Xiaohui Hu, Xiaoyi Ren, Xinyao Niu, Pengcheng Nie, Yuchi Xu, Yudong Liu, Yue Wang, Yuxuan Cai, Zhenyu Gu, Zhiyuan Liu, Zonghong Dai

Figure 1 for Yi: Open Foundation Models by 01.AI
Figure 2 for Yi: Open Foundation Models by 01.AI
Figure 3 for Yi: Open Foundation Models by 01.AI
Figure 4 for Yi: Open Foundation Models by 01.AI
Viaarxiv icon

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Add code
Bookmark button
Alert button
Aug 09, 2023
Yang Li, Kun Xiong, Yingping Zhang, Jiangcheng Zhu, Stephen Mcaleer, Wei Pan, Jun Wang, Zonghong Dai, Yaodong Yang

Figure 1 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 2 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 3 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 4 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Viaarxiv icon

An Empirical Study on Google Research Football Multi-agent Scenarios

Add code
Bookmark button
Alert button
May 16, 2023
Yan Song, He Jiang, Zheng Tian, Haifeng Zhang, Yingping Zhang, Jiangcheng Zhu, Zonghong Dai, Weinan Zhang, Jun Wang

Figure 1 for An Empirical Study on Google Research Football Multi-agent Scenarios
Figure 2 for An Empirical Study on Google Research Football Multi-agent Scenarios
Figure 3 for An Empirical Study on Google Research Football Multi-agent Scenarios
Figure 4 for An Empirical Study on Google Research Football Multi-agent Scenarios
Viaarxiv icon

Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection

Add code
Bookmark button
Alert button
May 09, 2023
Jiajun Fan, Yuzheng Zhuang, Yuecheng Liu, Jianye Hao, Bin Wang, Jiangcheng Zhu, Hao Wang, Shu-Tao Xia

Figure 1 for Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Figure 2 for Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Figure 3 for Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Figure 4 for Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Viaarxiv icon

CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 16, 2022
Jian Zhao, Xunhan Hu, Mingyu Yang, Wengang Zhou, Jiangcheng Zhu, Houqiang Li

Figure 1 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Figure 2 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Figure 3 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Figure 4 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Viaarxiv icon

Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization

Add code
Bookmark button
Alert button
Feb 16, 2022
Jian Zhao, Yue Zhang, Xunhan Hu, Weixun Wang, Wengang Zhou, Jianye Hao, Jiangcheng Zhu, Houqiang Li

Figure 1 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 2 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 3 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 4 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Viaarxiv icon

Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention

Add code
Bookmark button
Alert button
Nov 16, 2021
Yunkun Xu, Zhenyu Liu, Guifang Duan, Jiangcheng Zhu, Xiaolong Bai, Jianrong Tan

Figure 1 for Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Figure 2 for Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Figure 3 for Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Figure 4 for Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Viaarxiv icon

Learning to Shape Rewards using a Game of Switching Controls

Add code
Bookmark button
Alert button
Mar 16, 2021
David Mguni, Jianhong Wang, Taher Jafferjee, Nicolas Perez-Nieves, Wenbin Song, Yaodong Yang, Feifei Tong, Hui Chen, Jiangcheng Zhu, Yali Du, Jun Wang

Figure 1 for Learning to Shape Rewards using a Game of Switching Controls
Figure 2 for Learning to Shape Rewards using a Game of Switching Controls
Figure 3 for Learning to Shape Rewards using a Game of Switching Controls
Figure 4 for Learning to Shape Rewards using a Game of Switching Controls
Viaarxiv icon