Alert button
Picture for Zhizhou Ren

Zhizhou Ren

Alert button

Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation

Add code
Bookmark button
Alert button
Nov 20, 2022
Zhizhou Ren, Anji Liu, Yitao Liang, Jian Peng, Jianzhu Ma

Figure 1 for Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Figure 2 for Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Figure 3 for Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Viaarxiv icon

Self-Organized Polynomial-Time Coordination Graphs

Add code
Bookmark button
Alert button
Dec 07, 2021
Qianlan Yang, Weijun Dong, Zhizhou Ren, Jianhao Wang, Tonghan Wang, Chongjie Zhang

Figure 1 for Self-Organized Polynomial-Time Coordination Graphs
Figure 2 for Self-Organized Polynomial-Time Coordination Graphs
Figure 3 for Self-Organized Polynomial-Time Coordination Graphs
Figure 4 for Self-Organized Polynomial-Time Coordination Graphs
Viaarxiv icon

Learning Long-Term Reward Redistribution via Randomized Return Decomposition

Add code
Bookmark button
Alert button
Nov 26, 2021
Zhizhou Ren, Ruihan Guo, Yuan Zhou, Jian Peng

Figure 1 for Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Figure 2 for Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Figure 3 for Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Figure 4 for Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Viaarxiv icon

On the Estimation Bias in Double Q-Learning

Add code
Bookmark button
Alert button
Sep 29, 2021
Zhizhou Ren, Guangxiang Zhu, Hao Hu, Beining Han, Jianglun Chen, Chongjie Zhang

Figure 1 for On the Estimation Bias in Double Q-Learning
Figure 2 for On the Estimation Bias in Double Q-Learning
Figure 3 for On the Estimation Bias in Double Q-Learning
Figure 4 for On the Estimation Bias in Double Q-Learning
Viaarxiv icon

Off-Policy Reinforcement Learning with Delayed Rewards

Add code
Bookmark button
Alert button
Jun 22, 2021
Beining Han, Zhizhou Ren, Zuofan Wu, Yuan Zhou, Jian Peng

Figure 1 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 2 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 3 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 4 for Off-Policy Reinforcement Learning with Delayed Rewards
Viaarxiv icon

Generalizable Episodic Memory for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 11, 2021
Hao Hu, Jianing Ye, Zhizhou Ren, Guangxiang Zhu, Chongjie Zhang

Figure 1 for Generalizable Episodic Memory for Deep Reinforcement Learning
Figure 2 for Generalizable Episodic Memory for Deep Reinforcement Learning
Figure 3 for Generalizable Episodic Memory for Deep Reinforcement Learning
Figure 4 for Generalizable Episodic Memory for Deep Reinforcement Learning
Viaarxiv icon

QPLEX: Duplex Dueling Multi-Agent Q-Learning

Add code
Bookmark button
Alert button
Aug 03, 2020
Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang

Figure 1 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Figure 2 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Figure 3 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Figure 4 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Viaarxiv icon

Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning

Add code
Bookmark button
Alert button
Jun 23, 2020
Jianhao Wang, Zhizhou Ren, Beining Han, Chongjie Zhang

Figure 1 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Figure 2 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Figure 3 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Viaarxiv icon

Exploration via Hindsight Goal Generation

Add code
Bookmark button
Alert button
Jun 10, 2019
Zhizhou Ren, Kefan Dong, Yuan Zhou, Qiang Liu, Jian Peng

Figure 1 for Exploration via Hindsight Goal Generation
Figure 2 for Exploration via Hindsight Goal Generation
Figure 3 for Exploration via Hindsight Goal Generation
Figure 4 for Exploration via Hindsight Goal Generation
Viaarxiv icon