Alert button
Picture for Jianye Hao

Jianye Hao

Alert button

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration

Add code
Bookmark button
Alert button
Mar 16, 2022
Pengyi Li, Hongyao Tang, Tianpei Yang, Xiaotian Hao, Tong Sang, Yan Zheng, Jianye Hao, Matthew E. Taylor, Zhen Wang

Figure 1 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Figure 2 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Figure 3 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Figure 4 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Viaarxiv icon

Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents

Add code
Bookmark button
Alert button
Mar 16, 2022
Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li

Figure 1 for Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents
Figure 2 for Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents
Figure 3 for Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents
Figure 4 for Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents
Viaarxiv icon

API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks

Add code
Bookmark button
Alert button
Mar 10, 2022
Xiaotian Hao, Weixun Wang, Hangyu Mao, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang, Jianye Hao

Figure 1 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 2 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 3 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 4 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Viaarxiv icon

Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization

Add code
Bookmark button
Alert button
Mar 04, 2022
Minghuan Liu, Zhengbang Zhu, Yuzheng Zhuang, Weinan Zhang, Jianye Hao, Yong Yu, Jun Wang

Figure 1 for Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization
Figure 2 for Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization
Figure 3 for Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization
Figure 4 for Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization
Viaarxiv icon

Generalizable Information Theoretic Causal Representation

Add code
Bookmark button
Alert button
Feb 17, 2022
Mengyue Yang, Xinyu Cai, Furui Liu, Xu Chen, Zhitang Chen, Jianye Hao, Jun Wang

Figure 1 for Generalizable Information Theoretic Causal Representation
Figure 2 for Generalizable Information Theoretic Causal Representation
Figure 3 for Generalizable Information Theoretic Causal Representation
Figure 4 for Generalizable Information Theoretic Causal Representation
Viaarxiv icon

Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization

Add code
Bookmark button
Alert button
Feb 16, 2022
Jian Zhao, Yue Zhang, Xunhan Hu, Weixun Wang, Wengang Zhou, Jianye Hao, Jiangcheng Zhu, Houqiang Li

Figure 1 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 2 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 3 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 4 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Viaarxiv icon

Introduction to The Dynamic Pickup and Delivery Problem Benchmark -- ICAPS 2021 Competition

Add code
Bookmark button
Alert button
Jan 19, 2022
Jianye Hao, Jiawen Lu, Xijun Li, Xialiang Tong, Xiang Xiang, Mingxuan Yuan, Hankz Hankui Zhuo

Viaarxiv icon

Debiased Recommendation with User Feature Balancing

Add code
Bookmark button
Alert button
Jan 16, 2022
Mengyue Yang, Guohao Cai, Furui Liu, Zhenhua Dong, Xiuqiang He, Jianye Hao, Jun Wang, Xu Chen

Figure 1 for Debiased Recommendation with User Feature Balancing
Figure 2 for Debiased Recommendation with User Feature Balancing
Figure 3 for Debiased Recommendation with User Feature Balancing
Figure 4 for Debiased Recommendation with User Feature Balancing
Viaarxiv icon

A Survey on Interpretable Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 24, 2021
Claire Glanois, Paul Weng, Matthieu Zimmer, Dong Li, Tianpei Yang, Jianye Hao, Wulong Liu

Figure 1 for A Survey on Interpretable Reinforcement Learning
Figure 2 for A Survey on Interpretable Reinforcement Learning
Figure 3 for A Survey on Interpretable Reinforcement Learning
Figure 4 for A Survey on Interpretable Reinforcement Learning
Viaarxiv icon