Alert button
Picture for Xunhan Hu

Xunhan Hu

Alert button

Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation

Add code
Bookmark button
Alert button
Sep 22, 2022
Chenxu Wang, Fuli Feng, Yang Zhang, Qifan Wang, Xunhan Hu, Xiangnan He

Figure 1 for Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation
Figure 2 for Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation
Figure 3 for Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation
Figure 4 for Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation
Viaarxiv icon

LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
May 05, 2022
Mingyu Yang, Jian Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li

Figure 1 for LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
Figure 2 for LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
Figure 3 for LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
Figure 4 for LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning

Add code
Bookmark button
Alert button
Apr 06, 2022
Youpeng Zhao, Jian Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li

Figure 1 for DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning
Figure 2 for DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning
Figure 3 for DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning
Figure 4 for DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning
Viaarxiv icon

Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents

Add code
Bookmark button
Alert button
Mar 16, 2022
Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li

Figure 1 for Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents
Figure 2 for Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents
Figure 3 for Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents
Figure 4 for Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents
Viaarxiv icon

CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 16, 2022
Jian Zhao, Xunhan Hu, Mingyu Yang, Wengang Zhou, Jiangcheng Zhu, Houqiang Li

Figure 1 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Figure 2 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Figure 3 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Figure 4 for CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Viaarxiv icon

DQMIX: A Distributional Perspective on Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 21, 2022
Jian Zhao, Mingyu Yang, Xunhan Hu, Wengang Zhou, Houqiang Li

Figure 1 for DQMIX: A Distributional Perspective on Multi-Agent Reinforcement Learning
Figure 2 for DQMIX: A Distributional Perspective on Multi-Agent Reinforcement Learning
Figure 3 for DQMIX: A Distributional Perspective on Multi-Agent Reinforcement Learning
Figure 4 for DQMIX: A Distributional Perspective on Multi-Agent Reinforcement Learning
Viaarxiv icon

Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization

Add code
Bookmark button
Alert button
Feb 16, 2022
Jian Zhao, Yue Zhang, Xunhan Hu, Weixun Wang, Wengang Zhou, Jianye Hao, Jiangcheng Zhu, Houqiang Li

Figure 1 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 2 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 3 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Figure 4 for Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Viaarxiv icon