Alert button
Picture for Yi Wu

Yi Wu

Alert button

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased

Add code
Bookmark button
Alert button
Feb 03, 2023
Chao Yu, Jiaxuan Gao, Weilin Liu, Botian Xu, Hao Tang, Jiaqi Yang, Yu Wang, Yi Wu

Figure 1 for Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Figure 2 for Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Figure 3 for Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Figure 4 for Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Viaarxiv icon

Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration

Add code
Bookmark button
Alert button
Jan 09, 2023
Chao Yu, Xinyi Yang, Jiaxuan Gao, Jiayu Chen, Yunfei Li, Jijia Liu, Yunfei Xiang, Ruixin Huang, Huazhong Yang, Yi Wu, Yu Wang

Figure 1 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Figure 2 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Figure 3 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Figure 4 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Viaarxiv icon

Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 17, 2022
Zhecheng Yuan, Zhengrong Xue, Bo Yuan, Xueqian Wang, Yi Wu, Yang Gao, Huazhe Xu

Figure 1 for Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning
Figure 2 for Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning
Figure 3 for Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning
Figure 4 for Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning
Viaarxiv icon

AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process

Add code
Bookmark button
Alert button
Nov 17, 2022
Kevin Du, Ian Gemp, Yi Wu, Yingying Wu

Figure 1 for AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process
Figure 2 for AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process
Figure 3 for AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process
Figure 4 for AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process
Viaarxiv icon

PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation

Add code
Bookmark button
Alert button
Nov 11, 2022
Lianshang Cai, Linhao Zhang, Dehong Ma, Jun Fan, Daiting Shi, Yi Wu, Zhicong Cheng, Simiu Gu, Dawei Yin

Figure 1 for PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation
Figure 2 for PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation
Figure 3 for PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation
Figure 4 for PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation
Viaarxiv icon

FedBA: Non-IID Federated Learning Framework in UAV Networks

Add code
Bookmark button
Alert button
Oct 10, 2022
Pei Li, Zhijun Liu, Luyi Chang, Jialiang Peng, Yi Wu

Figure 1 for FedBA: Non-IID Federated Learning Framework in UAV Networks
Figure 2 for FedBA: Non-IID Federated Learning Framework in UAV Networks
Figure 3 for FedBA: Non-IID Federated Learning Framework in UAV Networks
Figure 4 for FedBA: Non-IID Federated Learning Framework in UAV Networks
Viaarxiv icon

Multi-Task Learning for Emotion Descriptors Estimation at the fourth ABAW Challenge

Add code
Bookmark button
Alert button
Jul 20, 2022
Yanan Chang, Yi Wu, Xiangyu Miao, Jiahe Wang, Shangfei Wang

Figure 1 for Multi-Task Learning for Emotion Descriptors Estimation at the fourth ABAW Challenge
Figure 2 for Multi-Task Learning for Emotion Descriptors Estimation at the fourth ABAW Challenge
Viaarxiv icon

Hand-Assisted Expression Recognition Method from Synthetic Images at the Fourth ABAW Challenge

Add code
Bookmark button
Alert button
Jul 20, 2022
Xiangyu Miao, Jiahe Wang, Yanan Chang, Yi Wu, Shangfei Wang

Figure 1 for Hand-Assisted Expression Recognition Method from Synthetic Images at the Fourth ABAW Challenge
Viaarxiv icon

Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 24, 2022
Yunfei Li, Tian Gao, Jiaqi Yang, Huazhe Xu, Yi Wu

Figure 1 for Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Figure 2 for Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Figure 3 for Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Figure 4 for Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Viaarxiv icon

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 15, 2022
Wei Fu, Chao Yu, Zelai Xu, Jiaqi Yang, Yi Wu

Figure 1 for Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Figure 4 for Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon