Alert button
Picture for Yali Du

Yali Du

Alert button

MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment

Add code
Bookmark button
Alert button
Dec 06, 2023
Ziyan Wang, Yali Du, Yudi Zhang, Meng Fang, Biwei Huang

Viaarxiv icon

Reduced Policy Optimization for Continuous Control with Hard Constraints

Add code
Bookmark button
Alert button
Oct 14, 2023
Shutong Ding, Jingya Wang, Yali Du, Ye Shi

Viaarxiv icon

Invariant Learning via Probability of Sufficient and Necessary Causes

Add code
Bookmark button
Alert button
Sep 22, 2023
Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jun Wang

Viaarxiv icon

Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank

Add code
Bookmark button
Alert button
Aug 05, 2023
Jiarui Jin, Xianyu Chen, Weinan Zhang, Mengyue Yang, Yang Wang, Yali Du, Yong Yu, Jun Wang

Figure 1 for Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank
Figure 2 for Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank
Figure 3 for Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank
Figure 4 for Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank
Viaarxiv icon

ChessGPT: Bridging Policy Learning and Language Modeling

Add code
Bookmark button
Alert button
Jun 15, 2023
Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang

Viaarxiv icon

Zero-shot Preference Learning for Offline RL via Optimal Transport

Add code
Bookmark button
Alert button
Jun 06, 2023
Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li

Figure 1 for Zero-shot Preference Learning for Offline RL via Optimal Transport
Figure 2 for Zero-shot Preference Learning for Offline RL via Optimal Transport
Figure 3 for Zero-shot Preference Learning for Offline RL via Optimal Transport
Figure 4 for Zero-shot Preference Learning for Offline RL via Optimal Transport
Viaarxiv icon

Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination

Add code
Bookmark button
Alert button
Jun 05, 2023
Yang Li, Shao Zhang, Jichen Sun, Wenhao Zhang, Yali Du, Ying Wen, Xinbing Wang, Wei Pan

Figure 1 for Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Figure 2 for Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Figure 3 for Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Figure 4 for Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Viaarxiv icon

GRD: A Generative Approach for Interpretable Reward Redistribution in Reinforcement Learning

Add code
Bookmark button
Alert button
May 28, 2023
Yudi Zhang, Yali Du, Biwei Huang, Ziyan Wang, Jun Wang, Meng Fang, Mykola Pechenizkiy

Figure 1 for GRD: A Generative Approach for Interpretable Reward Redistribution in Reinforcement Learning
Figure 2 for GRD: A Generative Approach for Interpretable Reward Redistribution in Reinforcement Learning
Figure 3 for GRD: A Generative Approach for Interpretable Reward Redistribution in Reinforcement Learning
Figure 4 for GRD: A Generative Approach for Interpretable Reward Redistribution in Reinforcement Learning
Viaarxiv icon

Introspective Tips: Large Language Model for In-Context Decision Making

Add code
Bookmark button
Alert button
May 19, 2023
Liting Chen, Lu Wang, Hang Dong, Yali Du, Jie Yan, Fangkai Yang, Shuang Li, Pu Zhao, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

Figure 1 for Introspective Tips: Large Language Model for In-Context Decision Making
Figure 2 for Introspective Tips: Large Language Model for In-Context Decision Making
Figure 3 for Introspective Tips: Large Language Model for In-Context Decision Making
Figure 4 for Introspective Tips: Large Language Model for In-Context Decision Making
Viaarxiv icon

STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 15, 2023
Sirui Chen, Zhaowei Zhang, Yali Du, Yaodong Yang

Figure 1 for STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning
Figure 2 for STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning
Figure 3 for STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning
Figure 4 for STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning
Viaarxiv icon