Alert button
Picture for Yaodong Yang

Yaodong Yang

Alert button

Constrained Update Projection Approach to Safe Policy Optimization

Add code
Bookmark button
Alert button
Sep 15, 2022
Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan

Figure 1 for Constrained Update Projection Approach to Safe Policy Optimization
Figure 2 for Constrained Update Projection Approach to Safe Policy Optimization
Figure 3 for Constrained Update Projection Approach to Safe Policy Optimization
Figure 4 for Constrained Update Projection Approach to Safe Policy Optimization
Viaarxiv icon

Debias the Black-box: A Fair Ranking Framework via Knowledge Distillation

Add code
Bookmark button
Alert button
Aug 24, 2022
Zhitao Zhu, Shijing Si, Jianzong Wang, Yaodong Yang, Jing Xiao

Figure 1 for Debias the Black-box: A Fair Ranking Framework via Knowledge Distillation
Figure 2 for Debias the Black-box: A Fair Ranking Framework via Knowledge Distillation
Figure 3 for Debias the Black-box: A Fair Ranking Framework via Knowledge Distillation
Figure 4 for Debias the Black-box: A Fair Ranking Framework via Knowledge Distillation
Viaarxiv icon

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL

Add code
Bookmark button
Alert button
Aug 02, 2022
Jakub Grudzien Kuba, Xidong Feng, Shiyao Ding, Hao Dong, Jun Wang, Yaodong Yang

Figure 1 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 2 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 3 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 4 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Viaarxiv icon

Fully Decentralized Model-based Policy Optimization for Networked Systems

Add code
Bookmark button
Alert button
Jul 13, 2022
Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang

Figure 1 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Figure 2 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Figure 3 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Figure 4 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Viaarxiv icon

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 17, 2022
Yuanpei Chen, Yaodong Yang, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuang Jiang, Stephen Marcus McAleer, Hao Dong, Zongqing Lu, Song-Chun Zhu

Figure 1 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Figure 2 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Figure 3 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Figure 4 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Viaarxiv icon

Learning Risk-Averse Equilibria in Multi-Agent Systems

Add code
Bookmark button
Alert button
May 30, 2022
Oliver Slumbers, David Henry Mguni, Stephen McAleer, Jun Wang, Yaodong Yang

Figure 1 for Learning Risk-Averse Equilibria in Multi-Agent Systems
Figure 2 for Learning Risk-Averse Equilibria in Multi-Agent Systems
Figure 3 for Learning Risk-Averse Equilibria in Multi-Agent Systems
Figure 4 for Learning Risk-Averse Equilibria in Multi-Agent Systems
Viaarxiv icon

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem

Add code
Bookmark button
Alert button
May 30, 2022
Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang

Figure 1 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 2 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 3 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 4 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Viaarxiv icon

A Review of Safe Reinforcement Learning: Methods, Theory and Applications

Add code
Bookmark button
Alert button
May 23, 2022
Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang, Alois Knoll

Figure 1 for A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Figure 2 for A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Figure 3 for A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Figure 4 for A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Viaarxiv icon

On the Convergence of Fictitious Play: A Decomposition Approach

Add code
Bookmark button
Alert button
May 03, 2022
Yurong Chen, Xiaotie Deng, Chenchen Li, David Mguni, Jun Wang, Xiang Yan, Yaodong Yang

Figure 1 for On the Convergence of Fictitious Play: A Decomposition Approach
Figure 2 for On the Convergence of Fictitious Play: A Decomposition Approach
Figure 3 for On the Convergence of Fictitious Play: A Decomposition Approach
Figure 4 for On the Convergence of Fictitious Play: A Decomposition Approach
Viaarxiv icon

API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks

Add code
Bookmark button
Alert button
Mar 10, 2022
Xiaotian Hao, Weixun Wang, Hangyu Mao, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang, Jianye Hao

Figure 1 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 2 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 3 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 4 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Viaarxiv icon