Alert button
Picture for Yaodong Yang

Yaodong Yang

Alert button

MIR2: Towards Provably Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization

Add code
Bookmark button
Alert button
Oct 15, 2023
Simin Li, Ruixiao Xu, Jun Guo, Pu Feng, Jiakai Wang, Aishan Liu, Yaodong Yang, Xianglong Liu, Weifeng Lv

Figure 1 for MIR2: Towards Provably Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
Figure 2 for MIR2: Towards Provably Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
Figure 3 for MIR2: Towards Provably Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
Figure 4 for MIR2: Towards Provably Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
Viaarxiv icon

Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models

Add code
Bookmark button
Alert button
Oct 10, 2023
Chengdong Ma, Ziran Yang, Minquan Gao, Hai Ci, Jun Gao, Xuehai Pan, Yaodong Yang

Figure 1 for Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models
Figure 2 for Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models
Figure 3 for Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models
Figure 4 for Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models
Viaarxiv icon

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models

Add code
Bookmark button
Alert button
Oct 08, 2023
Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai

Figure 1 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 2 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 3 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 4 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Viaarxiv icon

Measuring Value Understanding in Language Models through Discriminator-Critique Gap

Add code
Bookmark button
Alert button
Oct 07, 2023
Zhaowei Zhang, Fengshuo Bai, Jun Gao, Yaodong Yang

Figure 1 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Figure 2 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Figure 3 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Figure 4 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Viaarxiv icon

Dynamic Handover: Throw and Catch with Bimanual Hands

Add code
Bookmark button
Alert button
Sep 11, 2023
Binghao Huang, Yuanpei Chen, Tianyu Wang, Yuzhe Qin, Yaodong Yang, Nikolay Atanasov, Xiaolong Wang

Figure 1 for Dynamic Handover: Throw and Catch with Bimanual Hands
Figure 2 for Dynamic Handover: Throw and Catch with Bimanual Hands
Figure 3 for Dynamic Handover: Throw and Catch with Bimanual Hands
Figure 4 for Dynamic Handover: Throw and Catch with Bimanual Hands
Viaarxiv icon

Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators

Add code
Bookmark button
Alert button
Sep 07, 2023
Jingbang Chen, Yian Wang, Xingwei Qu, Shuangjia Zheng, Yaodong Yang, Hao Dong, Jie Fu

Figure 1 for Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators
Figure 2 for Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators
Figure 3 for Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators
Figure 4 for Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators
Viaarxiv icon

ProAgent: Building Proactive Cooperative AI with Large Language Models

Add code
Bookmark button
Alert button
Aug 28, 2023
Ceyao Zhang, Kaijie Yang, Siyi Hu, Zihao Wang, Guanghe Li, Yihang Sun, Cheng Zhang, Zhaowei Zhang, Anji Liu, Song-Chun Zhu, Xiaojun Chang, Junge Zhang, Feng Yin, Yitao Liang, Yaodong Yang

Figure 1 for ProAgent: Building Proactive Cooperative AI with Large Language Models
Figure 2 for ProAgent: Building Proactive Cooperative AI with Large Language Models
Figure 3 for ProAgent: Building Proactive Cooperative AI with Large Language Models
Figure 4 for ProAgent: Building Proactive Cooperative AI with Large Language Models
Viaarxiv icon

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Add code
Bookmark button
Alert button
Aug 09, 2023
Yang Li, Kun Xiong, Yingping Zhang, Jiangcheng Zhu, Stephen Mcaleer, Wei Pan, Jun Wang, Zonghong Dai, Yaodong Yang

Figure 1 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 2 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 3 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 4 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Viaarxiv icon