Picture for Yaodong Yang

Yaodong Yang

Masked Pretraining for Multi-Agent Decision Making

Add code
Oct 18, 2023
Viaarxiv icon

MIR2: Towards Provably Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization

Add code
Oct 15, 2023
Viaarxiv icon

Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models

Add code
Oct 10, 2023
Viaarxiv icon

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models

Add code
Oct 08, 2023
Viaarxiv icon

Dynamic Handover: Throw and Catch with Bimanual Hands

Add code
Sep 11, 2023
Viaarxiv icon

Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators

Add code
Sep 07, 2023
Viaarxiv icon

ProAgent: Building Proactive Cooperative AI with Large Language Models

Add code
Aug 28, 2023
Figure 1 for ProAgent: Building Proactive Cooperative AI with Large Language Models
Figure 2 for ProAgent: Building Proactive Cooperative AI with Large Language Models
Figure 3 for ProAgent: Building Proactive Cooperative AI with Large Language Models
Viaarxiv icon

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Add code
Aug 09, 2023
Figure 1 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 2 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 3 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Figure 4 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Viaarxiv icon

Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning

Add code
Jul 24, 2023
Viaarxiv icon

Safe DreamerV3: Safe Reinforcement Learning with World Models

Add code
Jul 14, 2023
Viaarxiv icon