Alert button
Picture for Zongzhang Zhang

Zongzhang Zhang

Alert button

Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation

Add code
Bookmark button
Alert button
Mar 12, 2024
Chengxing Jia, Fuxiang Zhang, Yi-Chen Li, Chen-Xiao Gao, Xu-Hui Liu, Lei Yuan, Zongzhang Zhang, Yang Yu

Figure 1 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 2 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 3 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 4 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Viaarxiv icon

Reinforced In-Context Black-Box Optimization

Add code
Bookmark button
Alert button
Feb 27, 2024
Lei Song, Chenxiao Gao, Ke Xue, Chenyang Wu, Dong Li, Jianye Hao, Zongzhang Zhang, Chao Qian

Viaarxiv icon

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics

Add code
Bookmark button
Alert button
Feb 17, 2024
Xinyu Zhang, Wenjie Qiu, Yi-Chen Li, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu

Viaarxiv icon

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

Add code
Bookmark button
Alert button
Dec 26, 2023
Renzhe Zhou, Chen-Xiao Gao, Zongzhang Zhang, Yang Yu

Viaarxiv icon

Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments

Add code
Bookmark button
Alert button
Oct 09, 2023
Xiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Anqi Huang, Kai Xu, Zongzhang Zhang, Yang Yu

Figure 1 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Figure 2 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Figure 3 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Figure 4 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Viaarxiv icon

ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning

Add code
Bookmark button
Alert button
Sep 12, 2023
Chenxiao Gao, Chenyang Wu, Mingjun Cao, Rui Kong, Zongzhang Zhang, Yang Yu

Viaarxiv icon

Policy Regularization with Dataset Constraint for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 11, 2023
Yuhang Ran, Yi-Chen Li, Fuxiang Zhang, Zongzhang Zhang, Yang Yu

Figure 1 for Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
Figure 2 for Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
Figure 3 for Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
Figure 4 for Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
Viaarxiv icon

Language Model Self-improvement by Reinforcement Learning Contemplation

Add code
Bookmark button
Alert button
May 23, 2023
Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu, Zongzhang Zhang, Yang Yu

Figure 1 for Language Model Self-improvement by Reinforcement Learning Contemplation
Figure 2 for Language Model Self-improvement by Reinforcement Learning Contemplation
Figure 3 for Language Model Self-improvement by Reinforcement Learning Contemplation
Figure 4 for Language Model Self-improvement by Reinforcement Learning Contemplation
Viaarxiv icon

Robust Multi-agent Communication via Multi-view Message Certification

Add code
Bookmark button
Alert button
May 07, 2023
Lei Yuan, Tao Jiang, Lihe Li, Feng Chen, Zongzhang Zhang, Yang Yu

Figure 1 for Robust Multi-agent Communication via Multi-view Message Certification
Figure 2 for Robust Multi-agent Communication via Multi-view Message Certification
Figure 3 for Robust Multi-agent Communication via Multi-view Message Certification
Figure 4 for Robust Multi-agent Communication via Multi-view Message Certification
Viaarxiv icon

How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement

Add code
Bookmark button
Alert button
Mar 03, 2023
Xu-Hui Liu, Feng Xu, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Ruifeng Chen, Zongzhang Zhang, Yang Yu

Figure 1 for How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement
Figure 2 for How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement
Figure 3 for How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement
Figure 4 for How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement
Viaarxiv icon