Alert button
Picture for Yaodong Yang

Yaodong Yang

Alert button

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

Add code
Bookmark button
Alert button
Jan 19, 2024
Siyuan Qi, Shuo Chen, Yexin Li, Xiangyu Kong, Junqi Wang, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Wei Wang, Yaodong Yang, Song-Chun Zhu

Viaarxiv icon

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 12, 2023
Yinmin Zhang, Jie Liu, Chuming Li, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang

Viaarxiv icon

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Add code
Bookmark button
Alert button
Nov 30, 2023
Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang

Viaarxiv icon

AI Alignment: A Comprehensive Survey

Add code
Bookmark button
Alert button
Nov 01, 2023
Jiaming Ji, Tianyi Qiu, Boyuan Chen, Borong Zhang, Hantao Lou, Kaile Wang, Yawen Duan, Zhonghao He, Jiayi Zhou, Zhaowei Zhang, Fanzhi Zeng, Kwan Yee Ng, Juntao Dai, Xuehai Pan, Aidan O'Gara, Yingshan Lei, Hua Xu, Brian Tse, Jie Fu, Stephen McAleer, Yaodong Yang, Yizhou Wang, Song-Chun Zhu, Yike Guo, Wen Gao

Viaarxiv icon

Grasp Multiple Objects with One Hand

Add code
Bookmark button
Alert button
Oct 24, 2023
Yuyang Li, Bo Liu, Yiran Geng, Puhao Li, Yaodong Yang, Yixin Zhu, Tengyu Liu, Siyuan Huang

Viaarxiv icon

Safe RLHF: Safe Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Oct 19, 2023
Josef Dai, Xuehai Pan, Ruiyang Sun, Jiaming Ji, Xinbo Xu, Mickel Liu, Yizhou Wang, Yaodong Yang

Figure 1 for Safe RLHF: Safe Reinforcement Learning from Human Feedback
Figure 2 for Safe RLHF: Safe Reinforcement Learning from Human Feedback
Figure 3 for Safe RLHF: Safe Reinforcement Learning from Human Feedback
Figure 4 for Safe RLHF: Safe Reinforcement Learning from Human Feedback
Viaarxiv icon

Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Add code
Bookmark button
Alert button
Oct 19, 2023
Jiaming Ji, Borong Zhang, Jiayi Zhou, Xuehai Pan, Weidong Huang, Ruiyang Sun, Yiran Geng, Yifan Zhong, Juntao Dai, Yaodong Yang

Figure 1 for Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Figure 2 for Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Figure 3 for Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Figure 4 for Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Viaarxiv icon

Measuring Value Understanding in Language Models through Discriminator-Critique Gap

Add code
Bookmark button
Alert button
Oct 19, 2023
Zhaowei Zhang, Fengshuo Bai, Jun Gao, Yaodong Yang

Figure 1 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Figure 2 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Figure 3 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Figure 4 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Viaarxiv icon

Masked Pretraining for Multi-Agent Decision Making

Add code
Bookmark button
Alert button
Oct 18, 2023
Jie Liu, Yinmin Zhang, Chuming Li, Chao Yang, Yaodong Yang, Yu Liu, Wanli Ouyang

Viaarxiv icon