Alert button
Picture for Tian Xu

Tian Xu

Alert button

AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials

Add code
Bookmark button
Alert button
Dec 27, 2023
Zijie Yang, Yongjing Yin, Chaojun Kong, Tiange Chi, Wufan Tao, Yue Zhang, Tian Xu

Viaarxiv icon

Policy Optimization in RLHF: The Impact of Out-of-preference Data

Add code
Bookmark button
Alert button
Dec 17, 2023
Ziniu Li, Tian Xu, Yang Yu

Viaarxiv icon

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Add code
Bookmark button
Alert button
Oct 17, 2023
Ziniu Li, Tian Xu, Yushun Zhang, Yang Yu, Ruoyu Sun, Zhi-Quan Luo

Viaarxiv icon

Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 09, 2023
Fan-Ming Luo, Tian Xu, Xingchen Cao, Yang Yu

Figure 1 for Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Figure 2 for Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Figure 3 for Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Figure 4 for Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Viaarxiv icon

Provably Efficient Adversarial Imitation Learning with Unknown Transitions

Add code
Bookmark button
Alert button
Jun 11, 2023
Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo

Figure 1 for Provably Efficient Adversarial Imitation Learning with Unknown Transitions
Figure 2 for Provably Efficient Adversarial Imitation Learning with Unknown Transitions
Figure 3 for Provably Efficient Adversarial Imitation Learning with Unknown Transitions
Viaarxiv icon

Theoretical Analysis of Offline Imitation With Supplementary Dataset

Add code
Bookmark button
Alert button
Jan 27, 2023
Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo

Figure 1 for Theoretical Analysis of Offline Imitation With Supplementary Dataset
Figure 2 for Theoretical Analysis of Offline Imitation With Supplementary Dataset
Figure 3 for Theoretical Analysis of Offline Imitation With Supplementary Dataset
Figure 4 for Theoretical Analysis of Offline Imitation With Supplementary Dataset
Viaarxiv icon

Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis

Add code
Bookmark button
Alert button
Aug 03, 2022
Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo

Figure 1 for Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis
Figure 2 for Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis
Figure 3 for Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis
Figure 4 for Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis
Viaarxiv icon

A Survey on Model-based Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 19, 2022
Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu

Figure 1 for A Survey on Model-based Reinforcement Learning
Viaarxiv icon

Model Generation with Provable Coverability for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 08, 2022
Chengxing Jia, Hao Yin, Chenxiao Gao, Tian Xu, Lei Yuan, Zongzhang Zhang, Yang Yu

Figure 1 for Model Generation with Provable Coverability for Offline Reinforcement Learning
Figure 2 for Model Generation with Provable Coverability for Offline Reinforcement Learning
Figure 3 for Model Generation with Provable Coverability for Offline Reinforcement Learning
Figure 4 for Model Generation with Provable Coverability for Offline Reinforcement Learning
Viaarxiv icon