Alert button
Picture for Chenlu Ye

Chenlu Ye

Alert button

Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

Add code
Bookmark button
Alert button
Feb 15, 2024
Chenlu Ye, Jiafan He, Quanquan Gu, Tong Zhang

Viaarxiv icon

A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference

Add code
Bookmark button
Alert button
Feb 11, 2024
Chenlu Ye, Wei Xiong, Yuheng Zhang, Nan Jiang, Tong Zhang

Viaarxiv icon

Gibbs Sampling from Human Feedback: A Provable KL- constrained Framework for RLHF

Add code
Bookmark button
Alert button
Dec 18, 2023
Wei Xiong, Hanze Dong, Chenlu Ye, Han Zhong, Nan Jiang, Tong Zhang

Viaarxiv icon

Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks

Add code
Bookmark button
Alert button
Nov 24, 2023
Jianqing Fan, Zhaoran Wang, Zhuoran Yang, Chenlu Ye

Figure 1 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 2 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 3 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 4 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Viaarxiv icon

Corruption-Robust Offline Reinforcement Learning with General Function Approximation

Add code
Bookmark button
Alert button
Oct 23, 2023
Chenlu Ye, Rui Yang, Quanquan Gu, Tong Zhang

Viaarxiv icon

Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning

Add code
Bookmark button
Alert button
Sep 05, 2023
Yong Lin, Chen Liu, Chenlu Ye, Qing Lian, Yuan Yao, Tong Zhang

Figure 1 for Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning
Figure 2 for Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning
Figure 3 for Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning
Figure 4 for Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning
Viaarxiv icon

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes

Add code
Bookmark button
Alert button
Dec 12, 2022
Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang

Viaarxiv icon