Picture for Chicheng Zhang

Chicheng Zhang

Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM

Add code
May 16, 2025
Viaarxiv icon

Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leiblier Maillard Sampling

Add code
Feb 20, 2025
Figure 1 for Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leiblier Maillard Sampling
Figure 2 for Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leiblier Maillard Sampling
Figure 3 for Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leiblier Maillard Sampling
Figure 4 for Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leiblier Maillard Sampling
Viaarxiv icon

A Note on Sample Complexity of Interactive Imitation Learning with Log Loss

Add code
Dec 09, 2024
Viaarxiv icon

Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits

Add code
Feb 17, 2024
Figure 1 for Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
Figure 2 for Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
Figure 3 for Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
Viaarxiv icon

Ensemble-based Interactive Imitation Learning

Add code
Dec 28, 2023
Viaarxiv icon

Efficient Active Learning Halfspaces with Tsybakov Noise: A Non-convex Optimization Approach

Add code
Oct 23, 2023
Figure 1 for Efficient Active Learning Halfspaces with Tsybakov Noise: A Non-convex Optimization Approach
Viaarxiv icon

Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards

Add code
Apr 28, 2023
Viaarxiv icon

PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits

Add code
Oct 25, 2022
Figure 1 for PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits
Figure 2 for PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits
Viaarxiv icon

On Efficient Online Imitation Learning via Classification

Add code
Sep 26, 2022
Figure 1 for On Efficient Online Imitation Learning via Classification
Figure 2 for On Efficient Online Imitation Learning via Classification
Figure 3 for On Efficient Online Imitation Learning via Classification
Figure 4 for On Efficient Online Imitation Learning via Classification
Viaarxiv icon

Thompson Sampling for Robust Transfer in Multi-Task Bandits

Add code
Jun 17, 2022
Figure 1 for Thompson Sampling for Robust Transfer in Multi-Task Bandits
Figure 2 for Thompson Sampling for Robust Transfer in Multi-Task Bandits
Figure 3 for Thompson Sampling for Robust Transfer in Multi-Task Bandits
Figure 4 for Thompson Sampling for Robust Transfer in Multi-Task Bandits
Viaarxiv icon