Picture for Wen Sun

Wen Sun

Contextual Bandits and Imitation Learning via Preference-Based Active Queries

Add code
Jul 24, 2023
Viaarxiv icon

JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning

Add code
Jul 21, 2023
Viaarxiv icon

Selective Sampling and Imitation Learning via Online Regression

Add code
Jul 11, 2023
Viaarxiv icon

Learning to Generate Better Than Your LLM

Add code
Jun 20, 2023
Viaarxiv icon

How to Query Human Feedback Efficiently in RL?

Add code
May 29, 2023
Viaarxiv icon

The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning

Add code
May 25, 2023
Figure 1 for The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Figure 2 for The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Viaarxiv icon

Provable Offline Reinforcement Learning with Human Feedback

Add code
May 24, 2023
Viaarxiv icon

Distributional Offline Policy Evaluation with Predictive Error Guarantees

Add code
Feb 19, 2023
Figure 1 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 2 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 3 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 4 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Viaarxiv icon

Multi-task Representation Learning for Pure Exploration in Linear Bandits

Add code
Feb 09, 2023
Figure 1 for Multi-task Representation Learning for Pure Exploration in Linear Bandits
Viaarxiv icon

Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR

Add code
Feb 07, 2023
Viaarxiv icon