Alert button
Picture for Wen Sun

Wen Sun

Alert button

Contextual Bandits and Imitation Learning via Preference-Based Active Queries

Jul 24, 2023
Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu

Viaarxiv icon

JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning

Jul 21, 2023
Kaiwen Wang, Junxiong Wang, Yueying Li, Nathan Kallus, Immanuel Trummer, Wen Sun

Figure 1 for JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Figure 2 for JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Figure 3 for JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Figure 4 for JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Viaarxiv icon

Selective Sampling and Imitation Learning via Online Regression

Jul 11, 2023
Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu

Figure 1 for Selective Sampling and Imitation Learning via Online Regression
Figure 2 for Selective Sampling and Imitation Learning via Online Regression
Figure 3 for Selective Sampling and Imitation Learning via Online Regression
Viaarxiv icon

Learning to Generate Better Than Your LLM

Jun 20, 2023
Jonathan D. Chang, Kiante Brantley, Rajkumar Ramamurthy, Dipendra Misra, Wen Sun

Figure 1 for Learning to Generate Better Than Your LLM
Figure 2 for Learning to Generate Better Than Your LLM
Figure 3 for Learning to Generate Better Than Your LLM
Figure 4 for Learning to Generate Better Than Your LLM
Viaarxiv icon

How to Query Human Feedback Efficiently in RL?

May 29, 2023
Wenhao Zhan, Masatoshi Uehara, Wen Sun, Jason D. Lee

Viaarxiv icon

The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning

May 25, 2023
Kaiwen Wang, Kevin Zhou, Runzhe Wu, Nathan Kallus, Wen Sun

Figure 1 for The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Figure 2 for The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Viaarxiv icon

Provable Offline Reinforcement Learning with Human Feedback

May 24, 2023
Wenhao Zhan, Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun

Viaarxiv icon

Distributional Offline Policy Evaluation with Predictive Error Guarantees

Feb 19, 2023
Runzhe Wu, Masatoshi Uehara, Wen Sun

Figure 1 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 2 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 3 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 4 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Viaarxiv icon

Multi-task Representation Learning for Pure Exploration in Linear Bandits

Feb 09, 2023
Yihan Du, Longbo Huang, Wen Sun

Figure 1 for Multi-task Representation Learning for Pure Exploration in Linear Bandits
Viaarxiv icon

Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR

Feb 07, 2023
Kaiwen Wang, Nathan Kallus, Wen Sun

Viaarxiv icon