Alert button
Picture for Runzhe Wu

Runzhe Wu

Alert button

Making RL with Preference-based Feedback Efficient via Randomization

Add code
Bookmark button
Alert button
Oct 23, 2023
Runzhe Wu, Wen Sun

Viaarxiv icon

Contextual Bandits and Imitation Learning via Preference-Based Active Queries

Add code
Bookmark button
Alert button
Jul 24, 2023
Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu

Viaarxiv icon

Selective Sampling and Imitation Learning via Online Regression

Add code
Bookmark button
Alert button
Jul 11, 2023
Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu

Figure 1 for Selective Sampling and Imitation Learning via Online Regression
Figure 2 for Selective Sampling and Imitation Learning via Online Regression
Figure 3 for Selective Sampling and Imitation Learning via Online Regression
Viaarxiv icon

The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning

Add code
Bookmark button
Alert button
May 25, 2023
Kaiwen Wang, Kevin Zhou, Runzhe Wu, Nathan Kallus, Wen Sun

Figure 1 for The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Figure 2 for The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Viaarxiv icon

Distributional Offline Policy Evaluation with Predictive Error Guarantees

Add code
Bookmark button
Alert button
Feb 19, 2023
Runzhe Wu, Masatoshi Uehara, Wen Sun

Figure 1 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 2 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 3 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 4 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Viaarxiv icon

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 05, 2021
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Weinan Zhang, Jun Wang

Figure 1 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Figure 2 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Figure 3 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Figure 4 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Viaarxiv icon