Picture for Ruihao Zhu

Ruihao Zhu

Reward Learning From Preference With Ties

Add code
Oct 05, 2024
Viaarxiv icon

Satisficing Exploration in Bandit Optimization

Add code
Jun 10, 2024
Viaarxiv icon

Efficient and Interpretable Bandit Algorithms

Add code
Oct 23, 2023
Viaarxiv icon

User Experience Design Professionals' Perceptions of Generative Artificial Intelligence

Add code
Sep 26, 2023
Viaarxiv icon

Phase Transitions in Learning and Earning under Price Protection Guarantee

Add code
Nov 03, 2022
Viaarxiv icon

Learning to Price Supply Chain Contracts against a Learning Retailer

Add code
Nov 02, 2022
Viaarxiv icon

Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing

Add code
Aug 04, 2022
Figure 1 for Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing
Figure 2 for Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing
Viaarxiv icon

Safe Optimal Design with Applications in Policy Learning

Add code
Nov 08, 2021
Figure 1 for Safe Optimal Design with Applications in Policy Learning
Figure 2 for Safe Optimal Design with Applications in Policy Learning
Figure 3 for Safe Optimal Design with Applications in Policy Learning
Figure 4 for Safe Optimal Design with Applications in Policy Learning
Viaarxiv icon

Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs

Add code
Oct 07, 2020
Figure 1 for Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs
Figure 2 for Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs
Figure 3 for Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs
Viaarxiv icon

Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism

Add code
Jun 24, 2020
Figure 1 for Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
Figure 2 for Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
Figure 3 for Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
Figure 4 for Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
Viaarxiv icon