Picture for Yu-Xiang Wang

Yu-Xiang Wang

University of California Santa Barbara

Sample-Efficient Reinforcement Learning with loglog Switching Cost

Add code
Feb 13, 2022
Figure 1 for Sample-Efficient Reinforcement Learning with loglog Switching Cost
Figure 2 for Sample-Efficient Reinforcement Learning with loglog Switching Cost
Viaarxiv icon

Towards Agnostic Feature-based Dynamic Pricing: Linear Policies vs Linear Valuation with Unknown Noise

Add code
Jan 27, 2022
Figure 1 for Towards Agnostic Feature-based Dynamic Pricing: Linear Policies vs Linear Valuation with Unknown Noise
Figure 2 for Towards Agnostic Feature-based Dynamic Pricing: Linear Policies vs Linear Valuation with Unknown Noise
Figure 3 for Towards Agnostic Feature-based Dynamic Pricing: Linear Policies vs Linear Valuation with Unknown Noise
Viaarxiv icon

Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond

Add code
Jan 21, 2022
Figure 1 for Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond
Figure 2 for Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond
Figure 3 for Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond
Figure 4 for Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond
Viaarxiv icon

Multivariate Trend Filtering for Lattice Data

Add code
Dec 29, 2021
Figure 1 for Multivariate Trend Filtering for Lattice Data
Figure 2 for Multivariate Trend Filtering for Lattice Data
Figure 3 for Multivariate Trend Filtering for Lattice Data
Figure 4 for Multivariate Trend Filtering for Lattice Data
Viaarxiv icon

Privately Publishable Per-instance Privacy

Add code
Nov 03, 2021
Figure 1 for Privately Publishable Per-instance Privacy
Figure 2 for Privately Publishable Per-instance Privacy
Figure 3 for Privately Publishable Per-instance Privacy
Figure 4 for Privately Publishable Per-instance Privacy
Viaarxiv icon

Towards Instance-Optimal Offline Reinforcement Learning with Pessimism

Add code
Oct 17, 2021
Figure 1 for Towards Instance-Optimal Offline Reinforcement Learning with Pessimism
Figure 2 for Towards Instance-Optimal Offline Reinforcement Learning with Pessimism
Viaarxiv icon

Optimal Accounting of Differential Privacy via Characteristic Function

Add code
Jun 16, 2021
Figure 1 for Optimal Accounting of Differential Privacy via Characteristic Function
Figure 2 for Optimal Accounting of Differential Privacy via Characteristic Function
Figure 3 for Optimal Accounting of Differential Privacy via Characteristic Function
Figure 4 for Optimal Accounting of Differential Privacy via Characteristic Function
Viaarxiv icon

Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings

Add code
May 21, 2021
Figure 1 for Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Viaarxiv icon

Optimal Dynamic Regret in Exp-Concave Online Learning

Add code
Apr 23, 2021
Figure 1 for Optimal Dynamic Regret in Exp-Concave Online Learning
Figure 2 for Optimal Dynamic Regret in Exp-Concave Online Learning
Figure 3 for Optimal Dynamic Regret in Exp-Concave Online Learning
Figure 4 for Optimal Dynamic Regret in Exp-Concave Online Learning
Viaarxiv icon

Non-stationary Online Learning with Memory and Non-stochastic Control

Add code
Feb 07, 2021
Figure 1 for Non-stationary Online Learning with Memory and Non-stochastic Control
Viaarxiv icon