Alert button
Picture for Yu-Xiang Wang

Yu-Xiang Wang

Alert button

Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost

Add code
Bookmark button
Alert button
Feb 13, 2022
Dan Qiao, Ming Yin, Ming Min, Yu-Xiang Wang

Figure 1 for Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
Figure 2 for Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
Viaarxiv icon

Towards Agnostic Feature-based Dynamic Pricing: Linear Policies vs Linear Valuation with Unknown Noise

Add code
Bookmark button
Alert button
Jan 27, 2022
Jianyu Xu, Yu-Xiang Wang

Figure 1 for Towards Agnostic Feature-based Dynamic Pricing: Linear Policies vs Linear Valuation with Unknown Noise
Figure 2 for Towards Agnostic Feature-based Dynamic Pricing: Linear Policies vs Linear Valuation with Unknown Noise
Figure 3 for Towards Agnostic Feature-based Dynamic Pricing: Linear Policies vs Linear Valuation with Unknown Noise
Viaarxiv icon

Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond

Add code
Bookmark button
Alert button
Jan 21, 2022
Dheeraj Baby, Yu-Xiang Wang

Figure 1 for Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond
Figure 2 for Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond
Figure 3 for Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond
Figure 4 for Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond
Viaarxiv icon

Multivariate Trend Filtering for Lattice Data

Add code
Bookmark button
Alert button
Dec 29, 2021
Veeranjaneyulu Sadhanala, Yu-Xiang Wang, Addison J. Hu, Ryan J. Tibshirani

Figure 1 for Multivariate Trend Filtering for Lattice Data
Figure 2 for Multivariate Trend Filtering for Lattice Data
Figure 3 for Multivariate Trend Filtering for Lattice Data
Figure 4 for Multivariate Trend Filtering for Lattice Data
Viaarxiv icon

Privately Publishable Per-instance Privacy

Add code
Bookmark button
Alert button
Nov 03, 2021
Rachel Redberg, Yu-Xiang Wang

Figure 1 for Privately Publishable Per-instance Privacy
Figure 2 for Privately Publishable Per-instance Privacy
Figure 3 for Privately Publishable Per-instance Privacy
Figure 4 for Privately Publishable Per-instance Privacy
Viaarxiv icon

Towards Instance-Optimal Offline Reinforcement Learning with Pessimism

Add code
Bookmark button
Alert button
Oct 17, 2021
Ming Yin, Yu-Xiang Wang

Figure 1 for Towards Instance-Optimal Offline Reinforcement Learning with Pessimism
Figure 2 for Towards Instance-Optimal Offline Reinforcement Learning with Pessimism
Viaarxiv icon

Optimal Accounting of Differential Privacy via Characteristic Function

Add code
Bookmark button
Alert button
Jun 16, 2021
Yuqing Zhu, Jinshuo Dong, Yu-Xiang Wang

Figure 1 for Optimal Accounting of Differential Privacy via Characteristic Function
Figure 2 for Optimal Accounting of Differential Privacy via Characteristic Function
Figure 3 for Optimal Accounting of Differential Privacy via Characteristic Function
Figure 4 for Optimal Accounting of Differential Privacy via Characteristic Function
Viaarxiv icon

Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings

Add code
Bookmark button
Alert button
May 21, 2021
Ming Yin, Yu-Xiang Wang

Figure 1 for Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Viaarxiv icon

Characterizing Uniform Convergence in Offline Policy Evaluation via model-based approach: Offline Learning, Task-Agnostic and Reward-Free

Add code
Bookmark button
Alert button
May 13, 2021
Ming Yin, Yu-Xiang Wang

Figure 1 for Characterizing Uniform Convergence in Offline Policy Evaluation via model-based approach: Offline Learning, Task-Agnostic and Reward-Free
Viaarxiv icon

Optimal Dynamic Regret in Exp-Concave Online Learning

Add code
Bookmark button
Alert button
Apr 23, 2021
Dheeraj Baby, Yu-Xiang Wang

Figure 1 for Optimal Dynamic Regret in Exp-Concave Online Learning
Figure 2 for Optimal Dynamic Regret in Exp-Concave Online Learning
Figure 3 for Optimal Dynamic Regret in Exp-Concave Online Learning
Figure 4 for Optimal Dynamic Regret in Exp-Concave Online Learning
Viaarxiv icon