Picture for David Simchi-Levi

David Simchi-Levi

Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff

Add code
May 28, 2024
Viaarxiv icon

On the Optimal Regret of Locally Private Linear Contextual Bandit

Add code
Apr 15, 2024
Figure 1 for On the Optimal Regret of Locally Private Linear Contextual Bandit
Figure 2 for On the Optimal Regret of Locally Private Linear Contextual Bandit
Viaarxiv icon

Online Local False Discovery Rate Control: A Resource Allocation Approach

Add code
Feb 18, 2024
Viaarxiv icon

Privacy Preserving Adaptive Experiment Design

Add code
Feb 05, 2024
Viaarxiv icon

Utility Fairness in Contextual Dynamic Pricing with Demand Learning

Add code
Nov 28, 2023
Viaarxiv icon

Regret Distribution in Stochastic Bandits: Optimal Trade-off between Expectation and Tail Risk

Add code
Apr 10, 2023
Figure 1 for Regret Distribution in Stochastic Bandits: Optimal Trade-off between Expectation and Tail Risk
Figure 2 for Regret Distribution in Stochastic Bandits: Optimal Trade-off between Expectation and Tail Risk
Viaarxiv icon

A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk

Add code
Jun 07, 2022
Figure 1 for A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk
Figure 2 for A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk
Figure 3 for A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk
Figure 4 for A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk
Viaarxiv icon

PAC-Bayesian Based Adaptation for Regularized Learning

Add code
Apr 16, 2022
Viaarxiv icon

Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation

Add code
Nov 21, 2021
Figure 1 for Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation
Figure 2 for Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation
Viaarxiv icon

Dynamic Pricing and Demand Learning on a Large Network of Products: A PAC-Bayesian Approach

Add code
Nov 10, 2021
Viaarxiv icon