Picture for Lihong Li

Lihong Li

MESOB: Balancing Equilibria & Social Optimality

Add code
Jul 16, 2023
Viaarxiv icon

Offline Policy Optimization in RL with Variance Regularizaton

Add code
Dec 29, 2022
Viaarxiv icon

A Reinforcement Learning Approach to Estimating Long-term Treatment Effects

Add code
Oct 14, 2022
Figure 1 for A Reinforcement Learning Approach to Estimating Long-term Treatment Effects
Figure 2 for A Reinforcement Learning Approach to Estimating Long-term Treatment Effects
Figure 3 for A Reinforcement Learning Approach to Estimating Long-term Treatment Effects
Figure 4 for A Reinforcement Learning Approach to Estimating Long-term Treatment Effects
Viaarxiv icon

Understanding Domain Randomization for Sim-to-real Transfer

Add code
Oct 07, 2021
Viaarxiv icon

A Map of Bandits for E-commerce

Add code
Jul 01, 2021
Figure 1 for A Map of Bandits for E-commerce
Figure 2 for A Map of Bandits for E-commerce
Figure 3 for A Map of Bandits for E-commerce
Figure 4 for A Map of Bandits for E-commerce
Viaarxiv icon

On the Optimality of Batch Policy Optimization Algorithms

Add code
Apr 06, 2021
Figure 1 for On the Optimality of Batch Policy Optimization Algorithms
Figure 2 for On the Optimality of Batch Policy Optimization Algorithms
Viaarxiv icon

Near-optimal Representation Learning for Linear Bandits and Linear RL

Add code
Feb 08, 2021
Viaarxiv icon

CoinDICE: Off-Policy Confidence Interval Estimation

Add code
Oct 22, 2020
Figure 1 for CoinDICE: Off-Policy Confidence Interval Estimation
Figure 2 for CoinDICE: Off-Policy Confidence Interval Estimation
Figure 3 for CoinDICE: Off-Policy Confidence Interval Estimation
Viaarxiv icon

Neural Thompson Sampling

Add code
Oct 02, 2020
Figure 1 for Neural Thompson Sampling
Figure 2 for Neural Thompson Sampling
Figure 3 for Neural Thompson Sampling
Figure 4 for Neural Thompson Sampling
Viaarxiv icon

Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL

Add code
Sep 15, 2020
Viaarxiv icon