Picture for Zhaoran Wang

Zhaoran Wang

Dynamic Bottleneck for Robust Self-Supervised Exploration

Add code
Oct 25, 2021
Figure 1 for Dynamic Bottleneck for Robust Self-Supervised Exploration
Figure 2 for Dynamic Bottleneck for Robust Self-Supervised Exploration
Figure 3 for Dynamic Bottleneck for Robust Self-Supervised Exploration
Figure 4 for Dynamic Bottleneck for Robust Self-Supervised Exploration
Viaarxiv icon

SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning

Add code
Oct 24, 2021
Figure 1 for SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning
Figure 2 for SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning
Figure 3 for SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning
Figure 4 for SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning
Viaarxiv icon

On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game

Add code
Oct 19, 2021
Viaarxiv icon

Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima

Add code
Oct 12, 2021
Figure 1 for Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima
Figure 2 for Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima
Figure 3 for Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima
Viaarxiv icon

Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation

Add code
Aug 19, 2021
Viaarxiv icon

Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning

Add code
Aug 08, 2021
Figure 1 for Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Figure 2 for Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Figure 3 for Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Figure 4 for Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Viaarxiv icon

Towards General Function Approximation in Zero-Sum Markov Games

Add code
Jul 30, 2021
Viaarxiv icon

A Unified Off-Policy Evaluation Approach for General Value Function

Add code
Jul 06, 2021
Figure 1 for A Unified Off-Policy Evaluation Approach for General Value Function
Figure 2 for A Unified Off-Policy Evaluation Approach for General Value Function
Viaarxiv icon

Gap-Dependent Bounds for Two-Player Markov Games

Add code
Jul 01, 2021
Viaarxiv icon

Randomized Exploration for Reinforcement Learning with General Value Function Approximation

Add code
Jun 15, 2021
Figure 1 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 2 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 3 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 4 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Viaarxiv icon