Picture for Zhaoran Wang

Zhaoran Wang

Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information

Add code
Dec 23, 2022
Figure 1 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 2 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 3 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 4 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Viaarxiv icon

Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality

Add code
Dec 19, 2022
Figure 1 for Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
Figure 2 for Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
Figure 3 for Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
Figure 4 for Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
Viaarxiv icon

Latent Variable Representation for Reinforcement Learning

Add code
Dec 17, 2022
Figure 1 for Latent Variable Representation for Reinforcement Learning
Figure 2 for Latent Variable Representation for Reinforcement Learning
Figure 3 for Latent Variable Representation for Reinforcement Learning
Figure 4 for Latent Variable Representation for Reinforcement Learning
Viaarxiv icon

A Posterior Sampling Framework for Interactive Decision Making

Add code
Nov 03, 2022
Figure 1 for A Posterior Sampling Framework for Interactive Decision Making
Figure 2 for A Posterior Sampling Framework for Interactive Decision Making
Viaarxiv icon

A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

Add code
Oct 19, 2022
Figure 1 for A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design
Viaarxiv icon

Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments

Add code
Sep 29, 2022
Figure 1 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Figure 2 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Figure 3 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Figure 4 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Viaarxiv icon

Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL

Add code
Sep 26, 2022
Figure 1 for Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Figure 2 for Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Figure 3 for Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Viaarxiv icon

Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

Add code
Sep 18, 2022
Figure 1 for Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes
Figure 2 for Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes
Figure 3 for Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes
Viaarxiv icon

Differentiable Bilevel Programming for Stackelberg Congestion Games

Add code
Sep 15, 2022
Figure 1 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Figure 2 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Figure 3 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Figure 4 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Viaarxiv icon

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

Add code
Jul 29, 2022
Figure 1 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 2 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 3 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 4 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Viaarxiv icon