Picture for Zhuoran Yang

Zhuoran Yang

A Posterior Sampling Framework for Interactive Decision Making

Add code
Nov 03, 2022
Viaarxiv icon

A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

Add code
Oct 19, 2022
Figure 1 for A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design
Viaarxiv icon

Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments

Add code
Sep 29, 2022
Figure 1 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Figure 2 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Figure 3 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Figure 4 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Viaarxiv icon

Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL

Add code
Sep 26, 2022
Figure 1 for Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Figure 2 for Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Figure 3 for Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Viaarxiv icon

Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

Add code
Sep 18, 2022
Figure 1 for Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes
Figure 2 for Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes
Figure 3 for Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes
Viaarxiv icon

Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments

Add code
Aug 23, 2022
Figure 1 for Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments
Viaarxiv icon

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

Add code
Jul 29, 2022
Figure 1 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 2 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 3 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 4 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Viaarxiv icon

Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions

Add code
Jul 25, 2022
Figure 1 for Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions
Figure 2 for Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions
Viaarxiv icon

Decentralized Optimistic Hyperpolicy Mirror Descent: Provably No-Regret Learning in Markov Games

Add code
Jun 03, 2022
Viaarxiv icon

Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes

Add code
May 26, 2022
Figure 1 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Figure 2 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Figure 3 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Figure 4 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Viaarxiv icon