Picture for Zhuoran Yang

Zhuoran Yang

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

Add code
May 08, 2023
Viaarxiv icon

Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization

Add code
Mar 28, 2023
Figure 1 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Figure 2 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Figure 3 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Figure 4 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Viaarxiv icon

A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations

Add code
Mar 20, 2023
Viaarxiv icon

Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model

Add code
Mar 15, 2023
Viaarxiv icon

Can We Find Nash Equilibria at a Linear Rate in Markov Games?

Add code
Mar 03, 2023
Viaarxiv icon

Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning

Add code
Feb 24, 2023
Figure 1 for Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning
Figure 2 for Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning
Viaarxiv icon

Offline Policy Optimization in RL with Variance Regularizaton

Add code
Dec 29, 2022
Figure 1 for Offline Policy Optimization in RL with Variance Regularizaton
Figure 2 for Offline Policy Optimization in RL with Variance Regularizaton
Figure 3 for Offline Policy Optimization in RL with Variance Regularizaton
Figure 4 for Offline Policy Optimization in RL with Variance Regularizaton
Viaarxiv icon

Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information

Add code
Dec 23, 2022
Figure 1 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 2 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 3 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 4 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Viaarxiv icon

Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality

Add code
Dec 19, 2022
Viaarxiv icon

The Sample Complexity of Online Contract Design

Add code
Nov 10, 2022
Viaarxiv icon