Picture for Zhuoran Yang

Zhuoran Yang

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

Add code
May 29, 2023
Viaarxiv icon

One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration

Add code
May 29, 2023
Viaarxiv icon

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

Add code
May 08, 2023
Figure 1 for Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Viaarxiv icon

Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization

Add code
Mar 28, 2023
Figure 1 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Figure 2 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Figure 3 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Figure 4 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Viaarxiv icon

A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations

Add code
Mar 20, 2023
Viaarxiv icon

Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model

Add code
Mar 15, 2023
Figure 1 for Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model
Viaarxiv icon

Can We Find Nash Equilibria at a Linear Rate in Markov Games?

Add code
Mar 03, 2023
Viaarxiv icon

Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning

Add code
Feb 24, 2023
Figure 1 for Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning
Figure 2 for Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning
Viaarxiv icon

Offline Policy Optimization in RL with Variance Regularizaton

Add code
Dec 29, 2022
Figure 1 for Offline Policy Optimization in RL with Variance Regularizaton
Figure 2 for Offline Policy Optimization in RL with Variance Regularizaton
Figure 3 for Offline Policy Optimization in RL with Variance Regularizaton
Figure 4 for Offline Policy Optimization in RL with Variance Regularizaton
Viaarxiv icon

Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information

Add code
Dec 23, 2022
Figure 1 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 2 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 3 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 4 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Viaarxiv icon