Alert button
Picture for Zhaoran Wang

Zhaoran Wang

Alert button

Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
May 31, 2023
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo R. Jovanović

Viaarxiv icon

What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization

Add code
Bookmark button
Alert button
May 30, 2023
Yufeng Zhang, Fengzhuo Zhang, Zhuoran Yang, Zhaoran Wang

Figure 1 for What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization
Viaarxiv icon

One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration

Add code
Bookmark button
Alert button
May 29, 2023
Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang

Figure 1 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 2 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 3 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 4 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Viaarxiv icon

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
May 08, 2023
Yulai Zhao, Zhuoran Yang, Zhaoran Wang, Jason D. Lee

Figure 1 for Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Viaarxiv icon

Dynamic Datasets and Market Environments for Financial Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 25, 2023
Xiao-Yang Liu, Ziyi Xia, Hongyang Yang, Jiechao Gao, Daochen Zha, Ming Zhu, Christina Dan Wang, Zhaoran Wang, Jian Guo

Figure 1 for Dynamic Datasets and Market Environments for Financial Reinforcement Learning
Figure 2 for Dynamic Datasets and Market Environments for Financial Reinforcement Learning
Figure 3 for Dynamic Datasets and Market Environments for Financial Reinforcement Learning
Figure 4 for Dynamic Datasets and Market Environments for Financial Reinforcement Learning
Viaarxiv icon

Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization

Add code
Bookmark button
Alert button
Mar 28, 2023
Haoran Xu, Li Jiang, Jianxiong Li, Zhuoran Yang, Zhaoran Wang, Victor Wai Kin Chan, Xianyuan Zhan

Figure 1 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Figure 2 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Figure 3 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Figure 4 for Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Viaarxiv icon

A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations

Add code
Bookmark button
Alert button
Mar 20, 2023
Siyu Chen, Yitan Wang, Zhaoran Wang, Zhuoran Yang

Figure 1 for A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations
Figure 2 for A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations
Figure 3 for A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations
Figure 4 for A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations
Viaarxiv icon

Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 24, 2023
Ruitu Xu, Yifei Min, Tianhao Wang, Zhaoran Wang, Michael I. Jordan, Zhuoran Yang

Figure 1 for Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning
Figure 2 for Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning
Viaarxiv icon

Differentiable Arbitrating in Zero-sum Markov Games

Add code
Bookmark button
Alert button
Feb 20, 2023
Jing Wang, Meichen Song, Feng Gao, Boyi Liu, Zhaoran Wang, Yi Wu

Figure 1 for Differentiable Arbitrating in Zero-sum Markov Games
Figure 2 for Differentiable Arbitrating in Zero-sum Markov Games
Figure 3 for Differentiable Arbitrating in Zero-sum Markov Games
Figure 4 for Differentiable Arbitrating in Zero-sum Markov Games
Viaarxiv icon

An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models

Add code
Bookmark button
Alert button
Dec 30, 2022
Yufeng Zhang, Boyi Liu, Qi Cai, Lingxiao Wang, Zhaoran Wang

Figure 1 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Figure 2 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Figure 3 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Figure 4 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Viaarxiv icon