Alert button
Picture for Zhaoran Wang

Zhaoran Wang

Alert button

Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information

Add code
Bookmark button
Alert button
Dec 23, 2022
Zuyue Fu, Zhengling Qi, Zhuoran Yang, Zhaoran Wang, Lan Wang

Figure 1 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 2 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 3 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 4 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Viaarxiv icon

Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality

Add code
Bookmark button
Alert button
Dec 19, 2022
Ying Jin, Zhimei Ren, Zhuoran Yang, Zhaoran Wang

Figure 1 for Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
Figure 2 for Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
Figure 3 for Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
Figure 4 for Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
Viaarxiv icon

Latent Variable Representation for Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 17, 2022
Tongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai

Figure 1 for Latent Variable Representation for Reinforcement Learning
Figure 2 for Latent Variable Representation for Reinforcement Learning
Figure 3 for Latent Variable Representation for Reinforcement Learning
Figure 4 for Latent Variable Representation for Reinforcement Learning
Viaarxiv icon

A Posterior Sampling Framework for Interactive Decision Making

Add code
Bookmark button
Alert button
Nov 03, 2022
Han Zhong, Wei Xiong, Sirui Zheng, Liwei Wang, Zhaoran Wang, Zhuoran Yang, Tong Zhang

Figure 1 for A Posterior Sampling Framework for Interactive Decision Making
Figure 2 for A Posterior Sampling Framework for Interactive Decision Making
Viaarxiv icon

A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

Add code
Bookmark button
Alert button
Oct 19, 2022
Rui Ai, Boxiang Lyu, Zhaoran Wang, Zhuoran Yang, Michael I. Jordan

Figure 1 for A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design
Viaarxiv icon

Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments

Add code
Bookmark button
Alert button
Sep 29, 2022
Yixuan Wang, Simon Sinong Zhan, Ruochen Jiao, Zhilu Wang, Wanxin Jin, Zhuoran Yang, Zhaoran Wang, Chao Huang, Qi Zhu

Figure 1 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Figure 2 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Figure 3 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Figure 4 for Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Viaarxiv icon

Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL

Add code
Bookmark button
Alert button
Sep 26, 2022
Fengzhuo Zhang, Boyi Liu, Kaixin Wang, Vincent Y. F. Tan, Zhuoran Yang, Zhaoran Wang

Figure 1 for Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Figure 2 for Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Figure 3 for Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Viaarxiv icon

Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

Add code
Bookmark button
Alert button
Sep 18, 2022
Zuyue Fu, Zhengling Qi, Zhaoran Wang, Zhuoran Yang, Yanxun Xu, Michael R. Kosorok

Figure 1 for Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes
Figure 2 for Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes
Figure 3 for Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes
Viaarxiv icon

Differentiable Bilevel Programming for Stackelberg Congestion Games

Add code
Bookmark button
Alert button
Sep 15, 2022
Jiayang Li, Jing Yu, Qianni Wang, Boyi Liu, Zhaoran Wang, Yu Marco Nie

Figure 1 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Figure 2 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Figure 3 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Figure 4 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Viaarxiv icon