Alert button
Picture for Zhaoran Wang

Zhaoran Wang

Alert button

A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic

Add code
Bookmark button
Alert button
Jul 10, 2020
Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang

Figure 1 for A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
Viaarxiv icon

Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion

Add code
Bookmark button
Alert button
Jul 04, 2020
Yi Chen, Jinglin Chen, Jing Dong, Jian Peng, Zhaoran Wang

Figure 1 for Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion
Figure 2 for Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion
Figure 3 for Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion
Viaarxiv icon

Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach

Add code
Bookmark button
Alert button
Jul 02, 2020
Luofeng Liao, You-Lin Chen, Zhuoran Yang, Bo Dai, Zhaoran Wang, Mladen Kolar

Figure 1 for Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach
Figure 2 for Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach
Viaarxiv icon

Dynamic Regret of Policy Optimization in Non-stationary Environments

Add code
Bookmark button
Alert button
Jun 30, 2020
Yingjie Fei, Zhuoran Yang, Zhaoran Wang, Qiaomin Xie

Viaarxiv icon

On the Global Optimality of Model-Agnostic Meta-Learning

Add code
Bookmark button
Alert button
Jun 23, 2020
Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

Viaarxiv icon

Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret

Add code
Bookmark button
Alert button
Jun 22, 2020
Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang, Qiaomin Xie

Figure 1 for Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
Figure 2 for Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
Viaarxiv icon

Provably Efficient Causal Reinforcement Learning with Confounded Observational Data

Add code
Bookmark button
Alert button
Jun 22, 2020
Lingxiao Wang, Zhuoran Yang, Zhaoran Wang

Figure 1 for Provably Efficient Causal Reinforcement Learning with Confounded Observational Data
Figure 2 for Provably Efficient Causal Reinforcement Learning with Confounded Observational Data
Figure 3 for Provably Efficient Causal Reinforcement Learning with Confounded Observational Data
Figure 4 for Provably Efficient Causal Reinforcement Learning with Confounded Observational Data
Viaarxiv icon

Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 21, 2020
Lingxiao Wang, Zhuoran Yang, Zhaoran Wang

Viaarxiv icon

Neural Certificates for Safe Control Policies

Add code
Bookmark button
Alert button
Jun 15, 2020
Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou

Figure 1 for Neural Certificates for Safe Control Policies
Figure 2 for Neural Certificates for Safe Control Policies
Figure 3 for Neural Certificates for Safe Control Policies
Figure 4 for Neural Certificates for Safe Control Policies
Viaarxiv icon

Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory

Add code
Bookmark button
Alert button
Jun 08, 2020
Yufeng Zhang, Qi Cai, Zhuoran Yang, Yongxin Chen, Zhaoran Wang

Figure 1 for Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory
Figure 2 for Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory
Viaarxiv icon