Picture for Zhuoran Yang

Zhuoran Yang

Upper Confidence Primal-Dual Optimization: Stochastically Constrained Markov Decision Processes with Adversarial Losses and Unknown Transitions

Add code
Mar 02, 2020
Viaarxiv icon

Provably Efficient Safe Exploration via Primal-Dual Policy Optimization

Add code
Mar 01, 2020
Viaarxiv icon

Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework

Add code
Feb 10, 2020
Figure 1 for Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework
Figure 2 for Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework
Figure 3 for Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework
Figure 4 for Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework
Viaarxiv icon

On Computation and Generalization of Generative Adversarial Imitation Learning

Add code
Jan 12, 2020
Figure 1 for On Computation and Generalization of Generative Adversarial Imitation Learning
Viaarxiv icon

Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator

Add code
Dec 14, 2019
Figure 1 for Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator
Figure 2 for Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator
Viaarxiv icon

Provably Efficient Exploration in Policy Optimization

Add code
Dec 12, 2019
Viaarxiv icon

Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances

Add code
Dec 09, 2019
Viaarxiv icon

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Add code
Nov 24, 2019
Figure 1 for Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Figure 2 for Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Figure 3 for Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Viaarxiv icon

Convergent Policy Optimization for Safe Reinforcement Learning

Add code
Oct 26, 2019
Figure 1 for Convergent Policy Optimization for Safe Reinforcement Learning
Figure 2 for Convergent Policy Optimization for Safe Reinforcement Learning
Viaarxiv icon

Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games

Add code
Oct 16, 2019
Viaarxiv icon