Picture for Zhuoran Yang

Zhuoran Yang

Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy

Add code
Dec 28, 2020
Viaarxiv icon

Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization

Add code
Dec 21, 2020
Viaarxiv icon

Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations

Add code
Nov 09, 2020
Figure 1 for Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations
Viaarxiv icon

Provable Fictitious Play for General Mean-Field Games

Add code
Oct 08, 2020
Viaarxiv icon

Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning

Add code
Aug 23, 2020
Figure 1 for Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning
Viaarxiv icon

Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time

Add code
Aug 16, 2020
Figure 1 for Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time
Figure 2 for Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time
Viaarxiv icon

Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy

Add code
Aug 02, 2020
Figure 1 for Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Viaarxiv icon

Understanding Implicit Regularization in Over-Parameterized Nonlinear Statistical Model

Add code
Jul 16, 2020
Figure 1 for Understanding Implicit Regularization in Over-Parameterized Nonlinear Statistical Model
Figure 2 for Understanding Implicit Regularization in Over-Parameterized Nonlinear Statistical Model
Figure 3 for Understanding Implicit Regularization in Over-Parameterized Nonlinear Statistical Model
Figure 4 for Understanding Implicit Regularization in Over-Parameterized Nonlinear Statistical Model
Viaarxiv icon

A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic

Add code
Jul 10, 2020
Figure 1 for A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
Viaarxiv icon

Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach

Add code
Jul 02, 2020
Figure 1 for Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach
Figure 2 for Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach
Viaarxiv icon