Picture for Wotao Yin

Wotao Yin

Provably Efficient Exploration for RL with Unsupervised Learning

Add code
Mar 15, 2020
Figure 1 for Provably Efficient Exploration for RL with Unsupervised Learning
Figure 2 for Provably Efficient Exploration for RL with Unsupervised Learning
Figure 3 for Provably Efficient Exploration for RL with Unsupervised Learning
Viaarxiv icon

Safeguarded Learned Convex Optimization

Add code
Mar 04, 2020
Figure 1 for Safeguarded Learned Convex Optimization
Figure 2 for Safeguarded Learned Convex Optimization
Figure 3 for Safeguarded Learned Convex Optimization
Figure 4 for Safeguarded Learned Convex Optimization
Viaarxiv icon

LASG: Lazily Aggregated Stochastic Gradients for Communication-Efficient Distributed Learning

Add code
Feb 26, 2020
Figure 1 for LASG: Lazily Aggregated Stochastic Gradients for Communication-Efficient Distributed Learning
Figure 2 for LASG: Lazily Aggregated Stochastic Gradients for Communication-Efficient Distributed Learning
Figure 3 for LASG: Lazily Aggregated Stochastic Gradients for Communication-Efficient Distributed Learning
Figure 4 for LASG: Lazily Aggregated Stochastic Gradients for Communication-Efficient Distributed Learning
Viaarxiv icon

Does Knowledge Transfer Always Help to Learn a Better Policy?

Add code
Dec 06, 2019
Figure 1 for Does Knowledge Transfer Always Help to Learn a Better Policy?
Figure 2 for Does Knowledge Transfer Always Help to Learn a Better Policy?
Figure 3 for Does Knowledge Transfer Always Help to Learn a Better Policy?
Viaarxiv icon

XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training

Add code
Nov 20, 2019
Figure 1 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Figure 2 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Figure 3 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Figure 4 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Viaarxiv icon

ODE Analysis of Stochastic Gradient Methods with Optimism and Anchoring for Minimax Problems and GANs

Add code
Jun 06, 2019
Figure 1 for ODE Analysis of Stochastic Gradient Methods with Optimism and Anchoring for Minimax Problems and GANs
Figure 2 for ODE Analysis of Stochastic Gradient Methods with Optimism and Anchoring for Minimax Problems and GANs
Figure 3 for ODE Analysis of Stochastic Gradient Methods with Optimism and Anchoring for Minimax Problems and GANs
Figure 4 for ODE Analysis of Stochastic Gradient Methods with Optimism and Anchoring for Minimax Problems and GANs
Viaarxiv icon

Plug-and-Play Methods Provably Converge with Properly Trained Denoisers

Add code
May 14, 2019
Figure 1 for Plug-and-Play Methods Provably Converge with Properly Trained Denoisers
Figure 2 for Plug-and-Play Methods Provably Converge with Properly Trained Denoisers
Figure 3 for Plug-and-Play Methods Provably Converge with Properly Trained Denoisers
Figure 4 for Plug-and-Play Methods Provably Converge with Properly Trained Denoisers
Viaarxiv icon

ExtraPush for convex smooth decentralized optimization over directed networks

Add code
Jan 30, 2019
Figure 1 for ExtraPush for convex smooth decentralized optimization over directed networks
Viaarxiv icon

AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Reinforcement Learning with Near-Optimal Sample Complexity

Add code
Dec 03, 2018
Figure 1 for AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Reinforcement Learning with Near-Optimal Sample Complexity
Figure 2 for AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Reinforcement Learning with Near-Optimal Sample Complexity
Figure 3 for AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Reinforcement Learning with Near-Optimal Sample Complexity
Figure 4 for AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Reinforcement Learning with Near-Optimal Sample Complexity
Viaarxiv icon

Markov Chain Block Coordinate Descent

Add code
Nov 22, 2018
Figure 1 for Markov Chain Block Coordinate Descent
Viaarxiv icon