Picture for Lin F. Yang

Lin F. Yang

Online Sub-Sampling for Reinforcement Learning with General Function Approximation

Add code
Jun 14, 2021
Viaarxiv icon

Safe Reinforcement Learning with Linear Function Approximation

Add code
Jun 11, 2021
Figure 1 for Safe Reinforcement Learning with Linear Function Approximation
Figure 2 for Safe Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Global Neighbor Sampling for Mixed CPU-GPU Training on Giant Graphs

Add code
Jun 11, 2021
Figure 1 for Global Neighbor Sampling for Mixed CPU-GPU Training on Giant Graphs
Figure 2 for Global Neighbor Sampling for Mixed CPU-GPU Training on Giant Graphs
Figure 3 for Global Neighbor Sampling for Mixed CPU-GPU Training on Giant Graphs
Figure 4 for Global Neighbor Sampling for Mixed CPU-GPU Training on Giant Graphs
Viaarxiv icon

Provably Correct Optimization and Exploration with Non-linear Policies

Add code
Mar 22, 2021
Figure 1 for Provably Correct Optimization and Exploration with Non-linear Policies
Figure 2 for Provably Correct Optimization and Exploration with Non-linear Policies
Figure 3 for Provably Correct Optimization and Exploration with Non-linear Policies
Figure 4 for Provably Correct Optimization and Exploration with Non-linear Policies
Viaarxiv icon

Provably Breaking the Quadratic Error Compounding Barrier in Imitation Learning, Optimally

Add code
Feb 25, 2021
Figure 1 for Provably Breaking the Quadratic Error Compounding Barrier in Imitation Learning, Optimally
Figure 2 for Provably Breaking the Quadratic Error Compounding Barrier in Imitation Learning, Optimally
Viaarxiv icon

A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost

Add code
Jan 02, 2021
Figure 1 for A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost
Figure 2 for A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost
Viaarxiv icon

Minimax Sample Complexity for Turn-based Stochastic Game

Add code
Nov 29, 2020
Viaarxiv icon

Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning

Add code
Nov 25, 2020
Figure 1 for Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning
Viaarxiv icon

Episodic Linear Quadratic Regulators with Low-rank Transitions

Add code
Nov 03, 2020
Figure 1 for Episodic Linear Quadratic Regulators with Low-rank Transitions
Figure 2 for Episodic Linear Quadratic Regulators with Low-rank Transitions
Figure 3 for Episodic Linear Quadratic Regulators with Low-rank Transitions
Viaarxiv icon

Random Walk Bandits

Add code
Nov 03, 2020
Figure 1 for Random Walk Bandits
Figure 2 for Random Walk Bandits
Figure 3 for Random Walk Bandits
Viaarxiv icon