Picture for Lin F. Yang

Lin F. Yang

Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation

Add code
Jun 01, 2022
Viaarxiv icon

Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

Add code
May 26, 2022
Figure 1 for Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Figure 2 for Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Viaarxiv icon

Solving Multi-Arm Bandit Using a Few Bits of Communication

Add code
Nov 11, 2021
Figure 1 for Solving Multi-Arm Bandit Using a Few Bits of Communication
Figure 2 for Solving Multi-Arm Bandit Using a Few Bits of Communication
Figure 3 for Solving Multi-Arm Bandit Using a Few Bits of Communication
Figure 4 for Solving Multi-Arm Bandit Using a Few Bits of Communication
Viaarxiv icon

Settling the Horizon-Dependence of Sample Complexity in Reinforcement Learning

Add code
Nov 01, 2021
Viaarxiv icon

Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs

Add code
Oct 26, 2021
Figure 1 for Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs
Figure 2 for Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs
Figure 3 for Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs
Figure 4 for Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs
Viaarxiv icon

Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration

Add code
Oct 12, 2021
Figure 1 for Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration
Figure 2 for Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration
Figure 3 for Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration
Figure 4 for Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration
Viaarxiv icon

Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation

Add code
Oct 09, 2021
Figure 1 for Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation
Figure 2 for Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation
Figure 3 for Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation
Viaarxiv icon

Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver

Add code
Oct 08, 2021
Viaarxiv icon

Gap-Dependent Unsupervised Exploration for Reinforcement Learning

Add code
Aug 11, 2021
Figure 1 for Gap-Dependent Unsupervised Exploration for Reinforcement Learning
Figure 2 for Gap-Dependent Unsupervised Exploration for Reinforcement Learning
Viaarxiv icon

Randomized Exploration for Reinforcement Learning with General Value Function Approximation

Add code
Jun 15, 2021
Figure 1 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 2 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 3 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 4 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Viaarxiv icon