Picture for Alekh Agarwal

Alekh Agarwal

Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach

Add code
Feb 02, 2022
Figure 1 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 2 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 3 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 4 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Viaarxiv icon

Provable RL with Exogenous Distractors via Multistep Inverse Dynamics

Add code
Oct 17, 2021
Figure 1 for Provable RL with Exogenous Distractors via Multistep Inverse Dynamics
Figure 2 for Provable RL with Exogenous Distractors via Multistep Inverse Dynamics
Figure 3 for Provable RL with Exogenous Distractors via Multistep Inverse Dynamics
Figure 4 for Provable RL with Exogenous Distractors via Multistep Inverse Dynamics
Viaarxiv icon

Bellman-consistent Pessimism for Offline Reinforcement Learning

Add code
Jul 01, 2021
Figure 1 for Bellman-consistent Pessimism for Offline Reinforcement Learning
Viaarxiv icon

Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation

Add code
Mar 24, 2021
Figure 1 for Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation
Viaarxiv icon

Provably Correct Optimization and Exploration with Non-linear Policies

Add code
Mar 22, 2021
Figure 1 for Provably Correct Optimization and Exploration with Non-linear Policies
Figure 2 for Provably Correct Optimization and Exploration with Non-linear Policies
Figure 3 for Provably Correct Optimization and Exploration with Non-linear Policies
Figure 4 for Provably Correct Optimization and Exploration with Non-linear Policies
Viaarxiv icon

Towards a Dimension-Free Understanding of Adaptive Linear Control

Add code
Mar 19, 2021
Viaarxiv icon

Model-free Representation Learning and Exploration in Low-rank MDPs

Add code
Feb 14, 2021
Figure 1 for Model-free Representation Learning and Exploration in Low-rank MDPs
Viaarxiv icon

PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning

Add code
Aug 13, 2020
Figure 1 for PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Figure 2 for PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Figure 3 for PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Figure 4 for PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Viaarxiv icon

Provably Good Batch Reinforcement Learning Without Great Exploration

Add code
Jul 22, 2020
Figure 1 for Provably Good Batch Reinforcement Learning Without Great Exploration
Figure 2 for Provably Good Batch Reinforcement Learning Without Great Exploration
Figure 3 for Provably Good Batch Reinforcement Learning Without Great Exploration
Figure 4 for Provably Good Batch Reinforcement Learning Without Great Exploration
Viaarxiv icon

Policy Improvement from Multiple Experts

Add code
Jul 01, 2020
Figure 1 for Policy Improvement from Multiple Experts
Figure 2 for Policy Improvement from Multiple Experts
Figure 3 for Policy Improvement from Multiple Experts
Figure 4 for Policy Improvement from Multiple Experts
Viaarxiv icon