Picture for Alekh Agarwal

Alekh Agarwal

Provable RL with Exogenous Distractors via Multistep Inverse Dynamics

Add code
Oct 17, 2021
Figure 1 for Provable RL with Exogenous Distractors via Multistep Inverse Dynamics
Figure 2 for Provable RL with Exogenous Distractors via Multistep Inverse Dynamics
Figure 3 for Provable RL with Exogenous Distractors via Multistep Inverse Dynamics
Figure 4 for Provable RL with Exogenous Distractors via Multistep Inverse Dynamics
Viaarxiv icon

Bellman-consistent Pessimism for Offline Reinforcement Learning

Add code
Jul 01, 2021
Figure 1 for Bellman-consistent Pessimism for Offline Reinforcement Learning
Viaarxiv icon

Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation

Add code
Mar 24, 2021
Figure 1 for Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation
Viaarxiv icon

Provably Correct Optimization and Exploration with Non-linear Policies

Add code
Mar 22, 2021
Figure 1 for Provably Correct Optimization and Exploration with Non-linear Policies
Figure 2 for Provably Correct Optimization and Exploration with Non-linear Policies
Figure 3 for Provably Correct Optimization and Exploration with Non-linear Policies
Figure 4 for Provably Correct Optimization and Exploration with Non-linear Policies
Viaarxiv icon

Towards a Dimension-Free Understanding of Adaptive Linear Control

Add code
Mar 19, 2021
Viaarxiv icon

Model-free Representation Learning and Exploration in Low-rank MDPs

Add code
Feb 14, 2021
Figure 1 for Model-free Representation Learning and Exploration in Low-rank MDPs
Viaarxiv icon

PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning

Add code
Aug 13, 2020
Figure 1 for PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Figure 2 for PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Figure 3 for PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Figure 4 for PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Viaarxiv icon

Provably Good Batch Reinforcement Learning Without Great Exploration

Add code
Jul 22, 2020
Figure 1 for Provably Good Batch Reinforcement Learning Without Great Exploration
Figure 2 for Provably Good Batch Reinforcement Learning Without Great Exploration
Figure 3 for Provably Good Batch Reinforcement Learning Without Great Exploration
Figure 4 for Provably Good Batch Reinforcement Learning Without Great Exploration
Viaarxiv icon

Policy Improvement from Multiple Experts

Add code
Jul 01, 2020
Figure 1 for Policy Improvement from Multiple Experts
Figure 2 for Policy Improvement from Multiple Experts
Figure 3 for Policy Improvement from Multiple Experts
Figure 4 for Policy Improvement from Multiple Experts
Viaarxiv icon

Safe Reinforcement Learning via Curriculum Induction

Add code
Jun 22, 2020
Figure 1 for Safe Reinforcement Learning via Curriculum Induction
Figure 2 for Safe Reinforcement Learning via Curriculum Induction
Figure 3 for Safe Reinforcement Learning via Curriculum Induction
Figure 4 for Safe Reinforcement Learning via Curriculum Induction
Viaarxiv icon