Picture for Alekh Agarwal

Alekh Agarwal

Leveraging User-Triggered Supervision in Contextual Bandits

Add code
Feb 07, 2023
Viaarxiv icon

Learning in POMDPs is Sample-Efficient with Hindsight Observability

Add code
Feb 03, 2023
Viaarxiv icon

VO$Q$L: Towards Optimal Regret in Model-free RL with Nonlinear Function Approximation

Add code
Dec 12, 2022
Viaarxiv icon

On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

Add code
Jun 21, 2022
Figure 1 for On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
Figure 2 for On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
Figure 3 for On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
Figure 4 for On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
Viaarxiv icon

Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity

Add code
Jun 15, 2022
Viaarxiv icon

Provable Benefits of Representational Transfer in Reinforcement Learning

Add code
May 29, 2022
Figure 1 for Provable Benefits of Representational Transfer in Reinforcement Learning
Figure 2 for Provable Benefits of Representational Transfer in Reinforcement Learning
Figure 3 for Provable Benefits of Representational Transfer in Reinforcement Learning
Figure 4 for Provable Benefits of Representational Transfer in Reinforcement Learning
Viaarxiv icon

Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling

Add code
Mar 15, 2022
Figure 1 for Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling
Viaarxiv icon

Minimax Regret Optimization for Robust Machine Learning under Distribution Shift

Add code
Feb 11, 2022
Viaarxiv icon

Adversarially Trained Actor Critic for Offline Reinforcement Learning

Add code
Feb 05, 2022
Figure 1 for Adversarially Trained Actor Critic for Offline Reinforcement Learning
Figure 2 for Adversarially Trained Actor Critic for Offline Reinforcement Learning
Figure 3 for Adversarially Trained Actor Critic for Offline Reinforcement Learning
Figure 4 for Adversarially Trained Actor Critic for Offline Reinforcement Learning
Viaarxiv icon

Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach

Add code
Feb 02, 2022
Figure 1 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 2 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 3 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 4 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Viaarxiv icon