Picture for Alekh Agarwal

Alekh Agarwal

Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations

Add code
May 12, 2019
Figure 1 for Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations
Figure 2 for Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations
Figure 3 for Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations
Figure 4 for Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations
Viaarxiv icon

Off-Policy Policy Gradient with State Distribution Correction

Add code
Apr 17, 2019
Figure 1 for Off-Policy Policy Gradient with State Distribution Correction
Figure 2 for Off-Policy Policy Gradient with State Distribution Correction
Figure 3 for Off-Policy Policy Gradient with State Distribution Correction
Figure 4 for Off-Policy Policy Gradient with State Distribution Correction
Viaarxiv icon

Provably efficient RL with Rich Observations via Latent State Decoding

Add code
Jan 25, 2019
Figure 1 for Provably efficient RL with Rich Observations via Latent State Decoding
Viaarxiv icon

Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback

Add code
Jan 02, 2019
Figure 1 for Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Figure 2 for Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Figure 3 for Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Viaarxiv icon

Model-Based Reinforcement Learning in Contextual Decision Processes

Add code
Nov 21, 2018
Figure 1 for Model-Based Reinforcement Learning in Contextual Decision Processes
Viaarxiv icon

On Oracle-Efficient PAC RL with Rich Observations

Add code
Oct 31, 2018
Figure 1 for On Oracle-Efficient PAC RL with Rich Observations
Viaarxiv icon

A Reductions Approach to Fair Classification

Add code
Jul 16, 2018
Figure 1 for A Reductions Approach to Fair Classification
Figure 2 for A Reductions Approach to Fair Classification
Viaarxiv icon

Hierarchical Imitation and Reinforcement Learning

Add code
Jun 09, 2018
Figure 1 for Hierarchical Imitation and Reinforcement Learning
Figure 2 for Hierarchical Imitation and Reinforcement Learning
Figure 3 for Hierarchical Imitation and Reinforcement Learning
Viaarxiv icon

Efficient Contextual Bandits in Non-stationary Worlds

Add code
Jun 07, 2018
Figure 1 for Efficient Contextual Bandits in Non-stationary Worlds
Viaarxiv icon

A Contextual Bandit Bake-off

Add code
May 30, 2018
Figure 1 for A Contextual Bandit Bake-off
Figure 2 for A Contextual Bandit Bake-off
Figure 3 for A Contextual Bandit Bake-off
Figure 4 for A Contextual Bandit Bake-off
Viaarxiv icon