Alert button
Picture for Alekh Agarwal

Alekh Agarwal

Alert button

On the Optimality of Sparse Model-Based Planning for Markov Decision Processes

Add code
Bookmark button
Alert button
Jul 04, 2019
Alekh Agarwal, Sham Kakade, Lin F. Yang

Figure 1 for On the Optimality of Sparse Model-Based Planning for Markov Decision Processes
Viaarxiv icon

Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting

Add code
Bookmark button
Alert button
Jun 23, 2019
Aditya Grover, Jiaming Song, Alekh Agarwal, Kenneth Tran, Ashish Kapoor, Eric Horvitz, Stefano Ermon

Figure 1 for Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting
Figure 2 for Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting
Figure 3 for Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting
Figure 4 for Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting
Viaarxiv icon

Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds

Add code
Bookmark button
Alert button
Jun 09, 2019
Jordan T. Ash, Chicheng Zhang, Akshay Krishnamurthy, John Langford, Alekh Agarwal

Figure 1 for Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds
Figure 2 for Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds
Figure 3 for Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds
Figure 4 for Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds
Viaarxiv icon

Fair Regression: Quantitative Definitions and Reduction-based Algorithms

Add code
Bookmark button
Alert button
May 30, 2019
Alekh Agarwal, Miroslav Dudík, Zhiwei Steven Wu

Figure 1 for Fair Regression: Quantitative Definitions and Reduction-based Algorithms
Figure 2 for Fair Regression: Quantitative Definitions and Reduction-based Algorithms
Figure 3 for Fair Regression: Quantitative Definitions and Reduction-based Algorithms
Figure 4 for Fair Regression: Quantitative Definitions and Reduction-based Algorithms
Viaarxiv icon

Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations

Add code
Bookmark button
Alert button
May 12, 2019
Aditya Modi, Debadeepta Dey, Alekh Agarwal, Adith Swaminathan, Besmira Nushi, Sean Andrist, Eric Horvitz

Figure 1 for Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations
Figure 2 for Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations
Figure 3 for Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations
Figure 4 for Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations
Viaarxiv icon

Off-Policy Policy Gradient with State Distribution Correction

Add code
Bookmark button
Alert button
Apr 17, 2019
Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill

Figure 1 for Off-Policy Policy Gradient with State Distribution Correction
Figure 2 for Off-Policy Policy Gradient with State Distribution Correction
Figure 3 for Off-Policy Policy Gradient with State Distribution Correction
Figure 4 for Off-Policy Policy Gradient with State Distribution Correction
Viaarxiv icon

Provably efficient RL with Rich Observations via Latent State Decoding

Add code
Bookmark button
Alert button
Jan 25, 2019
Simon S. Du, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal, Miroslav Dudík, John Langford

Figure 1 for Provably efficient RL with Rich Observations via Latent State Decoding
Viaarxiv icon

Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback

Add code
Bookmark button
Alert button
Jan 02, 2019
Chicheng Zhang, Alekh Agarwal, Hal Daumé III, John Langford, Sahand N Negahban

Figure 1 for Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Figure 2 for Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Figure 3 for Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Viaarxiv icon

Model-Based Reinforcement Learning in Contextual Decision Processes

Add code
Bookmark button
Alert button
Nov 21, 2018
Wen Sun, Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford

Figure 1 for Model-Based Reinforcement Learning in Contextual Decision Processes
Viaarxiv icon