Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning

Aug 13, 2020
Alekh Agarwal, Mikael Henaff, Sham Kakade, Wen Sun


  Access Paper or Ask Questions

Provably Good Batch Reinforcement Learning Without Great Exploration

Jul 22, 2020
Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill

* 36 pages, 7 figures 

  Access Paper or Ask Questions

Policy Improvement from Multiple Experts

Jul 01, 2020
Ching-An Cheng, Andrey Kolobov, Alekh Agarwal


  Access Paper or Ask Questions

Safe Reinforcement Learning via Curriculum Induction

Jun 22, 2020
Matteo Turchetta, Andrey Kolobov, Shital Shah, Andreas Krause, Alekh Agarwal


  Access Paper or Ask Questions

Optimizing Interactive Systems via Data-Driven Objectives

Jun 19, 2020
Ziming Li, Julia Kiseleva, Alekh Agarwal, Maarten de Rijke, Ryen W. White

* 30 pages, 12 figures. arXiv admin note: text overlap with arXiv:1802.06306 

  Access Paper or Ask Questions

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

Jun 18, 2020
Alekh Agarwal, Sham Kakade, Akshay Krishnamurthy, Wen Sun


  Access Paper or Ask Questions

Reparameterized Variational Divergence Minimization for Stable Imitation

Jun 18, 2020
Dilip Arumugam, Debadeepta Dey, Alekh Agarwal, Asli Celikyilmaz, Elnaz Nouri, Bill Dolan


  Access Paper or Ask Questions

Federated Residual Learning

Mar 28, 2020
Alekh Agarwal, John Langford, Chen-Yu Wei


  Access Paper or Ask Questions

Taking a hint: How to leverage loss predictors in contextual bandits?

Mar 04, 2020
Chen-Yu Wei, Haipeng Luo, Alekh Agarwal


  Access Paper or Ask Questions

Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes

Aug 29, 2019
Alekh Agarwal, Sham M. Kakade, Jason D. Lee, Gaurav Mahajan

* Additional references and discussion of prior work 

  Access Paper or Ask Questions

On the Optimality of Sparse Model-Based Planning for Markov Decision Processes

Jul 04, 2019
Alekh Agarwal, Sham Kakade, Lin F. Yang


  Access Paper or Ask Questions

Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting

Jun 23, 2019
Aditya Grover, Jiaming Song, Alekh Agarwal, Kenneth Tran, Ashish Kapoor, Eric Horvitz, Stefano Ermon


  Access Paper or Ask Questions

Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds

Jun 09, 2019
Jordan T. Ash, Chicheng Zhang, Akshay Krishnamurthy, John Langford, Alekh Agarwal


  Access Paper or Ask Questions

Fair Regression: Quantitative Definitions and Reduction-based Algorithms

May 30, 2019
Alekh Agarwal, Miroslav Dudík, Zhiwei Steven Wu


  Access Paper or Ask Questions

Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations

May 12, 2019
Aditya Modi, Debadeepta Dey, Alekh Agarwal, Adith Swaminathan, Besmira Nushi, Sean Andrist, Eric Horvitz

* 12 pages, 7 figures, 2 tables 

  Access Paper or Ask Questions

Off-Policy Policy Gradient with State Distribution Correction

Apr 17, 2019
Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill


  Access Paper or Ask Questions

Provably efficient RL with Rich Observations via Latent State Decoding

Jan 25, 2019
Simon S. Du, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal, Miroslav Dudík, John Langford


  Access Paper or Ask Questions

Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback

Jan 02, 2019
Chicheng Zhang, Alekh Agarwal, Hal Daumé III, John Langford, Sahand N Negahban

* 43 pages, 21 figures 

  Access Paper or Ask Questions

Model-Based Reinforcement Learning in Contextual Decision Processes

Nov 21, 2018
Wen Sun, Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford

* 30 

  Access Paper or Ask Questions

On Oracle-Efficient PAC RL with Rich Observations

Oct 31, 2018
Christoph Dann, Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford, Robert E. Schapire

* appearing at NIPS 18; full paper including appendix 

  Access Paper or Ask Questions

A Reductions Approach to Fair Classification

Jul 16, 2018
Alekh Agarwal, Alina Beygelzimer, Miroslav Dudík, John Langford, Hanna Wallach


  Access Paper or Ask Questions

Hierarchical Imitation and Reinforcement Learning

Jun 09, 2018
Hoang M. Le, Nan Jiang, Alekh Agarwal, Miroslav Dudík, Yisong Yue, Hal Daumé III

* Proceedings of the 35th International Conference on Machine Learning (ICML 2018) 

  Access Paper or Ask Questions

Efficient Contextual Bandits in Non-stationary Worlds

Jun 07, 2018
Haipeng Luo, Chen-Yu Wei, Alekh Agarwal, John Langford


  Access Paper or Ask Questions

A Contextual Bandit Bake-off

May 30, 2018
Alberto Bietti, Alekh Agarwal, John Langford


  Access Paper or Ask Questions

Practical Contextual Bandits with Regression Oracles

Mar 03, 2018
Dylan J. Foster, Alekh Agarwal, Miroslav Dudík, Haipeng Luo, Robert E. Schapire


  Access Paper or Ask Questions

Active Learning for Cost-Sensitive Classification

Nov 13, 2017
Akshay Krishnamurthy, Alekh Agarwal, Tzu-Kuo Huang, Hal Daume III, John Langford


  Access Paper or Ask Questions