Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Personalization Improves Privacy-Accuracy Tradeoffs in Federated Optimization


Feb 10, 2022
Alberto Bietti, Chen-Yu Wei, Miroslav Dudik, John Langford, Zhiwei Steven Wu

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Constrained episodic reinforcement learning in concave-convex and knapsack settings


Jun 09, 2020
Kianté Brantley, Miroslav Dudik, Thodoris Lykouris, Sobhan Miryoosefi, Max Simchowitz, Aleksandrs Slivkins, Wen Sun

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Reinforcement Learning with Convex Constraints


Jun 21, 2019
Sobhan Miryoosefi, Kianté Brantley, Hal Daumé III, Miroslav Dudik, Robert Schapire

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Optimal and Adaptive Off-policy Evaluation in Contextual Bandits


Nov 11, 2017
Yu-Xiang Wang, Alekh Agarwal, Miroslav Dudik

Add code

* International Conference on Machine Learning (pp. 3589-3597) (2017) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Contextual Semibandits via Supervised Learning Oracles


Nov 04, 2016
Akshay Krishnamurthy, Alekh Agarwal, Miroslav Dudik

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Para-active learning


Oct 30, 2013
Alekh Agarwal, Leon Bottou, Miroslav Dudik, John Langford

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Reliable Effective Terascale Linear Learning System


Jul 12, 2013
Alekh Agarwal, Olivier Chapelle, Miroslav Dudik, John Langford

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits


Oct 16, 2012
Miroslav Dudik, Dumitru Erhan, John Langford, Lihong Li

Add code

* Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

First-Order Mixed Integer Linear Programming


May 09, 2012
Geoffrey Gordon, Sue Ann Hong, Miroslav Dudik

Add code

* Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Efficient Optimal Learning for Contextual Bandits


Jun 13, 2011
Miroslav Dudik, Daniel Hsu, Satyen Kale, Nikos Karampatziakis, John Langford, Lev Reyzin, Tong Zhang

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>