Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Daniel Russo

Learning to Stop with Surprisingly Few Samples


Feb 22, 2021
Daniel Russo, Assaf Zeevi, Tianyi Zhang


  Access Paper or Ask Questions

Approximation Benefits of Policy Gradient Methods with Aggregated States


Jul 22, 2020
Daniel Russo


  Access Paper or Ask Questions

A Note on the Linear Convergence of Policy Gradient Methods


Jul 21, 2020
Jalaj Bhandari, Daniel Russo


  Access Paper or Ask Questions

SQuAP-Ont: an Ontology of Software Quality Relational Factors from Financial Systems


Sep 04, 2019
Paolo Ciancarini, Andrea Giovanni Nuzzolese, Valentina Presutti, Daniel Russo


  Access Paper or Ask Questions

Worst-Case Regret Bounds for Exploration via Randomized Value Functions


Jun 22, 2019
Daniel Russo


  Access Paper or Ask Questions

Global Optimality Guarantees For Policy Gradient Methods


Jun 05, 2019
Jalaj Bhandari, Daniel Russo


  Access Paper or Ask Questions

A Note on the Equivalence of Upper Confidence Bounds and Gittins Indices for Patient Agents


Apr 09, 2019
Daniel Russo


  Access Paper or Ask Questions

A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation


Nov 06, 2018
Jalaj Bhandari, Daniel Russo, Raghav Singal


  Access Paper or Ask Questions

Simple Bayesian Algorithms for Best Arm Identification


Jun 08, 2018
Daniel Russo


  Access Paper or Ask Questions

Deep Exploration via Randomized Value Functions


Jun 06, 2018
Ian Osband, Benjamin Van Roy, Daniel Russo, Zheng Wen


  Access Paper or Ask Questions

Satisficing in Time-Sensitive Bandit Learning


Mar 07, 2018
Daniel Russo, Benjamin Van Roy

* This submission largely supersedes earlier work in arXiv:1704.09028 

  Access Paper or Ask Questions

A Tutorial on Thompson Sampling


Nov 19, 2017
Daniel Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband, Zheng Wen


  Access Paper or Ask Questions

Learning to Optimize via Information-Directed Sampling


Jul 07, 2017
Daniel Russo, Benjamin Van Roy

* arXiv admin note: substantial text overlap with arXiv:1403.5341 

  Access Paper or Ask Questions

Improving the Expected Improvement Algorithm


May 29, 2017
Chao Qin, Diego Klabjan, Daniel Russo

* Submitted to NIPS 2017 

  Access Paper or Ask Questions

Time-Sensitive Bandit Learning and Satisficing Thompson Sampling


Apr 28, 2017
Daniel Russo, David Tse, Benjamin Van Roy


  Access Paper or Ask Questions

How much does your data exploration overfit? Controlling bias via information usage


Oct 06, 2016
Daniel Russo, James Zou


  Access Paper or Ask Questions

An Information-Theoretic Analysis of Thompson Sampling


Jun 08, 2015
Daniel Russo, Benjamin Van Roy


  Access Paper or Ask Questions

Learning to Optimize Via Posterior Sampling


Feb 03, 2014
Daniel Russo, Benjamin Van Roy


  Access Paper or Ask Questions

(More) Efficient Reinforcement Learning via Posterior Sampling


Dec 26, 2013
Ian Osband, Daniel Russo, Benjamin Van Roy

* 10 pages 

  Access Paper or Ask Questions