Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Yasin Abbasi-Yadkori

Efficient Local Planning with Linear Function Approximation

Aug 12, 2021
Dong Yin, Botao Hao, Yasin Abbasi-Yadkori, Nevena Lazić, Csaba Szepesvári

  Access Paper or Ask Questions

Parameter and Feature Selection in Stochastic Linear Bandits

Jun 09, 2021
Ahmadreza Moradipari, Yasin Abbasi-Yadkori, Mahnoosh Alizadeh, Mohammad Ghavamzadeh

  Access Paper or Ask Questions

Improved Regret Bound and Experience Replay in Regularized Policy Iteration

Feb 25, 2021
Nevena Lazic, Dong Yin, Yasin Abbasi-Yadkori, Csaba Szepesvari

  Access Paper or Ask Questions

Optimization Issues in KL-Constrained Approximate Policy Iteration

Feb 11, 2021
Nevena Lazić, Botao Hao, Yasin Abbasi-Yadkori, Dale Schuurmans, Csaba Szepesvári

  Access Paper or Ask Questions

On Query-efficient Planning in MDPs under Linear Realizability of the Optimal State-value Function

Feb 04, 2021
Gellert Weisz, Philip Amortila, Barnabás Janzer, Yasin Abbasi-Yadkori, Nan Jiang, Csaba Szepesvári

  Access Paper or Ask Questions

The Elliptical Potential Lemma Revisited

Oct 20, 2020
Alexandra Carpentier, Claire Vernade, Yasin Abbasi-Yadkori

* 8 pages 

  Access Paper or Ask Questions

Regret Balancing for Bandit and RL Model Selection

Jun 09, 2020
Yasin Abbasi-Yadkori, Aldo Pacchiano, My Phan

* Submitted to the Thirty-Fourth Annual Conference on Neural Information Processing Systems (NeurIPS 2020) 

  Access Paper or Ask Questions

Sample Efficient Graph-Based Optimization with Noisy Observations

Jun 04, 2020
Tan Nguyen, Ali Shameli, Yasin Abbasi-Yadkori, Anup Rao, Branislav Kveton

* AISTATS 2019 
* The first version of this paper appeared in AISTATS 2019. Thank to community feedback, some typos and a minor issue have been identified. Specifically, on page 4, column 2, line 18, the statement $\Delta_{1,s} \ge (1+m)^{S-1-s} \Delta_1$ is not valid, and in the proof of Theorem 2, "By Lemma 1" should be "By Definition 2". These problems are fixed in this updated version published here on arxiv 

  Access Paper or Ask Questions

Provably Efficient Adaptive Approximate Policy Iteration

Mar 15, 2020
Botao Hao, Nevena Lazic, Yasin Abbasi-Yadkori, Pooria Joulani, Csaba Szepesvari

  Access Paper or Ask Questions

Model Selection in Contextual Stochastic Bandit Problems

Mar 03, 2020
Aldo Pacchiano, My Phan, Yasin Abbasi-Yadkori, Anup Rao, Julian Zimmert, Tor Lattimore, Csaba Szepesvari

* 12 main pages, 2 figures, 14 appendix pages 

  Access Paper or Ask Questions

Exploration-Enhanced POLITEX

Aug 27, 2019
Yasin Abbasi-Yadkori, Nevena Lazic, Csaba Szepesvari, Gellert Weisz

  Access Paper or Ask Questions

Thompson Sampling and Approximate Inference

Aug 14, 2019
My Phan, Yasin Abbasi-Yadkori, Justin Domke

  Access Paper or Ask Questions

Bootstrapping Upper Confidence Bound

Jul 23, 2019
Botao Hao, Yasin Abbasi-Yadkori, Zheng Wen, Guang Cheng

  Access Paper or Ask Questions

Large-Scale Markov Decision Problems via the Linear Programming Dual

Jan 06, 2019
Yasin Abbasi-Yadkori, Peter L. Bartlett, Xi Chen, Alan Malek

* 53 pages. arXiv admin note: text overlap with arXiv:1402.6763 

  Access Paper or Ask Questions

Posterior Sampling for Large Scale Reinforcement Learning

Oct 22, 2018
Georgios Theocharous, Zheng Wen, Yasin Abbasi-Yadkori, Nikos Vlassis

  Access Paper or Ask Questions

Model-Free Linear Quadratic Control via Reduction to Expert Prediction

Oct 05, 2018
Yasin Abbasi-Yadkori, Nevena Lazic, Csaba Szepesvari

  Access Paper or Ask Questions

Sharp Convergence Rates for Langevin Dynamics in the Nonconvex Setting

Sep 07, 2018
Xiang Cheng, Niladri S. Chatterji, Yasin Abbasi-Yadkori, Peter L. Bartlett, Michael I. Jordan

* 37 Pages 

  Access Paper or Ask Questions

Offline Evaluation of Ranking Policies with Click Models

Jun 13, 2018
Shuai Li, Yasin Abbasi-Yadkori, Branislav Kveton, S. Muthukrishnan, Vishwa Vinay, Zheng Wen

  Access Paper or Ask Questions

New Insights into Bootstrapping for Bandits

May 24, 2018
Sharan Vaswani, Branislav Kveton, Zheng Wen, Anup Rao, Mark Schmidt, Yasin Abbasi-Yadkori

  Access Paper or Ask Questions

Optimizing over a Restricted Policy Class in Markov Decision Processes

Feb 26, 2018
Ershad Banijamali, Yasin Abbasi-Yadkori, Mohammad Ghavamzadeh, Nikos Vlassis

* 14 pages 

  Access Paper or Ask Questions

A Continuation Method for Discrete Optimization and its Application to Nearest Neighbor Classification

Feb 10, 2018
Ali Shameli, Yasin Abbasi-Yadkori

  Access Paper or Ask Questions

Stochastic Low-Rank Bandits

Dec 13, 2017
Branislav Kveton, Csaba Szepesvari, Anup Rao, Zheng Wen, Yasin Abbasi-Yadkori, S. Muthukrishnan

  Access Paper or Ask Questions

Conservative Contextual Linear Bandits

Mar 04, 2017
Abbas Kazerouni, Mohammad Ghavamzadeh, Yasin Abbasi-Yadkori, Benjamin Van Roy

  Access Paper or Ask Questions

Hit-and-Run for Sampling and Planning in Non-Convex Spaces

Oct 19, 2016
Yasin Abbasi-Yadkori, Peter L. Bartlett, Victor Gabillon, Alan Malek

  Access Paper or Ask Questions

Online learning in MDPs with side information

Jun 26, 2014
Yasin Abbasi-Yadkori, Gergely Neu

  Access Paper or Ask Questions

Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm

Jun 16, 2014
Yasin Abbasi-Yadkori, Csaba Szepesvari

  Access Paper or Ask Questions

Linear Programming for Large-Scale Markov Decision Problems

Feb 27, 2014
Yasin Abbasi-Yadkori, Peter L. Bartlett, Alan Malek

* 27 pages, 3 figures 

  Access Paper or Ask Questions