Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Tadashi Kozuno

Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences


Jul 17, 2021
Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White

* Submitted to JMLR 

  Access Paper or Ask Questions

Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation


Jun 24, 2021
Yunhao Tang, Tadashi Kozuno, Mark Rowland, R茅mi Munos, Michal Valko


  Access Paper or Ask Questions

Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall


Jun 11, 2021
Tadashi Kozuno, Pierre M茅nard, R茅mi Munos, Michal Valko

* 20 pages 

  Access Paper or Ask Questions

Identifying Co-Adaptation of Algorithmic and Implementational Innovations in Deep Reinforcement Learning: A Taxonomy and Case Study of Inference-based Algorithms


Mar 31, 2021
Hiroki Furuta, Tadashi Kozuno, Tatsuya Matsushima, Yutaka Matsuo, Shixiang Shane Gu

* The implementation is available at: https://github.com/frt03/inference-based-rl 

  Access Paper or Ask Questions

Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning


Mar 23, 2021
Hiroki Furuta, Tatsuya Matsushima, Tadashi Kozuno, Yutaka Matsuo, Sergey Levine, Ofir Nachum, Shixiang Shane Gu


  Access Paper or Ask Questions

Revisiting Peng's Q($位$) for Modern Reinforcement Learning


Feb 27, 2021
Tadashi Kozuno, Yunhao Tang, Mark Rowland, R茅mi Munos, Steven Kapturowski, Will Dabney, Michal Valko, David Abel

* 26 pages, 7 figures, 2 tables 

  Access Paper or Ask Questions

Leverage the Average: an Analysis of Regularization in RL


Apr 10, 2020
Nino Vieillard, Tadashi Kozuno, Bruno Scherrer, Olivier Pietquin, R茅mi Munos, Matthieu Geist


  Access Paper or Ask Questions

Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning


Jun 18, 2019
Tadashi Kozuno, Dongqi Han, Kenji Doya


  Access Paper or Ask Questions

Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming


Oct 30, 2017
Tadashi Kozuno, Eiji Uchibe, Kenji Doya


  Access Paper or Ask Questions