Alert button
Picture for Toshinori Kitamura

Toshinori Kitamura

Alert button

A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees

Add code
Bookmark button
Alert button
Feb 02, 2024
Toshinori Kitamura, Tadashi Kozuno, Masahiro Kato, Yuki Ichihara, Soichiro Nishimori, Akiyoshi Sannai, Sho Sonoda, Wataru Kumagai, Yutaka Matsuo

Viaarxiv icon

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Add code
Bookmark button
Alert button
May 22, 2023
Toshinori Kitamura, Tadashi Kozuno, Yunhao Tang, Nino Vieillard, Michal Valko, Wenhao Yang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári, Wataru Kumagai, Yutaka Matsuo

Figure 1 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 2 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 3 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 4 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Viaarxiv icon

KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal

Add code
Bookmark button
Alert button
May 27, 2022
Tadashi Kozuno, Wenhao Yang, Nino Vieillard, Toshinori Kitamura, Yunhao Tang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Michal Valko, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári

Figure 1 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 2 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 3 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 4 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Viaarxiv icon

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

Add code
Bookmark button
Alert button
Dec 08, 2021
Toshinori Kitamura, Ryo Yonetani

Figure 1 for ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives
Figure 2 for ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives
Figure 3 for ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives
Figure 4 for ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives
Viaarxiv icon

Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 16, 2021
Toshinori Kitamura, Lingwei Zhu, Takamitsu Matsubara

Figure 1 for Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning
Figure 2 for Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning
Figure 3 for Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning
Figure 4 for Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning
Viaarxiv icon

Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 13, 2021
Lingwei Zhu, Toshinori Kitamura, Takamitsu Matsubara

Figure 1 for Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Figure 2 for Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Figure 3 for Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Figure 4 for Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Viaarxiv icon

Cautious Actor-Critic

Add code
Bookmark button
Alert button
Jul 12, 2021
Lingwei Zhu, Toshinori Kitamura, Takamitsu Matsubara

Figure 1 for Cautious Actor-Critic
Figure 2 for Cautious Actor-Critic
Figure 3 for Cautious Actor-Critic
Figure 4 for Cautious Actor-Critic
Viaarxiv icon