Picture for Volodymyr Tkachuk

Volodymyr Tkachuk

Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear $q^π$-Realizability and Concentrability

Add code
May 27, 2024
Viaarxiv icon

Regret Minimization via Saddle Point Optimization

Add code
Mar 15, 2024
Figure 1 for Regret Minimization via Saddle Point Optimization
Figure 2 for Regret Minimization via Saddle Point Optimization
Viaarxiv icon

Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning

Add code
Feb 08, 2023
Figure 1 for Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning

Add code
Mar 07, 2021
Figure 1 for The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning
Figure 2 for The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning
Viaarxiv icon