Picture for Csaba Szepesvari

Csaba Szepesvari

Dj

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Add code
Jun 15, 2021
Figure 1 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 2 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 3 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 4 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Viaarxiv icon

Leveraging Non-uniformity in First-order Non-convex Optimization

Add code
May 13, 2021
Figure 1 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 2 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 3 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 4 for Leveraging Non-uniformity in First-order Non-convex Optimization
Viaarxiv icon

On the Optimality of Batch Policy Optimization Algorithms

Add code
Apr 06, 2021
Figure 1 for On the Optimality of Batch Policy Optimization Algorithms
Figure 2 for On the Optimality of Batch Policy Optimization Algorithms
Viaarxiv icon

Improved Regret Bound and Experience Replay in Regularized Policy Iteration

Add code
Feb 25, 2021
Figure 1 for Improved Regret Bound and Experience Replay in Regularized Policy Iteration
Viaarxiv icon

On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method

Add code
Feb 17, 2021
Figure 1 for On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method
Viaarxiv icon

Meta-Thompson Sampling

Add code
Feb 11, 2021
Figure 1 for Meta-Thompson Sampling
Figure 2 for Meta-Thompson Sampling
Figure 3 for Meta-Thompson Sampling
Viaarxiv icon

Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes

Add code
Jan 07, 2021
Figure 1 for Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes
Viaarxiv icon

Variational Policy Gradient Method for Reinforcement Learning with General Utilities

Add code
Jul 04, 2020
Figure 1 for Variational Policy Gradient Method for Reinforcement Learning with General Utilities
Figure 2 for Variational Policy Gradient Method for Reinforcement Learning with General Utilities
Figure 3 for Variational Policy Gradient Method for Reinforcement Learning with General Utilities
Viaarxiv icon

PAC-Bayes Analysis Beyond the Usual Bounds

Add code
Jun 23, 2020
Viaarxiv icon

Differentiable Meta-Learning in Contextual Bandits

Add code
Jun 09, 2020
Figure 1 for Differentiable Meta-Learning in Contextual Bandits
Figure 2 for Differentiable Meta-Learning in Contextual Bandits
Figure 3 for Differentiable Meta-Learning in Contextual Bandits
Figure 4 for Differentiable Meta-Learning in Contextual Bandits
Viaarxiv icon