Picture for Csaba Szepesvari

Csaba Szepesvari

Dj

Stochastic Gradient Succeeds for Bandits

Add code
Feb 27, 2024
Figure 1 for Stochastic Gradient Succeeds for Bandits
Figure 2 for Stochastic Gradient Succeeds for Bandits
Figure 3 for Stochastic Gradient Succeeds for Bandits
Figure 4 for Stochastic Gradient Succeeds for Bandits
Viaarxiv icon

Sample Efficient Deep Reinforcement Learning via Local Planning

Add code
Jan 29, 2023
Figure 1 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 2 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 3 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 4 for Sample Efficient Deep Reinforcement Learning via Local Planning
Viaarxiv icon

The Role of Baselines in Policy Gradient Optimization

Add code
Jan 16, 2023
Figure 1 for The Role of Baselines in Policy Gradient Optimization
Figure 2 for The Role of Baselines in Policy Gradient Optimization
Figure 3 for The Role of Baselines in Policy Gradient Optimization
Viaarxiv icon

Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making

Add code
Sep 29, 2022
Figure 1 for Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making
Figure 2 for Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making
Viaarxiv icon

Towards Painless Policy Optimization for Constrained MDPs

Add code
Apr 11, 2022
Figure 1 for Towards Painless Policy Optimization for Constrained MDPs
Figure 2 for Towards Painless Policy Optimization for Constrained MDPs
Figure 3 for Towards Painless Policy Optimization for Constrained MDPs
Figure 4 for Towards Painless Policy Optimization for Constrained MDPs
Viaarxiv icon

Understanding the Effect of Stochasticity in Policy Optimization

Add code
Oct 29, 2021
Figure 1 for Understanding the Effect of Stochasticity in Policy Optimization
Figure 2 for Understanding the Effect of Stochasticity in Policy Optimization
Figure 3 for Understanding the Effect of Stochasticity in Policy Optimization
Viaarxiv icon

On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data

Add code
Jun 18, 2021
Figure 1 for On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data
Figure 2 for On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data
Viaarxiv icon

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Add code
Jun 15, 2021
Figure 1 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 2 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 3 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 4 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Viaarxiv icon

Leveraging Non-uniformity in First-order Non-convex Optimization

Add code
May 13, 2021
Figure 1 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 2 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 3 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 4 for Leveraging Non-uniformity in First-order Non-convex Optimization
Viaarxiv icon

On the Optimality of Batch Policy Optimization Algorithms

Add code
Apr 06, 2021
Figure 1 for On the Optimality of Batch Policy Optimization Algorithms
Figure 2 for On the Optimality of Batch Policy Optimization Algorithms
Viaarxiv icon