Picture for Mohammad Ghavamzadeh

Mohammad Ghavamzadeh

INRIA Lille - Nord Europe

Non-Stationary Latent Bandits

Add code
Dec 01, 2020
Figure 1 for Non-Stationary Latent Bandits
Figure 2 for Non-Stationary Latent Bandits
Figure 3 for Non-Stationary Latent Bandits
Viaarxiv icon

Soft-Robust Algorithms for Handling Model Misspecification

Add code
Nov 30, 2020
Figure 1 for Soft-Robust Algorithms for Handling Model Misspecification
Figure 2 for Soft-Robust Algorithms for Handling Model Misspecification
Figure 3 for Soft-Robust Algorithms for Handling Model Misspecification
Figure 4 for Soft-Robust Algorithms for Handling Model Misspecification
Viaarxiv icon

A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges

Add code
Nov 17, 2020
Figure 1 for A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges
Figure 2 for A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges
Figure 3 for A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges
Figure 4 for A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges
Viaarxiv icon

Variance-Reduced Off-Policy Memory-Efficient Policy Search

Add code
Sep 14, 2020
Figure 1 for Variance-Reduced Off-Policy Memory-Efficient Policy Search
Figure 2 for Variance-Reduced Off-Policy Memory-Efficient Policy Search
Figure 3 for Variance-Reduced Off-Policy Memory-Efficient Policy Search
Figure 4 for Variance-Reduced Off-Policy Memory-Efficient Policy Search
Viaarxiv icon

Finite-Sample Analysis of Proximal Gradient TD Algorithms

Add code
Jul 03, 2020
Figure 1 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Figure 2 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Figure 3 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Figure 4 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Viaarxiv icon

Deep Bayesian Quadrature Policy Optimization

Add code
Jun 28, 2020
Figure 1 for Deep Bayesian Quadrature Policy Optimization
Figure 2 for Deep Bayesian Quadrature Policy Optimization
Figure 3 for Deep Bayesian Quadrature Policy Optimization
Figure 4 for Deep Bayesian Quadrature Policy Optimization
Viaarxiv icon

Variational Model-based Policy Optimization

Add code
Jun 24, 2020
Figure 1 for Variational Model-based Policy Optimization
Figure 2 for Variational Model-based Policy Optimization
Figure 3 for Variational Model-based Policy Optimization
Figure 4 for Variational Model-based Policy Optimization
Viaarxiv icon

Control-Aware Representations for Model-based Reinforcement Learning

Add code
Jun 24, 2020
Figure 1 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 2 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 3 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 4 for Control-Aware Representations for Model-based Reinforcement Learning
Viaarxiv icon

Stochastic Bandits with Linear Constraints

Add code
Jun 17, 2020
Figure 1 for Stochastic Bandits with Linear Constraints
Viaarxiv icon

Mirror Descent Policy Optimization

Add code
Jun 09, 2020
Figure 1 for Mirror Descent Policy Optimization
Figure 2 for Mirror Descent Policy Optimization
Figure 3 for Mirror Descent Policy Optimization
Figure 4 for Mirror Descent Policy Optimization
Viaarxiv icon