Alert button
Picture for Mohammad Ghavamzadeh

Mohammad Ghavamzadeh

Alert button

Finite-Sample Analysis of Proximal Gradient TD Algorithms

Add code
Bookmark button
Alert button
Jul 03, 2020
Bo Liu, Ji Liu, Mohammad Ghavamzadeh, Sridhar Mahadevan, Marek Petrik

Figure 1 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Figure 2 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Figure 3 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Figure 4 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Viaarxiv icon

Deep Bayesian Quadrature Policy Optimization

Add code
Bookmark button
Alert button
Jun 28, 2020
Akella Ravi Tej, Kamyar Azizzadenesheli, Mohammad Ghavamzadeh, Anima Anandkumar, Yisong Yue

Figure 1 for Deep Bayesian Quadrature Policy Optimization
Figure 2 for Deep Bayesian Quadrature Policy Optimization
Figure 3 for Deep Bayesian Quadrature Policy Optimization
Figure 4 for Deep Bayesian Quadrature Policy Optimization
Viaarxiv icon

Variational Model-based Policy Optimization

Add code
Bookmark button
Alert button
Jun 24, 2020
Yinlam Chow, Brandon Cui, MoonKyung Ryu, Mohammad Ghavamzadeh

Figure 1 for Variational Model-based Policy Optimization
Figure 2 for Variational Model-based Policy Optimization
Figure 3 for Variational Model-based Policy Optimization
Figure 4 for Variational Model-based Policy Optimization
Viaarxiv icon

Control-Aware Representations for Model-based Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 24, 2020
Brandon Cui, Yinlam Chow, Mohammad Ghavamzadeh

Figure 1 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 2 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 3 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 4 for Control-Aware Representations for Model-based Reinforcement Learning
Viaarxiv icon

Stochastic Bandits with Linear Constraints

Add code
Bookmark button
Alert button
Jun 17, 2020
Aldo Pacchiano, Mohammad Ghavamzadeh, Peter Bartlett, Heinrich Jiang

Figure 1 for Stochastic Bandits with Linear Constraints
Viaarxiv icon

Mirror Descent Policy Optimization

Add code
Bookmark button
Alert button
Jun 09, 2020
Manan Tomar, Lior Shani, Yonathan Efroni, Mohammad Ghavamzadeh

Figure 1 for Mirror Descent Policy Optimization
Figure 2 for Mirror Descent Policy Optimization
Figure 3 for Mirror Descent Policy Optimization
Figure 4 for Mirror Descent Policy Optimization
Viaarxiv icon

Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity

Add code
Bookmark button
Alert button
Jun 06, 2020
Bo Liu, Ian Gemp, Mohammad Ghavamzadeh, Ji Liu, Sridhar Mahadevan, Marek Petrik

Figure 1 for Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity
Figure 2 for Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity
Figure 3 for Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity
Figure 4 for Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity
Viaarxiv icon

Finite-Sample Analysis of GTD Algorithms

Add code
Bookmark button
Alert button
Jun 06, 2020
Bo Liu, Ji Liu, Mohammad Ghavamzadeh, Sridhar Mahadevan, Marek Petrik

Figure 1 for Finite-Sample Analysis of GTD Algorithms
Figure 2 for Finite-Sample Analysis of GTD Algorithms
Figure 3 for Finite-Sample Analysis of GTD Algorithms
Figure 4 for Finite-Sample Analysis of GTD Algorithms
Viaarxiv icon

Automatic Policy Synthesis to Improve the Safety of Nonlinear Dynamical Systems

Add code
Bookmark button
Alert button
Jun 06, 2020
Arash Mehrjou, Mohammad Ghavamzadeh, Bernhard Schölkopf

Figure 1 for Automatic Policy Synthesis to Improve the Safety of Nonlinear Dynamical Systems
Figure 2 for Automatic Policy Synthesis to Improve the Safety of Nonlinear Dynamical Systems
Figure 3 for Automatic Policy Synthesis to Improve the Safety of Nonlinear Dynamical Systems
Figure 4 for Automatic Policy Synthesis to Improve the Safety of Nonlinear Dynamical Systems
Viaarxiv icon