Alert button
Picture for Sharan Vaswani

Sharan Vaswani

Alert button

From Inverse Optimization to Feasibility to ERM

Add code
Bookmark button
Alert button
Feb 27, 2024
Saurabh Mishra, Anant Raj, Sharan Vaswani

Viaarxiv icon

Noise-adaptive (Accelerated) Stochastic Heavy-Ball Momentum

Add code
Bookmark button
Alert button
Jan 12, 2024
Anh Dang, Reza Babanezhad, Sharan Vaswani

Viaarxiv icon

Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees

Add code
Bookmark button
Alert button
May 24, 2023
Sharan Vaswani, Amirreza Kazemi, Reza Babanezhad, Nicolas Le Roux

Figure 1 for Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees
Figure 2 for Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees
Figure 3 for Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees
Figure 4 for Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees
Viaarxiv icon

Target-based Surrogates for Stochastic Optimization

Add code
Bookmark button
Alert button
Feb 06, 2023
Jonathan Wilder Lavington, Sharan Vaswani, Reza Babanezhad, Mark Schmidt, Nicolas Le Roux

Figure 1 for Target-based Surrogates for Stochastic Optimization
Figure 2 for Target-based Surrogates for Stochastic Optimization
Figure 3 for Target-based Surrogates for Stochastic Optimization
Figure 4 for Target-based Surrogates for Stochastic Optimization
Viaarxiv icon

Improved Policy Optimization for Online Imitation Learning

Add code
Bookmark button
Alert button
Jul 29, 2022
Jonathan Wilder Lavington, Sharan Vaswani, Mark Schmidt

Figure 1 for Improved Policy Optimization for Online Imitation Learning
Figure 2 for Improved Policy Optimization for Online Imitation Learning
Figure 3 for Improved Policy Optimization for Online Imitation Learning
Figure 4 for Improved Policy Optimization for Online Imitation Learning
Viaarxiv icon

Near-Optimal Sample Complexity Bounds for Constrained MDPs

Add code
Bookmark button
Alert button
Jun 13, 2022
Sharan Vaswani, Lin F. Yang, Csaba Szepesvári

Figure 1 for Near-Optimal Sample Complexity Bounds for Constrained MDPs
Figure 2 for Near-Optimal Sample Complexity Bounds for Constrained MDPs
Viaarxiv icon

Towards Painless Policy Optimization for Constrained MDPs

Add code
Bookmark button
Alert button
Apr 11, 2022
Arushi Jain, Sharan Vaswani, Reza Babanezhad, Csaba Szepesvari, Doina Precup

Figure 1 for Towards Painless Policy Optimization for Constrained MDPs
Figure 2 for Towards Painless Policy Optimization for Constrained MDPs
Figure 3 for Towards Painless Policy Optimization for Constrained MDPs
Figure 4 for Towards Painless Policy Optimization for Constrained MDPs
Viaarxiv icon

Towards Noise-adaptive, Problem-adaptive Stochastic Gradient Descent

Add code
Bookmark button
Alert button
Oct 21, 2021
Sharan Vaswani, Benjamin Dubois-Taine, Reza Babanezhad

Figure 1 for Towards Noise-adaptive, Problem-adaptive Stochastic Gradient Descent
Viaarxiv icon

A functional mirror ascent view of policy gradient methods with function approximation

Add code
Bookmark button
Alert button
Aug 12, 2021
Sharan Vaswani, Olivier Bachem, Simone Totaro, Robert Mueller, Matthieu Geist, Marlos C. Machado, Pablo Samuel Castro, Nicolas Le Roux

Figure 1 for A functional mirror ascent view of policy gradient methods with function approximation
Figure 2 for A functional mirror ascent view of policy gradient methods with function approximation
Figure 3 for A functional mirror ascent view of policy gradient methods with function approximation
Figure 4 for A functional mirror ascent view of policy gradient methods with function approximation
Viaarxiv icon

SVRG Meets AdaGrad: Painless Variance Reduction

Add code
Bookmark button
Alert button
Feb 18, 2021
Benjamin Dubois-Taine, Sharan Vaswani, Reza Babanezhad, Mark Schmidt, Simon Lacoste-Julien

Figure 1 for SVRG Meets AdaGrad: Painless Variance Reduction
Figure 2 for SVRG Meets AdaGrad: Painless Variance Reduction
Figure 3 for SVRG Meets AdaGrad: Painless Variance Reduction
Figure 4 for SVRG Meets AdaGrad: Painless Variance Reduction
Viaarxiv icon