Picture for Tobias Sutter

Tobias Sutter

A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance

Add code
May 07, 2025
Viaarxiv icon

Towards Optimal Offline Reinforcement Learning

Add code
Mar 15, 2025
Viaarxiv icon

Newton Losses: Using Curvature Information for Learning with Differentiable Algorithms

Add code
Oct 24, 2024
Viaarxiv icon

Finding the DeepDream for Time Series: Activation Maximization for Univariate Time Series

Add code
Aug 20, 2024
Figure 1 for Finding the DeepDream for Time Series: Activation Maximization for Univariate Time Series
Figure 2 for Finding the DeepDream for Time Series: Activation Maximization for Univariate Time Series
Figure 3 for Finding the DeepDream for Time Series: Activation Maximization for Univariate Time Series
Figure 4 for Finding the DeepDream for Time Series: Activation Maximization for Univariate Time Series
Viaarxiv icon

Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces

Add code
May 24, 2024
Viaarxiv icon

Regularized Q-learning through Robust Averaging

Add code
May 03, 2024
Viaarxiv icon

End-to-End Learning for Stochastic Optimization: A Bayesian Perspective

Add code
Jun 11, 2023
Figure 1 for End-to-End Learning for Stochastic Optimization: A Bayesian Perspective
Figure 2 for End-to-End Learning for Stochastic Optimization: A Bayesian Perspective
Figure 3 for End-to-End Learning for Stochastic Optimization: A Bayesian Perspective
Figure 4 for End-to-End Learning for Stochastic Optimization: A Bayesian Perspective
Viaarxiv icon

Policy Gradient Algorithms for Robust MDPs with Non-Rectangular Uncertainty Sets

Add code
May 31, 2023
Figure 1 for Policy Gradient Algorithms for Robust MDPs with Non-Rectangular Uncertainty Sets
Figure 2 for Policy Gradient Algorithms for Robust MDPs with Non-Rectangular Uncertainty Sets
Viaarxiv icon

Optimal Learning via Moderate Deviations Theory

Add code
May 23, 2023
Viaarxiv icon

ISAAC Newton: Input-based Approximate Curvature for Newton's Method

Add code
May 01, 2023
Figure 1 for ISAAC Newton: Input-based Approximate Curvature for Newton's Method
Figure 2 for ISAAC Newton: Input-based Approximate Curvature for Newton's Method
Figure 3 for ISAAC Newton: Input-based Approximate Curvature for Newton's Method
Figure 4 for ISAAC Newton: Input-based Approximate Curvature for Newton's Method
Viaarxiv icon