Picture for Steffen Udluft

Steffen Udluft

From Classical Data to Quantum Advantage -- Quantum Policy Evaluation on Quantum Hardware

Add code
Sep 09, 2025
Viaarxiv icon

Variational Quantum Circuits in Offline Contextual Bandit Problems

Add code
Sep 09, 2025
Viaarxiv icon

Is Q-learning an Ill-posed Problem?

Add code
Feb 21, 2025
Viaarxiv icon

TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning

Add code
Nov 28, 2024
Figure 1 for TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning
Figure 2 for TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning
Figure 3 for TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning
Figure 4 for TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning
Viaarxiv icon

Neural-ANOVA: Model Decomposition for Interpretable Machine Learning

Add code
Aug 22, 2024
Figure 1 for Neural-ANOVA: Model Decomposition for Interpretable Machine Learning
Figure 2 for Neural-ANOVA: Model Decomposition for Interpretable Machine Learning
Figure 3 for Neural-ANOVA: Model Decomposition for Interpretable Machine Learning
Figure 4 for Neural-ANOVA: Model Decomposition for Interpretable Machine Learning
Viaarxiv icon

Why long model-based rollouts are no reason for bad Q-value estimates

Add code
Jul 16, 2024
Viaarxiv icon

Model-based Offline Quantum Reinforcement Learning

Add code
Apr 14, 2024
Figure 1 for Model-based Offline Quantum Reinforcement Learning
Figure 2 for Model-based Offline Quantum Reinforcement Learning
Figure 3 for Model-based Offline Quantum Reinforcement Learning
Figure 4 for Model-based Offline Quantum Reinforcement Learning
Viaarxiv icon

Learning Control Policies for Variable Objectives from Offline Data

Add code
Aug 11, 2023
Figure 1 for Learning Control Policies for Variable Objectives from Offline Data
Figure 2 for Learning Control Policies for Variable Objectives from Offline Data
Figure 3 for Learning Control Policies for Variable Objectives from Offline Data
Figure 4 for Learning Control Policies for Variable Objectives from Offline Data
Viaarxiv icon

Automatic Trade-off Adaptation in Offline RL

Add code
Jun 16, 2023
Viaarxiv icon

Safe Policy Improvement Approaches and their Limitations

Add code
Aug 01, 2022
Figure 1 for Safe Policy Improvement Approaches and their Limitations
Figure 2 for Safe Policy Improvement Approaches and their Limitations
Figure 3 for Safe Policy Improvement Approaches and their Limitations
Figure 4 for Safe Policy Improvement Approaches and their Limitations
Viaarxiv icon