Picture for Johannes Kirschner

Johannes Kirschner

Regret Minimization via Saddle Point Optimization

Add code
Mar 15, 2024
Figure 1 for Regret Minimization via Saddle Point Optimization
Figure 2 for Regret Minimization via Saddle Point Optimization
Viaarxiv icon

Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning

Add code
Feb 08, 2023
Figure 1 for Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications

Add code
Feb 07, 2023
Figure 1 for Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications
Figure 2 for Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications
Viaarxiv icon

Near-optimal Policy Identification in Active Reinforcement Learning

Add code
Dec 19, 2022
Figure 1 for Near-optimal Policy Identification in Active Reinforcement Learning
Figure 2 for Near-optimal Policy Identification in Active Reinforcement Learning
Figure 3 for Near-optimal Policy Identification in Active Reinforcement Learning
Figure 4 for Near-optimal Policy Identification in Active Reinforcement Learning
Viaarxiv icon

Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off

Add code
Dec 17, 2022
Figure 1 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 2 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 3 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 4 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Viaarxiv icon

Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization

Add code
Mar 29, 2022
Figure 1 for Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization
Figure 2 for Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization
Figure 3 for Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization
Figure 4 for Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization
Viaarxiv icon

Bias-Robust Bayesian Optimization via Dueling Bandits

Add code
Jun 09, 2021
Figure 1 for Bias-Robust Bayesian Optimization via Dueling Bandits
Figure 2 for Bias-Robust Bayesian Optimization via Dueling Bandits
Viaarxiv icon

Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback

Add code
Jan 21, 2021
Figure 1 for Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback
Figure 2 for Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback
Figure 3 for Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback
Figure 4 for Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback
Viaarxiv icon

Asymptotically Optimal Information-Directed Sampling

Add code
Nov 11, 2020
Figure 1 for Asymptotically Optimal Information-Directed Sampling
Figure 2 for Asymptotically Optimal Information-Directed Sampling
Figure 3 for Asymptotically Optimal Information-Directed Sampling
Figure 4 for Asymptotically Optimal Information-Directed Sampling
Viaarxiv icon

Distributionally Robust Bayesian Optimization

Add code
Mar 22, 2020
Figure 1 for Distributionally Robust Bayesian Optimization
Figure 2 for Distributionally Robust Bayesian Optimization
Figure 3 for Distributionally Robust Bayesian Optimization
Figure 4 for Distributionally Robust Bayesian Optimization
Viaarxiv icon