Picture for Omer Gottesman

Omer Gottesman

Structure-Informed Deep Reinforcement Learning for Inventory Management

Add code
Jul 29, 2025
Viaarxiv icon

Geometry of Neural Reinforcement Learning in Continuous State and Action Spaces

Add code
Jul 28, 2025
Viaarxiv icon

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

Add code
Jul 10, 2024
Viaarxiv icon

TD Convergence: An Optimization Perspective

Add code
Jun 30, 2023
Viaarxiv icon

Robust Decision-Focused Learning for Reward Transfer

Add code
Apr 06, 2023
Figure 1 for Robust Decision-Focused Learning for Reward Transfer
Figure 2 for Robust Decision-Focused Learning for Reward Transfer
Figure 3 for Robust Decision-Focused Learning for Reward Transfer
Figure 4 for Robust Decision-Focused Learning for Reward Transfer
Viaarxiv icon

On the Geometry of Reinforcement Learning in Continuous State and Action Spaces

Add code
Dec 29, 2022
Figure 1 for On the Geometry of Reinforcement Learning in Continuous State and Action Spaces
Figure 2 for On the Geometry of Reinforcement Learning in Continuous State and Action Spaces
Figure 3 for On the Geometry of Reinforcement Learning in Continuous State and Action Spaces
Figure 4 for On the Geometry of Reinforcement Learning in Continuous State and Action Spaces
Viaarxiv icon

A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes

Add code
Jul 30, 2022
Figure 1 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Figure 2 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Figure 3 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Figure 4 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Viaarxiv icon

Deep Q-Network with Proximal Iteration

Add code
Dec 10, 2021
Figure 1 for Deep Q-Network with Proximal Iteration
Figure 2 for Deep Q-Network with Proximal Iteration
Figure 3 for Deep Q-Network with Proximal Iteration
Figure 4 for Deep Q-Network with Proximal Iteration
Viaarxiv icon

Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation

Add code
Nov 28, 2021
Figure 1 for Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation
Figure 2 for Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation
Figure 3 for Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation
Figure 4 for Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation
Viaarxiv icon

Coarse-Grained Smoothness for RL in Metric Spaces

Add code
Oct 23, 2021
Figure 1 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 2 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 3 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 4 for Coarse-Grained Smoothness for RL in Metric Spaces
Viaarxiv icon