Picture for Shie Mannor

Shie Mannor

Faculty of Electrical Engineering, Technion, Israel Institute of Technology

The Geometry of Robust Value Functions

Add code
Jan 30, 2022
Figure 1 for The Geometry of Robust Value Functions
Figure 2 for The Geometry of Robust Value Functions
Figure 3 for The Geometry of Robust Value Functions
Figure 4 for The Geometry of Robust Value Functions
Viaarxiv icon

Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms

Add code
Jan 30, 2022
Figure 1 for Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms
Figure 2 for Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms
Figure 3 for Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms
Viaarxiv icon

Planning and Learning with Adaptive Lookahead

Add code
Jan 28, 2022
Figure 1 for Planning and Learning with Adaptive Lookahead
Figure 2 for Planning and Learning with Adaptive Lookahead
Figure 3 for Planning and Learning with Adaptive Lookahead
Figure 4 for Planning and Learning with Adaptive Lookahead
Viaarxiv icon

On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

Add code
Oct 13, 2021
Figure 1 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 2 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 3 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 4 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Viaarxiv icon

Twice regularized MDPs and the equivalence between robustness and regularization

Add code
Oct 12, 2021
Figure 1 for Twice regularized MDPs and the equivalence between robustness and regularization
Figure 2 for Twice regularized MDPs and the equivalence between robustness and regularization
Figure 3 for Twice regularized MDPs and the equivalence between robustness and regularization
Figure 4 for Twice regularized MDPs and the equivalence between robustness and regularization
Viaarxiv icon

Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits

Add code
Oct 12, 2021
Figure 1 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 2 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 3 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 4 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Viaarxiv icon

Reinforcement Learning in Reward-Mixing MDPs

Add code
Oct 07, 2021
Viaarxiv icon

Continuous-Time Fitted Value Iteration for Robust Policies

Add code
Oct 05, 2021
Figure 1 for Continuous-Time Fitted Value Iteration for Robust Policies
Figure 2 for Continuous-Time Fitted Value Iteration for Robust Policies
Figure 3 for Continuous-Time Fitted Value Iteration for Robust Policies
Figure 4 for Continuous-Time Fitted Value Iteration for Robust Policies
Viaarxiv icon

Sim and Real: Better Together

Add code
Oct 05, 2021
Figure 1 for Sim and Real: Better Together
Figure 2 for Sim and Real: Better Together
Figure 3 for Sim and Real: Better Together
Figure 4 for Sim and Real: Better Together
Viaarxiv icon

Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning

Add code
Sep 22, 2021
Figure 1 for Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon