Alert button
Picture for Shie Mannor

Shie Mannor

Alert button

On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 13, 2021
Guy Tennenholtz, Assaf Hallak, Gal Dalal, Shie Mannor, Gal Chechik, Uri Shalit

Figure 1 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 2 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 3 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 4 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Viaarxiv icon

Twice regularized MDPs and the equivalence between robustness and regularization

Add code
Bookmark button
Alert button
Oct 12, 2021
Esther Derman, Matthieu Geist, Shie Mannor

Figure 1 for Twice regularized MDPs and the equivalence between robustness and regularization
Figure 2 for Twice regularized MDPs and the equivalence between robustness and regularization
Figure 3 for Twice regularized MDPs and the equivalence between robustness and regularization
Figure 4 for Twice regularized MDPs and the equivalence between robustness and regularization
Viaarxiv icon

Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits

Add code
Bookmark button
Alert button
Oct 12, 2021
Nadav Merlis, Yonathan Efroni, Shie Mannor

Figure 1 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 2 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 3 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 4 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Viaarxiv icon

Reinforcement Learning in Reward-Mixing MDPs

Add code
Bookmark button
Alert button
Oct 07, 2021
Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor

Viaarxiv icon

Continuous-Time Fitted Value Iteration for Robust Policies

Add code
Bookmark button
Alert button
Oct 05, 2021
Michael Lutter, Boris Belousov, Shie Mannor, Dieter Fox, Animesh Garg, Jan Peters

Figure 1 for Continuous-Time Fitted Value Iteration for Robust Policies
Figure 2 for Continuous-Time Fitted Value Iteration for Robust Policies
Figure 3 for Continuous-Time Fitted Value Iteration for Robust Policies
Figure 4 for Continuous-Time Fitted Value Iteration for Robust Policies
Viaarxiv icon

Sim and Real: Better Together

Add code
Bookmark button
Alert button
Oct 05, 2021
Shirli Di Castro Shashua, Dotan Di Castro, Shie Mannor

Figure 1 for Sim and Real: Better Together
Figure 2 for Sim and Real: Better Together
Figure 3 for Sim and Real: Better Together
Figure 4 for Sim and Real: Better Together
Viaarxiv icon

Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 22, 2021
Roy Zohar, Shie Mannor, Guy Tennenholtz

Figure 1 for Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction

Add code
Bookmark button
Alert button
Jul 04, 2021
Assaf Hallak, Gal Dalal, Steven Dalton, Iuri Frosio, Shie Mannor, Gal Chechik

Figure 1 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 2 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 3 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 4 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Viaarxiv icon

Robust Value Iteration for Continuous Control Tasks

Add code
Bookmark button
Alert button
May 25, 2021
Michael Lutter, Shie Mannor, Jan Peters, Dieter Fox, Animesh Garg

Figure 1 for Robust Value Iteration for Continuous Control Tasks
Figure 2 for Robust Value Iteration for Continuous Control Tasks
Figure 3 for Robust Value Iteration for Continuous Control Tasks
Figure 4 for Robust Value Iteration for Continuous Control Tasks
Viaarxiv icon