Alert button
Picture for Shie Mannor

Shie Mannor

Alert button

Towards Deployable RL -- What's Broken with RL Research and a Potential Fix

Jan 03, 2023
Shie Mannor, Aviv Tamar

Viaarxiv icon

DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles

Dec 13, 2022
Peter Karkus, Boris Ivanovic, Shie Mannor, Marco Pavone

Figure 1 for DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles
Figure 2 for DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles
Figure 3 for DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles
Figure 4 for DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles
Viaarxiv icon

Tractable Optimality in Episodic Latent MABs

Oct 05, 2022
Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor

Figure 1 for Tractable Optimality in Episodic Latent MABs
Figure 2 for Tractable Optimality in Episodic Latent MABs
Figure 3 for Tractable Optimality in Episodic Latent MABs
Viaarxiv icon

Reward-Mixing MDPs with a Few Latent Contexts are Learnable

Oct 05, 2022
Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor

Viaarxiv icon

Policy Gradient for Reinforcement Learning with General Utilities

Oct 03, 2022
Navdeep Kumar, Kaixin Wang, Kfir Levy, Shie Mannor

Viaarxiv icon

SoftTreeMax: Policy Gradient with Tree Search

Sep 28, 2022
Gal Dalal, Assaf Hallak, Shie Mannor, Gal Chechik

Figure 1 for SoftTreeMax: Policy Gradient with Tree Search
Figure 2 for SoftTreeMax: Policy Gradient with Tree Search
Figure 3 for SoftTreeMax: Policy Gradient with Tree Search
Viaarxiv icon

Actor-Critic based Improper Reinforcement Learning

Jul 19, 2022
Mohammadi Zaki, Avinash Mohan, Aditya Gopalan, Shie Mannor

Figure 1 for Actor-Critic based Improper Reinforcement Learning
Figure 2 for Actor-Critic based Improper Reinforcement Learning
Figure 3 for Actor-Critic based Improper Reinforcement Learning
Figure 4 for Actor-Critic based Improper Reinforcement Learning
Viaarxiv icon

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Jul 05, 2022
Benjamin Fuhrer, Yuval Shpigelman, Chen Tessler, Shie Mannor, Gal Chechik, Eitan Zahavi, Gal Dalal

Figure 1 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 2 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 3 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 4 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Viaarxiv icon

Analysis of Stochastic Processes through Replay Buffers

Jun 26, 2022
Shirli Di Castro Shashua, Shie Mannor, Dotan Di-Castro

Figure 1 for Analysis of Stochastic Processes through Replay Buffers
Figure 2 for Analysis of Stochastic Processes through Replay Buffers
Viaarxiv icon

Reinforcement Learning with a Terminator

May 30, 2022
Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal

Figure 1 for Reinforcement Learning with a Terminator
Figure 2 for Reinforcement Learning with a Terminator
Figure 3 for Reinforcement Learning with a Terminator
Figure 4 for Reinforcement Learning with a Terminator
Viaarxiv icon