Picture for Shie Mannor

Shie Mannor

Faculty of Electrical Engineering, Technion, Israel Institute of Technology

An Efficient Solution to s-Rectangular Robust Markov Decision Processes

Add code
Jan 31, 2023
Figure 1 for An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Figure 2 for An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Figure 3 for An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Figure 4 for An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Viaarxiv icon

Policy Gradient for s-Rectangular Robust Markov Decision Processes

Add code
Jan 31, 2023
Figure 1 for Policy Gradient for s-Rectangular Robust Markov Decision Processes
Figure 2 for Policy Gradient for s-Rectangular Robust Markov Decision Processes
Figure 3 for Policy Gradient for s-Rectangular Robust Markov Decision Processes
Figure 4 for Policy Gradient for s-Rectangular Robust Markov Decision Processes
Viaarxiv icon

SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search

Add code
Jan 30, 2023
Figure 1 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 2 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 3 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 4 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Viaarxiv icon

Train Hard, Fight Easy: Robust Meta Reinforcement Learning

Add code
Jan 26, 2023
Figure 1 for Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Figure 2 for Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Figure 3 for Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Figure 4 for Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Viaarxiv icon

Towards Deployable RL -- What's Broken with RL Research and a Potential Fix

Add code
Jan 03, 2023
Viaarxiv icon

DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles

Add code
Dec 13, 2022
Figure 1 for DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles
Figure 2 for DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles
Figure 3 for DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles
Figure 4 for DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles
Viaarxiv icon

Tractable Optimality in Episodic Latent MABs

Add code
Oct 05, 2022
Figure 1 for Tractable Optimality in Episodic Latent MABs
Figure 2 for Tractable Optimality in Episodic Latent MABs
Figure 3 for Tractable Optimality in Episodic Latent MABs
Viaarxiv icon

Reward-Mixing MDPs with a Few Latent Contexts are Learnable

Add code
Oct 05, 2022
Viaarxiv icon

Policy Gradient for Reinforcement Learning with General Utilities

Add code
Oct 03, 2022
Viaarxiv icon

SoftTreeMax: Policy Gradient with Tree Search

Add code
Sep 28, 2022
Figure 1 for SoftTreeMax: Policy Gradient with Tree Search
Figure 2 for SoftTreeMax: Policy Gradient with Tree Search
Figure 3 for SoftTreeMax: Policy Gradient with Tree Search
Viaarxiv icon