Alert button
Picture for Shie Mannor

Shie Mannor

Alert button

Value Iteration in Continuous Actions, States and Time

Add code
Bookmark button
Alert button
May 10, 2021
Michael Lutter, Shie Mannor, Jan Peters, Dieter Fox, Animesh Garg

Figure 1 for Value Iteration in Continuous Actions, States and Time
Figure 2 for Value Iteration in Continuous Actions, States and Time
Figure 3 for Value Iteration in Continuous Actions, States and Time
Figure 4 for Value Iteration in Continuous Actions, States and Time
Viaarxiv icon

Noise Estimation Is Not Optimal: How To Use Kalman Filter The Right Way

Add code
Bookmark button
Alert button
May 04, 2021
Ido Greenberg, Netanel Yannay, Shie Mannor

Figure 1 for Noise Estimation Is Not Optimal: How To Use Kalman Filter The Right Way
Figure 2 for Noise Estimation Is Not Optimal: How To Use Kalman Filter The Right Way
Figure 3 for Noise Estimation Is Not Optimal: How To Use Kalman Filter The Right Way
Figure 4 for Noise Estimation Is Not Optimal: How To Use Kalman Filter The Right Way
Viaarxiv icon

Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling

Add code
Bookmark button
Alert button
May 01, 2021
Mohammani Zaki, Avi Mohan, Aditya Gopalan, Shie Mannor

Figure 1 for Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling
Figure 2 for Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling
Viaarxiv icon

Using Kalman Filter The Right Way: Noise Estimation Is Not Optimal

Add code
Bookmark button
Alert button
Apr 06, 2021
Ido Greenberg, Shie Mannor, Netanel Yannay

Figure 1 for Using Kalman Filter The Right Way: Noise Estimation Is Not Optimal
Figure 2 for Using Kalman Filter The Right Way: Noise Estimation Is Not Optimal
Figure 3 for Using Kalman Filter The Right Way: Noise Estimation Is Not Optimal
Figure 4 for Using Kalman Filter The Right Way: Noise Estimation Is Not Optimal
Viaarxiv icon

Maximum Entropy Reinforcement Learning with Mixture Policies

Add code
Bookmark button
Alert button
Mar 18, 2021
Nir Baram, Guy Tennenholtz, Shie Mannor

Figure 1 for Maximum Entropy Reinforcement Learning with Mixture Policies
Figure 2 for Maximum Entropy Reinforcement Learning with Mixture Policies
Figure 3 for Maximum Entropy Reinforcement Learning with Mixture Policies
Viaarxiv icon

Action Redundancy in Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 22, 2021
Nir Baram, Guy Tennenholtz, Shie Mannor

Figure 1 for Action Redundancy in Reinforcement Learning
Figure 2 for Action Redundancy in Reinforcement Learning
Figure 3 for Action Redundancy in Reinforcement Learning
Figure 4 for Action Redundancy in Reinforcement Learning
Viaarxiv icon

GELATO: Geometrically Enriched Latent Model for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 22, 2021
Guy Tennenholtz, Nir Baram, Shie Mannor

Figure 1 for GELATO: Geometrically Enriched Latent Model for Offline Reinforcement Learning
Figure 2 for GELATO: Geometrically Enriched Latent Model for Offline Reinforcement Learning
Figure 3 for GELATO: Geometrically Enriched Latent Model for Offline Reinforcement Learning
Figure 4 for GELATO: Geometrically Enriched Latent Model for Offline Reinforcement Learning
Viaarxiv icon

Improper Learning with Gradient-based Policy Optimization

Add code
Bookmark button
Alert button
Feb 21, 2021
Mohammadi Zaki, Avinash Mohan, Aditya Gopalan, Shie Mannor

Figure 1 for Improper Learning with Gradient-based Policy Optimization
Figure 2 for Improper Learning with Gradient-based Policy Optimization
Figure 3 for Improper Learning with Gradient-based Policy Optimization
Figure 4 for Improper Learning with Gradient-based Policy Optimization
Viaarxiv icon

Reinforcement Learning for Datacenter Congestion Control

Add code
Bookmark button
Alert button
Feb 18, 2021
Chen Tessler, Yuval Shpigelman, Gal Dalal, Amit Mandelbaum, Doron Haritan Kazakov, Benjamin Fuhrer, Gal Chechik, Shie Mannor

Figure 1 for Reinforcement Learning for Datacenter Congestion Control
Figure 2 for Reinforcement Learning for Datacenter Congestion Control
Figure 3 for Reinforcement Learning for Datacenter Congestion Control
Figure 4 for Reinforcement Learning for Datacenter Congestion Control
Viaarxiv icon