Alert button
Picture for Shie Mannor

Shie Mannor

Alert button

A Problem-Adaptive Algorithm for Resource Allocation

Add code
Bookmark button
Alert button
Feb 12, 2019
Xavier Fontaine, Shie Mannor, Vianney Perchet

Figure 1 for A Problem-Adaptive Algorithm for Resource Allocation
Figure 2 for A Problem-Adaptive Algorithm for Resource Allocation
Figure 3 for A Problem-Adaptive Algorithm for Resource Allocation
Figure 4 for A Problem-Adaptive Algorithm for Resource Allocation
Viaarxiv icon

The Natural Language of Actions

Add code
Bookmark button
Alert button
Feb 04, 2019
Guy Tennenholtz, Shie Mannor

Figure 1 for The Natural Language of Actions
Figure 2 for The Natural Language of Actions
Figure 3 for The Natural Language of Actions
Figure 4 for The Natural Language of Actions
Viaarxiv icon

Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 27, 2019
Chao Qu, Shie Mannor, Huan Xu, Yuan Qi, Le Song, Junwu Xiong

Figure 1 for Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning
Figure 2 for Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning
Figure 3 for Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning
Figure 4 for Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning
Viaarxiv icon

Action Robust Reinforcement Learning and Applications in Continuous Control

Add code
Bookmark button
Alert button
Jan 26, 2019
Chen Tessler, Yonathan Efroni, Shie Mannor

Figure 1 for Action Robust Reinforcement Learning and Applications in Continuous Control
Figure 2 for Action Robust Reinforcement Learning and Applications in Continuous Control
Figure 3 for Action Robust Reinforcement Learning and Applications in Continuous Control
Figure 4 for Action Robust Reinforcement Learning and Applications in Continuous Control
Viaarxiv icon

Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching

Add code
Bookmark button
Alert button
Jan 24, 2019
Tom Zahavy, Shie Mannor

Figure 1 for Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
Figure 2 for Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
Figure 3 for Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
Figure 4 for Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
Viaarxiv icon

Trust Region Value Optimization using Kalman Filtering

Add code
Bookmark button
Alert button
Jan 23, 2019
Shirli Di-Castro Shashua, Shie Mannor

Figure 1 for Trust Region Value Optimization using Kalman Filtering
Figure 2 for Trust Region Value Optimization using Kalman Filtering
Figure 3 for Trust Region Value Optimization using Kalman Filtering
Figure 4 for Trust Region Value Optimization using Kalman Filtering
Viaarxiv icon

Multi Instance Learning For Unbalanced Data

Add code
Bookmark button
Alert button
Dec 17, 2018
Mark Kozdoba, Edward Moroshko, Lior Shani, Takuya Takagi, Takashi Katoh, Shie Mannor, Koby Crammer

Figure 1 for Multi Instance Learning For Unbalanced Data
Figure 2 for Multi Instance Learning For Unbalanced Data
Figure 3 for Multi Instance Learning For Unbalanced Data
Figure 4 for Multi Instance Learning For Unbalanced Data
Viaarxiv icon

Revisiting Exploration-Conscious Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 13, 2018
Lior Shani, Yonathan Efroni, Shie Mannor

Figure 1 for Revisiting Exploration-Conscious Reinforcement Learning
Figure 2 for Revisiting Exploration-Conscious Reinforcement Learning
Figure 3 for Revisiting Exploration-Conscious Reinforcement Learning
Figure 4 for Revisiting Exploration-Conscious Reinforcement Learning
Viaarxiv icon

Soft-Robust Actor-Critic Policy-Gradient

Add code
Bookmark button
Alert button
Oct 24, 2018
Esther Derman, Daniel J. Mankowitz, Timothy A. Mann, Shie Mannor

Figure 1 for Soft-Robust Actor-Critic Policy-Gradient
Figure 2 for Soft-Robust Actor-Critic Policy-Gradient
Viaarxiv icon

Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 20, 2018
Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor

Figure 1 for Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning
Viaarxiv icon