Alert button
Picture for Shie Mannor

Shie Mannor

Alert button

Online Apprenticeship Learning

Add code
Bookmark button
Alert button
Feb 13, 2021
Lior Shani, Tom Zahavy, Shie Mannor

Figure 1 for Online Apprenticeship Learning
Figure 2 for Online Apprenticeship Learning
Figure 3 for Online Apprenticeship Learning
Figure 4 for Online Apprenticeship Learning
Viaarxiv icon

RL for Latent MDPs: Regret Guarantees and a Lower Bound

Add code
Bookmark button
Alert button
Feb 09, 2021
Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor

Figure 1 for RL for Latent MDPs: Regret Guarantees and a Lower Bound
Figure 2 for RL for Latent MDPs: Regret Guarantees and a Lower Bound
Figure 3 for RL for Latent MDPs: Regret Guarantees and a Lower Bound
Figure 4 for RL for Latent MDPs: Regret Guarantees and a Lower Bound
Viaarxiv icon

Dimension Free Generalization Bounds for Non Linear Metric Learning

Add code
Bookmark button
Alert button
Feb 07, 2021
Mark Kozdoba, Shie Mannor

Figure 1 for Dimension Free Generalization Bounds for Non Linear Metric Learning
Figure 2 for Dimension Free Generalization Bounds for Non Linear Metric Learning
Figure 3 for Dimension Free Generalization Bounds for Non Linear Metric Learning
Viaarxiv icon

Online Limited Memory Neural-Linear Bandits with Likelihood Matching

Add code
Bookmark button
Alert button
Feb 07, 2021
Ofir Nabati, Tom Zahavy, Shie Mannor

Figure 1 for Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Figure 2 for Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Figure 3 for Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Figure 4 for Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Viaarxiv icon

Confidence-Budget Matching for Sequential Budgeted Learning

Add code
Bookmark button
Alert button
Feb 05, 2021
Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor

Viaarxiv icon

Acting in Delayed Environments with Non-Stationary Markov Policies

Add code
Bookmark button
Alert button
Jan 28, 2021
Esther Derman, Gal Dalal, Shie Mannor

Figure 1 for Acting in Delayed Environments with Non-Stationary Markov Policies
Figure 2 for Acting in Delayed Environments with Non-Stationary Markov Policies
Figure 3 for Acting in Delayed Environments with Non-Stationary Markov Policies
Figure 4 for Acting in Delayed Environments with Non-Stationary Markov Policies
Viaarxiv icon

The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems

Add code
Bookmark button
Alert button
Dec 08, 2020
Ahmet Inci, Evgeny Bolotin, Yaosheng Fu, Gal Dalal, Shie Mannor, David Nellans, Diana Marculescu

Figure 1 for The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
Figure 2 for The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
Figure 3 for The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
Figure 4 for The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
Viaarxiv icon

How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks

Add code
Bookmark button
Alert button
Oct 26, 2020
Eli A. Meirom, Haggai Maron, Shie Mannor, Gal Chechik

Figure 1 for How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks
Figure 2 for How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks
Figure 3 for How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks
Figure 4 for How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks
Viaarxiv icon

Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering

Add code
Bookmark button
Alert button
Oct 22, 2020
Ido Greenberg, Shie Mannor

Figure 1 for Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering
Figure 2 for Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering
Figure 3 for Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering
Figure 4 for Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering
Viaarxiv icon