Picture for Shie Mannor

Shie Mannor

Faculty of Electrical Engineering, Technion, Israel Institute of Technology

Online Apprenticeship Learning

Add code
Feb 13, 2021
Figure 1 for Online Apprenticeship Learning
Figure 2 for Online Apprenticeship Learning
Figure 3 for Online Apprenticeship Learning
Figure 4 for Online Apprenticeship Learning
Viaarxiv icon

RL for Latent MDPs: Regret Guarantees and a Lower Bound

Add code
Feb 09, 2021
Figure 1 for RL for Latent MDPs: Regret Guarantees and a Lower Bound
Figure 2 for RL for Latent MDPs: Regret Guarantees and a Lower Bound
Figure 3 for RL for Latent MDPs: Regret Guarantees and a Lower Bound
Figure 4 for RL for Latent MDPs: Regret Guarantees and a Lower Bound
Viaarxiv icon

Dimension Free Generalization Bounds for Non Linear Metric Learning

Add code
Feb 07, 2021
Figure 1 for Dimension Free Generalization Bounds for Non Linear Metric Learning
Figure 2 for Dimension Free Generalization Bounds for Non Linear Metric Learning
Figure 3 for Dimension Free Generalization Bounds for Non Linear Metric Learning
Viaarxiv icon

Online Limited Memory Neural-Linear Bandits with Likelihood Matching

Add code
Feb 07, 2021
Figure 1 for Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Figure 2 for Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Figure 3 for Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Figure 4 for Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Viaarxiv icon

Confidence-Budget Matching for Sequential Budgeted Learning

Add code
Feb 05, 2021
Viaarxiv icon

Acting in Delayed Environments with Non-Stationary Markov Policies

Add code
Jan 28, 2021
Figure 1 for Acting in Delayed Environments with Non-Stationary Markov Policies
Figure 2 for Acting in Delayed Environments with Non-Stationary Markov Policies
Figure 3 for Acting in Delayed Environments with Non-Stationary Markov Policies
Figure 4 for Acting in Delayed Environments with Non-Stationary Markov Policies
Viaarxiv icon

The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems

Add code
Dec 08, 2020
Figure 1 for The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
Figure 2 for The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
Figure 3 for The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
Figure 4 for The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
Viaarxiv icon

How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks

Add code
Oct 26, 2020
Figure 1 for How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks
Figure 2 for How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks
Figure 3 for How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks
Figure 4 for How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks
Viaarxiv icon

Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering

Add code
Oct 22, 2020
Figure 1 for Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering
Figure 2 for Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering
Figure 3 for Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering
Figure 4 for Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering
Viaarxiv icon

Lenient Regret for Multi-Armed Bandits

Add code
Sep 13, 2020
Figure 1 for Lenient Regret for Multi-Armed Bandits
Figure 2 for Lenient Regret for Multi-Armed Bandits
Figure 3 for Lenient Regret for Multi-Armed Bandits
Viaarxiv icon