Picture for Shie Mannor

Shie Mannor

Faculty of Electrical Engineering, Technion, Israel Institute of Technology

Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach

Add code
Aug 14, 2018
Figure 1 for Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach
Figure 2 for Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach
Figure 3 for Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach
Figure 4 for Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach
Viaarxiv icon

Beyond the One Step Greedy Approach in Reinforcement Learning

Add code
Jul 30, 2018
Figure 1 for Beyond the One Step Greedy Approach in Reinforcement Learning
Viaarxiv icon

A General Approach to Multi-Armed Bandits Under Risk Criteria

Add code
Jun 04, 2018
Figure 1 for A General Approach to Multi-Armed Bandits Under Risk Criteria
Viaarxiv icon

Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning

Add code
Jun 04, 2018
Figure 1 for Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning
Figure 2 for Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning
Figure 3 for Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning
Figure 4 for Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning
Viaarxiv icon

Reward Constrained Policy Optimization

Add code
May 28, 2018
Figure 1 for Reward Constrained Policy Optimization
Figure 2 for Reward Constrained Policy Optimization
Figure 3 for Reward Constrained Policy Optimization
Figure 4 for Reward Constrained Policy Optimization
Viaarxiv icon

Nonlinear Distributional Gradient Temporal-Difference Learning

Add code
May 20, 2018
Figure 1 for Nonlinear Distributional Gradient Temporal-Difference Learning
Figure 2 for Nonlinear Distributional Gradient Temporal-Difference Learning
Viaarxiv icon

Interdependent Gibbs Samplers

Add code
Apr 19, 2018
Figure 1 for Interdependent Gibbs Samplers
Figure 2 for Interdependent Gibbs Samplers
Figure 3 for Interdependent Gibbs Samplers
Figure 4 for Interdependent Gibbs Samplers
Viaarxiv icon

Deep Learning Reconstruction of Ultra-Short Pulses

Add code
Mar 15, 2018
Figure 1 for Deep Learning Reconstruction of Ultra-Short Pulses
Figure 2 for Deep Learning Reconstruction of Ultra-Short Pulses
Figure 3 for Deep Learning Reconstruction of Ultra-Short Pulses
Figure 4 for Deep Learning Reconstruction of Ultra-Short Pulses
Viaarxiv icon

Unit Commitment using Nearest Neighbor as a Short-Term Proxy

Add code
Feb 28, 2018
Figure 1 for Unit Commitment using Nearest Neighbor as a Short-Term Proxy
Figure 2 for Unit Commitment using Nearest Neighbor as a Short-Term Proxy
Figure 3 for Unit Commitment using Nearest Neighbor as a Short-Term Proxy
Figure 4 for Unit Commitment using Nearest Neighbor as a Short-Term Proxy
Viaarxiv icon

Train on Validation: Squeezing the Data Lemon

Add code
Feb 16, 2018
Figure 1 for Train on Validation: Squeezing the Data Lemon
Figure 2 for Train on Validation: Squeezing the Data Lemon
Figure 3 for Train on Validation: Squeezing the Data Lemon
Figure 4 for Train on Validation: Squeezing the Data Lemon
Viaarxiv icon