Alert button
Picture for Arghyadip Roy

Arghyadip Roy

Alert button

Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings

Add code
Bookmark button
Alert button
Sep 14, 2020
Arghyadip Roy, Sanjay Shakkottai, R. Srikant

Figure 1 for Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings
Figure 2 for Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings
Viaarxiv icon

Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

Add code
Bookmark button
Alert button
Dec 21, 2019
Arghyadip Roy, Vivek Borkar, Abhay Karandikar, Prasanna Chaporkar

Figure 1 for Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
Figure 2 for Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
Figure 3 for Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
Viaarxiv icon

A Structure-aware Online Learning Algorithm for Markov Decision Processes

Add code
Bookmark button
Alert button
Nov 28, 2018
Arghyadip Roy, Vivek Borkar, Abhay Karandikar, Prasanna Chaporkar

Figure 1 for A Structure-aware Online Learning Algorithm for Markov Decision Processes
Figure 2 for A Structure-aware Online Learning Algorithm for Markov Decision Processes
Figure 3 for A Structure-aware Online Learning Algorithm for Markov Decision Processes
Viaarxiv icon