Alert button
Picture for R. Srikant

R. Srikant

Alert button

Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings

Add code
Bookmark button
Alert button
Sep 14, 2020
Arghyadip Roy, Sanjay Shakkottai, R. Srikant

Figure 1 for Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings
Figure 2 for Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings
Viaarxiv icon

Provably-Efficient Double Q-Learning

Add code
Bookmark button
Alert button
Jul 09, 2020
Wentao Weng, Harsh Gupta, Niao He, Lei Ying, R. Srikant

Figure 1 for Provably-Efficient Double Q-Learning
Figure 2 for Provably-Efficient Double Q-Learning
Figure 3 for Provably-Efficient Double Q-Learning
Viaarxiv icon

Robust Multi-Agent Multi-Armed Bandits

Add code
Bookmark button
Alert button
Jul 07, 2020
Daniel Vial, Sanjay Shakkottai, R. Srikant

Figure 1 for Robust Multi-Agent Multi-Armed Bandits
Figure 2 for Robust Multi-Agent Multi-Armed Bandits
Viaarxiv icon

Continuous-Time Multi-Armed Bandits with Controlled Restarts

Add code
Bookmark button
Alert button
Jun 30, 2020
Semih Cayci, Atilla Eryilmaz, R. Srikant

Figure 1 for Continuous-Time Multi-Armed Bandits with Controlled Restarts
Figure 2 for Continuous-Time Multi-Armed Bandits with Controlled Restarts
Figure 3 for Continuous-Time Multi-Armed Bandits with Controlled Restarts
Viaarxiv icon

Budget-Constrained Bandits over General Cost and Reward Distributions

Add code
Bookmark button
Alert button
Feb 29, 2020
Semih Cayci, Atilla Eryilmaz, R. Srikant

Figure 1 for Budget-Constrained Bandits over General Cost and Reward Distributions
Viaarxiv icon

Revisiting Landscape Analysis in Deep Neural Networks: Eliminating Decreasing Paths to Infinity

Add code
Bookmark button
Alert button
Dec 31, 2019
Shiyu Liang, Ruoyu Sun, R. Srikant

Viaarxiv icon

Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 14, 2019
Harsh Gupta, R. Srikant, Lei Ying

Figure 1 for Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning
Figure 2 for Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning
Viaarxiv icon

Finite-Time Error Bounds For Linear Stochastic Approximation and TD Learning

Add code
Bookmark button
Alert button
Mar 07, 2019
R. Srikant, Lei Ying

Viaarxiv icon

Almost Boltzmann Exploration

Add code
Bookmark button
Alert button
Jan 25, 2019
Harsh Gupta, Seo Taek Kong, R. Srikant, Weina Wang

Figure 1 for Almost Boltzmann Exploration
Figure 2 for Almost Boltzmann Exploration
Figure 3 for Almost Boltzmann Exploration
Figure 4 for Almost Boltzmann Exploration
Viaarxiv icon