Picture for R. Srikant

R. Srikant

Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation

Add code
Oct 13, 2022
Viaarxiv icon

MaxWeight With Discounted UCB: A Provably Stable Scheduling Policy for Nonstationary Multi-Server Systems With Unknown Statistics

Add code
Sep 02, 2022
Figure 1 for MaxWeight With Discounted UCB: A Provably Stable Scheduling Policy for Nonstationary Multi-Server Systems With Unknown Statistics
Figure 2 for MaxWeight With Discounted UCB: A Provably Stable Scheduling Policy for Nonstationary Multi-Server Systems With Unknown Statistics
Figure 3 for MaxWeight With Discounted UCB: A Provably Stable Scheduling Policy for Nonstationary Multi-Server Systems With Unknown Statistics
Figure 4 for MaxWeight With Discounted UCB: A Provably Stable Scheduling Policy for Nonstationary Multi-Server Systems With Unknown Statistics
Viaarxiv icon

Finite-Time Analysis of Entropy-Regularized Neural Natural Actor-Critic Algorithm

Add code
Jun 02, 2022
Figure 1 for Finite-Time Analysis of Entropy-Regularized Neural Natural Actor-Critic Algorithm
Viaarxiv icon

Minimax Regret for Cascading Bandits

Add code
Mar 23, 2022
Figure 1 for Minimax Regret for Cascading Bandits
Viaarxiv icon

Robust Multi-Agent Bandits Over Undirected Graphs

Add code
Feb 28, 2022
Figure 1 for Robust Multi-Agent Bandits Over Undirected Graphs
Viaarxiv icon

Learning to Control Partially Observed Systems with Finite Memory

Add code
Feb 22, 2022
Viaarxiv icon

The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation

Add code
Sep 28, 2021
Figure 1 for The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation
Figure 2 for The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation
Figure 3 for The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation
Figure 4 for The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation
Viaarxiv icon

Improved Algorithms for Misspecified Linear Markov Decision Processes

Add code
Sep 12, 2021
Figure 1 for Improved Algorithms for Misspecified Linear Markov Decision Processes
Viaarxiv icon

Linear Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation

Add code
Jun 08, 2021
Viaarxiv icon

Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation

Add code
May 04, 2021
Figure 1 for Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation
Figure 2 for Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation
Viaarxiv icon