Alert button
Picture for R. Srikant

R. Srikant

Alert button

Modified Policy Iteration for Exponential Cost Risk Sensitive MDPs

Add code
Bookmark button
Alert button
Feb 08, 2023
Yashaswini Murthy, Mehrdad Moharrami, R. Srikant

Viaarxiv icon

Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms

Add code
Bookmark button
Alert button
Feb 02, 2023
Yashaswini Murthy, Mehrdad Moharrami, R. Srikant

Viaarxiv icon

On The Convergence Of Policy Iteration-Based Reinforcement Learning With Monte Carlo Policy Evaluation

Add code
Bookmark button
Alert button
Jan 23, 2023
Anna Winnicki, R. Srikant

Viaarxiv icon

Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation

Add code
Bookmark button
Alert button
Oct 13, 2022
Anna Winnicki, R. Srikant

Viaarxiv icon

MaxWeight With Discounted UCB: A Provably Stable Scheduling Policy for Nonstationary Multi-Server Systems With Unknown Statistics

Add code
Bookmark button
Alert button
Sep 02, 2022
Zixian Yang, R. Srikant, Lei Ying

Figure 1 for MaxWeight With Discounted UCB: A Provably Stable Scheduling Policy for Nonstationary Multi-Server Systems With Unknown Statistics
Figure 2 for MaxWeight With Discounted UCB: A Provably Stable Scheduling Policy for Nonstationary Multi-Server Systems With Unknown Statistics
Figure 3 for MaxWeight With Discounted UCB: A Provably Stable Scheduling Policy for Nonstationary Multi-Server Systems With Unknown Statistics
Figure 4 for MaxWeight With Discounted UCB: A Provably Stable Scheduling Policy for Nonstationary Multi-Server Systems With Unknown Statistics
Viaarxiv icon

Finite-Time Analysis of Entropy-Regularized Neural Natural Actor-Critic Algorithm

Add code
Bookmark button
Alert button
Jun 02, 2022
Semih Cayci, Niao He, R. Srikant

Figure 1 for Finite-Time Analysis of Entropy-Regularized Neural Natural Actor-Critic Algorithm
Viaarxiv icon

Minimax Regret for Cascading Bandits

Add code
Bookmark button
Alert button
Mar 23, 2022
Daniel Vial, Sujay Sanghavi, Sanjay Shakkottai, R. Srikant

Figure 1 for Minimax Regret for Cascading Bandits
Viaarxiv icon

Robust Multi-Agent Bandits Over Undirected Graphs

Add code
Bookmark button
Alert button
Feb 28, 2022
Daniel Vial, Sanjay Shakkottai, R. Srikant

Figure 1 for Robust Multi-Agent Bandits Over Undirected Graphs
Viaarxiv icon

Learning to Control Partially Observed Systems with Finite Memory

Add code
Bookmark button
Alert button
Feb 22, 2022
Semih Cayci, Niao He, R. Srikant

Viaarxiv icon

The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation

Add code
Bookmark button
Alert button
Sep 28, 2021
Anna Winnicki, Joseph Lubars, Michael Livesay, R. Srikant

Figure 1 for The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation
Figure 2 for The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation
Figure 3 for The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation
Figure 4 for The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation
Viaarxiv icon