Picture for R. Srikant

R. Srikant

Convergence for Natural Policy Gradient on Infinite-State Average-Reward Markov Decision Processes

Add code
Feb 07, 2024
Viaarxiv icon

Rates of Convergence in the Central Limit Theorem for Markov Chains, with an Application to TD Learning

Add code
Jan 28, 2024
Viaarxiv icon

Cascading Reinforcement Learning

Add code
Jan 17, 2024
Viaarxiv icon

Striking a Balance: An Optimal Mechanism Design for Heterogenous Differentially Private Data Acquisition for Logistic Regression

Add code
Sep 19, 2023
Viaarxiv icon

Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits

Add code
May 30, 2023
Figure 1 for Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits
Figure 2 for Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits
Viaarxiv icon

A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games

Add code
Mar 17, 2023
Viaarxiv icon

Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms

Add code
Feb 15, 2023
Viaarxiv icon

A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks

Add code
Feb 14, 2023
Figure 1 for A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks
Figure 2 for A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks
Figure 3 for A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks
Figure 4 for A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks
Viaarxiv icon

Modified Policy Iteration for Exponential Cost Risk Sensitive MDPs

Add code
Feb 08, 2023
Figure 1 for Modified Policy Iteration for Exponential Cost Risk Sensitive MDPs
Figure 2 for Modified Policy Iteration for Exponential Cost Risk Sensitive MDPs
Figure 3 for Modified Policy Iteration for Exponential Cost Risk Sensitive MDPs
Viaarxiv icon

On The Convergence Of Policy Iteration-Based Reinforcement Learning With Monte Carlo Policy Evaluation

Add code
Jan 23, 2023
Viaarxiv icon