Alert button
Picture for R. Srikant

R. Srikant

Alert button

On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

Add code
Bookmark button
Alert button
Mar 11, 2024
Navdeep Kumar, Yashaswini Murthy, Itai Shufaro, Kfir Y. Levy, R. Srikant, Shie Mannor

Figure 1 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 2 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 3 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 4 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Viaarxiv icon

Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization

Add code
Bookmark button
Alert button
Feb 15, 2024
Yihan Du, Anna Winnicki, Gal Dalal, Shie Mannor, R. Srikant

Viaarxiv icon

Convergence for Natural Policy Gradient on Infinite-State Average-Reward Markov Decision Processes

Add code
Bookmark button
Alert button
Feb 07, 2024
Isaac Grosof, Siva Theja Maguluri, R. Srikant

Viaarxiv icon

Rates of Convergence in the Central Limit Theorem for Markov Chains, with an Application to TD Learning

Add code
Bookmark button
Alert button
Jan 28, 2024
R. Srikant

Viaarxiv icon

Cascading Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 17, 2024
Yihan Du, R. Srikant, Wei Chen

Viaarxiv icon

Striking a Balance: An Optimal Mechanism Design for Heterogenous Differentially Private Data Acquisition for Logistic Regression

Add code
Bookmark button
Alert button
Sep 19, 2023
Ameya Anjarlekar, Rasoul Etesami, R. Srikant

Figure 1 for Striking a Balance: An Optimal Mechanism Design for Heterogenous Differentially Private Data Acquisition for Logistic Regression
Figure 2 for Striking a Balance: An Optimal Mechanism Design for Heterogenous Differentially Private Data Acquisition for Logistic Regression
Figure 3 for Striking a Balance: An Optimal Mechanism Design for Heterogenous Differentially Private Data Acquisition for Logistic Regression
Figure 4 for Striking a Balance: An Optimal Mechanism Design for Heterogenous Differentially Private Data Acquisition for Logistic Regression
Viaarxiv icon

Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits

Add code
Bookmark button
Alert button
May 30, 2023
Ronshee Chawla, Daniel Vial, Sanjay Shakkottai, R. Srikant

Figure 1 for Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits
Figure 2 for Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits
Viaarxiv icon

A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games

Add code
Bookmark button
Alert button
Mar 17, 2023
Anna Winnicki, R. Srikant

Viaarxiv icon

Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms

Add code
Bookmark button
Alert button
Feb 15, 2023
Yashaswini Murthy, Mehrdad Moharrami, R. Srikant

Viaarxiv icon

A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks

Add code
Bookmark button
Alert button
Feb 14, 2023
Seo Taek Kong, Saptarshi Mandal, Dimitrios Katselis, R. Srikant

Figure 1 for A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks
Figure 2 for A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks
Figure 3 for A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks
Figure 4 for A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks
Viaarxiv icon