Picture for R. Srikant

R. Srikant

Finite-Time Bounds for Distributionally Robust TD Learning with Linear Function Approximation

Add code
Oct 02, 2025
Viaarxiv icon

Joint Optimal Transport and Embedding for Network Alignment

Add code
Feb 26, 2025
Figure 1 for Joint Optimal Transport and Embedding for Network Alignment
Figure 2 for Joint Optimal Transport and Embedding for Network Alignment
Figure 3 for Joint Optimal Transport and Embedding for Network Alignment
Figure 4 for Joint Optimal Transport and Embedding for Network Alignment
Viaarxiv icon

Reinforcement Learning with Segment Feedback

Add code
Feb 03, 2025
Figure 1 for Reinforcement Learning with Segment Feedback
Figure 2 for Reinforcement Learning with Segment Feedback
Figure 3 for Reinforcement Learning with Segment Feedback
Figure 4 for Reinforcement Learning with Segment Feedback
Viaarxiv icon

A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networks

Add code
Dec 12, 2024
Figure 1 for A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networks
Figure 2 for A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networks
Viaarxiv icon

Decentralized and Uncoordinated Learning of Stable Matchings: A Game-Theoretic Approach

Add code
Jul 31, 2024
Viaarxiv icon

Performance of NPG in Countable State-Space Average-Cost RL

Add code
May 30, 2024
Viaarxiv icon

On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

Add code
Mar 11, 2024
Figure 1 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 2 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 3 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 4 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Viaarxiv icon

Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization

Add code
Feb 15, 2024
Figure 1 for Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Figure 2 for Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Figure 3 for Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Viaarxiv icon

Convergence for Natural Policy Gradient on Infinite-State Average-Reward Markov Decision Processes

Add code
Feb 07, 2024
Viaarxiv icon

Rates of Convergence in the Central Limit Theorem for Markov Chains, with an Application to TD Learning

Add code
Jan 28, 2024
Viaarxiv icon