Alert button
Picture for Siva Theja Maguluri

Siva Theja Maguluri

Alert button

Convergence for Natural Policy Gradient on Infinite-State Average-Reward Markov Decision Processes

Add code
Bookmark button
Alert button
Feb 07, 2024
Isaac Grosof, Siva Theja Maguluri, R. Srikant

Viaarxiv icon

Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise

Add code
Bookmark button
Alert button
Dec 31, 2023
Shaan Ul Haque, Sajad Khodadadian, Siva Theja Maguluri

Viaarxiv icon

Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise

Add code
Bookmark button
Alert button
Mar 28, 2023
Zaiwei Chen, Siva Theja Maguluri, Martin Zubeldia

Figure 1 for Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise
Figure 2 for Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise
Figure 3 for Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise
Viaarxiv icon

Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation

Add code
Bookmark button
Alert button
Aug 05, 2022
Zaiwei Chen, Siva Theja Maguluri

Viaarxiv icon

Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling

Add code
Bookmark button
Alert button
Jun 21, 2022
Sajad Khodadadian, Pranay Sharma, Gauri Joshi, Siva Theja Maguluri

Figure 1 for Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling
Figure 2 for Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling
Viaarxiv icon

Target Network and Truncation Overcome The Deadly triad in $Q$-Learning

Add code
Bookmark button
Alert button
Mar 05, 2022
Zaiwei Chen, John Paul Clarke, Siva Theja Maguluri

Figure 1 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 2 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 3 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 4 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Viaarxiv icon

Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization

Add code
Bookmark button
Alert button
Nov 11, 2021
Zaiwei Chen, Shancong Mou, Siva Theja Maguluri

Figure 1 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Figure 2 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Figure 3 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Figure 4 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Viaarxiv icon

Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators

Add code
Bookmark button
Alert button
Jun 24, 2021
Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam

Viaarxiv icon

Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation

Add code
Bookmark button
Alert button
May 26, 2021
Zaiwei Chen, Sajad Khodadadian, Siva Theja Maguluri

Figure 1 for Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation
Viaarxiv icon

On the Linear convergence of Natural Policy Gradient Algorithm

Add code
Bookmark button
Alert button
May 04, 2021
Sajad Khodadadian, Prakirt Raj Jhunjhunwala, Sushil Mahavir Varma, Siva Theja Maguluri

Figure 1 for On the Linear convergence of Natural Policy Gradient Algorithm
Viaarxiv icon