Alert button
Picture for Gugan Thoppe

Gugan Thoppe

Alert button

Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries

Add code
Bookmark button
Alert button
Mar 15, 2024
Swetha Ganesh, Jiayu Chen, Gugan Thoppe, Vaneet Aggarwal

Figure 1 for Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries
Figure 2 for Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries
Viaarxiv icon

VaR\ and CVaR Estimation in a Markov Cost Process: Lower and Upper Bounds

Add code
Bookmark button
Alert button
Oct 17, 2023
Sanjay Bhat, Prashanth L. A., Gugan Thoppe

Viaarxiv icon

Online Learning with Adversaries: A Differential Inclusion Analysis

Add code
Bookmark button
Alert button
Apr 04, 2023
Swetha Ganesh, Alexandre Reiffers-Masson, Gugan Thoppe

Figure 1 for Online Learning with Adversaries: A Differential Inclusion Analysis
Viaarxiv icon

SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search

Add code
Bookmark button
Alert button
Jan 30, 2023
Gal Dalal, Assaf Hallak, Gugan Thoppe, Shie Mannor, Gal Chechik

Figure 1 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 2 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 3 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 4 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Viaarxiv icon

Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking

Add code
Bookmark button
Alert button
Aug 22, 2022
Eshwar S R, Shishir Kolathaya, Gugan Thoppe

Figure 1 for Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking
Figure 2 for Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking
Figure 3 for Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking
Figure 4 for Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking
Viaarxiv icon

Approximate Q-learning and SARSA(0) under the $ε$-greedy Policy: a Differential Inclusion Analysis

Add code
Bookmark button
Alert button
May 26, 2022
Aditya Gopalan, Gugan Thoppe

Figure 1 for Approximate Q-learning and SARSA(0) under the $ε$-greedy Policy: a Differential Inclusion Analysis
Figure 2 for Approximate Q-learning and SARSA(0) under the $ε$-greedy Policy: a Differential Inclusion Analysis
Figure 3 for Approximate Q-learning and SARSA(0) under the $ε$-greedy Policy: a Differential Inclusion Analysis
Viaarxiv icon

A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 10, 2021
Gugan Thoppe, Bhumesh Kumar

Viaarxiv icon

Does Momentum Help? A Sample Complexity Analysis

Add code
Bookmark button
Alert button
Oct 29, 2021
Gugan Thoppe, Rohan Deb, Swetha Ganesh, Amarjit Budhiraja

Figure 1 for Does Momentum Help? A Sample Complexity Analysis
Figure 2 for Does Momentum Help? A Sample Complexity Analysis
Viaarxiv icon

Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 15, 2021
Rahul Madhavan, Gugan Thoppe, Hemanta Makwana

Figure 1 for Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning
Figure 2 for Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning
Figure 3 for Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning
Figure 4 for Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning
Viaarxiv icon