Picture for Siddharth Chandak

Siddharth Chandak

Heavy-Tailed and Long-Range Dependent Noise in Stochastic Approximation: A Finite-Time Analysis

Add code
Mar 20, 2026
Viaarxiv icon

High-Probability Bounds for SGD under the Polyak-Lojasiewicz Condition with Markovian Noise

Add code
Mar 15, 2026
Viaarxiv icon

Regret and Sample Complexity of Online Q-Learning via Concentration of Stochastic Approximation with Time-Inhomogeneous Markov Chains

Add code
Feb 18, 2026
Viaarxiv icon

$O(1/k)$ Finite-Time Bound for Non-Linear Two-Time-Scale Stochastic Approximation

Add code
Apr 27, 2025
Viaarxiv icon

Non-Expansive Mappings in Two-Time-Scale Stochastic Approximation: Finite-Time Analysis

Add code
Jan 18, 2025
Viaarxiv icon

Learning to Control Unknown Strongly Monotone Games

Add code
Jun 30, 2024
Figure 1 for Learning to Control Unknown Strongly Monotone Games
Figure 2 for Learning to Control Unknown Strongly Monotone Games
Figure 3 for Learning to Control Unknown Strongly Monotone Games
Viaarxiv icon

A Concentration Bound for TD with Function Approximation

Add code
Dec 16, 2023
Viaarxiv icon

Equilibrium Bandits: Learning Optimal Equilibria of Unknown Dynamics

Add code
Feb 27, 2023
Figure 1 for Equilibrium Bandits: Learning Optimal Equilibria of Unknown Dynamics
Figure 2 for Equilibrium Bandits: Learning Optimal Equilibria of Unknown Dynamics
Viaarxiv icon

Reinforcement Learning in Non-Markovian Environments

Add code
Nov 03, 2022
Viaarxiv icon

A Concentration Bound for LSPE($λ$)

Add code
Nov 04, 2021
Viaarxiv icon