Picture for Shalabh Bhatnagar

Shalabh Bhatnagar

Policy Gradient Methods for Non-Markovian Reinforcement Learning

Add code
May 11, 2026
Viaarxiv icon

Finite-time analysis of Multi-timescale Stochastic Optimization Algorithms

Add code
Mar 31, 2026
Viaarxiv icon

High-Probability Bounds for SGD under the Polyak-Lojasiewicz Condition with Markovian Noise

Add code
Mar 15, 2026
Viaarxiv icon

Generalized Random Direction Newton Algorithms for Stochastic Optimization

Add code
Feb 23, 2026
Viaarxiv icon

Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization

Add code
Nov 10, 2025
Viaarxiv icon

Convergence of Multiagent Learning Systems for Traffic control

Add code
Nov 10, 2025
Figure 1 for Convergence of Multiagent Learning Systems for Traffic control
Figure 2 for Convergence of Multiagent Learning Systems for Traffic control
Viaarxiv icon

An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes

Add code
Feb 17, 2025
Figure 1 for An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes
Figure 2 for An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes
Figure 3 for An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes
Figure 4 for An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes
Viaarxiv icon

Gradient-Weighted Feature Back-Projection: A Fast Alternative to Feature Distillation in 3D Gaussian Splatting

Add code
Nov 19, 2024
Viaarxiv icon

Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks

Add code
Sep 18, 2024
Viaarxiv icon

Critic-Actor for Average Reward MDPs with Function Approximation: A Finite-Time Analysis

Add code
Feb 02, 2024
Viaarxiv icon