Picture for Shalabh Bhatnagar

Shalabh Bhatnagar

High-Probability Bounds for SGD under the Polyak-Lojasiewicz Condition with Markovian Noise

Add code
Mar 15, 2026
Viaarxiv icon

Generalized Random Direction Newton Algorithms for Stochastic Optimization

Add code
Feb 23, 2026
Viaarxiv icon

Convergence of Multiagent Learning Systems for Traffic control

Add code
Nov 10, 2025
Figure 1 for Convergence of Multiagent Learning Systems for Traffic control
Figure 2 for Convergence of Multiagent Learning Systems for Traffic control
Viaarxiv icon

Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization

Add code
Nov 10, 2025
Viaarxiv icon

An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes

Add code
Feb 17, 2025
Figure 1 for An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes
Figure 2 for An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes
Figure 3 for An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes
Figure 4 for An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes
Viaarxiv icon

Gradient-Weighted Feature Back-Projection: A Fast Alternative to Feature Distillation in 3D Gaussian Splatting

Add code
Nov 19, 2024
Viaarxiv icon

Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks

Add code
Sep 18, 2024
Viaarxiv icon

Critic-Actor for Average Reward MDPs with Function Approximation: A Finite-Time Analysis

Add code
Feb 02, 2024
Viaarxiv icon

Approximate Linear Programming and Decentralized Policy Improvement in Cooperative Multi-agent Markov Decision Processes

Add code
Nov 20, 2023
Figure 1 for Approximate Linear Programming and Decentralized Policy Improvement in Cooperative Multi-agent Markov Decision Processes
Figure 2 for Approximate Linear Programming and Decentralized Policy Improvement in Cooperative Multi-agent Markov Decision Processes
Figure 3 for Approximate Linear Programming and Decentralized Policy Improvement in Cooperative Multi-agent Markov Decision Processes
Figure 4 for Approximate Linear Programming and Decentralized Policy Improvement in Cooperative Multi-agent Markov Decision Processes
Viaarxiv icon

Finite Time Analysis of Constrained Actor Critic and Constrained Natural Actor Critic Algorithms

Add code
Oct 25, 2023
Viaarxiv icon