Picture for S. R. Eshwar

S. R. Eshwar

Reliable Policy Iteration: Performance Robustness Across Architecture and Environment Perturbations

Add code
Dec 12, 2025
Viaarxiv icon

Reinforcement Learning with Quasi-Hyperbolic Discounting

Add code
Sep 16, 2024
Figure 1 for Reinforcement Learning with Quasi-Hyperbolic Discounting
Figure 2 for Reinforcement Learning with Quasi-Hyperbolic Discounting
Figure 3 for Reinforcement Learning with Quasi-Hyperbolic Discounting
Viaarxiv icon

Online Learning of Weakly Coupled MDP Policies for Load Balancing and Auto Scaling

Add code
Jun 20, 2024
Figure 1 for Online Learning of Weakly Coupled MDP Policies for Load Balancing and Auto Scaling
Viaarxiv icon