Picture for L. A. Prashanth

L. A. Prashanth

Optimizing Shortfall Risk Metric for Learning Regression Models

Add code
May 23, 2025
Figure 1 for Optimizing Shortfall Risk Metric for Learning Regression Models
Figure 2 for Optimizing Shortfall Risk Metric for Learning Regression Models
Viaarxiv icon

Concentration Bounds for Optimized Certainty Equivalent Risk Estimation

Add code
May 31, 2024
Figure 1 for Concentration Bounds for Optimized Certainty Equivalent Risk Estimation
Figure 2 for Concentration Bounds for Optimized Certainty Equivalent Risk Estimation
Figure 3 for Concentration Bounds for Optimized Certainty Equivalent Risk Estimation
Figure 4 for Concentration Bounds for Optimized Certainty Equivalent Risk Estimation
Viaarxiv icon

Stochastic approximation for speeding up LSTD (and LSPI)

Add code
Nov 28, 2017
Figure 1 for Stochastic approximation for speeding up LSTD (and LSPI)
Figure 2 for Stochastic approximation for speeding up LSTD (and LSPI)
Figure 3 for Stochastic approximation for speeding up LSTD (and LSPI)
Viaarxiv icon

Weighted bandits or: How bandits learn distorted values that are not expected

Add code
Nov 30, 2016
Figure 1 for Weighted bandits or: How bandits learn distorted values that are not expected
Figure 2 for Weighted bandits or: How bandits learn distorted values that are not expected
Figure 3 for Weighted bandits or: How bandits learn distorted values that are not expected
Viaarxiv icon

On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence

Add code
Sep 01, 2015
Figure 1 for On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence
Figure 2 for On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence
Viaarxiv icon

Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games

Add code
Jul 02, 2015
Figure 1 for Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games
Figure 2 for Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games
Figure 3 for Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games
Figure 4 for Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games
Viaarxiv icon

Simultaneous Perturbation Algorithms for Batch Off-Policy Search

Add code
Mar 31, 2014
Figure 1 for Simultaneous Perturbation Algorithms for Batch Off-Policy Search
Figure 2 for Simultaneous Perturbation Algorithms for Batch Off-Policy Search
Figure 3 for Simultaneous Perturbation Algorithms for Batch Off-Policy Search
Viaarxiv icon