Alert button
Picture for Sridhar Mahadevan

Sridhar Mahadevan

Alert button

Universal Decision Models

Add code
Bookmark button
Alert button
Oct 28, 2021
Sridhar Mahadevan

Figure 1 for Universal Decision Models
Figure 2 for Universal Decision Models
Figure 3 for Universal Decision Models
Figure 4 for Universal Decision Models
Viaarxiv icon

Causal Inference in Network Economics

Add code
Bookmark button
Alert button
Sep 20, 2021
Sridhar Mahadevan

Figure 1 for Causal Inference in Network Economics
Figure 2 for Causal Inference in Network Economics
Figure 3 for Causal Inference in Network Economics
Figure 4 for Causal Inference in Network Economics
Viaarxiv icon

Asymptotic Causal Inference

Add code
Bookmark button
Alert button
Sep 20, 2021
Sridhar Mahadevan

Figure 1 for Asymptotic Causal Inference
Figure 2 for Asymptotic Causal Inference
Figure 3 for Asymptotic Causal Inference
Figure 4 for Asymptotic Causal Inference
Viaarxiv icon

Multiscale Manifold Warping

Add code
Bookmark button
Alert button
Sep 19, 2021
Sridhar Mahadevan, Anup Rao, Georgios Theocharous, Jennifer Healey

Figure 1 for Multiscale Manifold Warping
Figure 2 for Multiscale Manifold Warping
Figure 3 for Multiscale Manifold Warping
Figure 4 for Multiscale Manifold Warping
Viaarxiv icon

Finite-Sample Analysis of Proximal Gradient TD Algorithms

Add code
Bookmark button
Alert button
Jul 03, 2020
Bo Liu, Ji Liu, Mohammad Ghavamzadeh, Sridhar Mahadevan, Marek Petrik

Figure 1 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Figure 2 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Figure 3 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Figure 4 for Finite-Sample Analysis of Proximal Gradient TD Algorithms
Viaarxiv icon

Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity

Add code
Bookmark button
Alert button
Jun 06, 2020
Bo Liu, Ian Gemp, Mohammad Ghavamzadeh, Ji Liu, Sridhar Mahadevan, Marek Petrik

Figure 1 for Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity
Figure 2 for Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity
Figure 3 for Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity
Figure 4 for Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity
Viaarxiv icon

Regularized Off-Policy TD-Learning

Add code
Bookmark button
Alert button
Jun 06, 2020
Bo Liu, Sridhar Mahadevan, Ji Liu

Figure 1 for Regularized Off-Policy TD-Learning
Figure 2 for Regularized Off-Policy TD-Learning
Figure 3 for Regularized Off-Policy TD-Learning
Viaarxiv icon

Finite-Sample Analysis of GTD Algorithms

Add code
Bookmark button
Alert button
Jun 06, 2020
Bo Liu, Ji Liu, Mohammad Ghavamzadeh, Sridhar Mahadevan, Marek Petrik

Figure 1 for Finite-Sample Analysis of GTD Algorithms
Figure 2 for Finite-Sample Analysis of GTD Algorithms
Figure 3 for Finite-Sample Analysis of GTD Algorithms
Figure 4 for Finite-Sample Analysis of GTD Algorithms
Viaarxiv icon

Optimizing for the Future in Non-Stationary MDPs

Add code
Bookmark button
Alert button
Jun 02, 2020
Yash Chandak, Georgios Theocharous, Shiv Shankar, Martha White, Sridhar Mahadevan, Philip S. Thomas

Figure 1 for Optimizing for the Future in Non-Stationary MDPs
Figure 2 for Optimizing for the Future in Non-Stationary MDPs
Figure 3 for Optimizing for the Future in Non-Stationary MDPs
Figure 4 for Optimizing for the Future in Non-Stationary MDPs
Viaarxiv icon