Alert button
Picture for Steve Marcus

Steve Marcus

Alert button

Weighted bandits or: How bandits learn distorted values that are not expected

Add code
Bookmark button
Alert button
Nov 30, 2016
Aditya Gopalan, L. A. Prashanth, Michael Fu, Steve Marcus

Figure 1 for Weighted bandits or: How bandits learn distorted values that are not expected
Figure 2 for Weighted bandits or: How bandits learn distorted values that are not expected
Figure 3 for Weighted bandits or: How bandits learn distorted values that are not expected
Viaarxiv icon

Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control

Add code
Bookmark button
Alert button
Feb 26, 2016
Prashanth L. A., Cheng Jie, Michael Fu, Steve Marcus, Csaba Szepesvári

Figure 1 for Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control
Figure 2 for Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control
Figure 3 for Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control
Figure 4 for Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control
Viaarxiv icon

Adaptive system optimization using random directions stochastic approximation

Add code
Bookmark button
Alert button
Aug 08, 2015
Prashanth L. A., Shalabh Bhatnagar, Michael Fu, Steve Marcus

Figure 1 for Adaptive system optimization using random directions stochastic approximation
Figure 2 for Adaptive system optimization using random directions stochastic approximation
Figure 3 for Adaptive system optimization using random directions stochastic approximation
Figure 4 for Adaptive system optimization using random directions stochastic approximation
Viaarxiv icon