Alert button
Picture for Paniz Behboudian

Paniz Behboudian

Alert button

Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 14, 2023
Yash Satsangi, Paniz Behboudian

Figure 1 for Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning
Figure 2 for Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning
Figure 3 for Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning
Viaarxiv icon

Useful Policy Invariant Shaping from Arbitrary Advice

Add code
Bookmark button
Alert button
Nov 02, 2020
Paniz Behboudian, Yash Satsangi, Matthew E. Taylor, Anna Harutyunyan, Michael Bowling

Figure 1 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 2 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 3 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 4 for Useful Policy Invariant Shaping from Arbitrary Advice
Viaarxiv icon