Alert button

TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning

Add code
Bookmark button
Alert button
May 17, 2019
Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan

Figure 1 for TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning
Figure 2 for TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning
Figure 3 for TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning
Figure 4 for TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: