Alert button

Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic

Mar 18, 2024
Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Amrit Singh Bedi, Dinesh Manocha

Figure 1 for Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic
Figure 2 for Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic
Figure 3 for Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: