Alert button
Picture for Paulo Rauber

Paulo Rauber

Alert button

Posterior Sampling for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 30, 2023
Remo Sasso, Michelangelo Conserva, Paulo Rauber

Figure 1 for Posterior Sampling for Deep Reinforcement Learning
Figure 2 for Posterior Sampling for Deep Reinforcement Learning
Figure 3 for Posterior Sampling for Deep Reinforcement Learning
Figure 4 for Posterior Sampling for Deep Reinforcement Learning
Viaarxiv icon

Hardness in Markov Decision Processes: Theory and Practice

Add code
Bookmark button
Alert button
Oct 24, 2022
Michelangelo Conserva, Paulo Rauber

Figure 1 for Hardness in Markov Decision Processes: Theory and Practice
Figure 2 for Hardness in Markov Decision Processes: Theory and Practice
Figure 3 for Hardness in Markov Decision Processes: Theory and Practice
Figure 4 for Hardness in Markov Decision Processes: Theory and Practice
Viaarxiv icon

Recurrent Neural-Linear Posterior Sampling for Non-Stationary Contextual Bandits

Add code
Bookmark button
Alert button
Jul 09, 2020
Aditya Ramesh, Paulo Rauber, Jürgen Schmidhuber

Figure 1 for Recurrent Neural-Linear Posterior Sampling for Non-Stationary Contextual Bandits
Figure 2 for Recurrent Neural-Linear Posterior Sampling for Non-Stationary Contextual Bandits
Figure 3 for Recurrent Neural-Linear Posterior Sampling for Non-Stationary Contextual Bandits
Figure 4 for Recurrent Neural-Linear Posterior Sampling for Non-Stationary Contextual Bandits
Viaarxiv icon

Hindsight policy gradients

Add code
Bookmark button
Alert button
Feb 20, 2019
Paulo Rauber, Avinash Ummadisingu, Filipe Mutz, Juergen Schmidhuber

Figure 1 for Hindsight policy gradients
Figure 2 for Hindsight policy gradients
Figure 3 for Hindsight policy gradients
Figure 4 for Hindsight policy gradients
Viaarxiv icon