Alert button
Picture for Michal Nauman

Michal Nauman

Alert button

A Case for Validation Buffer in Pessimistic Actor-Critic

Add code
Bookmark button
Alert button
Mar 01, 2024
Michal Nauman, Mateusz Ostaszewski, Marek Cygan

Figure 1 for A Case for Validation Buffer in Pessimistic Actor-Critic
Figure 2 for A Case for Validation Buffer in Pessimistic Actor-Critic
Figure 3 for A Case for Validation Buffer in Pessimistic Actor-Critic
Figure 4 for A Case for Validation Buffer in Pessimistic Actor-Critic
Viaarxiv icon

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 01, 2024
Michal Nauman, Michał Bortkiewicz, Mateusz Ostaszewski, Piotr Miłoś, Tomasz Trzciński, Marek Cygan

Figure 1 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 2 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 3 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 4 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Viaarxiv icon

Decoupled Actor-Critic

Add code
Bookmark button
Alert button
Oct 30, 2023
Michal Nauman, Marek Cygan

Viaarxiv icon

On All-Action Policy Gradients

Add code
Bookmark button
Alert button
Oct 24, 2022
Michal Nauman, Marek Cygan

Figure 1 for On All-Action Policy Gradients
Figure 2 for On All-Action Policy Gradients
Figure 3 for On All-Action Policy Gradients
Figure 4 for On All-Action Policy Gradients
Viaarxiv icon

Low-Variance Policy Gradient Estimation with World Models

Add code
Bookmark button
Alert button
Oct 29, 2020
Michal Nauman, Floris Den Hengst

Figure 1 for Low-Variance Policy Gradient Estimation with World Models
Figure 2 for Low-Variance Policy Gradient Estimation with World Models
Figure 3 for Low-Variance Policy Gradient Estimation with World Models
Figure 4 for Low-Variance Policy Gradient Estimation with World Models
Viaarxiv icon