Alert button
Picture for Michał Bortkiewicz

Michał Bortkiewicz

Alert button

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 01, 2024
Michal Nauman, Michał Bortkiewicz, Mateusz Ostaszewski, Piotr Miłoś, Tomasz Trzciński, Marek Cygan

Figure 1 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 2 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 3 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 4 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Viaarxiv icon

Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem

Add code
Bookmark button
Alert button
Feb 05, 2024
Maciej Wołczyk, Bartłomiej Cupiał, Mateusz Ostaszewski, Michał Bortkiewicz, Michał Zając, Razvan Pascanu, Łukasz Kuciński, Piotr Miłoś

Viaarxiv icon

Emergency action termination for immediate reaction in hierarchical reinforcement learning

Add code
Bookmark button
Alert button
Nov 11, 2022
Michał Bortkiewicz, Jakub Łyskawa, Paweł Wawrzyński, Mateusz Ostaszewski, Artur Grudkowski, Tomasz Trzciński

Figure 1 for Emergency action termination for immediate reaction in hierarchical reinforcement learning
Figure 2 for Emergency action termination for immediate reaction in hierarchical reinforcement learning
Figure 3 for Emergency action termination for immediate reaction in hierarchical reinforcement learning
Figure 4 for Emergency action termination for immediate reaction in hierarchical reinforcement learning
Viaarxiv icon

Progressive Latent Replay for efficient Generative Rehearsal

Add code
Bookmark button
Alert button
Jul 05, 2022
Stanisław Pawlak, Filip Szatkowski, Michał Bortkiewicz, Jan Dubiński, Tomasz Trzciński

Figure 1 for Progressive Latent Replay for efficient Generative Rehearsal
Figure 2 for Progressive Latent Replay for efficient Generative Rehearsal
Figure 3 for Progressive Latent Replay for efficient Generative Rehearsal
Figure 4 for Progressive Latent Replay for efficient Generative Rehearsal
Viaarxiv icon