Alert button
Picture for Mateusz Ostaszewski

Mateusz Ostaszewski

Alert button

A Case for Validation Buffer in Pessimistic Actor-Critic

Add code
Bookmark button
Alert button
Mar 01, 2024
Michal Nauman, Mateusz Ostaszewski, Marek Cygan

Figure 1 for A Case for Validation Buffer in Pessimistic Actor-Critic
Figure 2 for A Case for Validation Buffer in Pessimistic Actor-Critic
Figure 3 for A Case for Validation Buffer in Pessimistic Actor-Critic
Figure 4 for A Case for Validation Buffer in Pessimistic Actor-Critic
Viaarxiv icon

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 01, 2024
Michal Nauman, Michał Bortkiewicz, Mateusz Ostaszewski, Piotr Miłoś, Tomasz Trzciński, Marek Cygan

Figure 1 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 2 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 3 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 4 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Viaarxiv icon

Curriculum reinforcement learning for quantum architecture search under hardware errors

Add code
Bookmark button
Alert button
Feb 05, 2024
Yash J. Patel, Akash Kundu, Mateusz Ostaszewski, Xavier Bonet-Monroig, Vedran Dunjko, Onur Danaci

Viaarxiv icon

Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem

Add code
Bookmark button
Alert button
Feb 05, 2024
Maciej Wołczyk, Bartłomiej Cupiał, Mateusz Ostaszewski, Michał Bortkiewicz, Michał Zając, Razvan Pascanu, Łukasz Kuciński, Piotr Miłoś

Viaarxiv icon

On consequences of finetuning on data with highly discriminative features

Add code
Bookmark button
Alert button
Oct 30, 2023
Wojciech Masarczyk, Tomasz Trzciński, Mateusz Ostaszewski

Viaarxiv icon

Enhancing variational quantum state diagonalization using reinforcement learning techniques

Add code
Bookmark button
Alert button
Jun 22, 2023
Akash Kundu, Przemysław Bedełek, Mateusz Ostaszewski, Onur Danaci, Yash J. Patel, Vedran Dunjko, Jarosław A. Miszczak

Viaarxiv icon

Enhancing quantum variational state diagonalization using reinforcement learning techniques

Add code
Bookmark button
Alert button
Jun 19, 2023
Akash Kundu, Przemysław Bedełek, Mateusz Ostaszewski, Onur Danaci, Vedran Dunjko, Jarosław A. Miszczak

Viaarxiv icon

The Tunnel Effect: Building Data Representations in Deep Neural Networks

Add code
Bookmark button
Alert button
May 31, 2023
Wojciech Masarczyk, Mateusz Ostaszewski, Ehsan Imani, Razvan Pascanu, Piotr Miłoś, Tomasz Trzciński

Figure 1 for The Tunnel Effect: Building Data Representations in Deep Neural Networks
Figure 2 for The Tunnel Effect: Building Data Representations in Deep Neural Networks
Figure 3 for The Tunnel Effect: Building Data Representations in Deep Neural Networks
Figure 4 for The Tunnel Effect: Building Data Representations in Deep Neural Networks
Viaarxiv icon

Emergency action termination for immediate reaction in hierarchical reinforcement learning

Add code
Bookmark button
Alert button
Nov 11, 2022
Michał Bortkiewicz, Jakub Łyskawa, Paweł Wawrzyński, Mateusz Ostaszewski, Artur Grudkowski, Tomasz Trzciński

Figure 1 for Emergency action termination for immediate reaction in hierarchical reinforcement learning
Figure 2 for Emergency action termination for immediate reaction in hierarchical reinforcement learning
Figure 3 for Emergency action termination for immediate reaction in hierarchical reinforcement learning
Figure 4 for Emergency action termination for immediate reaction in hierarchical reinforcement learning
Viaarxiv icon

Reinforcement learning with experience replay and adaptation of action dispersion

Add code
Bookmark button
Alert button
Jul 30, 2022
Paweł Wawrzyński, Wojciech Masarczyk, Mateusz Ostaszewski

Figure 1 for Reinforcement learning with experience replay and adaptation of action dispersion
Figure 2 for Reinforcement learning with experience replay and adaptation of action dispersion
Figure 3 for Reinforcement learning with experience replay and adaptation of action dispersion
Figure 4 for Reinforcement learning with experience replay and adaptation of action dispersion
Viaarxiv icon