Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kyungsoo Kim

Self-Predictive Dynamics for Generalization of Vision-based Reinforcement Learning

Jun 05, 2025

Kyungsoo Kim, Jeongsoo Ha, Yusung Kim

Abstract:Vision-based reinforcement learning requires efficient and robust representations of image-based observations, especially when the images contain distracting (task-irrelevant) elements such as shadows, clouds, and light. It becomes more important if those distractions are not exposed during training. We design a Self-Predictive Dynamics (SPD) method to extract task-relevant features efficiently, even in unseen observations after training. SPD uses weak and strong augmentations in parallel, and learns representations by predicting inverse and forward transitions across the two-way augmented versions. In a set of MuJoCo visual control tasks and an autonomous driving task (CARLA), SPD outperforms previous studies in complex observations, and significantly improves the generalization performance for unseen observations. Our code is available at https://github.com/unigary/SPD.

* IJCAI 2022

Via

Access Paper or Ask Questions

Dream to Generalize: Zero-Shot Model-Based Reinforcement Learning for Unseen Visual Distractions

Jun 05, 2025

Jeongsoo Ha, Kyungsoo Kim, Yusung Kim

Figure 1 for Dream to Generalize: Zero-Shot Model-Based Reinforcement Learning for Unseen Visual Distractions

Figure 2 for Dream to Generalize: Zero-Shot Model-Based Reinforcement Learning for Unseen Visual Distractions

Figure 3 for Dream to Generalize: Zero-Shot Model-Based Reinforcement Learning for Unseen Visual Distractions

Figure 4 for Dream to Generalize: Zero-Shot Model-Based Reinforcement Learning for Unseen Visual Distractions

Abstract:Model-based reinforcement learning (MBRL) has been used to efficiently solve vision-based control tasks in highdimensional image observations. Although recent MBRL algorithms perform well in trained observations, they fail when faced with visual distractions in observations. These task-irrelevant distractions (e.g., clouds, shadows, and light) may be constantly present in real-world scenarios. In this study, we propose a novel self-supervised method, Dream to Generalize (Dr. G), for zero-shot MBRL. Dr. G trains its encoder and world model with dual contrastive learning which efficiently captures task-relevant features among multi-view data augmentations. We also introduce a recurrent state inverse dynamics model that helps the world model to better understand the temporal structure. The proposed methods can enhance the robustness of the world model against visual distractions. To evaluate the generalization performance, we first train Dr. G on simple backgrounds and then test it on complex natural video backgrounds in the DeepMind Control suite, and the randomizing environments in Robosuite. Dr. G yields a performance improvement of 117% and 14% over prior works, respectively. Our code is open-sourced and available at https://github.com/JeongsooHa/DrG.git

* AAAI 2023

Via

Access Paper or Ask Questions

Enhanced physics-informed neural networks with domain scaling and residual correction methods for multi-frequency elliptic problems

Nov 07, 2023

Deok-Kyu Jang, Hyea Hyun Kim, Kyungsoo Kim

Abstract:In this paper, neural network approximation methods are developed for elliptic partial differential equations with multi-frequency solutions. Neural network work approximation methods have advantages over classical approaches in that they can be applied without much concerns on the form of the differential equations or the shape or dimension of the problem domain. When applied to problems with multi-frequency solutions, the performance and accuracy of neural network approximation methods are strongly affected by the contrast of the high- and low-frequency parts in the solutions. To address this issue, domain scaling and residual correction methods are proposed. The efficiency and accuracy of the proposed methods are demonstrated for multi-frequency model problems.

Via

Access Paper or Ask Questions