Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Byungchan Ko

Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

Jun 03, 2022

Byungchan Ko, Jungseul Ok

Figure 1 for Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

Figure 2 for Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

Figure 3 for Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

Figure 4 for Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

Abstract:In deep reinforcement learning (RL), data augmentation is widely considered as a tool to induce a set of useful priors about semantic consistency and improve sample efficiency and generalization performance. However, even when the prior is useful for generalization, distilling it to RL agent often interferes with RL training and degenerates sample efficiency. Meanwhile, the agent is forgetful of the prior due to the non-stationary nature of RL. These observations suggest two extreme schedules of distillation: (i) over the entire training; or (ii) only at the end. Hence, we devise a stand-alone network distillation method to inject the consistency prior at any time (even after RL), and a simple yet efficient framework to automatically schedule the distillation. Specifically, the proposed framework first focuses on mastering train environments regardless of generalization by adaptively deciding which {\it or no} augmentation to be used for the training. After this, we add the distillation to extract the remaining benefits for generalization from all the augmentations, which requires no additional new samples. In our experiments, we demonstrate the utility of the proposed framework, in particular, that considers postponing the augmentation to the end of RL training.

* I want to update my previous submitted paper arXiv:2102.08581,rather than new submission. Thus, I want to withdraw this submission, and i already replace 2102.08581 by updated paper

Via

Access Paper or Ask Questions

Time Matters in Using Data Augmentation for Vision-based Deep Reinforcement Learning

Feb 17, 2021

Byungchan Ko, Jungseul Ok

Figure 1 for Time Matters in Using Data Augmentation for Vision-based Deep Reinforcement Learning

Figure 2 for Time Matters in Using Data Augmentation for Vision-based Deep Reinforcement Learning

Figure 3 for Time Matters in Using Data Augmentation for Vision-based Deep Reinforcement Learning

Figure 4 for Time Matters in Using Data Augmentation for Vision-based Deep Reinforcement Learning

Abstract:Data augmentation technique from computer vision has been widely considered as a regularization method to improve data efficiency and generalization performance in vision-based reinforcement learning. We variate the timing of using augmentation, which is, in turn, critical depending on tasks to be solved in training and testing. According to our experiments on Open AI Procgen Benchmark, if the regularization imposed by augmentation is helpful only in testing, it is better to procrastinate the augmentation after training than to use it during training in terms of sample and computation complexity. We note that some of such augmentations can disturb the training process. Conversely, an augmentation providing regularization useful in training needs to be used during the whole training period to fully utilize its benefit in terms of not only generalization but also data efficiency. These phenomena suggest a useful timing control of data augmentation in reinforcement learning.

Via

Access Paper or Ask Questions