Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Peijia Li

RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment

Jun 26, 2025

Suorong Yang, Peijia Li, Furao Shen, Jian Zhao

Figure 1 for RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment

Figure 2 for RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment

Figure 3 for RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment

Figure 4 for RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment

Abstract:Modern deep architectures often rely on large-scale datasets, but training on these datasets incurs high computational and storage overhead. Real-world datasets often contain substantial redundancies, prompting the need for more data-efficient training paradigms. Data selection has shown promise to mitigate redundancy by identifying the most representative samples, thereby reducing training costs without compromising performance. Existing methods typically rely on static scoring metrics or pretrained models, overlooking the combined effect of selected samples and their evolving dynamics during training. We introduce the concept of epsilon-sample cover, which quantifies sample redundancy based on inter-sample relationships, capturing the intrinsic structure of the dataset. Based on this, we reformulate data selection as a reinforcement learning (RL) process and propose RL-Selector, where a lightweight RL agent optimizes the selection policy by leveraging epsilon-sample cover derived from evolving dataset distribution as a reward signal. Extensive experiments across benchmark datasets and diverse architectures demonstrate that our method consistently outperforms existing state-of-the-art baselines. Models trained with our selected datasets show enhanced generalization performance with improved training efficiency.

* ICCV 2025
* ICCV 2025

Via

Access Paper or Ask Questions

AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation

May 23, 2024

Suorong Yang, Peijia Li, Xin Xiong, Furao Shen, Jian Zhao

Figure 1 for AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation

Figure 2 for AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation

Figure 3 for AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation

Figure 4 for AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation

Abstract:Data augmentation (DA) is widely employed to improve the generalization performance of deep models. However, most existing DA methods use augmentation operations with random magnitudes throughout training. While this fosters diversity, it can also inevitably introduce uncontrolled variability in augmented data, which may cause misalignment with the evolving training status of the target models. Both theoretical and empirical findings suggest that this misalignment increases the risks of underfitting and overfitting. To address these limitations, we propose AdaAugment, an innovative and tuning-free Adaptive Augmentation method that utilizes reinforcement learning to dynamically adjust augmentation magnitudes for individual training samples based on real-time feedback from the target network. Specifically, AdaAugment features a dual-model architecture consisting of a policy network and a target network, which are jointly optimized to effectively adapt augmentation magnitudes. The policy network optimizes the variability within the augmented data, while the target network utilizes the adaptively augmented samples for training. Extensive experiments across benchmark datasets and deep architectures demonstrate that AdaAugment consistently outperforms other state-of-the-art DA methods in effectiveness while maintaining remarkable efficiency.

Via

Access Paper or Ask Questions