Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xulin Gu

Temporal Saliency-Guided Distillation: A Scalable Framework for Distilling Video Datasets

May 27, 2025

Xulin Gu, Xinhao Zhong, Zhixing Wei, Yimin Zhou, Shuoyang Sun, Bin Chen, Hongpeng Wang, Yuan Luo

Figure 1 for Temporal Saliency-Guided Distillation: A Scalable Framework for Distilling Video Datasets

Figure 2 for Temporal Saliency-Guided Distillation: A Scalable Framework for Distilling Video Datasets

Figure 3 for Temporal Saliency-Guided Distillation: A Scalable Framework for Distilling Video Datasets

Figure 4 for Temporal Saliency-Guided Distillation: A Scalable Framework for Distilling Video Datasets

Abstract:Dataset distillation (DD) has emerged as a powerful paradigm for dataset compression, enabling the synthesis of compact surrogate datasets that approximate the training utility of large-scale ones. While significant progress has been achieved in distilling image datasets, extending DD to the video domain remains challenging due to the high dimensionality and temporal complexity inherent in video data. Existing video distillation (VD) methods often suffer from excessive computational costs and struggle to preserve temporal dynamics, as na\"ive extensions of image-based approaches typically lead to degraded performance. In this paper, we propose a novel uni-level video dataset distillation framework that directly optimizes synthetic videos with respect to a pre-trained model. To address temporal redundancy and enhance motion preservation, we introduce a temporal saliency-guided filtering mechanism that leverages inter-frame differences to guide the distillation process, encouraging the retention of informative temporal cues while suppressing frame-level redundancy. Extensive experiments on standard video benchmarks demonstrate that our method achieves state-of-the-art performance, bridging the gap between real and distilled video data and offering a scalable solution for video dataset compression.

Via

Access Paper or Ask Questions

Efficient Dataset Distillation via Diffusion-Driven Patch Selection for Improved Generalization

Dec 13, 2024

Xinhao Zhong, Shuoyang Sun, Xulin Gu, Zhaoyang Xu, Yaowei Wang, Jianlong Wu, Bin Chen

Figure 1 for Efficient Dataset Distillation via Diffusion-Driven Patch Selection for Improved Generalization

Figure 2 for Efficient Dataset Distillation via Diffusion-Driven Patch Selection for Improved Generalization

Figure 3 for Efficient Dataset Distillation via Diffusion-Driven Patch Selection for Improved Generalization

Figure 4 for Efficient Dataset Distillation via Diffusion-Driven Patch Selection for Improved Generalization

Abstract:Dataset distillation offers an efficient way to reduce memory and computational costs by optimizing a smaller dataset with performance comparable to the full-scale original. However, for large datasets and complex deep networks (e.g., ImageNet-1K with ResNet-101), the extensive optimization space limits performance, reducing its practicality. Recent approaches employ pre-trained diffusion models to generate informative images directly, avoiding pixel-level optimization and achieving notable results. However, these methods often face challenges due to distribution shifts between pre-trained models and target datasets, along with the need for multiple distillation steps across varying settings. To address these issues, we propose a novel framework orthogonal to existing diffusion-based distillation methods, leveraging diffusion models for selection rather than generation. Our method starts by predicting noise generated by the diffusion model based on input images and text prompts (with or without label text), then calculates the corresponding loss for each pair. With the loss differences, we identify distinctive regions of the original images. Additionally, we perform intra-class clustering and ranking on selected patches to maintain diversity constraints. This streamlined framework enables a single-step distillation process, and extensive experiments demonstrate that our approach outperforms state-of-the-art methods across various metrics.

* arXiv admin note: text overlap with arXiv:2408.02752 by other authors

Via

Access Paper or Ask Questions

Going Beyond Feature Similarity: Effective Dataset distillation based on Class-aware Conditional Mutual Information

Dec 13, 2024

Xinhao Zhong, Bin Chen, Hao Fang, Xulin Gu, Shu-Tao Xia, En-Hui Yang

Figure 1 for Going Beyond Feature Similarity: Effective Dataset distillation based on Class-aware Conditional Mutual Information

Figure 2 for Going Beyond Feature Similarity: Effective Dataset distillation based on Class-aware Conditional Mutual Information

Figure 3 for Going Beyond Feature Similarity: Effective Dataset distillation based on Class-aware Conditional Mutual Information

Figure 4 for Going Beyond Feature Similarity: Effective Dataset distillation based on Class-aware Conditional Mutual Information

Abstract:Dataset distillation (DD) aims to minimize the time and memory consumption needed for training deep neural networks on large datasets, by creating a smaller synthetic dataset that has similar performance to that of the full real dataset. However, current dataset distillation methods often result in synthetic datasets that are excessively difficult for networks to learn from, due to the compression of a substantial amount of information from the original data through metrics measuring feature similarity, e,g., distribution matching (DM). In this work, we introduce conditional mutual information (CMI) to assess the class-aware complexity of a dataset and propose a novel method by minimizing CMI. Specifically, we minimize the distillation loss while constraining the class-aware complexity of the synthetic dataset by minimizing its empirical CMI from the feature space of pre-trained networks, simultaneously. Conducting on a thorough set of experiments, we show that our method can serve as a general regularization method to existing DD methods and improve the performance and training efficiency.

Via

Access Paper or Ask Questions

Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

Jun 12, 2024

Xinhao Zhong, Hao Fang, Bin Chen, Xulin Gu, Tao Dai, Meikang Qiu, Shu-Tao Xia

Figure 1 for Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

Figure 2 for Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

Figure 3 for Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

Figure 4 for Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

Abstract:Dataset distillation is an emerging dataset reduction method, which condenses large-scale datasets while maintaining task accuracy. Current methods have integrated parameterization techniques to boost synthetic dataset performance by shifting the optimization space from pixel to another informative feature domain. However, they limit themselves to a fixed optimization space for distillation, neglecting the diverse guidance across different informative latent spaces. To overcome this limitation, we propose a novel parameterization method dubbed Hierarchical Generative Latent Distillation (H-GLaD), to systematically explore hierarchical layers within the generative adversarial networks (GANs). This allows us to progressively span from the initial latent space to the final pixel space. In addition, we introduce a novel class-relevant feature distance metric to alleviate the computational burden associated with synthetic dataset evaluation, bridging the gap between synthetic and original datasets. Experimental results demonstrate that the proposed H-GLaD achieves a significant improvement in both same-architecture and cross-architecture performance with equivalent time consumption.

Via

Access Paper or Ask Questions