Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Julien Mairal

LJK

On Good Practices for Task-Specific Distillation of Large Pretrained Models

Feb 17, 2024

Juliette Marrie, Michael Arbel, Julien Mairal, Diane Larlus

Figure 1 for On Good Practices for Task-Specific Distillation of Large Pretrained Models

Figure 2 for On Good Practices for Task-Specific Distillation of Large Pretrained Models

Figure 3 for On Good Practices for Task-Specific Distillation of Large Pretrained Models

Figure 4 for On Good Practices for Task-Specific Distillation of Large Pretrained Models

Abstract:Large pretrained visual models exhibit remarkable generalization across diverse recognition tasks. Yet, real-world applications often demand compact models tailored to specific problems. Variants of knowledge distillation have been devised for such a purpose, enabling task-specific compact models (the students) to learn from a generic large pretrained one (the teacher). In this paper, we show that the excellent robustness and versatility of recent pretrained models challenge common practices established in the literature, calling for a new set of optimal guidelines for task-specific distillation. To address the lack of samples in downstream tasks, we also show that a variant of Mixup based on stable diffusion complements standard data augmentation. This strategy eliminates the need for engineered text prompts and improves distillation of generic models into streamlined specialized networks.

Via

Access Paper or Ask Questions

Fast Semi-supervised Unmixing using Non-convex Optimization

Jan 23, 2024

Behnood Rasti, Alexandre Zouaoui, Julien Mairal, Jocelyn Chanussot

Abstract:In this paper, we introduce a novel linear model tailored for semisupervised/library-based unmixing. Our model incorporates considerations for library mismatch while enabling the enforcement of the abundance sum-to-one constraint (ASC). Unlike conventional sparse unmixing methods, this model involves nonconvex optimization, presenting significant computational challenges. We demonstrate the efficacy of Alternating Methods of Multipliers (ADMM) in cyclically solving these intricate problems. We propose two semisupervised unmixing approaches, each relying on distinct priors applied to the new model in addition to the ASC: sparsity prior and convexity constraint. Our experimental results validate that enforcing the convexity constraint outperforms the sparsity prior for the endmember library. These results are corroborated across three simulated datasets (accounting for spectral variability and varying pixel purity levels) and the Cuprite dataset. Additionally, our comparison with conventional sparse unmixing methods showcases considerable advantages of our proposed model, which entails nonconvex optimization. Notably, our implementations of the proposed algorithms-fast semisupervised unmixing (FaSUn) and sparse unmixing using soft-shrinkage (SUnS)-prove considerably more efficient than traditional sparse unmixing methods. SUnS and FaSUn were implemented using PyTorch and provided in a dedicated Python package called Fast Semisupervised Unmixing (FUnmix), which is open-source and available at https://github.com/BehnoodRasti/FUnmix

Via

Access Paper or Ask Questions

Fine Dense Alignment of Image Bursts through Camera Pose and Depth Estimation

Dec 08, 2023

Bruno Lecouat, Yann Dubois de Mont-Marin, Théo Bodrito, Julien Mairal, Jean Ponce

Abstract:This paper introduces a novel approach to the fine alignment of images in a burst captured by a handheld camera. In contrast to traditional techniques that estimate two-dimensional transformations between frame pairs or rely on discrete correspondences, the proposed algorithm establishes dense correspondences by optimizing both the camera motion and surface depth and orientation at every pixel. This approach improves alignment, particularly in scenarios with parallax challenges. Extensive experiments with synthetic bursts featuring small and even tiny baselines demonstrate that it outperforms the best optical flow methods available today in this setting, without requiring any training. Beyond enhanced alignment, our method opens avenues for tasks beyond simple image restoration, such as depth estimation and 3D reconstruction, as supported by promising preliminary results. This positions our approach as a versatile tool for various burst image processing applications.

Via

Access Paper or Ask Questions

Towards Real-World Focus Stacking with Deep Learning

Nov 29, 2023

Alexandre Araujo, Jean Ponce, Julien Mairal

Figure 1 for Towards Real-World Focus Stacking with Deep Learning

Figure 2 for Towards Real-World Focus Stacking with Deep Learning

Figure 3 for Towards Real-World Focus Stacking with Deep Learning

Figure 4 for Towards Real-World Focus Stacking with Deep Learning

Abstract:Focus stacking is widely used in micro, macro, and landscape photography to reconstruct all-in-focus images from multiple frames obtained with focus bracketing, that is, with shallow depth of field and different focus planes. Existing deep learning approaches to the underlying multi-focus image fusion problem have limited applicability to real-world imagery since they are designed for very short image sequences (two to four images), and are typically trained on small, low-resolution datasets either acquired by light-field cameras or generated synthetically. We introduce a new dataset consisting of 94 high-resolution bursts of raw images with focus bracketing, with pseudo ground truth computed from the data using state-of-the-art commercial software. This dataset is used to train the first deep learning algorithm for focus stacking capable of handling bursts of sufficient length for real-world applications. Qualitative experiments demonstrate that it is on par with existing commercial solutions in the long-burst, realistic regime while being significantly more tolerant to noise. The code and dataset are available at https://github.com/araujoalexandre/FocusStackingDataset.

Via

Access Paper or Ask Questions

Vision Transformers Need Registers

Sep 28, 2023

Timothée Darcet, Maxime Oquab, Julien Mairal, Piotr Bojanowski

Figure 1 for Vision Transformers Need Registers

Figure 2 for Vision Transformers Need Registers

Figure 3 for Vision Transformers Need Registers

Figure 4 for Vision Transformers Need Registers

Abstract:Transformers have recently emerged as a powerful tool for learning visual representations. In this paper, we identify and characterize artifacts in feature maps of both supervised and self-supervised ViT networks. The artifacts correspond to high-norm tokens appearing during inference primarily in low-informative background areas of images, that are repurposed for internal computations. We propose a simple yet effective solution based on providing additional tokens to the input sequence of the Vision Transformer to fill that role. We show that this solution fixes that problem entirely for both supervised and self-supervised models, sets a new state of the art for self-supervised visual models on dense visual prediction tasks, enables object discovery methods with larger models, and most importantly leads to smoother feature maps and attention maps for downstream visual processing.

Via

Access Paper or Ask Questions

Image Processing and Machine Learning for Hyperspectral Unmixing: An Overview and the HySUPP Python Package

Aug 18, 2023

Behnood Rasti, Alexandre Zouaoui, Julien Mairal, Jocelyn Chanussot

Figure 1 for Image Processing and Machine Learning for Hyperspectral Unmixing: An Overview and the HySUPP Python Package

Figure 2 for Image Processing and Machine Learning for Hyperspectral Unmixing: An Overview and the HySUPP Python Package

Figure 3 for Image Processing and Machine Learning for Hyperspectral Unmixing: An Overview and the HySUPP Python Package

Figure 4 for Image Processing and Machine Learning for Hyperspectral Unmixing: An Overview and the HySUPP Python Package

Abstract:Spectral pixels are often a mixture of the pure spectra of the materials, called endmembers, due to the low spatial resolution of hyperspectral sensors, double scattering, and intimate mixtures of materials in the scenes. Unmixing estimates the fractional abundances of the endmembers within the pixel. Depending on the prior knowledge of endmembers, linear unmixing can be divided into three main groups: supervised, semi-supervised, and unsupervised (blind) linear unmixing. Advances in Image processing and machine learning substantially affected unmixing. This paper provides an overview of advanced and conventional unmixing approaches. Additionally, we draw a critical comparison between advanced and conventional techniques from the three categories. We compare the performance of the unmixing techniques on three simulated and two real datasets. The experimental results reveal the advantages of different unmixing categories for different unmixing scenarios. Moreover, we provide an open-source Python-based package available at https://github.com/BehnoodRasti/HySUPP to reproduce the results.

Via

Access Paper or Ask Questions

SUnAA: Sparse Unmixing using Archetypal Analysis

Aug 09, 2023

Behnood Rasti, Alexandre Zouaoui, Julien Mairal, Jocelyn Chanussot

Abstract:This paper introduces a new sparse unmixing technique using archetypal analysis (SUnAA). First, we design a new model based on archetypal analysis. We assume that the endmembers of interest are a convex combination of endmembers provided by a spectral library and that the number of endmembers of interest is known. Then, we propose a minimization problem. Unlike most conventional sparse unmixing methods, here the minimization problem is non-convex. We minimize the optimization objective iteratively using an active set algorithm. Our method is robust to the initialization and only requires the number of endmembers of interest. SUnAA is evaluated using two simulated datasets for which results confirm its better performance over other conventional and advanced techniques in terms of signal-to-reconstruction error. SUnAA is also applied to Cuprite dataset and the results are compared visually with the available geological map provided for this dataset. The qualitative assessment demonstrates the successful estimation of the minerals abundances and significantly improves the detection of dominant minerals compared to the conventional regression-based sparse unmixing methods. The Python implementation of SUnAA can be found at: https://github.com/BehnoodRasti/SUnAA.

* IEEE Geoscience and Remote Sensing Letters, 2023, 20, pp.1-5

Via

Access Paper or Ask Questions

GloptiNets: Scalable Non-Convex Optimization with Certificates

Jun 26, 2023

Gaspard Beugnot, Julien Mairal, Alessandro Rudi

Figure 1 for GloptiNets: Scalable Non-Convex Optimization with Certificates

Figure 2 for GloptiNets: Scalable Non-Convex Optimization with Certificates

Figure 3 for GloptiNets: Scalable Non-Convex Optimization with Certificates

Figure 4 for GloptiNets: Scalable Non-Convex Optimization with Certificates

Abstract:We present a novel approach to non-convex optimization with certificates, which handles smooth functions on the hypercube or on the torus. Unlike traditional methods that rely on algebraic properties, our algorithm exploits the regularity of the target function intrinsic in the decay of its Fourier spectrum. By defining a tractable family of models, we allow at the same time to obtain precise certificates and to leverage the advanced and powerful computational techniques developed to optimize neural networks. In this way the scalability of our approach is naturally enhanced by parallel computing with GPUs. Our approach, when applied to the case of polynomials of moderate dimensions but with thousands of coefficients, outperforms the state-of-the-art optimization methods with certificates, as the ones based on Lasserre's hierarchy, addressing problems intractable for the competitors.

Via

Access Paper or Ask Questions

Combining multi-spectral data with statistical and deep-learning models for improved exoplanet detection in direct imaging at high contrast

Jun 21, 2023

Olivier Flasseur, Théo Bodrito, Julien Mairal, Jean Ponce, Maud Langlois, Anne-Marie Lagrange

Abstract:Exoplanet detection by direct imaging is a difficult task: the faint signals from the objects of interest are buried under a spatially structured nuisance component induced by the host star. The exoplanet signals can only be identified when combining several observations with dedicated detection algorithms. In contrast to most of existing methods, we propose to learn a model of the spatial, temporal and spectral characteristics of the nuisance, directly from the observations. In a pre-processing step, a statistical model of their correlations is built locally, and the data are centered and whitened to improve both their stationarity and signal-to-noise ratio (SNR). A convolutional neural network (CNN) is then trained in a supervised fashion to detect the residual signature of synthetic sources in the pre-processed images. Our method leads to a better trade-off between precision and recall than standard approaches in the field. It also outperforms a state-of-the-art algorithm based solely on a statistical framework. Besides, the exploitation of the spectral diversity improves the performance compared to a similar model built solely from spatio-temporal data.

* accepted to EUSIPCO 2023

Via

Access Paper or Ask Questions

SLACK: Stable Learning of Augmentations with Cold-start and KL regularization

Jun 16, 2023

Juliette Marrie, Michael Arbel, Diane Larlus, Julien Mairal

Figure 1 for SLACK: Stable Learning of Augmentations with Cold-start and KL regularization

Figure 2 for SLACK: Stable Learning of Augmentations with Cold-start and KL regularization

Figure 3 for SLACK: Stable Learning of Augmentations with Cold-start and KL regularization

Figure 4 for SLACK: Stable Learning of Augmentations with Cold-start and KL regularization

Abstract:Data augmentation is known to improve the generalization capabilities of neural networks, provided that the set of transformations is chosen with care, a selection often performed manually. Automatic data augmentation aims at automating this process. However, most recent approaches still rely on some prior information; they start from a small pool of manually-selected default transformations that are either used to pretrain the network or forced to be part of the policy learned by the automatic data augmentation algorithm. In this paper, we propose to directly learn the augmentation policy without leveraging such prior knowledge. The resulting bilevel optimization problem becomes more challenging due to the larger search space and the inherent instability of bilevel optimization algorithms. To mitigate these issues (i) we follow a successive cold-start strategy with a Kullback-Leibler regularization, and (ii) we parameterize magnitudes as continuous distributions. Our approach leads to competitive results on standard benchmarks despite a more challenging setting, and generalizes beyond natural images.

* Accepted to CVPR 2023

Via

Access Paper or Ask Questions