Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andrew D. Bagdanov

Covariances for Free: Exploiting Mean Distributions for Federated Learning with Pre-Trained Models

Dec 18, 2024

Dipam Goswami, Simone Magistri, Kai Wang, Bartłomiej Twardowski, Andrew D. Bagdanov, Joost van de Weijer

Figure 1 for Covariances for Free: Exploiting Mean Distributions for Federated Learning with Pre-Trained Models

Figure 2 for Covariances for Free: Exploiting Mean Distributions for Federated Learning with Pre-Trained Models

Figure 3 for Covariances for Free: Exploiting Mean Distributions for Federated Learning with Pre-Trained Models

Figure 4 for Covariances for Free: Exploiting Mean Distributions for Federated Learning with Pre-Trained Models

Abstract:Using pre-trained models has been found to reduce the effect of data heterogeneity and speed up federated learning algorithms. Recent works have investigated the use of first-order statistics and second-order statistics to aggregate local client data distributions at the server and achieve very high performance without any training. In this work we propose a training-free method based on an unbiased estimator of class covariance matrices. Our method, which only uses first-order statistics in the form of class means communicated by clients to the server, incurs only a fraction of the communication costs required by methods based on communicating second-order statistics. We show how these estimated class covariances can be used to initialize a linear classifier, thus exploiting the covariances without actually sharing them. When compared to state-of-the-art methods which also share only class means, our approach improves performance in the range of 4-26\% with exactly the same communication cost. Moreover, our method achieves performance competitive or superior to sharing second-order statistics with dramatically less communication overhead. Finally, using our method to initialize classifiers and then performing federated fine-tuning yields better and faster convergence. Code is available at https://github.com/dipamgoswami/FedCOF.

Via

Access Paper or Ask Questions

RE-tune: Incremental Fine Tuning of Biomedical Vision-Language Models for Multi-label Chest X-ray Classification

Oct 23, 2024

Marco Mistretta, Andrew D. Bagdanov

Figure 1 for RE-tune: Incremental Fine Tuning of Biomedical Vision-Language Models for Multi-label Chest X-ray Classification

Figure 2 for RE-tune: Incremental Fine Tuning of Biomedical Vision-Language Models for Multi-label Chest X-ray Classification

Abstract:In this paper we introduce RE-tune, a novel approach for fine-tuning pre-trained Multimodal Biomedical Vision-Language models (VLMs) in Incremental Learning scenarios for multi-label chest disease diagnosis. RE-tune freezes the backbones and only trains simple adaptors on top of the Image and Text encoders of the VLM. By engineering positive and negative text prompts for diseases, we leverage the ability of Large Language Models to steer the training trajectory. We evaluate RE-tune in three realistic incremental learning scenarios: class-incremental, label-incremental, and data-incremental. Our results demonstrate that Biomedical VLMs are natural continual learners and prevent catastrophic forgetting. RE-tune not only achieves accurate multi-label classification results, but also prioritizes patient privacy and it distinguishes itself through exceptional computational efficiency, rendering it highly suitable for broad adoption in real-world healthcare settings.

* Accepted for publication at Medical Imaging meets NeurIPS (NeurIPS23)

Via

Access Paper or Ask Questions

How green is continual learning, really? Analyzing the energy consumption in continual training of vision foundation models

Sep 27, 2024

Tomaso Trinci, Simone Magistri, Roberto Verdecchia, Andrew D. Bagdanov

Figure 1 for How green is continual learning, really? Analyzing the energy consumption in continual training of vision foundation models

Figure 2 for How green is continual learning, really? Analyzing the energy consumption in continual training of vision foundation models

Figure 3 for How green is continual learning, really? Analyzing the energy consumption in continual training of vision foundation models

Figure 4 for How green is continual learning, really? Analyzing the energy consumption in continual training of vision foundation models

Abstract:With the ever-growing adoption of AI, its impact on the environment is no longer negligible. Despite the potential that continual learning could have towards Green AI, its environmental sustainability remains relatively uncharted. In this work we aim to gain a systematic understanding of the energy efficiency of continual learning algorithms. To that end, we conducted an extensive set of empirical experiments comparing the energy consumption of recent representation-, prompt-, and exemplar-based continual learning algorithms and two standard baseline (fine tuning and joint training) when used to continually adapt a pre-trained ViT-B/16 foundation model. We performed our experiments on three standard datasets: CIFAR-100, ImageNet-R, and DomainNet. Additionally, we propose a novel metric, the Energy NetScore, which we use measure the algorithm efficiency in terms of energy-accuracy trade-off. Through numerous evaluations varying the number and size of the incremental learning steps, our experiments demonstrate that different types of continual learning algorithms have very different impacts on energy consumption during both training and inference. Although often overlooked in the continual learning literature, we found that the energy consumed during the inference phase is crucial for evaluating the environmental sustainability of continual learning models.

* This manuscript has been accepted at the Green FOundation MOdels (GreenFOMO) ECCV 2024 Workshop

Via

Access Paper or Ask Questions

Offline Reinforcement Learning with Imputed Rewards

Jul 15, 2024

Carlo Romeo, Andrew D. Bagdanov

Figure 1 for Offline Reinforcement Learning with Imputed Rewards

Figure 2 for Offline Reinforcement Learning with Imputed Rewards

Figure 3 for Offline Reinforcement Learning with Imputed Rewards

Abstract:Offline Reinforcement Learning (ORL) offers a robust solution to training agents in applications where interactions with the environment must be strictly limited due to cost, safety, or lack of accurate simulation environments. Despite its potential to facilitate deployment of artificial agents in the real world, Offline Reinforcement Learning typically requires very many demonstrations annotated with ground-truth rewards. Consequently, state-of-the-art ORL algorithms can be difficult or impossible to apply in data-scarce scenarios. In this paper we propose a simple but effective Reward Model that can estimate the reward signal from a very limited sample of environment transitions annotated with rewards. Once the reward signal is modeled, we use the Reward Model to impute rewards for a large sample of reward-free transitions, thus enabling the application of ORL techniques. We demonstrate the potential of our approach on several D4RL continuous locomotion tasks. Our results show that, using only 1\% of reward-labeled transitions from the original datasets, our learned reward model is able to impute rewards for the remaining 99\% of the transitions, from which performant agents can be learned using Offline Reinforcement Learning.

* RLBRew Workshop @ RLC 2024

Via

Access Paper or Ask Questions

A Benchmark Environment for Offline Reinforcement Learning in Racing Games

Jul 12, 2024

Girolamo Macaluso, Alessandro Sestini, Andrew D. Bagdanov

Abstract:Offline Reinforcement Learning (ORL) is a promising approach to reduce the high sample complexity of traditional Reinforcement Learning (RL) by eliminating the need for continuous environmental interactions. ORL exploits a dataset of pre-collected transitions and thus expands the range of application of RL to tasks in which the excessive environment queries increase training time and decrease efficiency, such as in modern AAA games. This paper introduces OfflineMania a novel environment for ORL research. It is inspired by the iconic TrackMania series and developed using the Unity 3D game engine. The environment simulates a single-agent racing game in which the objective is to complete the track through optimal navigation. We provide a variety of datasets to assess ORL performance. These datasets, created from policies of varying ability and in different sizes, aim to offer a challenging testbed for algorithm development and evaluation. We further establish a set of baselines for a range of Online RL, ORL, and hybrid Offline to Online RL approaches using our environment.

* Accepted at IEEE Conference on Games

Via

Access Paper or Ask Questions

Exemplar-free Continual Representation Learning via Learnable Drift Compensation

Jul 11, 2024

Alex Gomez-Villa, Dipam Goswami, Kai Wang, Andrew D. Bagdanov, Bartlomiej Twardowski, Joost van de Weijer

Figure 1 for Exemplar-free Continual Representation Learning via Learnable Drift Compensation

Figure 2 for Exemplar-free Continual Representation Learning via Learnable Drift Compensation

Figure 3 for Exemplar-free Continual Representation Learning via Learnable Drift Compensation

Figure 4 for Exemplar-free Continual Representation Learning via Learnable Drift Compensation

Abstract:Exemplar-free class-incremental learning using a backbone trained from scratch and starting from a small first task presents a significant challenge for continual representation learning. Prototype-based approaches, when continually updated, face the critical issue of semantic drift due to which the old class prototypes drift to different positions in the new feature space. Through an analysis of prototype-based continual learning, we show that forgetting is not due to diminished discriminative power of the feature extractor, and can potentially be corrected by drift compensation. To address this, we propose Learnable Drift Compensation (LDC), which can effectively mitigate drift in any moving backbone, whether supervised or unsupervised. LDC is fast and straightforward to integrate on top of existing continual learning approaches. Furthermore, we showcase how LDC can be applied in combination with self-supervised CL methods, resulting in the first exemplar-free semi-supervised continual learning approach. We achieve state-of-the-art performance in both supervised and semi-supervised settings across multiple datasets. Code is available at \url{https://github.com/alviur/ldc}.

* Accepted to ECCV 2024

Via

Access Paper or Ask Questions

Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation

Jul 03, 2024

Marco Mistretta, Alberto Baldrati, Marco Bertini, Andrew D. Bagdanov

Figure 1 for Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation

Figure 2 for Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation

Figure 3 for Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation

Figure 4 for Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation

Abstract:Vision-Language Models (VLMs) demonstrate remarkable zero-shot generalization to unseen tasks, but fall short of the performance of supervised methods in generalizing to downstream tasks with limited data. Prompt learning is emerging as a parameter-efficient method for adapting VLMs, but state-of-the-art approaches require annotated samples. In this paper we propose a novel approach to prompt learning based on unsupervised knowledge distillation from more powerful models. Our approach, which we call Knowledge Distillation Prompt Learning (KDPL), can be integrated into existing prompt learning techniques and eliminates the need for labeled examples during adaptation. Our experiments on more than ten standard benchmark datasets demonstrate that KDPL is very effective at improving generalization of learned prompts for zero-shot domain generalization, zero-shot cross-dataset generalization, and zero-shot base-to-novel class generalization problems. KDPL requires no ground-truth labels for adaptation, and moreover we show that even in the absence of any knowledge of training class names it can be used to effectively transfer knowledge. The code is publicly available at https://github.com/miccunifi/KDPL.

* Accepted for publication at ECCV24

Via

Access Paper or Ask Questions

EUFCC-340K: A Faceted Hierarchical Dataset for Metadata Annotation in GLAM Collections

Jun 04, 2024

Francesc Net, Marc Folia, Pep Casals, Andrew D. Bagdanov, Lluis Gomez

Figure 1 for EUFCC-340K: A Faceted Hierarchical Dataset for Metadata Annotation in GLAM Collections

Figure 2 for EUFCC-340K: A Faceted Hierarchical Dataset for Metadata Annotation in GLAM Collections

Figure 3 for EUFCC-340K: A Faceted Hierarchical Dataset for Metadata Annotation in GLAM Collections

Figure 4 for EUFCC-340K: A Faceted Hierarchical Dataset for Metadata Annotation in GLAM Collections

Abstract:In this paper, we address the challenges of automatic metadata annotation in the domain of Galleries, Libraries, Archives, and Museums (GLAMs) by introducing a novel dataset, EUFCC340K, collected from the Europeana portal. Comprising over 340,000 images, the EUFCC340K dataset is organized across multiple facets: Materials, Object Types, Disciplines, and Subjects, following a hierarchical structure based on the Art & Architecture Thesaurus (AAT). We developed several baseline models, incorporating multiple heads on a ConvNeXT backbone for multi-label image tagging on these facets, and fine-tuning a CLIP model with our image text pairs. Our experiments to evaluate model robustness and generalization capabilities in two different test scenarios demonstrate the utility of the dataset in improving multi-label classification tools that have the potential to alleviate cataloging tasks in the cultural heritage sector.

* 23 pages, 13 figures

Via

Access Paper or Ask Questions

Elastic Feature Consolidation for Cold Start Exemplar-free Incremental Learning

Feb 06, 2024

Simone Magistri, Tomaso Trinci, Albin Soutif-Cormerais, Joost van de Weijer, Andrew D. Bagdanov

Abstract:Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, which results in feature drift which is difficult to compensate for in the exemplar-free setting. To address this problem, we propose a simple and effective approach that consolidates feature representations by regularizing drift in directions highly relevant to previous tasks and employs prototypes to reduce task-recency bias. Our method, called Elastic Feature Consolidation (EFC), exploits a tractable second-order approximation of feature drift based on an Empirical Feature Matrix (EFM). The EFM induces a pseudo-metric in feature space which we use to regularize feature drift in important directions and to update Gaussian prototypes used in a novel asymmetric cross entropy loss which effectively balances prototype rehearsal with data from new tasks. Experimental results on CIFAR-100, Tiny-ImageNet, ImageNet-Subset and ImageNet-1K demonstrate that Elastic Feature Consolidation is better able to learn new tasks by maintaining model plasticity and significantly outperform the state-of-the-art.

* Accepted at Twelfth International Conference on Learning Representations (ICLR 2024)

Via

Access Paper or Ask Questions

Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline Pre-Training with Model Based Augmentation

Dec 19, 2023

Girolamo Macaluso, Alessandro Sestini, Andrew D. Bagdanov

Abstract:Offline reinforcement learning leverages pre-collected datasets of transitions to train policies. It can serve as effective initialization for online algorithms, enhancing sample efficiency and speeding up convergence. However, when such datasets are limited in size and quality, offline pre-training can produce sub-optimal policies and lead to degraded online reinforcement learning performance. In this paper we propose a model-based data augmentation strategy to maximize the benefits of offline reinforcement learning pre-training and reduce the scale of data needed to be effective. Our approach leverages a world model of the environment trained on the offline dataset to augment states during offline pre-training. We evaluate our approach on a variety of MuJoCo robotic tasks and our results show it can jump-start online fine-tuning and substantially reduce - in some cases by an order of magnitude - the required number of environment interactions.

Via

Access Paper or Ask Questions