Alert button
Picture for Da Li

Da Li

Alert button

Better Practices for Domain Adaptation

Sep 07, 2023
Linus Ericsson, Da Li, Timothy M. Hospedales

Distribution shifts are all too common in real-world applications of machine learning. Domain adaptation (DA) aims to address this by providing various frameworks for adapting models to the deployment data without using labels. However, the domain shift scenario raises a second more subtle challenge: the difficulty of performing hyperparameter optimisation (HPO) for these adaptation algorithms without access to a labelled validation set. The unclear validation protocol for DA has led to bad practices in the literature, such as performing HPO using the target test labels when, in real-world scenarios, they are not available. This has resulted in over-optimism about DA research progress compared to reality. In this paper, we analyse the state of DA when using good evaluation practice, by benchmarking a suite of candidate validation criteria and using them to assess popular adaptation algorithms. We show that there are challenges across all three branches of domain adaptation methodology including Unsupervised Domain Adaptation (UDA), Source-Free Domain Adaptation (SFDA), and Test Time Adaptation (TTA). While the results show that realistically achievable performance is often worse than expected, they also show that using proper validation splits is beneficial, as well as showing that some previously unexplored validation metrics provide the best options to date. Altogether, our improved practices covering data, training, validation and hyperparameter optimisation form a new rigorous pipeline to improve benchmarking, and hence research progress, within this important field going forward.

* AutoML 2023 (Best paper award) 
Viaarxiv icon

Label Calibration for Semantic Segmentation Under Domain Shift

Jul 20, 2023
Ondrej Bohdal, Da Li, Timothy Hospedales

Performance of a pre-trained semantic segmentation model is likely to substantially decrease on data from a new domain. We show a pre-trained model can be adapted to unlabelled target domain data by calculating soft-label prototypes under the domain shift and making predictions according to the prototype closest to the vector with predicted class probabilities. The proposed adaptation procedure is fast, comes almost for free in terms of computational resources and leads to considerable performance improvements. We demonstrate the benefits of such label calibration on the highly-practical synthetic-to-real semantic segmentation problem.

* ICLR 2023 Workshop on Pitfalls of Limited Data and Computation for Trustworthy ML 
Viaarxiv icon

Feed-Forward Source-Free Domain Adaptation via Class Prototypes

Jul 20, 2023
Ondrej Bohdal, Da Li, Timothy Hospedales

Source-free domain adaptation has become popular because of its practical usefulness and no need to access source data. However, the adaptation process still takes a considerable amount of time and is predominantly based on optimization that relies on back-propagation. In this work we present a simple feed-forward approach that challenges the need for back-propagation based adaptation. Our approach is based on computing prototypes of classes under the domain shift using a pre-trained model. It achieves strong improvements in accuracy compared to the pre-trained model and requires only a small fraction of time of existing domain adaptation methods.

* ECCV 2022 Workshop on Out of Distribution Generalization in Computer Vision (OOD-CV) 
Viaarxiv icon

Neural Fine-Tuning Search for Few-Shot Learning

Jun 15, 2023
Panagiotis Eustratiadis, Łukasz Dudziak, Da Li, Timothy Hospedales

Figure 1 for Neural Fine-Tuning Search for Few-Shot Learning
Figure 2 for Neural Fine-Tuning Search for Few-Shot Learning
Figure 3 for Neural Fine-Tuning Search for Few-Shot Learning
Figure 4 for Neural Fine-Tuning Search for Few-Shot Learning

In few-shot recognition, a classifier that has been trained on one set of classes is required to rapidly adapt and generalize to a disjoint, novel set of classes. To that end, recent studies have shown the efficacy of fine-tuning with carefully crafted adaptation architectures. However this raises the question of: How can one design the optimal adaptation strategy? In this paper, we study this question through the lens of neural architecture search (NAS). Given a pre-trained neural network, our algorithm discovers the optimal arrangement of adapters, which layers to keep frozen and which to fine-tune. We demonstrate the generality of our NAS method by applying it to both residual networks and vision transformers and report state-of-the-art performance on Meta-Dataset and Meta-Album.

Viaarxiv icon

Meta Omnium: A Benchmark for General-Purpose Learning-to-Learn

May 12, 2023
Ondrej Bohdal, Yinbing Tian, Yongshuo Zong, Ruchika Chavhan, Da Li, Henry Gouk, Li Guo, Timothy Hospedales

Figure 1 for Meta Omnium: A Benchmark for General-Purpose Learning-to-Learn
Figure 2 for Meta Omnium: A Benchmark for General-Purpose Learning-to-Learn
Figure 3 for Meta Omnium: A Benchmark for General-Purpose Learning-to-Learn
Figure 4 for Meta Omnium: A Benchmark for General-Purpose Learning-to-Learn

Meta-learning and other approaches to few-shot learning are widely studied for image recognition, and are increasingly applied to other vision tasks such as pose estimation and dense prediction. This naturally raises the question of whether there is any few-shot meta-learning algorithm capable of generalizing across these diverse task types? To support the community in answering this question, we introduce Meta Omnium, a dataset-of-datasets spanning multiple vision tasks including recognition, keypoint localization, semantic segmentation and regression. We experiment with popular few-shot meta-learning baselines and analyze their ability to generalize across tasks and to transfer knowledge between them. Meta Omnium enables meta-learning researchers to evaluate model generalization to a much wider array of tasks than previously possible, and provides a single framework for evaluating meta-learners across a wide suite of vision applications in a consistent manner.

* Accepted at CVPR 2023. Project page: https://edi-meta-learning.github.io/meta-omnium 
Viaarxiv icon

Multi-Spectrally Constrained Low-PAPR Waveform Optimization for MIMO Radar Space-Time Adaptive Processing

Apr 05, 2023
Da Li, Bo Tang, Lei Xue

Figure 1 for Multi-Spectrally Constrained Low-PAPR Waveform Optimization for MIMO Radar Space-Time Adaptive Processing
Figure 2 for Multi-Spectrally Constrained Low-PAPR Waveform Optimization for MIMO Radar Space-Time Adaptive Processing
Figure 3 for Multi-Spectrally Constrained Low-PAPR Waveform Optimization for MIMO Radar Space-Time Adaptive Processing
Figure 4 for Multi-Spectrally Constrained Low-PAPR Waveform Optimization for MIMO Radar Space-Time Adaptive Processing

This paper focuses on the joint design of transmit waveforms and receive filters for airborne multiple-input-multiple-output (MIMO) radar systems in spectrally crowded environments. The purpose is to maximize the output signal-to-interference-plus-noise-ratio (SINR) in the presence of signal-dependent clutter. To improve the practicability of the radar waveforms, both a multi-spectral constraint and a peak-to-average-power ratio (PAPR) constraint are imposed. A cyclic method is derived to iteratively optimize the transmit waveforms and receive filters. In particular, to tackle the encountered non-convex constrained fractional programming in designing the waveforms (for fixed filters), we resort to the Dinkelbach's transform, minorization-maximization (MM), and leverage the alternating direction method of multipliers (ADMM). We highlight that the proposed algorithm can iterate from an infeasible initial point and the waveforms at convergence not only satisfy the stringent constraints, but also attain superior performance.

* 2023 IEEE Transactions on Aerospace and Electronic Systems  
Viaarxiv icon

Relative Entropy-Based Waveform Optimization for Rician Target Detection with Dual-Function Radar Communication Systems

Apr 05, 2023
Xuyang Wang, Bo Tang, Wenjun Wu, Da Li

Figure 1 for Relative Entropy-Based Waveform Optimization for Rician Target Detection with Dual-Function Radar Communication Systems
Figure 2 for Relative Entropy-Based Waveform Optimization for Rician Target Detection with Dual-Function Radar Communication Systems
Figure 3 for Relative Entropy-Based Waveform Optimization for Rician Target Detection with Dual-Function Radar Communication Systems
Figure 4 for Relative Entropy-Based Waveform Optimization for Rician Target Detection with Dual-Function Radar Communication Systems

In this paper, we consider waveform design for dualfunction radar-communication systems based on multiple-inputmultiple-out arrays. To achieve better Rician target detection performance, we use the relative entropy associated with the formulated detection problem as the design metric. We also impose a multiuser interference energy constraint on the waveforms to ensure the achievable sum-rate of the communications. Two algorithms are presented to tackle the nonlinear non-convex waveform design problem. In the first algorithm, we derive a quadratic function to minorize the objective function. To tackle the quadratically constrained quadratic programming problem at each iteration, a semidefinite relaxation approach followed by a rank-one decomposition procedure and an efficient alternating direction method of multipliers (ADMM) are proposed, respectively. In the second algorithm, we present a novel ADMM algorithm to tackle the optimization problem and employ an efficient minorization-maximization approach in the inner loop of the ADMM algorithm. Numerical results demonstrate the superiority of both algorithms. Moreover, the presented algorithms can be extended to synthesize peak-to-average-power ratio constrained waveforms, which allows the radio frequency amplifier to operate at an increased efficiency.

Viaarxiv icon

Co-Design for Spectral Coexistence between RIS-aided MIMO Radar and MIMO Communication Systems

Apr 04, 2023
Da Li, Bo Tang, Xuyang Wang, Wenjun Wu, Lei Xue

Figure 1 for Co-Design for Spectral Coexistence between RIS-aided MIMO Radar and MIMO Communication Systems
Figure 2 for Co-Design for Spectral Coexistence between RIS-aided MIMO Radar and MIMO Communication Systems
Figure 3 for Co-Design for Spectral Coexistence between RIS-aided MIMO Radar and MIMO Communication Systems
Figure 4 for Co-Design for Spectral Coexistence between RIS-aided MIMO Radar and MIMO Communication Systems

Reconfigurable intelligent surface (RIS) refers to a signal reflection surface containing a large number of low-cost passive reflecting elements. RIS can improve the performance of radar and communication systems by dynamically modulating the wireless channels. In this paper, we consider the co-design for improving the co-existence between multiple-input-multiple-output (MIMO) radar and MIMO communication system with the aid of RIS.The design purpose is to improve the radar detection performance and guarantee the communication capability. Due to the unimodular constraint on the RIS coefficients and the constant-envelope constraint on the radar transmit waveforms, the associated optimization problem is non-convex.To tackle this problem, we develop a cyclic method based on minorization-maximization, semi-definite programming, and alternating direction method of multipliers. Numerical examples verify the effectiveness of the proposed algorithm.

Viaarxiv icon

Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style

Mar 25, 2023
Fengyin Lin, Mingkang Li, Da Li, Timothy Hospedales, Yi-Zhe Song, Yonggang Qi

Figure 1 for Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style
Figure 2 for Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style
Figure 3 for Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style
Figure 4 for Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style

This paper studies the problem of zero-short sketch-based image retrieval (ZS-SBIR), however with two significant differentiators to prior art (i) we tackle all variants (inter-category, intra-category, and cross datasets) of ZS-SBIR with just one network (``everything''), and (ii) we would really like to understand how this sketch-photo matching operates (``explainable''). Our key innovation lies with the realization that such a cross-modal matching problem could be reduced to comparisons of groups of key local patches -- akin to the seasoned ``bag-of-words'' paradigm. Just with this change, we are able to achieve both of the aforementioned goals, with the added benefit of no longer requiring external semantic knowledge. Technically, ours is a transformer-based cross-modal network, with three novel components (i) a self-attention module with a learnable tokenizer to produce visual tokens that correspond to the most informative local regions, (ii) a cross-attention module to compute local correspondences between the visual tokens across two modalities, and finally (iii) a kernel-based relation network to assemble local putative matches and produce an overall similarity metric for a sketch-photo pair. Experiments show ours indeed delivers superior performances across all ZS-SBIR settings. The all important explainable goal is elegantly achieved by visualizing cross-modal token correspondences, and for the first time, via sketch to photo synthesis by universal replacement of all matched photo patches. Code and model are available at \url{https://github.com/buptLinfy/ZSE-SBIR}.

* CVPR 2023 (Highlight) 
Viaarxiv icon

Generative Model Based Noise Robust Training for Unsupervised Domain Adaptation

Mar 10, 2023
Zhongying Deng, Da Li, Junjun He, Yi-Zhe Song, Tao Xiang

Figure 1 for Generative Model Based Noise Robust Training for Unsupervised Domain Adaptation
Figure 2 for Generative Model Based Noise Robust Training for Unsupervised Domain Adaptation
Figure 3 for Generative Model Based Noise Robust Training for Unsupervised Domain Adaptation
Figure 4 for Generative Model Based Noise Robust Training for Unsupervised Domain Adaptation

Target domain pseudo-labelling has shown effectiveness in unsupervised domain adaptation (UDA). However, pseudo-labels of unlabeled target domain data are inevitably noisy due to the distribution shift between source and target domains. This paper proposes a Generative model-based Noise-Robust Training method (GeNRT), which eliminates domain shift while mitigating label noise. GeNRT incorporates a Distribution-based Class-wise Feature Augmentation (D-CFA) and a Generative-Discriminative classifier Consistency (GDC), both based on the class-wise target distributions modelled by generative models. D-CFA minimizes the domain gap by augmenting the source data with distribution-sampled target features, and trains a noise-robust discriminative classifier by using target domain knowledge from the generative models. GDC regards all the class-wise generative models as generative classifiers and enforces a consistency regularization between the generative and discriminative classifiers. It exploits an ensemble of target knowledge from all the generative models to train a noise-robust discriminative classifier and eventually gets theoretically linked to the Ben-David domain adaptation theorem for reducing the domain gap. Extensive experiments on Office-Home, PACS, and Digit-Five show that our GeNRT achieves comparable performance to state-of-the-art methods under single-source and multi-source UDA settings.

Viaarxiv icon