Large-scale pre-trained Vision-Language Models (VLMs), such as CLIP and ALIGN, have introduced a new paradigm for learning transferable visual representations. Recently, there has been a surge of interest among researchers in developing lightweight fine-tuning techniques to adapt these models to downstream visual tasks. We recognize that current state-of-the-art fine-tuning methods, such as Tip-Adapter, simply consider the covariance between the query image feature and features of support few-shot training samples, which only captures linear relations and potentially instigates a deceptive perception of independence. To address this issue, in this work, we innovatively introduce Brownian Distance Covariance (BDC) to the field of vision-language reasoning. The BDC metric can model all possible relations, providing a robust metric for measuring feature dependence. Based on this, we present a novel method called BDC-Adapter, which integrates BDC prototype similarity reasoning and multi-modal reasoning network prediction to perform classification tasks. Our extensive experimental results show that the proposed BDC-Adapter can freely handle non-linear relations and fully characterize independence, outperforming the current state-of-the-art methods by large margins.
In this paper, we present an innovative symbol-level precoding (SLP) approach for a wideband multi-user multi-input multi-output (MU-MIMO) downlink Integrated Sensing and Communications (ISAC) system employing faster-than-Nyquist (FTN) signaling. Our proposed technique minimizes the minimum mean squared error (MMSE) for the sensed parameter estimation while ensuring the communication per-user quality-of-service through the utilization of constructive interference (CI) methodologies. While the formulated problem is non-convex in general, we tackle this issue using proficient minorization and successive convex approximation (SCA) strategies. Numerical results substantiate that our FTN-ISAC-SLP framework significantly enhances communication throughput while preserving satisfactory sensing performance.
In this paper, we propose a novel symbol-level precoding (SLP) method for a multi-user multi-input multi-output (MU-MIMO) downlink Integrated Sensing and Communications (ISAC) system based on the faster-than-Nyquist (FTN) signaling, where an ISAC signal is designed to simultaneously accomplish target sensing and wireless communication tasks. In particular, we minimize the minimum mean squared error (MMSE) for target parameter estimation, while guaranteeing the per-user quality-of-service by exploiting both multi-user and inter-symbol interference with constructive interference (CI) techniques. While the formulated problem is non-convex in general, we propose an efficient successive convex approximation (SCA) method, which solves a convex second-order cone program (SOCP) subproblem at each iteration. Numerical results demonstrate the effectiveness of the proposed FTN-ISAC-SLP design, showing that out method significantly outperforms conventional benchmark approaches in terms of both communication and sensing performance.