Abstract:This paper presents ProFi-Net, a novel few-shot learning framework for WiFi-based gesture recognition that overcomes the challenges of limited training data and sparse feature representations. ProFi-Net employs a prototype-based metric learning architecture enhanced with a feature-level attention mechanism, which dynamically refines the Euclidean distance by emphasizing the most discriminative feature dimensions. Additionally, our approach introduces a curriculum-inspired data augmentation strategy exclusively on the query set. By progressively incorporating Gaussian noise of increasing magnitude, the model is exposed to a broader range of challenging variations, thereby improving its generalization and robustness to overfitting. Extensive experiments conducted across diverse real-world environments demonstrate that ProFi-Net significantly outperforms conventional prototype networks and other state-of-the-art few-shot learning methods in terms of classification accuracy and training efficiency.
Abstract:As the number of multiple-input multiple-output (MIMO) antennas increases drastically with the development towards 6G systems, channel state information (CSI) compression becomes crucial for mitigating feedback overhead. In recent years, learning models such as autoencoders (AE) have been studied for CSI compression, aiming to eliminate model assumptions and reduce compression loss. However, current learning methods are often designed and trained mainly for individual channel scenarios, with limited generalizability across different scenarios, of which the channel characteristics are prominently discrepant. Motivated by this, we propose a novel AE-based learning method named attention-infused autoencoder network (AiANet), which can parallelly and adaptively extract channel-wise and spatial features of CSI with an attention fusion mechanism. In addition, a locally-aware self-attention mechanism is developed to extract both global and local spatial patterns, to better capture the unique CSI features of different scenarios. Moreover, a mixed-training scheme is introduced to enable the proposed AiANet to gain generalizability across indoor and outdoor scenarios. Results show that when trained and tested in the same scenario, AiANet can substantially outperform the existing AE-based methods such as ACRNet, with an improvement of up to 3.42 dB in terms of normalized mean squared error (NMSE). With the mixed-training scheme, AiANet exhibits superior cross-scenario generalizability compared to the benchmark methods which are trained in one scenario and misused in another.