Abstract:A compact dynamic omnidirectional array is proposed for antenna-level physical-layer security through directional modulation. Unlike conventional directional-modulation transmitters based on phased-array beam synthesis or multiple RF chains, the proposed architecture uses a single RF input and a switching-controlled four-element printed meander-line monopole array operating at 5.05 GHz. The state-dependent excitation introduces controllable magnitude and phase perturbations in the radiated field, producing angle-dependent constellation distortion and bit error rate behavior. Reliable information recovery is confined to a narrow broadside region in the E-plane, whereas the H-plane remains quasi-static and omnidirectional, providing a full 360-degree information-recoverable region. The antenna is implemented on a single-layer Rogers RO4350B substrate with a compact footprint of 0.57 x 1.11 lambda_0^2. A four-path switching network based on commercial RF components is used for experimental validation. Communication measurements using 16-QAM at 5.05 GHz demonstrate BER-defined E-plane information beamwidths of 30 to 36 degrees for calibrated switching modes under a BER <= 10^-3 criterion, while no bit errors are observed in the measured H-plane and the SNR remains above approximately 33 dB. Feed-phase offsets are also used to steer the BER-defined information-recoverable sector, demonstrating information-beam steering with the same antenna-level switching mechanism. These results show that compact antenna-level directional modulation can provide angularly selective information recovery in one principal plane while preserving omnidirectional coverage in the orthogonal plane.
Abstract:A compact dynamic four-element array with omnidirectional H-plane coverage is presented for planar physical-layer security using antenna-level directional modulation. The proposed approach achieves angularly selective information transmission without phased-array beamforming or multiple RF chains by dynamically switching the excitation paths of a four-element array. The antenna comprises four printed meander-line monopole elements operating at 5.05 GHz with independently controlled differential power excitation, which introduces magnitude and phase pattern modulation and dynamic motion of the apparent element spacing, resulting in strongly angle-dependent signal distortion and bit error rate (BER) performance. Reliable information recovery is confined to a narrow broadside region in the E-plane, while significantly elevated BER is observed at off-broadside angles. In contrast, the H-plane radiation remains static and omnidirectional, enabling full 360-degree information-recoverable coverage in the orthogonal plane. The antenna is fabricated on a single-layer Rogers RO4350B substrate with a compact footprint of 0.55 x 1.73 lambda_0^2. A four-path switching network implemented using commercial RF components validates the concept experimentally. Communication measurements under high-SNR conditions above 19 dB using 16-QAM demonstrate a planar information beamwidth below 24 degrees, confirming effective antenna-level directional modulation with angle-dependent BER characteristics and omnidirectional H-plane coverage.
Abstract:In computational pathology, few-shot whole slide image classification is primarily driven by the extreme scarcity of expert-labeled slides. Recent vision-language methods incorporate textual semantics generated by large language models, but treat these descriptions as static class-level priors that are shared across all samples and lack sample-wise refinement. This limits both the diversity and precision of visual-semantic alignment, hindering generalization under limited supervision. To overcome this, we propose the stochastic MUlti-view Semantic Enhancement (MUSE), a framework that first refines semantic precision via sample-wise adaptation and then enhances semantic richness through retrieval-augmented multi-view generation. Specifically, MUSE introduces Sample-wise Fine-grained Semantic Enhancement (SFSE), which yields a fine-grained semantic prior for each sample through MoE-based adaptive visual-semantic interaction. Guided by this prior, Stochastic Multi-view Model Optimization (SMMO) constructs an LLM-generated knowledge base of diverse pathological descriptions per class, then retrieves and stochastically integrates multiple matched textual views during training. These dynamically selected texts serve as enriched semantic supervisions to stochastically optimize the vision-language model, promoting robustness and mitigating overfitting. Experiments on three benchmark WSI datasets show that MUSE consistently outperforms existing vision-language baselines in few-shot settings, demonstrating that effective few-shot pathology learning requires not only richer semantic sources but also their active and sample-aware semantic optimization. Our code is available at: https://github.com/JiahaoXu-god/CVPR2026_MUSE.
Abstract:While existing multivariate time series forecasting models have advanced significantly in modeling periodicity, they largely neglect the periodic heterogeneity common in real-world data, where variates exhibit distinct and dynamically changing periods. To effectively capture this periodic heterogeneity, we propose PHAT (Period Heterogeneity-Aware Transformer). Specifically, PHAT arranges multivariate inputs into a three-dimensional "periodic bucket" tensor, where the dimensions correspond to variate group characteristics with similar periodicity, time steps aligned by phase, and offsets within the period. By restricting interactions within buckets and masking cross-bucket connections, PHAT effectively avoids interference from inconsistent periods. We also propose a positive-negative attention mechanism, which captures periodic dependencies from two perspectives: periodic alignment and periodic deviation. Additionally, the periodic alignment attention scores are decomposed into positive and negative components, with a modulation term encoding periodic priors. This modulation constrains the attention mechanism to more faithfully reflect the underlying periodic trends. A mathematical explanation is provided to support this property. We evaluate PHAT comprehensively on 14 real-world datasets against 18 baselines, and the results show that it significantly outperforms existing methods, achieving highly competitive forecasting performance. Our sources is available at GitHub.




Abstract:We propose a novel omnidirectional antenna design incorporating directional modulation for secure narrow planar information transmission. The proposed antenna features a compact size and stable omnidirectional radiation performance by employing two tightly spaced, printed meander line monopole antennas, acting as a single radiating element. To achieve a narrow information secure region, the proposed antenna is fed by differential power excitation of two ports with real-time dynamic switching. This leads to phase pattern modulation only along the electrical polarization, resulting in directionally confined information recoverable region in the E-plane, while maintaining highly constant or static omnidirectional H-plane pattern, inducing a $360^\circ$ information recoverable region. The dynamic antenna is designed and fabricated on a single layer of Rogers RO4350B which provides a miniaturized planar size of $0.36 \times 0.5 , \lambda_0^2$ at 2.7 GHz and easy integration. To validate the wireless communication performance, the fabricated antenna is directly fed with a 10 dB power ratio by a radio frequency (RF) switching system and evaluated for 16-QAM and 256-QAM transmission in a high signal-to-noise ratio (SNR) environment. Experimental results demonstrate that for 16-QAM transmission, a narrow E-plane information beam (IB) of approximately $34^\circ$ and omnidirectional H-plane IB are obtained, and a narrower E-plane IB is achieved around $15^\circ$ for 256-QAM. These results confirm that the proposed antenna offers a simple yet effective approach to enhance planar physical information security with a compact dynamic antenna system.
Abstract:Due to the profound impact of air pollution on human health, livelihoods, and economic development, air quality forecasting is of paramount significance. Initially, we employ the causal graph method to scrutinize the constraints of existing research in comprehensively modeling the causal relationships between the air quality index (AQI) and meteorological features. In order to enhance prediction accuracy, we introduce a novel air quality forecasting model, AirCade, which incorporates a causal decoupling approach. AirCade leverages a spatiotemporal module in conjunction with knowledge embedding techniques to capture the internal dynamics of AQI. Subsequently, a causal decoupling module is proposed to disentangle synchronous causality from past AQI and meteorological features, followed by the dissemination of acquired knowledge to future time steps to enhance performance. Additionally, we introduce a causal intervention mechanism to explicitly represent the uncertainty of future meteorological features, thereby bolstering the model's robustness. Our evaluation of AirCade on an open-source air quality dataset demonstrates over 20\% relative improvement over state-of-the-art models.
Abstract:Radio frequency (RF) fingerprinting is widely used for supporting physical layer security in various wireless applications. In this paper, we present the design and implementation of a small antenna with low-cost fabrication that can be directly integrated with nonlinear passive devices, forming a passive RF tag providing unique nonlinear signatures for RF fingerprinting. We first propose a miniaturized meander line dipole, achieved by two folded arms on two sides of the substrate. This leads to antenna with a simple feeding structure and compact size, making it ideal for planar integration. Two antennas on Rogers 4350B and ultra-thin flexible Panasonic Felios are fabricated, achieving small size at $0.21 \times 0.06 \times 0.004 \lambda_0^3$ and $0.14 \times 0.1 \times 0.0008 \lambda_0^3$ with realized gain of 1.87 dBi and 1.46 dBi. The passive tag consists of the proposed antenna structure and an integrated RF diode, and is further developed on both substrates, aiming to generate inter-modulation products (IMP) due to the nonlinearity of the diode, which can be used for device identification through classification algorithms. We investigate the nonlinearity of the designed tags for transmission at 15 dBm using two-tone signals. All tags produce a significant increased power at IMP frequencies at a range of 0.4 m. The tags on Rogers substrate provide around 23 dB IMP power increase and tags on flexible substrate embedded in lossy material provide around 16 dB power increase. These findings confirm that the proposed solution offers a simple passive tag design to support unique nonlinear signatures for RF fingerprinting applications in a simple, low-cost device.




Abstract:Graph node clustering is a fundamental unsupervised task. Existing methods typically train an encoder through selfsupervised learning and then apply K-means to the encoder output. Some methods use this clustering result directly as the final assignment, while others initialize centroids based on this initial clustering and then finetune both the encoder and these learnable centroids. However, due to their reliance on K-means, these methods inherit its drawbacks when the cluster separability of encoder output is low, facing challenges from the Uniform Effect and Cluster Assimilation. We summarize three reasons for the low cluster separability in existing methods: (1) lack of contextual information prevents discrimination between similar nodes from different clusters; (2) training tasks are not sufficiently aligned with the downstream clustering task; (3) the cluster information in the graph structure is not appropriately exploited. To address these issues, we propose conTrastive grapH clustEring by SwApping fUsed gRomov-wasserstein coUplingS (THESAURUS). Our method introduces semantic prototypes to provide contextual information, and employs a cross-view assignment prediction pretext task that aligns well with the downstream clustering task. Additionally, it utilizes Gromov-Wasserstein Optimal Transport (GW-OT) along with the proposed prototype graph to thoroughly exploit cluster information in the graph structure. To adapt to diverse real-world data, THESAURUS updates the prototype graph and the prototype marginal distribution in OT by using momentum. Extensive experiments demonstrate that THESAURUS achieves higher cluster separability than the prior art, effectively mitigating the Uniform Effect and Cluster Assimilation issues




Abstract:Real-world data consistently exhibits a long-tailed distribution, often spanning multiple categories. This complexity underscores the challenge of content comprehension, particularly in scenarios requiring Long-Tailed Multi-Label image Classification (LTMLC). In such contexts, imbalanced data distribution and multi-object recognition pose significant hurdles. To address this issue, we propose a novel and effective approach for LTMLC, termed Category-Prompt Refined Feature Learning (CPRFL), utilizing semantic correlations between different categories and decoupling category-specific visual representations for each category. Specifically, CPRFL initializes category-prompts from the pretrained CLIP's embeddings and decouples category-specific visual representations through interaction with visual features, thereby facilitating the establishment of semantic correlations between the head and tail classes. To mitigate the visual-semantic domain bias, we design a progressive Dual-Path Back-Propagation mechanism to refine the prompts by progressively incorporating context-related visual information into prompts. Simultaneously, the refinement process facilitates the progressive purification of the category-specific visual representations under the guidance of the refined prompts. Furthermore, taking into account the negative-positive sample imbalance, we adopt the Asymmetric Loss as our optimization objective to suppress negative samples across all classes and potentially enhance the head-to-tail recognition performance. We validate the effectiveness of our method on two LTMLC benchmarks and extensive experiments demonstrate the superiority of our work over baselines. The code is available at https://github.com/jiexuanyan/CPRFL.

Abstract:Multiple Instance Learning (MIL) represents the predominant framework in Whole Slide Image (WSI) classification, covering aspects such as sub-typing, diagnosis, and beyond. Current MIL models predominantly rely on instance-level features derived from pretrained models such as ResNet. These models segment each WSI into independent patches and extract features from these local patches, leading to a significant loss of global spatial context and restricting the model's focus to merely local features. To address this issue, we propose a novel MIL framework, named SAM-MIL, that emphasizes spatial contextual awareness and explicitly incorporates spatial context by extracting comprehensive, image-level information. The Segment Anything Model (SAM) represents a pioneering visual segmentation foundational model that can capture segmentation features without the need for additional fine-tuning, rendering it an outstanding tool for extracting spatial context directly from raw WSIs. Our approach includes the design of group feature extraction based on spatial context and a SAM-Guided Group Masking strategy to mitigate class imbalance issues. We implement a dynamic mask ratio for different segmentation categories and supplement these with representative group features of categories. Moreover, SAM-MIL divides instances to generate additional pseudo-bags, thereby augmenting the training set, and introduces consistency of spatial context across pseudo-bags to further enhance the model's performance. Experimental results on the CAMELYON-16 and TCGA Lung Cancer datasets demonstrate that our proposed SAM-MIL model outperforms existing mainstream methods in WSIs classification. Our open-source implementation code is is available at https://github.com/FangHeng/SAM-MIL.