Abstract:Reconfigurable intelligent surfaces (RIS) have recently been proposed as an effective means for spatial interference suppression in large reflector antenna systems. Existing RIS weight optimization algorithms typically rely on accurate theoretical radiation models. However, in practice, distortions on the reflector antenna may cause mismatches between the theoretical and true antenna patterns, leading to degraded interference cancellation performance when these weights are directly applied. In this report, a residual learning network-assisted simulated annealing (ResNet-SA) framework is proposed to address this mismatch without requiring explicit knowledge of the distorted electric field. By learning the residual difference between the theoretical and true antenna gains, a neural network (NN) is embedded in a heuristic optimization algorithm to find the optimal weight vector. Simulation results demonstrate that the proposed approach achieves improved null depth in the true radiation pattern as compared with conventional methods that optimize weights based solely on the theoretical model, validating the effectiveness of the ResNet-SA algorithm for reflector antenna systems with approximate knowledge of the pattern.
Abstract:Reconfigurable Intelligent Surfaces (RIS) have recently gained attention as a means to dynamically shape the wireless propagation environment through programmable reflection control. Among the numerous applications, an important emerging use case is employing RIS as an auxiliary mechanism for spatial interference nulling, particularly in large ground-based reflector antennas where sidelobe interference can significantly degrade the system performance. With the growing density of satellites and terrestrial emitters, algorithms with faster convergence speed and better performance are needed. This work investigates RIS-equipped reflector antennas as a representative example of RIS-assisted spatial nulling and develop algorithms for sidelobe cancellation at specific directions and frequencies under various constraints. For the continuous-phase case, we adapt the gradient projection (GP) and alternating projection (AP) algorithms for scalability and propose a closed-form near-optimal solution that achieves satisfactory nulling performance with significantly reduced complexity. For the discrete-phase case, we reformulate the problem using a penalty method and solve it via majorization-minimization, outperforming the heuristic methods from our earlier work. Further, we analyze the electric field characteristics across multiple interference directions and frequencies to quantify the nulling capability of the RIS-aided reflectors, and identify a simple criterion for the existence of unimodular weights enabling perfect nulls. Simulation results demonstrate the effectiveness of the proposed methods and confirm the theoretical nulling limits.
Abstract:In many sensing (viz., radio astronomy) and radar applications, the received signal of interest (SOI) exhibits a significantly wider bandwidth or weaker power than the interference signal, rendering it indistinguishable from the background noise. Such scenarios arise frequently in applications such as passive radar, cognitive radio, low-probability-of-intercept (LPI) radar, and planetary radar for radio astronomy, where canceling the radio frequency interference (RFI) is critical for uncovering the SOI. In this work, we examine the Demodulation-Remodulation (Demod-Remod) based interference cancellation framework for the RFI. This approach demodulates the unknown interference, creates a noise-free interference replica, and coherently subtracts it from the received signal. To evaluate the performance limits, we employ the performance metric termed \textit{interference rejection ratio} (IRR), which quantifies the interference canceled. We derive the analytical expressions of IRR as a function of the optimal estimation variances of the signal parameters. Simulation results confirm the accuracy of the analytical expression for both single-carrier and multi-carrier interference signals and demonstrate that the method can substantially suppress the interference at a sufficient interference-to-noise ratio (INR), enabling enhanced detection and extraction of the SOI. We further extend the analysis to the scenario where the SOI is above the noise floor, and confirm the validity of the theoretical IRR expression in this scenario. Lastly, we compare the Demod-Remod technique to other time-domain cancellation methods. The result of the comparison identifies the conditions under which each method is preferred, offering practical guidelines for interference mitigation under different scenarios.
Abstract:Structure from Motion (SfM) is a critical task in computer vision, aiming to recover the 3D scene structure and camera motion from a sequence of 2D images. The recent pose-only imaging geometry decouples 3D coordinates from camera poses and demonstrates significantly better SfM performance through pose adjustment. Continuing the pose-only perspective, this paper explores the critical relationship between the scene structures, rotation and translation. Notably, the translation can be expressed in terms of rotation, allowing us to condense the imaging geometry representation onto the rotation manifold. A rotation-only optimization framework based on reprojection error is proposed for both two-view and multi-view scenarios. The experiment results demonstrate superior accuracy and robustness performance over the current state-of-the-art rotation estimation methods, even comparable to multiple bundle adjustment iteration results. Hopefully, this work contributes to even more accurate, efficient and reliable 3D visual computing.
Abstract:Pioneering text-to-image (T2I) diffusion models have ushered in a new era of real-world image super-resolution (Real-ISR), significantly enhancing the visual perception of reconstructed images. However, existing methods typically integrate uniform abstract textual semantics across all blocks, overlooking the distinct semantic requirements at different depths and the fine-grained, concrete semantics inherently present in the images themselves. Moreover, relying solely on a single type of guidance further disrupts the consistency of reconstruction. To address these issues, we propose MegaSR, a novel framework that mines customized block-wise semantics and expressive guidance for diffusion-based ISR. Compared to uniform textual semantics, MegaSR enables flexible adaptation to multi-granularity semantic awareness by dynamically incorporating image attributes at each block. Furthermore, we experimentally identify HED edge maps, depth maps, and segmentation maps as the most expressive guidance, and propose a multi-stage aggregation strategy to modulate them into the T2I models. Extensive experiments demonstrate the superiority of MegaSR in terms of semantic richness and structural consistency.
Abstract:For the 6G wireless networks, achieving high-performance integrated localization and communication (ILAC) is critical to unlock the full potential of wireless networks. To simultaneously enhance localization and communication performance cost-effectively, this paper proposes sparse multiple-input multiple-output (MIMO) based ILAC with nested and co-prime sparse arrays deployed at the base station. Sparse MIMO relaxes the traditional half-wavelength antenna spacing constraint to enlarge the antenna aperture, thus enhancing localization degrees of freedom and providing finer spatial resolution. However, it also leads to undesired grating lobes, which may cause severe inter-user interference for communication and angular ambiguity for localization. While the latter issue can be effectively addressed by the virtual array technology, by forming sum or difference co-arrays via signal (conjugate) correlation among array elements, it is unclear whether the similar virtual array technology also benefits wireless communications for ILAC systems. In this paper, we first reveal that the answer to the above question is negative, by showing that forming virtual arrays for wireless communication will cause destruction of phase information, degradation of signal-to-noise ratio and aggravation of multi-user interference. Therefore, we propose the hybrid processing for sparse MIMO based ILAC, i.e., physical array based communication while virtual array based localization. To this end, we characterize the beam pattern of sparse arrays by three metrics, demonstrating that despite of the introduction of grating lobes, sparse arrays can also bring benefits to communications thanks to its narrower main lobe beam width than the conventional compact arrays. Extensive simulation results are presented to demonstrate the performance gains of sparse MIMO based ILAC over that based on the conventional compact MIMO.
Abstract:Movable antenna (MA), which can flexibly change the position of antenna in three-dimensional (3D) continuous space, is an emerging technology for achieving full spatial performance gains. In this paper, a prototype of MA communication system with ultra-accurate movement control is presented to verify the performance gain of MA in practical environments. The prototype utilizes the feedback control to ensure that each power measurement is performed after the MA moves to a designated position. The system operates at 3.5 GHz or 27.5 GHz, where the MA moves along a one-dimensional horizontal line with a step size of 0.01{\lambda} and in a two-dimensional square region with a step size of 0.05{\lambda}, respectively, with {\lambda} denoting the signal wavelength. The scenario with mixed line-of-sight (LoS) and non-LoS (NLoS) links is considered. Extensive experimental results are obtained with the designed prototype and compared with the simulation results, which validate the great potential of MA technology in improving wireless communication performance. For example, the maximum variation of measured power reaches over 40 dB and 23 dB at 3.5 GHz and 27.5 GHz, respectively, thanks to the flexible antenna movement. In addition, experimental results indicate that the power gain of MA system relies on the estimated path state information (PSI), including the number of paths, their delays, elevation and azimuth angles of arrival (AoAs), as well as the power ratio of each path.




Abstract:Multiple-input multiple-output (MIMO) has been a key technology of wireless communications for decades. A typical MIMO system employs antenna arrays with the inter-antenna spacing being half of the signal wavelength, which we term as compact MIMO. Looking forward towards the future sixth-generation (6G) mobile communication networks, MIMO system will achieve even finer spatial resolution to not only enhance the spectral efficiency of wireless communications, but also enable more accurate wireless sensing. To this end, by removing the restriction of half-wavelength antenna spacing, sparse MIMO has been proposed as a new architecture that is able to significantly enlarge the array aperture as compared to conventional compact MIMO with the same number of array elements. In addition, sparse MIMO leads to a new form of virtual MIMO systems for sensing with their virtual apertures considerably larger than physical apertures. As sparse MIMO is expected to be a viable technology for 6G, we provide in this article a comprehensive overview of it, especially focusing on its appealing advantages for integrated sensing and communication (ISAC) towards 6G. Specifically, assorted sparse MIMO architectures are first introduced, followed by their new benefits as well as challenges. We then discuss the main design issues of sparse MIMO, including beam pattern synthesis, signal processing, grating lobe suppression, beam codebook design, and array geometry optimization. Last, we provide numerical results to evaluate the performance of sparse MIMO for ISAC and point out promising directions for future research.
Abstract:Gastrointestinal (GI) tract cancers pose a global health challenge, demanding precise radiotherapy planning for optimal treatment outcomes. This paper introduces a cutting-edge approach to automate the segmentation of GI tract regions in magnetic resonance imaging (MRI) scans. Leveraging advanced deep learning architectures, the proposed model integrates Inception-V4 for initial classification, UNet++ with a VGG19 encoder for 2.5D data, and Edge UNet for grayscale data segmentation. Meticulous data preprocessing, including innovative 2.5D processing, is employed to enhance adaptability, robustness, and accuracy. This work addresses the manual and time-consuming segmentation process in current radiotherapy planning, presenting a unified model that captures intricate anatomical details. The integration of diverse architectures, each specializing in unique aspects of the segmentation task, signifies a novel and comprehensive solution. This model emerges as an efficient and accurate tool for clinicians, marking a significant advancement in the field of GI tract image segmentation for radiotherapy planning.
Abstract:How to efficiently and accurately handle image matching outliers is a critical issue in two-view relative estimation. The prevailing RANSAC method necessitates that the minimal point pairs be inliers. This paper introduces a linear relative pose estimation algorithm for n $( n \geq 6$) point pairs, which is founded on the recent pose-only imaging geometry to filter out outliers by proper reweighting. The proposed algorithm is able to handle planar degenerate scenes, and enhance robustness and accuracy in the presence of a substantial ratio of outliers. Specifically, we embed the linear global translation (LiGT) constraint into the strategies of iteratively reweighted least-squares (IRLS) and RANSAC so as to realize robust outlier removal. Simulations and real tests of the Strecha dataset show that the proposed algorithm achieves relative rotation accuracy improvement of 2 $\sim$ 10 times in face of as large as 80% outliers.