Empowered by the latest progress on innovative metamaterials/metasurfaces and advanced antenna technologies, holographic multiple-input multiple-output (H-MIMO) emerges as a promising technology to fulfill the extreme goals of the sixth-generation (6G) wireless networks. The antenna arrays utilized in H-MIMO comprise massive (possibly to extreme extent) numbers of antenna elements, densely spaced less than half-a-wavelength and integrated into a compact space, realizing an almost continuous aperture. Thanks to the expected low cost, size, weight, and power consumption, such apertures are expected to be largely fabricated for near-field communications. In addition, the physical features of H-MIMO enable manipulations directly on the electromagnetic (EM) wave domain and spatial multiplexing. To fully leverage this potential, near-field H-MIMO channel modeling, especially from the EM perspective, is of paramount significance. In this article, we overview near-field H-MIMO channel models elaborating on the various modeling categories and respective features, as well as their challenges and evaluation criteria. We also present EM-domain channel models that address the inherit computational and measurement complexities. Finally, the article is concluded with a set of future research directions on the topic.
Training good representations for items is critical in recommender models. Typically, an item is assigned a unique randomly generated ID, and is commonly represented by learning an embedding corresponding to the value of the random ID. Although widely used, this approach have limitations when the number of items are large and items are power-law distributed -- typical characteristics of real-world recommendation systems. This leads to the item cold-start problem, where the model is unable to make reliable inferences for tail and previously unseen items. Removing these ID features and their learned embeddings altogether to combat cold-start issue severely degrades the recommendation quality. Content-based item embeddings are more reliable, but they are expensive to store and use, particularly for users' past item interaction sequence. In this paper, we use Semantic IDs, a compact discrete item representations learned from content embeddings using RQ-VAE that captures hierarchy of concepts in items. We showcase how we use them as a replacement of item IDs in a resource-constrained ranking model used in an industrial-scale video sharing platform. Moreover, we show how Semantic IDs improves the generalization ability of our system, without sacrificing top-level metrics.
Envisioned as one of the most promising technologies, holographic multiple-input multiple-output (H-MIMO) recently attracts notable research interests for its great potential in expanding wireless possibilities and achieving fundamental wireless limits. Empowered by the nearly continuous, large and energy-efficient surfaces with powerful electromagnetic (EM) wave control capabilities, H-MIMO opens up the opportunity for signal processing in a more fundamental EM-domain, paving the way for realizing holographic imaging level communications in supporting the extremely high spectral efficiency and energy efficiency in future networks. In this article, we try to implement a generalized EM-domain near-field channel modeling and study its capacity limit of point-to-point H-MIMO systems that equips arbitrarily placed surfaces in a line-of-sight (LoS) environment. Two effective and computational-efficient channel models are established from their integral counterpart, where one is with a sophisticated formula but showcases more accurate, and another is concise with a slight precision sacrifice. Furthermore, we unveil the capacity limit using our channel model, and derive a tight upper bound based upon an elaborately built analytical framework. Our result reveals that the capacity limit grows logarithmically with the product of transmit element area, receive element area, and the combined effects of $1/{{d}_{mn}^2}$, $1/{{d}_{mn}^4}$, and $1/{{d}_{mn}^6}$ over all transmit and receive antenna elements, where $d_{mn}$ indicates the distance between each transmit and receive elements. Numerical evaluations validate the effectiveness of our channel models, and showcase the slight disparity between the upper bound and the exact capacity, which is beneficial for predicting practical system performance.
Holographic multiple-input multiple-output (H-MIMO) is considered as one of the most promising technologies to enable future wireless communications in supporting the expected extreme requirements, such as high energy and spectral efficiency. Empowered by the powerful capability in electromagnetic (EM) wave manipulations, H-MIMO has the potential to reach the fundamental limit of the wireless environment, and opens up the possibility of signal processing in the EM-domain, which needs to be depicted carefully from an EM perspective, especially the wireless channel. To this aim, we study the line-of-sight (LOS) H-MIMO communications with arbitrary surface placements and establish an exact expression of the wireless channel in the EM-domain. To further obtain a more explicit and computationally-efficient channel models, we solve the implicit integrals of the exact channel model with moderate and reasonable assumptions. Numerical studies are executed and the results show good agreements of our established approximated channel models to the exact channel model.
Recommender systems play an important role in many content platforms. While most recommendation research is dedicated to designing better models to improve user experience, we found that research on stabilizing the training for such models is severely under-explored. As recommendation models become larger and more sophisticated, they are more susceptible to training instability issues, \emph{i.e.}, loss divergence, which can make the model unusable, waste significant resources and block model developments. In this paper, we share our findings and best practices we learned for improving the training stability of a real-world multitask ranking model for YouTube recommendations. We show some properties of the model that lead to unstable training and conjecture on the causes. Furthermore, based on our observations of training dynamics near the point of training instability, we hypothesize why existing solutions would fail, and propose a new algorithm to mitigate the limitations of existing solutions. Our experiments on YouTube production dataset show the proposed algorithm can significantly improve training stability while not compromising convergence, comparing with several commonly used baseline methods.
This paper studies the exploitation of triple polarization (TP) for multi-user (MU) holographic multiple-input multiple-output surface (HMIMOS) wireless communication systems, aiming at capacity boosting without enlarging the antenna array size. We specifically consider that both the transmitter and receiver are equipped with an HMIMOS comprising compact sub-wavelength TP patch antennas. To characterize TP MUHMIMOS systems, a TP near-field channel model is proposed using the dyadic Green's function, whose characteristics are leveraged to design a user-cluster-based precoding scheme for mitigating the cross-polarization and inter-user interference contributions. A theoretical correlation analysis for HMIMOS with infinitely small patch antennas is also presented. According to the proposed scheme, the users are assigned to one of the three polarizations, which is easy to implement, at the cost, however, of reducing the system's diversity. Our numerical results showcase that the cross-polarization channel components have a nonnegligible impact on the system performance, which is efficiently eliminated with the proposed MU precoding scheme.
Future wireless systems are envisioned to create an endogenously holography-capable, intelligent, and programmable radio propagation environment, that will offer unprecedented capabilities for high spectral and energy efficiency, low latency, and massive connectivity. A potential and promising technology for supporting the expected extreme requirements of the sixth-generation (6G) communication systems is the holographic multiple-input multiple-output (MIMO) surface (HMIMOS), which will actualize holographic radios with reasonable power consumption and fabrication cost. An HMIMOS is a nearly continuous aperture that incorporates reconfigurable and sub-wavelength-spaced antennas and/or metamaterials. Such surfaces comprising dense electromagnetic (EM) excited elements are capable of recording and manipulating impinging fields with utmost flexibility and precision, as well as with reduced cost and power consumption, thereby shaping arbitrary-intended EM waves with high energy efficiency. The powerful EM processing capability of HMIMOS opens up the possibility of wireless communications of holographic imaging level, paving the way for signal processing techniques realized in the EM domain, possibly in conjunction with their digital-domain counterparts. However, in spite of the significant potential, the studies on HMIMOS-based wireless systems are still at an initial stage. In this survey, we present a comprehensive overview of the latest advances in holographic MIMO communications, with a special focus on their physical aspects, theoretical foundations, and enabling technologies. We also compare HMIMOS systems with conventional multi-antenna technologies, especially massive MIMO systems, present various promising synergies of HMIMOS with current and future candidate technologies, and provide an extensive list of research challenges and open directions.
This paper investigates the utilization of triple polarization (TP) for multi-user (MU) holographic multiple-input multi-output surface (HMIMOS) wireless communication systems, targeting capacity boosting and diversity exploitation without enlarging the antenna array sizes. We specifically consider that both the transmitter and receiver are both equipped with an HMIMOS consisting of compact sub-wavelength TP patch antennas within the near-field (NF) regime. To characterize TP MU-HMIMOS systems, a TP NF channel model is constructed using the dyadic Green's function, whose characteristics are leveraged to design two precoding schemes for mitigating the cross-polarization and inter-user interference contributions. Specifically, a user-cluster-based precoding scheme assigns different users to one of three polarizations at the expense of the system's diversity, and a two-layer precoding scheme removes interference using the Gaussian elimination method at a high computational cost. The theoretical correlation analysis for HMIMOS in the NF region is also investigated, revealing that both the spacing of transmit patch antennas and user distance impact transmit correlation factors. Our numerical results show that the users far from transmitting HMIMOS experience higher correlation than those closer within the NF regime, resulting in a lower channel capacity. Meanwhile, in terms of channel capacity, TP HMIMOS can almost achieve 1.25 times gain compared with dual-polarized HMIMOS, and 3 times compared with conventional HMIMOS. In addition, the proposed two-layer precoding scheme combined with two-layer power allocation realizes a higher spectral efficiency than other schemes without sacrificing diversity.
There has been a flurry of research in recent years on notions of fairness in ranking and recommender systems, particularly on how to evaluate if a recommender allocates exposure equally across groups of relevant items (also known as provider fairness). While this research has laid an important foundation, it gave rise to different approaches depending on whether relevant items are compared per-user/per-query or aggregated across users. Despite both being established and intuitive, we discover that these two notions can lead to opposite conclusions, a form of Simpson's Paradox. We reconcile these notions and show that the tension is due to differences in distributions of users where items are relevant, and break down the important factors of the user's recommendations. Based on this new understanding, practitioners might be interested in either notions, but might face challenges with the per-user metric due to partial observability of the relevance and user satisfaction, typical in real-world recommenders. We describe a technique based on distribution matching to estimate it in such a scenario. We demonstrate on simulated and real-world recommender data the effectiveness and usefulness of such an approach.