Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Cheng Zhang

RMTransformer: Accurate Radio Map Construction and Coverage Prediction

Jan 11, 2025

Yuxuan Li, Cheng Zhang, Wen Wang, Yongming Huang

Figure 1 for RMTransformer: Accurate Radio Map Construction and Coverage Prediction

Figure 2 for RMTransformer: Accurate Radio Map Construction and Coverage Prediction

Figure 3 for RMTransformer: Accurate Radio Map Construction and Coverage Prediction

Abstract:Radio map, or pathloss map prediction, is a crucial method for wireless network modeling and management. By leveraging deep learning to construct pathloss patterns from geographical maps, an accurate digital replica of the transmission environment could be established with less computational overhead and lower prediction error compared to traditional model-driven techniques. While existing state-of-the-art (SOTA) methods predominantly rely on convolutional architectures, this paper introduces a hybrid transformer-convolution model, termed RMTransformer, to enhance the accuracy of radio map prediction. The proposed model features a multi-scale transformer-based encoder for efficient feature extraction and a convolution-based decoder for precise pixel-level image reconstruction. Simulation results demonstrate that the proposed scheme significantly improves prediction accuracy, and over a 30% reduction in root mean square error (RMSE) is achieved compared to typical SOTA approaches.

* Submitted to IEEE VTC 2025 Spring

Via

Access Paper or Ask Questions

RadioTransformer: Accurate Radio Map Construction and Coverage Prediction

Jan 09, 2025

Yuxuan Li, Cheng Zhang, Wen Wang, Yongming Huang

Figure 1 for RadioTransformer: Accurate Radio Map Construction and Coverage Prediction

Figure 2 for RadioTransformer: Accurate Radio Map Construction and Coverage Prediction

Figure 3 for RadioTransformer: Accurate Radio Map Construction and Coverage Prediction

Abstract:Radio map, or pathloss map prediction, is a crucial method for wireless network modeling and management. By leveraging deep learning to construct pathloss patterns from geographical maps, an accurate digital replica of the transmission environment could be established with less computational overhead and lower prediction error compared to traditional model-driven techniques. While existing state-of-the-art (SOTA) methods predominantly rely on convolutional architectures, this paper introduces a hybrid transformer-convolution model, termed RadioTransformer, to enhance the accuracy of radio map prediction. The proposed model features a multi-scale transformer-based encoder for efficient feature extraction and a convolution-based decoder for precise pixel-level image reconstruction. Simulation results demonstrate that the proposed scheme significantly improves prediction accuracy, and over a 30% reduction in root mean square error (RMSE) is achieved compared to typical SOTA approaches.

* Submitted to IEEE VTC 2025 Spring

Via

Access Paper or Ask Questions

Joint Detection and Angle Estimation for Multiple Jammers in Beamspace Massive MIMO

Jan 09, 2025

Pengguang Du, Cheng Zhang, Changwei Zhang, Zhilei Zhang, Yongming Huang

Figure 1 for Joint Detection and Angle Estimation for Multiple Jammers in Beamspace Massive MIMO

Figure 2 for Joint Detection and Angle Estimation for Multiple Jammers in Beamspace Massive MIMO

Abstract:In this paper, we study the joint detection and angle estimation problem for beamspace multiple-input multiple-output (MIMO) systems with multiple random jamming targets. An iterative low-complexity generalized likelihood ratio test (GLRT) is proposed by transforming the composite multiple hypothesis test on the projected vector into a series of binary hypothesis tests based on the spatial covariance matrix. In each iteration, the detector implicitly inhibits the mainlobe effects of the previously detected jammers by utilizing the estimated angles and average jamming-to-signal ratios. This enables the detection of a new potential jammer and the identification of its corresponding spatial covariance. Simulation results demonstrate that the proposed method outperforms existing benchmarks by suppressing sidelobes of the detected jammers and interference from irrelevant angles, especially in medium-to-high jamming-to-noise ratio scenarios.

* 6 pages, 2 figures. The paper has been submitted to an IEEE conference for possible publication

Via

Access Paper or Ask Questions

CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models

Dec 23, 2024

Ruibo Tu, Hedvig Kjellström, Gustav Eje Henter, Cheng Zhang

Figure 1 for CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models

Figure 2 for CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models

Figure 3 for CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models

Figure 4 for CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models

Abstract:Causal reasoning capabilities are essential for large language models (LLMs) in a wide range of applications, such as education and healthcare. But there is still a lack of benchmarks for a better understanding of such capabilities. Current LLM benchmarks are mainly based on conversational tasks, academic math tests, and coding tests. Such benchmarks evaluate LLMs in well-regularized settings, but they are limited in assessing the skills and abilities to solve real-world problems. In this work, we provide a benchmark, named by CARL-GT, which evaluates CAusal Reasoning capabilities of large Language models using Graphs and Tabular data. The benchmark has a diverse range of tasks for evaluating LLMs from causal graph reasoning, knowledge discovery, and decision-making aspects. In addition, effective zero-shot learning prompts are developed for the tasks. In our experiments, we leverage the benchmark for evaluating open-source LLMs and provide a detailed comparison of LLMs for causal reasoning abilities. We found that LLMs are still weak in casual reasoning, especially with tabular data to discover new insights. Furthermore, we investigate and discuss the relationships of different benchmark tasks by analyzing the performance of LLMs. The experimental results show that LLMs have different strength over different tasks and that their performance on tasks in different categories, i.e., causal graph reasoning, knowledge discovery, and decision-making, shows stronger correlation than tasks in the same category.

Via

Access Paper or Ask Questions

Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models

Dec 18, 2024

Xinxin Liu, Aaron Thomas, Cheng Zhang, Jianyi Cheng, Yiren Zhao, Xitong Gao

Figure 1 for Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models

Figure 2 for Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models

Figure 3 for Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models

Figure 4 for Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models

Abstract:Parameter-Efficient Fine-Tuning (PEFT) has gained prominence through low-rank adaptation methods like LoRA. In this paper, we focus on sparsity-based PEFT (SPEFT), which introduces trainable sparse adaptations to the weight matrices in the model, offering greater flexibility in selecting fine-tuned parameters compared to low-rank methods. We conduct the first systematic evaluation of salience metrics for SPEFT, inspired by zero-cost NAS proxies, and identify simple gradient-based metrics is reliable, and results are on par with the best alternatives, offering both computational efficiency and robust performance. Additionally, we compare static and dynamic masking strategies, finding that static masking, which predetermines non-zero entries before training, delivers efficiency without sacrificing performance, while dynamic masking offers no substantial benefits. Across NLP tasks, a simple gradient-based, static SPEFT consistently outperforms other fine-tuning methods for LLMs, providing a simple yet effective baseline for SPEFT. Our work challenges the notion that complexity is necessary for effective PEFT. Our work is open source and available to the community at [https://github.com/0-ml/speft].

Via

Access Paper or Ask Questions

PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting

Dec 16, 2024

Cheng Zhang, Haofei Xu, Qianyi Wu, Camilo Cruz Gambardella, Dinh Phung, Jianfei Cai

Abstract:With the advent of portable 360{\deg} cameras, panorama has gained significant attention in applications like virtual reality (VR), virtual tours, robotics, and autonomous driving. As a result, wide-baseline panorama view synthesis has emerged as a vital task, where high resolution, fast inference, and memory efficiency are essential. Nevertheless, existing methods are typically constrained to lower resolutions (512 $\times$ 1024) due to demanding memory and computational requirements. In this paper, we present PanSplat, a generalizable, feed-forward approach that efficiently supports resolution up to 4K (2048 $\times$ 4096). Our approach features a tailored spherical 3D Gaussian pyramid with a Fibonacci lattice arrangement, enhancing image quality while reducing information redundancy. To accommodate the demands of high resolution, we propose a pipeline that integrates a hierarchical spherical cost volume and Gaussian heads with local operations, enabling two-step deferred backpropagation for memory-efficient training on a single A100 GPU. Experiments demonstrate that PanSplat achieves state-of-the-art results with superior efficiency and image quality across both synthetic and real-world datasets. Code will be available at \url{https://github.com/chengzhag/PanSplat}.

* Project Page: https://chengzhag.github.io/publication/pansplat/ Code: https://github.com/chengzhag/PanSplat

Via

Access Paper or Ask Questions

Resonant Inductive Coupling Power Transfer for Mid-Sized Inspection Robot

Nov 26, 2024

Mohd Norhakim Bin Hassan, Simon Watson, Cheng Zhang

Figure 1 for Resonant Inductive Coupling Power Transfer for Mid-Sized Inspection Robot

Figure 2 for Resonant Inductive Coupling Power Transfer for Mid-Sized Inspection Robot

Figure 3 for Resonant Inductive Coupling Power Transfer for Mid-Sized Inspection Robot

Figure 4 for Resonant Inductive Coupling Power Transfer for Mid-Sized Inspection Robot

Abstract:This paper presents a wireless power transfer (WPT) for a mid-sized inspection mobile robot. The objective is to transmit 100 W of power over 1 meter of distance, achieved through lightweight Litz wire coils weighing 320 g held together with a coil structure of 3.54 kg. The Wireless Power Transfer System (WPTS) is mounted onto an unmanned ground vehicle (UGV). The study addresses an investigation of coil design, accounting for misalignment and tolerance issues in resonance-coupled coils. In experimental validation, the system effectively transmits 109.7 W of power over a 1-meter distance, with obstacles present. This achievement yields a system efficiency of 47.14%, a value that is remarkably close to the maximum power transfer point (50%) when the WPTS utilises the full voltage allowance of the capacitor. The paper shows the WPTS charging speed of 5 minutes for 12 V, 0.8 Ah lead acid batteries.

Via

Access Paper or Ask Questions

Kleene algebra with commutativity conditions is undecidable

Nov 24, 2024

Arthur Azevedo de Amorim, Cheng Zhang, Marco Gaboardi

Abstract:We prove that the equational theory of Kleene algebra with commutativity conditions on primitives (or atomic terms) is undecidable, thereby settling a longstanding open question in the theory of Kleene algebra. While this question has also been recently solved independently by Kuznetsov, our results hold even for weaker theories that do not support the induction axioms of Kleene algebra.

* Published at CSL 2025

Via

Access Paper or Ask Questions

SPAC-Net: Rethinking Point Cloud Completion with Structural Prior

Nov 22, 2024

Zizhao Wu, Jian Shi, Xuan Deng, Cheng Zhang, Genfu Yang, Ming Zeng, Yunhai Wang

Abstract:Point cloud completion aims to infer a complete shape from its partial observation. Many approaches utilize a pure encoderdecoder paradigm in which complete shape can be directly predicted by shape priors learned from partial scans, however, these methods suffer from the loss of details inevitably due to the feature abstraction issues. In this paper, we propose a novel framework,termed SPAC-Net, that aims to rethink the completion task under the guidance of a new structural prior, we call it interface. Specifically, our method first investigates Marginal Detector (MAD) module to localize the interface, defined as the intersection between the known observation and the missing parts. Based on the interface, our method predicts the coarse shape by learning the displacement from the points in interface move to their corresponding position in missing parts. Furthermore, we devise an additional Structure Supplement(SSP) module before the upsampling stage to enhance the structural details of the coarse shape, enabling the upsampling module to focus more on the upsampling task. Extensive experiments have been conducted on several challenging benchmarks, and the results demonstrate that our method outperforms existing state-of-the-art approaches.

Via

Access Paper or Ask Questions

Hardware and Software Platform Inference

Nov 07, 2024

Cheng Zhang, Hanna Foerster, Robert D. Mullins, Yiren Zhao, Ilia Shumailov

Figure 1 for Hardware and Software Platform Inference

Figure 2 for Hardware and Software Platform Inference

Figure 3 for Hardware and Software Platform Inference

Figure 4 for Hardware and Software Platform Inference

Abstract:It is now a common business practice to buy access to large language model (LLM) inference rather than self-host, because of significant upfront hardware infrastructure and energy costs. However, as a buyer, there is no mechanism to verify the authenticity of the advertised service including the serving hardware platform, e.g. that it is actually being served using an NVIDIA H100. Furthermore, there are reports suggesting that model providers may deliver models that differ slightly from the advertised ones, often to make them run on less expensive hardware. That way, a client pays premium for a capable model access on more expensive hardware, yet ends up being served by a (potentially less capable) cheaper model on cheaper hardware. In this paper we introduce \textit{\textbf{hardware and software platform inference (HSPI)}} -- a method for identifying the underlying \GPU{} architecture and software stack of a (black-box) machine learning model solely based on its input-output behavior. Our method leverages the inherent differences of various \GPU{} architectures and compilers to distinguish between different \GPU{} types and software stacks. By analyzing the numerical patterns in the model's outputs, we propose a classification framework capable of accurately identifying the \GPU{} used for model inference as well as the underlying software configuration. Our findings demonstrate the feasibility of inferring \GPU{} type from black-box models. We evaluate HSPI against models served on different real hardware and find that in a white-box setting we can distinguish between different \GPU{}s with between $83.9\%$ and $100\%$ accuracy. Even in a black-box setting we are able to achieve results that are up to three times higher than random guess accuracy.

Via

Access Paper or Ask Questions