Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xinyu Gu

Integrated Influence: Data Attribution with Baseline

Aug 07, 2025

Linxiao Yang, Xinyu Gu, Liang Sun

Abstract:As an effective approach to quantify how training samples influence test sample, data attribution is crucial for understanding data and model and further enhance the transparency of machine learning models. We find that prevailing data attribution methods based on leave-one-out (LOO) strategy suffer from the local-based explanation, as these LOO-based methods only perturb a single training sample, and overlook the collective influence in the training set. On the other hand, the lack of baseline in many data attribution methods reduces the flexibility of the explanation, e.g., failing to provide counterfactual explanations. In this paper, we propose Integrated Influence, a novel data attribution method that incorporates a baseline approach. Our method defines a baseline dataset, follows a data degeneration process to transition the current dataset to the baseline, and accumulates the influence of each sample throughout this process. We provide a solid theoretical framework for our method, and further demonstrate that popular methods, such as influence functions, can be viewed as special cases of our approach. Experimental results show that Integrated Influence generates more reliable data attributions compared to existing methods in both data attribution task and mislablled example identification task.

Via

Access Paper or Ask Questions

EarthLink: A Self-Evolving AI Agent for Climate Science

Jul 24, 2025

Zijie Guo, Jiong Wang, Xiaoyu Yue, Wangxu Wei, Zhe Jiang, Wanghan Xu, Ben Fei, Wenlong Zhang, Xinyu Gu, Lijing Cheng(+7 more)

Figure 1 for EarthLink: A Self-Evolving AI Agent for Climate Science

Figure 2 for EarthLink: A Self-Evolving AI Agent for Climate Science

Figure 3 for EarthLink: A Self-Evolving AI Agent for Climate Science

Abstract:Modern Earth science is at an inflection point. The vast, fragmented, and complex nature of Earth system data, coupled with increasingly sophisticated analytical demands, creates a significant bottleneck for rapid scientific discovery. Here we introduce EarthLink, the first AI agent designed as an interactive copilot for Earth scientists. It automates the end-to-end research workflow, from planning and code generation to multi-scenario analysis. Unlike static diagnostic tools, EarthLink can learn from user interaction, continuously refining its capabilities through a dynamic feedback loop. We validated its performance on a number of core scientific tasks of climate change, ranging from model-observation comparisons to the diagnosis of complex phenomena. In a multi-expert evaluation, EarthLink produced scientifically sound analyses and demonstrated an analytical competency that was rated as comparable to specific aspects of a human junior researcher's workflow. Additionally, its transparent, auditable workflows and natural language interface empower scientists to shift from laborious manual execution to strategic oversight and hypothesis generation. EarthLink marks a pivotal step towards an efficient, trustworthy, and collaborative paradigm for Earth system research in an era of accelerating global change. The system is accessible at our website https://earthlink.intern-ai.org.cn.

Via

Access Paper or Ask Questions

Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models

Jan 20, 2025

Zhuangzhuang Yan, Xinyu Gu, Shilong Fan, Zhenyu Liu

Figure 1 for Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models

Figure 2 for Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models

Figure 3 for Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models

Figure 4 for Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models

Abstract:Accurate and reliable link quality prediction (LQP) is crucial for optimizing network performance, ensuring communication stability, and enhancing user experience in wireless communications. However, LQP faces significant challenges due to the dynamic and lossy nature of wireless links, which are influenced by interference, multipath effects, fading, and blockage. In this paper, we propose GAT-LLM, a novel multivariate wireless link quality prediction model that combines Large Language Models (LLMs) with Graph Attention Networks (GAT) to enable accurate and reliable multivariate LQP of wireless communications. By framing LQP as a time series prediction task and appropriately preprocessing the input data, we leverage LLMs to improve the accuracy of link quality prediction. To address the limitations of LLMs in multivariate prediction due to typically handling one-dimensional data, we integrate GAT to model interdependencies among multiple variables across different protocol layers, enhancing the model's ability to handle complex dependencies. Experimental results demonstrate that GAT-LLM significantly improves the accuracy and robustness of link quality prediction, particularly in multi-step prediction scenarios.

Via

Access Paper or Ask Questions

A CSI Feedback Framework based on Transmitting the Important Values and Generating the Others

Nov 20, 2024

Zhilin Du, Zhenyu Liu, Haozhen Li, Shilong Fan, Xinyu Gu, Lin Zhang

Abstract:The application of deep learning (DL)-based channel state information (CSI) feedback frameworks in massive multiple-input multiple-output (MIMO) systems has significantly improved reconstruction accuracy. However, the limited generalization of widely adopted autoencoder-based networks for CSI feedback challenges consistent performance under dynamic wireless channel conditions and varying communication overhead constraints. To enhance the robustness of DL-based CSI feedback across diverse channel scenarios, we propose a novel framework, ITUG, where the user equipment (UE) transmits only a selected portion of critical values in the CSI matrix, while a generative model deployed at the BS reconstructs the remaining values. Specifically, we introduce a scoring algorithm to identify important values based on amplitude and contrast, an encoding algorithm to convert these values into a bit stream for transmission using adaptive bit length and a modified Huffman codebook, and a Transformer-based generative network named TPMVNet to recover the untransmitted values based on the received important values. Experimental results demonstrate that the ITUG framework, equipped with a single TPMVNet, achieves superior reconstruction performance compared to several high-performance autoencoder models across various channel conditions.

Via

Access Paper or Ask Questions

Passenger hazard perception based on EEG signals for highly automated driving vehicles

Aug 29, 2024

Ashton Yu Xuan Tan, Yingkai Yang, Xiaofei Zhang, Bowen Li, Xiaorong Gao, Sifa Zheng, Jianqiang Wang, Xinyu Gu, Jun Li, Yang Zhao(+2 more)

Figure 1 for Passenger hazard perception based on EEG signals for highly automated driving vehicles

Figure 2 for Passenger hazard perception based on EEG signals for highly automated driving vehicles

Figure 3 for Passenger hazard perception based on EEG signals for highly automated driving vehicles

Figure 4 for Passenger hazard perception based on EEG signals for highly automated driving vehicles

Abstract:Enhancing the safety of autonomous vehicles is crucial, especially given recent accidents involving automated systems. As passengers in these vehicles, humans' sensory perception and decision-making can be integrated with autonomous systems to improve safety. This study explores neural mechanisms in passenger-vehicle interactions, leading to the development of a Passenger Cognitive Model (PCM) and the Passenger EEG Decoding Strategy (PEDS). Central to PEDS is a novel Convolutional Recurrent Neural Network (CRNN) that captures spatial and temporal EEG data patterns. The CRNN, combined with stacking algorithms, achieves an accuracy of $85.0\% \pm 3.18\%$. Our findings highlight the predictive power of pre-event EEG data, enhancing the detection of hazardous scenarios and offering a network-driven framework for safer autonomous vehicles.

Via

Access Paper or Ask Questions

Dig-CSI: A Distributed and Generative Model Assisted CSI Feedback Training Framework

Dec 10, 2023

Zhilin Du, Haozhen Li, Zhenyu Liu, Shilong Fan, Xinyu Gu, Lin Zhang

Figure 1 for Dig-CSI: A Distributed and Generative Model Assisted CSI Feedback Training Framework

Figure 2 for Dig-CSI: A Distributed and Generative Model Assisted CSI Feedback Training Framework

Figure 3 for Dig-CSI: A Distributed and Generative Model Assisted CSI Feedback Training Framework

Figure 4 for Dig-CSI: A Distributed and Generative Model Assisted CSI Feedback Training Framework

Abstract:The advent of deep learning (DL)-based models has significantly advanced Channel State Information (CSI) feedback mechanisms in wireless communication systems. However, traditional approaches often suffer from high communication overhead and potential privacy risks due to the centralized nature of CSI data processing. To address these challenges, we design a CSI feedback training framework called Dig-CSI, in which the dataset for training the CSI feedback model is produced by the distributed generators uploaded by each user equipment (UE), but not through local data upload. Each UE trains an autoencoder, where the decoder is considered as the distributed generator, with local data to gain reconstruction accuracy and the ability to generate. Experimental results show that Dig-CSI can train a global CSI feedback model with comparable performance to the model trained with classical centralized learning with a much lighter communication overhead.

Via

Access Paper or Ask Questions

Multi-task Deep Neural Networks for Massive MIMO CSI Feedback

Apr 18, 2022

Boyuan Zhang, Haozhen Li, Xin Liang, Xinyu Gu, Lin Zhang

Figure 1 for Multi-task Deep Neural Networks for Massive MIMO CSI Feedback

Figure 2 for Multi-task Deep Neural Networks for Massive MIMO CSI Feedback

Figure 3 for Multi-task Deep Neural Networks for Massive MIMO CSI Feedback

Figure 4 for Multi-task Deep Neural Networks for Massive MIMO CSI Feedback

Abstract:Deep learning has been widely applied for the channel state information (CSI) feedback in frequency division duplexing (FDD) massive multiple-input multiple-output (MIMO) system. For the typical supervised training of the feedback model, the requirements of large amounts of task-specific labeled data can hardly be satisfied, and the huge training costs and storage usage of the model in multiple scenarios are hindrance for model application. In this letter, a multi-task learning-based approach is proposed to improve the feasibility of the feedback network. An encoder-shared feedback architecture and the corresponding training scheme are further proposed to facilitate the implementation of the multi-task learning approach. The experimental results indicate that the proposed multi-task learning approach can achieve comprehensive feedback performance with considerable reduction of training cost and storage usage of the feedback model.

* 5 pages, 2 figures

Via

Access Paper or Ask Questions

Changeable Rate and Novel Quantization for CSI Feedback Based on Deep Learning

Feb 28, 2022

Xin Liang, Haoran Chang, Haozhen Li, Xinyu Gu, Lin Zhang

Figure 1 for Changeable Rate and Novel Quantization for CSI Feedback Based on Deep Learning

Figure 2 for Changeable Rate and Novel Quantization for CSI Feedback Based on Deep Learning

Figure 3 for Changeable Rate and Novel Quantization for CSI Feedback Based on Deep Learning

Figure 4 for Changeable Rate and Novel Quantization for CSI Feedback Based on Deep Learning

Abstract:Deep learning (DL)-based channel state information (CSI) feedback improves the capacity and energy efficiency of massive multiple-input multiple-output (MIMO) systems in frequency division duplexing mode. However, multiple neural networks with different lengths of feedback overhead are required by time-varying bandwidth resources. The storage space required at the user equipment (UE) and the base station (BS) for these models increases linearly with the number of models. In this paper, we propose a DL-based changeable-rate framework with novel quantization scheme to improve the efficiency and feasibility of CSI feedback systems. This framework can reutilize all the network layers to achieve overhead-changeable CSI feedback to optimize the storage efficiency at the UE and the BS sides. Designed quantizer in this framework can avoid the normalization and gradient problems faced by traditional quantization schemes. Specifically, we propose two DL-based changeable-rate CSI feedback networks CH-CsiNetPro and CH-DualNetSph by introducing a feedback overhead control unit. Then, a pluggable quantization block (PQB) is developed to further improve the encoding efficiency of CSI feedback in an end-to-end way. Compared with existing CSI feedback methods, the proposed framework saves the storage space by about 50% with changeable-rate scheme and improves the encoding efficiency with the quantization module.

Via

Access Paper or Ask Questions

CSI Sensing and Feedback: A Semi-Supervised Learning Approach

Sep 26, 2021

Haozhen Li, Boyuan Zhang, Xin Liang, Haoran Chang, Xinyu Gu, Lin Zhang

Figure 1 for CSI Sensing and Feedback: A Semi-Supervised Learning Approach

Figure 2 for CSI Sensing and Feedback: A Semi-Supervised Learning Approach

Figure 3 for CSI Sensing and Feedback: A Semi-Supervised Learning Approach

Figure 4 for CSI Sensing and Feedback: A Semi-Supervised Learning Approach

Abstract:Deep learning-based (DL-based) channel state information (CSI) feedback for a Massive multiple-input multiple-output (MIMO) system has proved to be a creative and efficient application. However, the existing systems ignored the wireless channel environment variation sensing, e.g., indoor and outdoor scenarios. Moreover, systems training requires excess pre-labeled CSI data, which is often unavailable. In this letter, to address these issues, we first exploit the rationality of introducing semi-supervised learning on CSI feedback, then one semi-supervised CSI sensing and feedback Network ($S^2$CsiNet) with three classifiers comparisons is proposed. Experiment shows that $S^2$CsiNet primarily improves the feasibility of the DL-based CSI feedback system by \textbf{\textit{indoor}} and \textbf{\textit{outdoor}} environment sensing and at most 96.2\% labeled dataset decreasing and secondarily boost the system performance by data distillation and latent information mining.

Via

Access Paper or Ask Questions