Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xinyi Chen

Princeton University

Leveraging LLM Agents for Translating Network Configurations

Jan 15, 2025

Yunze Wei, Xiaohui Xie, Yiwei Zuo, Tianshuo Hu, Xinyi Chen, Kaiwen Chi, Yong Cui

Abstract:Configuration translation is a critical and frequent task in network operations. When a network device is damaged or outdated, administrators need to replace it to maintain service continuity. The replacement devices may originate from different vendors, necessitating configuration translation to ensure seamless network operation. However, translating configurations manually is a labor-intensive and error-prone process. In this paper, we propose an intent-based framework for translating network configuration with Large Language Model (LLM) Agents. The core of our approach is an Intent-based Retrieval Augmented Generation (IRAG) module that systematically splits a configuration file into fragments, extracts intents, and generates accurate translations. We also design a two-stage verification method to validate the syntax and semantics correctness of the translated configurations. We implement and evaluate the proposed method on real-world network configurations. Experimental results show that our method achieves 97.74% syntax correctness, outperforming state-of-the-art methods in translation accuracy.

Via

Access Paper or Ask Questions

Neural Reflectance Fields for Radio-Frequency Ray Tracing

Jan 05, 2025

Haifeng Jia, Xinyi Chen, Yichen Wei, Yifei Sun, Yibo Pi

Abstract:Ray tracing is widely employed to model the propagation of radio-frequency (RF) signal in complex environment. The modelling performance greatly depends on how accurately the target scene can be depicted, including the scene geometry and surface material properties. The advances in computer vision and LiDAR make scene geometry estimation increasingly accurate, but there still lacks scalable and efficient approaches to estimate the material reflectivity in real-world environment. In this work, we tackle this problem by learning the material reflectivity efficiently from the path loss of the RF signal from the transmitters to receivers. Specifically, we want the learned material reflection coefficients to minimize the gap between the predicted and measured powers of the receivers. We achieve this by translating the neural reflectance field from optics to RF domain by modelling both the amplitude and phase of RF signals to account for the multipath effects. We further propose a differentiable RF ray tracing framework that optimizes the neural reflectance field to match the signal strength measurements. We simulate a complex real-world environment for experiments and our simulation results show that the neural reflectance field can successfully learn the reflection coefficients for all incident angles. As a result, our approach achieves better accuracy in predicting the powers of receivers with significantly less training data compared to existing approaches.

* Accepted by IEEE Global Communications Conference 2024 (GLOBECOM'24)

Via

Access Paper or Ask Questions

S3TU-Net: Structured Convolution and Superpixel Transformer for Lung Nodule Segmentation

Nov 19, 2024

Yuke Wu, Xiang Liu, Yunyu Shi, Xinyi Chen, Zhenglei Wang, YuQing Xu, Shuo Hong Wang

Figure 1 for S3TU-Net: Structured Convolution and Superpixel Transformer for Lung Nodule Segmentation

Figure 2 for S3TU-Net: Structured Convolution and Superpixel Transformer for Lung Nodule Segmentation

Figure 3 for S3TU-Net: Structured Convolution and Superpixel Transformer for Lung Nodule Segmentation

Figure 4 for S3TU-Net: Structured Convolution and Superpixel Transformer for Lung Nodule Segmentation

Abstract:The irregular and challenging characteristics of lung adenocarcinoma nodules in computed tomography (CT) images complicate staging diagnosis, making accurate segmentation critical for clinicians to extract detailed lesion information. In this study, we propose a segmentation model, S3TU-Net, which integrates multi-dimensional spatial connectors and a superpixel-based visual transformer. S3TU-Net is built on a multi-view CNN-Transformer hybrid architecture, incorporating superpixel algorithms, structured weighting, and spatial shifting techniques to achieve superior segmentation performance. The model leverages structured convolution blocks (DWF-Conv/D2BR-Conv) to extract multi-scale local features while mitigating overfitting. To enhance multi-scale feature fusion, we introduce the S2-MLP Link, integrating spatial shifting and attention mechanisms at the skip connections. Additionally, the residual-based superpixel visual transformer (RM-SViT) effectively merges global and local features by employing sparse correlation learning and multi-branch attention to capture long-range dependencies, with residual connections enhancing stability and computational efficiency. Experimental results on the LIDC-IDRI dataset demonstrate that S3TU-Net achieves a DSC, precision, and IoU of 89.04%, 90.73%, and 90.70%, respectively. Compared to recent methods, S3TU-Net improves DSC by 4.52% and sensitivity by 3.16%, with other metrics showing an approximate 2% increase. In addition to comparison and ablation studies, we validated the generalization ability of our model on the EPDB private dataset, achieving a DSC of 86.40%.

Via

Access Paper or Ask Questions

Provable Length Generalization in Sequence Prediction via Spectral Filtering

Nov 01, 2024

Annie Marsden, Evan Dogariu, Naman Agarwal, Xinyi Chen, Daniel Suo, Elad Hazan

Figure 1 for Provable Length Generalization in Sequence Prediction via Spectral Filtering

Figure 2 for Provable Length Generalization in Sequence Prediction via Spectral Filtering

Figure 3 for Provable Length Generalization in Sequence Prediction via Spectral Filtering

Figure 4 for Provable Length Generalization in Sequence Prediction via Spectral Filtering

Abstract:We consider the problem of length generalization in sequence prediction. We define a new metric of performance in this setting -- the Asymmetric-Regret -- which measures regret against a benchmark predictor with longer context length than available to the learner. We continue by studying this concept through the lens of the spectral filtering algorithm. We present a gradient-based learning algorithm that provably achieves length generalization for linear dynamical systems. We conclude with proof-of-concept experiments which are consistent with our theory.

* 34 pages, 9 figures

Via

Access Paper or Ask Questions

Toward Understanding In-context vs. In-weight Learning

Oct 30, 2024

Bryan Chan, Xinyi Chen, András György, Dale Schuurmans

Abstract:It has recently been demonstrated empirically that in-context learning emerges in transformers when certain distributional properties are present in the training data, but this ability can also diminish upon further training. We provide a new theoretical understanding of these phenomena by identifying simplified distributional properties that give rise to the emergence and eventual disappearance of in-context learning. We do so by first analyzing a simplified model that uses a gating mechanism to choose between an in-weight and an in-context predictor. Through a combination of a generalization error and regret analysis we identify conditions where in-context and in-weight learning emerge. These theoretical findings are then corroborated experimentally by comparing the behaviour of a full transformer on the simplified distributions to that of the stylized model, demonstrating aligned results. We then extend the study to a full large language model, showing how fine-tuning on various collections of natural language prompts can elicit similar in-context and in-weight learning behaviour.

Via

Access Paper or Ask Questions

LLM$\times$MapReduce: Simplified Long-Sequence Processing using Large Language Models

Oct 12, 2024

Zihan Zhou, Chong Li, Xinyi Chen, Shuo Wang, Yu Chao, Zhili Li, Haoyu Wang, Rongqiao An, Qi Shi, Zhixing Tan(+4 more)

$Figure 1 for LLM$\times$MapReduce: Simplified Long-Sequence Processing using Large Language Models$

$Figure 2 for LLM$\times$MapReduce: Simplified Long-Sequence Processing using Large Language Models$

$Figure 3 for LLM$\times$MapReduce: Simplified Long-Sequence Processing using Large Language Models$

$Figure 4 for LLM$\times$MapReduce: Simplified Long-Sequence Processing using Large Language Models$

Abstract:Enlarging the context window of large language models (LLMs) has become a crucial research area, particularly for applications involving extremely long texts. In this work, we propose a novel training-free framework for processing long texts, utilizing a divide-and-conquer strategy to achieve comprehensive document understanding. The proposed LLM$\times$MapReduce framework splits the entire document into several chunks for LLMs to read and then aggregates the intermediate answers to produce the final output. The main challenge for divide-and-conquer long text processing frameworks lies in the risk of losing essential long-range information when splitting the document, which can lead the model to produce incomplete or incorrect answers based on the segmented texts. Disrupted long-range information can be classified into two categories: inter-chunk dependency and inter-chunk conflict. We design a structured information protocol to better cope with inter-chunk dependency and an in-context confidence calibration mechanism to resolve inter-chunk conflicts. Experimental results demonstrate that LLM$\times$MapReduce can outperform representative open-source and commercial long-context LLMs, and is applicable to several different models.

* Work in Progress. Code: https://github.com/thunlp/LLMxMapReduce

Via

Access Paper or Ask Questions

Test-Time Intensity Consistency Adaptation for Shadow Detection

Oct 10, 2024

Leyi Zhu, Weihuang Liu, Xinyi Chen, Zimeng Li, Xuhang Chen, Zhen Wang, Chi-Man Pun

Figure 1 for Test-Time Intensity Consistency Adaptation for Shadow Detection

Figure 2 for Test-Time Intensity Consistency Adaptation for Shadow Detection

Figure 3 for Test-Time Intensity Consistency Adaptation for Shadow Detection

Figure 4 for Test-Time Intensity Consistency Adaptation for Shadow Detection

Abstract:Shadow detection is crucial for accurate scene understanding in computer vision, yet it is challenged by the diverse appearances of shadows caused by variations in illumination, object geometry, and scene context. Deep learning models often struggle to generalize to real-world images due to the limited size and diversity of training datasets. To address this, we introduce TICA, a novel framework that leverages light-intensity information during test-time adaptation to enhance shadow detection accuracy. TICA exploits the inherent inconsistencies in light intensity across shadow regions to guide the model toward a more consistent prediction. A basic encoder-decoder model is initially trained on a labeled dataset for shadow detection. Then, during the testing phase, the network is adjusted for each test sample by enforcing consistent intensity predictions between two augmented input image versions. This consistency training specifically targets both foreground and background intersection regions to identify shadow regions within images accurately for robust adaptation. Extensive evaluations on the ISTD and SBU shadow detection datasets reveal that TICA significantly demonstrates that TICA outperforms existing state-of-the-art methods, achieving superior results in balanced error rate (BER).

* 15 pages, 5 figures, published to ICONIP 2024

Via

Access Paper or Ask Questions

FutureFill: Fast Generation from Convolutional Sequence Models

Oct 02, 2024

Naman Agarwal, Xinyi Chen, Evan Dogariu, Vlad Feinberg, Daniel Suo, Peter Bartlett, Elad Hazan

Figure 1 for FutureFill: Fast Generation from Convolutional Sequence Models

Figure 2 for FutureFill: Fast Generation from Convolutional Sequence Models

Figure 3 for FutureFill: Fast Generation from Convolutional Sequence Models

Figure 4 for FutureFill: Fast Generation from Convolutional Sequence Models

Abstract:We address the challenge of efficient auto-regressive generation in sequence prediction models by introducing FutureFill: a method for fast generation that applies to any sequence prediction algorithm based on convolutional operators. Our approach reduces the generation time requirement from linear to square root relative to the context length. Additionally, FutureFill requires a prefill cache sized only by the number of tokens generated, which is smaller than the cache requirements for standard convolutional and attention-based models. We validate our theoretical findings with experimental evidence demonstrating correctness and efficiency gains in a synthetic generation task.

Via

Access Paper or Ask Questions

ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation

Sep 12, 2024

Fuchen Zheng, Xinyi Chen, Xuhang Chen, Haolun Li, Xiaojiao Guo, Guoheng Huang, Chi-Man Pun, Shoujun Zhou

Figure 1 for ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation

Figure 2 for ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation

Figure 3 for ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation

Figure 4 for ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation

Abstract:Medical image segmentation, a crucial task in computer vision, facilitates the automated delineation of anatomical structures and pathologies, supporting clinicians in diagnosis, treatment planning, and disease monitoring. Notably, transformers employing shifted window-based self-attention have demonstrated exceptional performance. However, their reliance on local window attention limits the fusion of local and global contextual information, crucial for segmenting microtumors and miniature organs. To address this limitation, we propose the Adaptive Semantic Segmentation Network (ASSNet), a transformer architecture that effectively integrates local and global features for precise medical image segmentation. ASSNet comprises a transformer-based U-shaped encoder-decoder network. The encoder utilizes shifted window self-attention across five resolutions to extract multi-scale features, which are then propagated to the decoder through skip connections. We introduce an augmented multi-layer perceptron within the encoder to explicitly model long-range dependencies during feature extraction. Recognizing the constraints of conventional symmetrical encoder-decoder designs, we propose an Adaptive Feature Fusion (AFF) decoder to complement our encoder. This decoder incorporates three key components: the Long Range Dependencies (LRD) block, the Multi-Scale Feature Fusion (MFF) block, and the Adaptive Semantic Center (ASC) block. These components synergistically facilitate the effective fusion of multi-scale features extracted by the decoder while capturing long-range dependencies and refining object boundaries. Comprehensive experiments on diverse medical image segmentation tasks, including multi-organ, liver tumor, and bladder tumor segmentation, demonstrate that ASSNet achieves state-of-the-art results. Code and models are available at: \url{https://github.com/lzeeorno/ASSNet}.

* 8 pages, 4 figures, 3 tables

Via

Access Paper or Ask Questions

PMSN: A Parallel Multi-compartment Spiking Neuron for Multi-scale Temporal Processing

Aug 27, 2024

Xinyi Chen, Jibin Wu, Chenxiang Ma, Yinsong Yan, Yujie Wu, Kay Chen Tan

Figure 1 for PMSN: A Parallel Multi-compartment Spiking Neuron for Multi-scale Temporal Processing

Figure 2 for PMSN: A Parallel Multi-compartment Spiking Neuron for Multi-scale Temporal Processing

Figure 3 for PMSN: A Parallel Multi-compartment Spiking Neuron for Multi-scale Temporal Processing

Figure 4 for PMSN: A Parallel Multi-compartment Spiking Neuron for Multi-scale Temporal Processing

Abstract:Spiking Neural Networks (SNNs) hold great potential to realize brain-inspired, energy-efficient computational systems. However, current SNNs still fall short in terms of multi-scale temporal processing compared to their biological counterparts. This limitation has resulted in poor performance in many pattern recognition tasks with information that varies across different timescales. To address this issue, we put forward a novel spiking neuron model called Parallel Multi-compartment Spiking Neuron (PMSN). The PMSN emulates biological neurons by incorporating multiple interacting substructures and allows for flexible adjustment of the substructure counts to effectively represent temporal information across diverse timescales. Additionally, to address the computational burden associated with the increased complexity of the proposed model, we introduce two parallelization techniques that decouple the temporal dependencies of neuronal updates, enabling parallelized training across different time steps. Our experimental results on a wide range of pattern recognition tasks demonstrate the superiority of PMSN. It outperforms other state-of-the-art spiking neuron models in terms of its temporal processing capacity, training speed, and computation cost. Specifically, compared with the commonly used Leaky Integrate-and-Fire neuron, PMSN offers a simulation acceleration of over 10 $\times$ and a 30 % improvement in accuracy on Sequential CIFAR10 dataset, while maintaining comparable computational cost.

Via

Access Paper or Ask Questions