Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rui Zhang

Henry

Evolutionary Spiking Neural Networks: A Survey

Jun 18, 2024

Shuaijie Shen, Rui Zhang, Chao Wang, Renzhuo Huang, Aiersi Tuerhong, Qinghai Guo, Zhichao Lu, Jianguo Zhang, Luziwei Leng

Abstract:Spiking neural networks (SNNs) are gaining increasing attention as potential computationally efficient alternatives to traditional artificial neural networks(ANNs). However, the unique information propagation mechanisms and the complexity of SNN neuron models pose challenges for adopting traditional methods developed for ANNs to SNNs. These challenges include both weight learning and architecture design. While surrogate gradient learning has shown some success in addressing the former challenge, the latter remains relatively unexplored. Recently, a novel paradigm utilizing evolutionary computation methods has emerged to tackle these challenges. This approach has resulted in the development of a variety of energy-efficient and high-performance SNNs across a wide range of machine learning benchmarks. In this paper, we present a survey of these works and initiate discussions on potential challenges ahead.

* J Membr Comput (2024)

Via

Access Paper or Ask Questions

CancerLLM: A Large Language Model in Cancer Domain

Jun 15, 2024

Mingchen Li, Anne Blaes, Steven Johnson, Hongfang Liu, Hua Xu, Rui Zhang

Figure 1 for CancerLLM: A Large Language Model in Cancer Domain

Figure 2 for CancerLLM: A Large Language Model in Cancer Domain

Figure 3 for CancerLLM: A Large Language Model in Cancer Domain

Figure 4 for CancerLLM: A Large Language Model in Cancer Domain

Abstract:Medical Large Language Models (LLMs) such as ClinicalCamel 70B, Llama3-OpenBioLLM 70B have demonstrated impressive performance on a wide variety of medical NLP task.However, there still lacks a large language model (LLM) specifically designed for cancer domain. Moreover, these LLMs typically have billions of parameters, making them computationally expensive for healthcare systems.Thus, in this study, we propose CancerLLM, a model with 7 billion parameters and a Mistral-style architecture, pre-trained on 2,676,642 clinical notes and 515,524 pathology reports covering 17 cancer types, followed by fine-tuning on three cancer-relevant tasks, including cancer phenotypes extraction, cancer diagnosis generation, and cancer treatment plan generation. Our evaluation demonstrated that CancerLLM achieves state-of-the-art results compared to other existing LLMs, with an average F1 score improvement of 8.1\%. Additionally, CancerLLM outperforms other models on two proposed robustness testbeds. This illustrates that CancerLLM can be effectively applied to clinical AI systems, enhancing clinical research and healthcare delivery in the field of cancer.

Via

Access Paper or Ask Questions

Rethinking Waveform for 6G: Harnessing Delay-Doppler Alignment Modulation

Jun 13, 2024

Zhiqiang Xiao, Xianda Liu, Yong Zeng, J. Andrew Zhang, Shi Jin, Rui Zhang

Figure 1 for Rethinking Waveform for 6G: Harnessing Delay-Doppler Alignment Modulation

Figure 2 for Rethinking Waveform for 6G: Harnessing Delay-Doppler Alignment Modulation

Figure 3 for Rethinking Waveform for 6G: Harnessing Delay-Doppler Alignment Modulation

Figure 4 for Rethinking Waveform for 6G: Harnessing Delay-Doppler Alignment Modulation

Abstract:Waveform design has served as a cornerstone for each generation of mobile communication systems. The future sixth-generation (6G) mobile communication networks are expected to employ larger-scale antenna arrays and exploit higher-frequency bands for further boosting data transmission rate and providing ubiquitous wireless sensing. This brings new opportunities and challenges for 6G waveform design. In this article, by leveraging the super spatial resolution of large antenna arrays and the multi-path spatial sparsity of highfrequency wireless channels, we introduce a new approach for waveform design based on the recently proposed delay-Doppler alignment modulation (DDAM). In particular, DDAM makes a paradigm shift of waveform design from the conventional manner of tolerating channel delay and Doppler spreads to actively manipulating them. First, we review the fundamental constraints and performance limitations of orthogonal frequency division multiplexing (OFDM) and introduce new opportunities for 6G waveform design. Next, the motivations and basic principles of DDAM are presented, followed by its various extensions to different wireless system setups. Finally, the main design considerations for DDAM are discussed and the new opportunities for future research are highlighted.

Via

Access Paper or Ask Questions

6DMA Enhanced Wireless Network with Flexible Antenna Position and Rotation: Opportunities and Challenges

Jun 11, 2024

Xiaodan Shao, Rui Zhang

Abstract:6DMA (six-dimensional movable antenna) is a new and revolutionizing technology that fully exploits the wireless channel spatial variation at the transmitter/receiver by flexibly adjusting the three-dimensional (3D) positions and 3D rotations of distributed antennas/antenna surfaces (arrays). In this article, we provide an overview of 6DMA for unveiling its great potential in wireless networks, including its motivation and competitive advantages over existing technologies, system/channel modeling, and practical implementation. In particular, we present a variety of 6DMA-enabled performance enhancement in terms of array gain, spatial multiplexing, interference suppression, and geometric gain. Furthermore, we illustrate the main applications of 6DMA in wireless communication and sensing, and elaborate their design challenges as well as promising solutions. Finally, numerical results are provided to demonstrate the significant capacity improvement of 6DMA-aided communication in wireless network.

* 8 pages, 5 figures, 1 table

Via

Access Paper or Ask Questions

W-Net: One-Shot Arbitrary-Style Chinese Character Generation with Deep Neural Networks

Jun 10, 2024

Haochuan Jiang, Guanyu Yang, Kaizhu Huang, Rui Zhang

Figure 1 for W-Net: One-Shot Arbitrary-Style Chinese Character Generation with Deep Neural Networks

Figure 2 for W-Net: One-Shot Arbitrary-Style Chinese Character Generation with Deep Neural Networks

Figure 3 for W-Net: One-Shot Arbitrary-Style Chinese Character Generation with Deep Neural Networks

Figure 4 for W-Net: One-Shot Arbitrary-Style Chinese Character Generation with Deep Neural Networks

Abstract:Due to the huge category number, the sophisticated combinations of various strokes and radicals, and the free writing or printing styles, generating Chinese characters with diverse styles is always considered as a difficult task. In this paper, an efficient and generalized deep framework, namely, the W-Net, is introduced for the one-shot arbitrary-style Chinese character generation task. Specifically, given a single character (one-shot) with a specific style (e.g., a printed font or hand-writing style), the proposed W-Net model is capable of learning and generating any arbitrary characters sharing the style similar to the given single character. Such appealing property was rarely seen in the literature. We have compared the proposed W-Net framework to many other competitive methods. Experimental results showed the proposed method is significantly superior in the one-shot setting.

* 2018, Neural Information Processing - 25th International Conference, ICONIP

Via

Access Paper or Ask Questions

RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering

Jun 09, 2024

Rui Zhang, Tianyue Luo, Weidong Yang, Ben Fei, Jingyi Xu, Qingyuan Zhou, Keyi Liu, Ying He

Figure 1 for RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering

Figure 2 for RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering

Figure 3 for RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering

Figure 4 for RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering

Abstract:3D Gaussian Splatting (3D-GS) has made a notable advancement in the field of neural rendering, 3D scene reconstruction, and novel view synthesis. Nevertheless, 3D-GS encounters the main challenge when it comes to accurately representing physical reflections, especially in the case of total reflection and semi-reflection that are commonly found in real-world scenes. This limitation causes reflections to be mistakenly treated as independent elements with physical presence, leading to imprecise reconstructions. Herein, to tackle this challenge, we propose RefGaussian to disentangle reflections from 3D-GS for realistically modeling reflections. Specifically, we propose to split a scene into transmitted and reflected components and represent these components using two Spherical Harmonics (SH). Given that this decomposition is not fully determined, we employ local regularization techniques to ensure local smoothness for both the transmitted and reflected components, thereby achieving more plausible decomposition outcomes than 3D-GS. Experimental results demonstrate that our approach achieves superior novel view synthesis and accurate depth estimation outcomes. Furthermore, it enables the utilization of scene editing applications, ensuring both high-quality results and physical coherence.

Via

Access Paper or Ask Questions

Denoising-Aware Contrastive Learning for Noisy Time Series

Jun 07, 2024

Shuang Zhou, Daochen Zha, Xiao Shen, Xiao Huang, Rui Zhang, Fu-Lai Chung

Figure 1 for Denoising-Aware Contrastive Learning for Noisy Time Series

Figure 2 for Denoising-Aware Contrastive Learning for Noisy Time Series

Figure 3 for Denoising-Aware Contrastive Learning for Noisy Time Series

Figure 4 for Denoising-Aware Contrastive Learning for Noisy Time Series

Abstract:Time series self-supervised learning (SSL) aims to exploit unlabeled data for pre-training to mitigate the reliance on labels. Despite the great success in recent years, there is limited discussion on the potential noise in the time series, which can severely impair the performance of existing SSL methods. To mitigate the noise, the de facto strategy is to apply conventional denoising methods before model training. However, this pre-processing approach may not fully eliminate the effect of noise in SSL for two reasons: (i) the diverse types of noise in time series make it difficult to automatically determine suitable denoising methods; (ii) noise can be amplified after mapping raw data into latent space. In this paper, we propose denoising-aware contrastive learning (DECL), which uses contrastive learning objectives to mitigate the noise in the representation and automatically selects suitable denoising methods for every sample. Extensive experiments on various datasets verify the effectiveness of our method. The code is open-sourced.

* Accepted to 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)

Via

Access Paper or Ask Questions

Prompt-based Visual Alignment for Zero-shot Policy Transfer

Jun 05, 2024

Haihan Gao, Rui Zhang, Qi Yi, Hantao Yao, Haochen Li, Jiaming Guo, Shaohui Peng, Yunkai Gao, QiCheng Wang, Xing Hu(+6 more)

Figure 1 for Prompt-based Visual Alignment for Zero-shot Policy Transfer

Figure 2 for Prompt-based Visual Alignment for Zero-shot Policy Transfer

Figure 3 for Prompt-based Visual Alignment for Zero-shot Policy Transfer

Figure 4 for Prompt-based Visual Alignment for Zero-shot Policy Transfer

Abstract:Overfitting in RL has become one of the main obstacles to applications in reinforcement learning(RL). Existing methods do not provide explicit semantic constrain for the feature extractor, hindering the agent from learning a unified cross-domain representation and resulting in performance degradation on unseen domains. Besides, abundant data from multiple domains are needed. To address these issues, in this work, we propose prompt-based visual alignment (PVA), a robust framework to mitigate the detrimental domain bias in the image for zero-shot policy transfer. Inspired that Visual-Language Model (VLM) can serve as a bridge to connect both text space and image space, we leverage the semantic information contained in a text sequence as an explicit constraint to train a visual aligner. Thus, the visual aligner can map images from multiple domains to a unified domain and achieve good generalization performance. To better depict semantic information, prompt tuning is applied to learn a sequence of learnable tokens. With explicit constraints of semantic information, PVA can learn unified cross-domain representation under limited access to cross-domain data and achieves great zero-shot generalization ability in unseen domains. We verify PVA on a vision-based autonomous driving task with CARLA simulator. Experiments show that the agent generalizes well on unseen domains under limited access to multi-domain data.

* This paper has been accepted by ICML2024

Via

Access Paper or Ask Questions

ST-DPGAN: A Privacy-preserving Framework for Spatiotemporal Data Generation

Jun 04, 2024

Wei Shao, Rongyi Zhu, Cai Yang, Chandra Thapa, Muhammad Ejaz Ahmed, Seyit Camtepe, Rui Zhang, DuYong Kim, Hamid Menouar, Flora D. Salim

Figure 1 for ST-DPGAN: A Privacy-preserving Framework for Spatiotemporal Data Generation

Figure 2 for ST-DPGAN: A Privacy-preserving Framework for Spatiotemporal Data Generation

Figure 3 for ST-DPGAN: A Privacy-preserving Framework for Spatiotemporal Data Generation

Figure 4 for ST-DPGAN: A Privacy-preserving Framework for Spatiotemporal Data Generation

Abstract:Spatiotemporal data is prevalent in a wide range of edge devices, such as those used in personal communication and financial transactions. Recent advancements have sparked a growing interest in integrating spatiotemporal analysis with large-scale language models. However, spatiotemporal data often contains sensitive information, making it unsuitable for open third-party access. To address this challenge, we propose a Graph-GAN-based model for generating privacy-protected spatiotemporal data. Our approach incorporates spatial and temporal attention blocks in the discriminator and a spatiotemporal deconvolution structure in the generator. These enhancements enable efficient training under Gaussian noise to achieve differential privacy. Extensive experiments conducted on three real-world spatiotemporal datasets validate the efficacy of our model. Our method provides a privacy guarantee while maintaining the data utility. The prediction model trained on our generated data maintains a competitive performance compared to the model trained on the original data.

Via

Access Paper or Ask Questions

Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Jun 04, 2024

Yusen Zhang, Ruoxi Sun, Yanfei Chen, Tomas Pfister, Rui Zhang, Sercan Ö. Arik

Figure 1 for Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Figure 2 for Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Figure 3 for Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Figure 4 for Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Abstract:Addressing the challenge of effectively processing long contexts has become a critical issue for Large Language Models (LLMs). Two common strategies have emerged: 1) reducing the input length, such as retrieving relevant chunks by Retrieval-Augmented Generation (RAG), and 2) expanding the context window limit of LLMs. However, both strategies have drawbacks: input reduction has no guarantee of covering the part with needed information, while window extension struggles with focusing on the pertinent information for solving the task. To mitigate these limitations, we propose Chain-of-Agents (CoA), a novel framework that harnesses multi-agent collaboration through natural language to enable information aggregation and context reasoning across various LLMs over long-context tasks. CoA consists of multiple worker agents who sequentially communicate to handle different segmented portions of the text, followed by a manager agent who synthesizes these contributions into a coherent final output. CoA processes the entire input by interleaving reading and reasoning, and it mitigates long context focus issues by assigning each agent a short context. We perform comprehensive evaluation of CoA on a wide range of long-context tasks in question answering, summarization, and code completion, demonstrating significant improvements by up to 10% over strong baselines of RAG, Full-Context, and multi-agent LLMs.

* 19 pages, 6 figures

Via

Access Paper or Ask Questions