Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hongzhi Wang

Habin Institute of Technology

Modern Hopfield Networks meet Encoded Neural Representations -- Addressing Practical Considerations

Sep 24, 2024

Satyananda Kashyap, Niharika S. D'Souza, Luyao Shi, Ken C. L. Wong, Hongzhi Wang, Tanveer Syeda-Mahmood

Figure 1 for Modern Hopfield Networks meet Encoded Neural Representations -- Addressing Practical Considerations

Figure 2 for Modern Hopfield Networks meet Encoded Neural Representations -- Addressing Practical Considerations

Figure 3 for Modern Hopfield Networks meet Encoded Neural Representations -- Addressing Practical Considerations

Figure 4 for Modern Hopfield Networks meet Encoded Neural Representations -- Addressing Practical Considerations

Abstract:Content-addressable memories such as Modern Hopfield Networks (MHN) have been studied as mathematical models of auto-association and storage/retrieval in the human declarative memory, yet their practical use for large-scale content storage faces challenges. Chief among them is the occurrence of meta-stable states, particularly when handling large amounts of high dimensional content. This paper introduces Hopfield Encoding Networks (HEN), a framework that integrates encoded neural representations into MHNs to improve pattern separability and reduce meta-stable states. We show that HEN can also be used for retrieval in the context of hetero association of images with natural language queries, thus removing the limitation of requiring access to partial content in the same domain. Experimental results demonstrate substantial reduction in meta-stable states and increased storage capacity while still enabling perfect recall of a significantly larger number of inputs advancing the practical utility of associative memory networks for real-world tasks.

* 17 pages, 8 figures, workshop submission to Neurips

Via

Access Paper or Ask Questions

An Unsupervised Learning Framework Combined with Heuristics for the Maximum Minimal Cut Problem

Aug 16, 2024

Huaiyuan Liu, Xianzhang Liu, Donghua Yang, Hongzhi Wang, Yingchi Long, Mengtong Ji, Dongjing Miao, Zhiyu Liang

Figure 1 for An Unsupervised Learning Framework Combined with Heuristics for the Maximum Minimal Cut Problem

Figure 2 for An Unsupervised Learning Framework Combined with Heuristics for the Maximum Minimal Cut Problem

Figure 3 for An Unsupervised Learning Framework Combined with Heuristics for the Maximum Minimal Cut Problem

Figure 4 for An Unsupervised Learning Framework Combined with Heuristics for the Maximum Minimal Cut Problem

Abstract:The Maximum Minimal Cut Problem (MMCP), a NP-hard combinatorial optimization (CO) problem, has not received much attention due to the demanding and challenging bi-connectivity constraint. Moreover, as a CO problem, it is also a daunting task for machine learning, especially without labeled instances. To deal with these problems, this work proposes an unsupervised learning framework combined with heuristics for MMCP that can provide valid and high-quality solutions. As far as we know, this is the first work that explores machine learning and heuristics to solve MMCP. The unsupervised solver is inspired by a relaxation-plus-rounding approach, the relaxed solution is parameterized by graph neural networks, and the cost and penalty of MMCP are explicitly written out, which can train the model end-to-end. A crucial observation is that each solution corresponds to at least one spanning tree. Based on this finding, a heuristic solver that implements tree transformations by adding vertices is utilized to repair and improve the solution quality of the unsupervised solver. Alternatively, the graph is simplified while guaranteeing solution consistency, which reduces the running time. We conduct extensive experiments to evaluate our framework and give a specific application. The results demonstrate the superiority of our method against two techniques designed.

Via

Access Paper or Ask Questions

RTFormer: Re-parameter TSBN Spiking Transformer

Jun 20, 2024

Hongzhi Wang, Xiubo Liang, Mengjian Li, Tao Zhang

Figure 1 for RTFormer: Re-parameter TSBN Spiking Transformer

Figure 2 for RTFormer: Re-parameter TSBN Spiking Transformer

Figure 3 for RTFormer: Re-parameter TSBN Spiking Transformer

Figure 4 for RTFormer: Re-parameter TSBN Spiking Transformer

Abstract:The Spiking Neural Networks (SNNs), renowned for their bio-inspired operational mechanism and energy efficiency, mirror the human brain's neural activity. Yet, SNNs face challenges in balancing energy efficiency with the computational demands of advanced tasks. Our research introduces the RTFormer, a novel architecture that embeds Re-parameterized Temporal Sliding Batch Normalization (TSBN) within the Spiking Transformer framework. This innovation optimizes energy usage during inference while ensuring robust computational performance. The crux of RTFormer lies in its integration of reparameterized convolutions and TSBN, achieving an equilibrium between computational prowess and energy conservation.

Via

Access Paper or Ask Questions

Self-Regulated Data-Free Knowledge Amalgamation for Text Classification

Jun 16, 2024

Prashanth Vijayaraghavan, Hongzhi Wang, Luyao Shi, Tyler Baldwin, David Beymer, Ehsan Degan

Figure 1 for Self-Regulated Data-Free Knowledge Amalgamation for Text Classification

Figure 2 for Self-Regulated Data-Free Knowledge Amalgamation for Text Classification

Figure 3 for Self-Regulated Data-Free Knowledge Amalgamation for Text Classification

Figure 4 for Self-Regulated Data-Free Knowledge Amalgamation for Text Classification

Abstract:Recently, there has been a growing availability of pre-trained text models on various model repositories. These models greatly reduce the cost of training new models from scratch as they can be fine-tuned for specific tasks or trained on large datasets. However, these datasets may not be publicly accessible due to the privacy, security, or intellectual property issues. In this paper, we aim to develop a lightweight student network that can learn from multiple teacher models without accessing their original training data. Hence, we investigate Data-Free Knowledge Amalgamation (DFKA), a knowledge-transfer task that combines insights from multiple pre-trained teacher models and transfers them effectively to a compact student network. To accomplish this, we propose STRATANET, a modeling framework comprising: (a) a steerable data generator that produces text data tailored to each teacher and (b) an amalgamation module that implements a self-regulative strategy using confidence estimates from the teachers' different layers to selectively integrate their knowledge and train a versatile student. We evaluate our method on three benchmark text classification datasets with varying labels or domains. Empirically, we demonstrate that the student model learned using our STRATANET outperforms several baselines significantly under data-driven and data-free constraints.

* 12 pages, 5 Figures, Proceedings of NAACL 2024

Via

Access Paper or Ask Questions

IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors

May 02, 2024

Shenghe Zheng, Hongzhi Wang, Xianglong Liu

Figure 1 for IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors

Figure 2 for IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors

Figure 3 for IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors

Figure 4 for IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors

Abstract:Graph Neural Networks (GNNs) demonstrate excellent performance on graphs, with their core idea about aggregating neighborhood information and learning from labels. However, the prevailing challenges in most graph datasets are twofold of Insufficient High-Quality Labels and Lack of Neighborhoods, resulting in weak GNNs. Existing data augmentation methods designed to address these two issues often tackle only one. They may either require extensive training of generators, rely on overly simplistic strategies, or demand substantial prior knowledge, leading to suboptimal generalization abilities. To simultaneously address both of these two challenges, we propose an elegant method called IntraMix. IntraMix innovatively employs Mixup among low-quality labeled data of the same class, generating high-quality labeled data at minimal cost. Additionally, it establishes neighborhoods for the generated data by connecting them with data from the same class with high confidence, thereby enriching the neighborhoods of graphs. IntraMix efficiently tackles both challenges faced by graphs and challenges the prior notion of the limited effectiveness of Mixup in node classification. IntraMix serves as a universal framework that can be readily applied to all GNNs. Extensive experiments demonstrate the effectiveness of IntraMix across various GNNs and datasets.

* 18 pages

Via

Access Paper or Ask Questions

TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis

Apr 07, 2024

Zhiyu Liang, Chen Liang, Zheng Liang, Hongzhi Wang, Bo Zheng

Figure 1 for TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis

Figure 2 for TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis

Figure 3 for TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis

Abstract:Unsupervised (a.k.a. Self-supervised) representation learning (URL) has emerged as a new paradigm for time series analysis, because it has the ability to learn generalizable time series representation beneficial for many downstream tasks without using labels that are usually difficult to obtain. Considering that existing approaches have limitations in the design of the representation encoder and the learning objective, we have proposed Contrastive Shapelet Learning (CSL), the first URL method that learns the general-purpose shapelet-based representation through unsupervised contrastive learning, and shown its superior performance in several analysis tasks, such as time series classification, clustering, and anomaly detection. In this paper, we develop TimeCSL, an end-to-end system that makes full use of the general and interpretable shapelets learned by CSL to achieve explorable time series analysis in a unified pipeline. We introduce the system components and demonstrate how users interact with TimeCSL to solve different analysis tasks in the unified pipeline, and gain insight into their time series by exploring the learned shapelets and representation.

Via

Access Paper or Ask Questions

Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning

Dec 09, 2023

Chen Liang, Donghua Yang, Zhiyu Liang, Hongzhi Wang, Zheng Liang, Xiyang Zhang, Jianfeng Huang

Figure 1 for Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning

Figure 2 for Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning

Figure 3 for Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning

Figure 4 for Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning

Abstract:In recent times, the field of unsupervised representation learning (URL) for time series data has garnered significant interest due to its remarkable adaptability across diverse downstream applications. Unsupervised learning goals differ from downstream tasks, making it tricky to ensure downstream task utility by focusing only on temporal feature characterization. Researchers have proposed multiple transformations to extract discriminative patterns implied in informative time series, trying to fill the gap. Despite the introduction of a variety of feature engineering techniques, e.g. spectral domain, wavelet transformed features, features in image form and symbolic features etc. the utilization of intricate feature fusion methods and dependence on heterogeneous features during inference hampers the scalability of the solutions. To address this, our study introduces an innovative approach that focuses on aligning and binding time series representations encoded from different modalities, inspired by spectral graph theory, thereby guiding the neural encoder to uncover latent pattern associations among these multi-modal features. In contrast to conventional methods that fuse features from multiple modalities, our proposed approach simplifies the neural architecture by retaining a single time series encoder, consequently leading to preserved scalability. We further demonstrate and prove mechanisms for the encoder to maintain better inductive bias. In our experimental evaluation, we validated the proposed method on a diverse set of time series datasets from various domains. Our approach outperforms existing state-of-the-art URL methods across diverse downstream tasks.

Via

Access Paper or Ask Questions

Image-Based Soil Organic Carbon Remote Sensing from Satellite Images with Fourier Neural Operator and Structural Similarity

Nov 21, 2023

Ken C. L. Wong, Levente Klein, Ademir Ferreira da Silva, Hongzhi Wang, Jitendra Singh, Tanveer Syeda-Mahmood

Figure 1 for Image-Based Soil Organic Carbon Remote Sensing from Satellite Images with Fourier Neural Operator and Structural Similarity

Figure 2 for Image-Based Soil Organic Carbon Remote Sensing from Satellite Images with Fourier Neural Operator and Structural Similarity

Abstract:Soil organic carbon (SOC) sequestration is the transfer and storage of atmospheric carbon dioxide in soils, which plays an important role in climate change mitigation. SOC concentration can be improved by proper land use, thus it is beneficial if SOC can be estimated at a regional or global scale. As multispectral satellite data can provide SOC-related information such as vegetation and soil properties at a global scale, estimation of SOC through satellite data has been explored as an alternative to manual soil sampling. Although existing studies show promising results, they are mainly based on pixel-based approaches with traditional machine learning methods, and convolutional neural networks (CNNs) are uncommon. To study the use of CNNs on SOC remote sensing, here we propose the FNO-DenseNet based on the Fourier neural operator (FNO). By combining the advantages of the FNO and DenseNet, the FNO-DenseNet outperformed the FNO in our experiments with hundreds of times fewer parameters. The FNO-DenseNet also outperformed a pixel-based random forest by 18% in the mean absolute percentage error.

* This paper was accepted by the 2023 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2023)

Via

Access Paper or Ask Questions

FNOSeg3D: Resolution-Robust 3D Image Segmentation with Fourier Neural Operator

Oct 05, 2023

Ken C. L. Wong, Hongzhi Wang, Tanveer Syeda-Mahmood

Figure 1 for FNOSeg3D: Resolution-Robust 3D Image Segmentation with Fourier Neural Operator

Figure 2 for FNOSeg3D: Resolution-Robust 3D Image Segmentation with Fourier Neural Operator

Figure 3 for FNOSeg3D: Resolution-Robust 3D Image Segmentation with Fourier Neural Operator

Figure 4 for FNOSeg3D: Resolution-Robust 3D Image Segmentation with Fourier Neural Operator

Abstract:Due to the computational complexity of 3D medical image segmentation, training with downsampled images is a common remedy for out-of-memory errors in deep learning. Nevertheless, as standard spatial convolution is sensitive to variations in image resolution, the accuracy of a convolutional neural network trained with downsampled images can be suboptimal when applied on the original resolution. To address this limitation, we introduce FNOSeg3D, a 3D segmentation model robust to training image resolution based on the Fourier neural operator (FNO). The FNO is a deep learning framework for learning mappings between functions in partial differential equations, which has the appealing properties of zero-shot super-resolution and global receptive field. We improve the FNO by reducing its parameter requirement and enhancing its learning capability through residual connections and deep supervision, and these result in our FNOSeg3D model which is parameter efficient and resolution robust. When tested on the BraTS'19 dataset, it achieved superior robustness to training image resolution than other tested models with less than 1% of their model parameters.

* This paper was accepted by the IEEE International Symposium on Biomedical Imaging (ISBI) 2023

Via

Access Paper or Ask Questions

HartleyMHA: Self-Attention in Frequency Domain for Resolution-Robust and Parameter-Efficient 3D Image Segmentation

Oct 05, 2023

Ken C. L. Wong, Hongzhi Wang, Tanveer Syeda-Mahmood

Abstract:With the introduction of Transformers, different attention-based models have been proposed for image segmentation with promising results. Although self-attention allows capturing of long-range dependencies, it suffers from a quadratic complexity in the image size especially in 3D. To avoid the out-of-memory error during training, input size reduction is usually required for 3D segmentation, but the accuracy can be suboptimal when the trained models are applied on the original image size. To address this limitation, inspired by the Fourier neural operator (FNO), we introduce the HartleyMHA model which is robust to training image resolution with efficient self-attention. FNO is a deep learning framework for learning mappings between functions in partial differential equations, which has the appealing properties of zero-shot super-resolution and global receptive field. We modify the FNO by using the Hartley transform with shared parameters to reduce the model size by orders of magnitude, and this allows us to further apply self-attention in the frequency domain for more expressive high-order feature combination with improved efficiency. When tested on the BraTS'19 dataset, it achieved superior robustness to training image resolution than other tested models with less than 1% of their model parameters.

* This paper was accepted by the International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2023). arXiv admin note: text overlap with arXiv:2310.03872

Via

Access Paper or Ask Questions