Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fei He

Retrieving Filter Spectra in CNN for Explainable Sleep Stage Classification

Feb 10, 2025

Stephan Goerttler, Yucheng Wang, Emadeldeen Eldele, Fei He, Min Wu

Abstract:Despite significant advances in deep learning-based sleep stage classification, the clinical adoption of automatic classification models remains slow. One key challenge is the lack of explainability, as many models function as black boxes with millions of parameters. In response, recent work has increasingly focussed on enhancing model explainability. This study contributes to these efforts by globally explaining spectral processing of individual EEG channels. Specifically, we introduce a method to retrieve the filter spectrum of low-level convolutional feature extraction and compare it with the classification-relevant spectral information in the data. We evaluate our approach on the MSA-CNN model using the ISRUC-S3 and Sleep-EDF-20 datasets. Our findings show that spectral processing plays a significant role in the lower frequency bands. In addition, comparing the correlation between filter spectrum and data-based spectral information with univariate performance indicates that the model naturally prioritises the most informative channels in a multimodal setting. We specify how these insights can be leveraged to enhance model performance. The code for the filter spectrum retrieval and its analysis is available at https://github.com/sgoerttler/MSA-CNN.

* 5 pages, 3 figures, conference paper

Via

Access Paper or Ask Questions

MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification

Jan 06, 2025

Stephan Goerttler, Yucheng Wang, Emadeldeen Eldele, Min Wu, Fei He

Figure 1 for MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification

Figure 2 for MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification

Figure 3 for MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification

Figure 4 for MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification

Abstract:Recent advancements in machine learning-based signal analysis, coupled with open data initiatives, have fuelled efforts in automatic sleep stage classification. Despite the proliferation of classification models, few have prioritised reducing model complexity, which is a crucial factor for practical applications. In this work, we introduce Multi-Scale and Attention Convolutional Neural Network (MSA-CNN), a lightweight architecture featuring as few as ~10,000 parameters. MSA-CNN leverages a novel multi-scale module employing complementary pooling to eliminate redundant filter parameters and dense convolutions. Model complexity is further reduced by separating temporal and spatial feature extraction and using cost-effective global spatial convolutions. This separation of tasks not only reduces model complexity but also mirrors the approach used by human experts in sleep stage scoring. We evaluated both small and large configurations of MSA-CNN against nine state-of-the-art baseline models across three public datasets, treating univariate and multivariate models separately. Our evaluation, based on repeated cross-validation and re-evaluation of all baseline models, demonstrated that the large MSA-CNN outperformed all baseline models on all three datasets in terms of accuracy and Cohen's kappa, despite its significantly reduced parameter count. Lastly, we explored various model variants and conducted an in-depth analysis of the key modules and techniques, providing deeper insights into the underlying mechanisms. The code for our models, baselines, and evaluation procedures is available at https://github.com/sgoerttler/MSA-CNN.

* 10 pages, 6 figures, journal paper

Via

Access Paper or Ask Questions

EEG-GMACN: Interpretable EEG Graph Mutual Attention Convolutional Network

Dec 15, 2024

Haili Ye, Stephan Goerttler, Fei He

Figure 1 for EEG-GMACN: Interpretable EEG Graph Mutual Attention Convolutional Network

Figure 2 for EEG-GMACN: Interpretable EEG Graph Mutual Attention Convolutional Network

Figure 3 for EEG-GMACN: Interpretable EEG Graph Mutual Attention Convolutional Network

Figure 4 for EEG-GMACN: Interpretable EEG Graph Mutual Attention Convolutional Network

Abstract:Electroencephalogram (EEG) is a valuable technique to record brain electrical activity through electrodes placed on the scalp. Analyzing EEG signals contributes to the understanding of neurological conditions and developing brain-computer interface. Graph Signal Processing (GSP) has emerged as a promising method for EEG spatial-temporal analysis, by further considering the topological relationships between electrodes. However, existing GSP studies lack interpretability of electrode importance and the credibility of prediction confidence. This work proposes an EEG Graph Mutual Attention Convolutional Network (EEG-GMACN), by introducing an 'Inverse Graph Weight Module' to output interpretable electrode graph weights, enhancing the clinical credibility and interpretability of EEG classification results. Additionally, we incorporate a mutual attention mechanism module into the model to improve its capability to distinguish critical electrodes and introduce credibility calibration to assess the uncertainty of prediction results. This study enhances the transparency and effectiveness of EEG analysis, paving the way for its widespread use in clinical and neuroscience research.

Via

Access Paper or Ask Questions

NonSysId: A nonlinear system identification package with improved model term selection for NARMAX models

Nov 25, 2024

Rajintha Gunawardena, Zi-Qiang Lang, Fei He

Figure 1 for NonSysId: A nonlinear system identification package with improved model term selection for NARMAX models

Figure 2 for NonSysId: A nonlinear system identification package with improved model term selection for NARMAX models

Figure 3 for NonSysId: A nonlinear system identification package with improved model term selection for NARMAX models

Figure 4 for NonSysId: A nonlinear system identification package with improved model term selection for NARMAX models

Abstract:System identification involves constructing mathematical models of dynamic systems using input-output data, enabling analysis and prediction of system behaviour in both time and frequency domains. This approach can model the entire system or capture specific dynamics within it. For meaningful analysis, it is essential for the model to accurately reflect the underlying system's behaviour. This paper introduces NonSysId, an open-sourced MATLAB software package designed for nonlinear system identification, specifically focusing on NARMAX models. The software incorporates an advanced term selection methodology that prioritises on simulation (free-run) accuracy while preserving model parsimony. A key feature is the integration of iterative Orthogonal Forward Regression (iOFR) with Predicted Residual Sum of Squares (PRESS) statistic-based term selection, facilitating robust model generalisation without the need for a separate validation dataset. Furthermore, techniques for reducing computational overheads are implemented. These features make NonSysId particularly suitable for real-time applications such as structural health monitoring, fault diagnosis, and biomedical signal processing, where it is a challenge to capture the signals under consistent conditions, resulting in limited or no validation data.

* 14 pages, 7 figures

Via

Access Paper or Ask Questions

Balancing Spectral, Temporal and Spatial Information for EEG-based Alzheimer's Disease Classification

Feb 21, 2024

Stephan Goerttler, Fei He, Min Wu

Figure 1 for Balancing Spectral, Temporal and Spatial Information for EEG-based Alzheimer's Disease Classification

Figure 2 for Balancing Spectral, Temporal and Spatial Information for EEG-based Alzheimer's Disease Classification

Figure 3 for Balancing Spectral, Temporal and Spatial Information for EEG-based Alzheimer's Disease Classification

Abstract:The prospect of future treatment warrants the development of cost-effective screening for Alzheimer's disease (AD). A promising candidate in this regard is electroencephalography (EEG), as it is one of the most economic imaging modalities. Recent efforts in EEG analysis have shifted towards leveraging spatial information, employing novel frameworks such as graph signal processing or graph neural networks. Here, we systematically investigate the importance of spatial information relative to spectral or temporal information by varying the proportion of each dimension for AD classification. To do so, we test various dimension resolution configurations on two routine EEG datasets. We find that spatial information is consistently more relevant than temporal information and equally relevant as spectral information. These results emphasise the necessity to consider spatial information for EEG-based AD classification. On our second dataset, we further find that well-balanced feature resolutions boost classification accuracy by up to 1.6%. Our resolution-based feature extraction has the potential to improve AD classification specifically, and multivariate signal classification generally.

* 4 pages, 3 figures, conference paper

Via

Access Paper or Ask Questions

Stochastic Graph Heat Modelling for Diffusion-based Connectivity Retrieval

Feb 20, 2024

Stephan Goerttler, Fei He, Min Wu

Abstract:Heat diffusion describes the process by which heat flows from areas with higher temperatures to ones with lower temperatures. This concept was previously adapted to graph structures, whereby heat flows between nodes of a graph depending on the graph topology. Here, we combine the graph heat equation with the stochastic heat equation, which ultimately yields a model for multivariate time signals on a graph. We show theoretically how the model can be used to directly compute the diffusion-based connectivity structure from multivariate signals. Unlike other connectivity measures, our heat model-based approach is inherently multivariate and yields an absolute scaling factor, namely the graph thermal diffusivity, which captures the extent of heat-like graph propagation in the data. On two datasets, we show how the graph thermal diffusivity can be used to characterise Alzheimer's disease. We find that the graph thermal diffusivity is lower for Alzheimer's patients than healthy controls and correlates with dementia scores, suggesting structural impairment in patients in line with previous findings.

* 4 pages, 1 figure, conference paper

Via

Access Paper or Ask Questions

Deep Automated Mechanism Design for Integrating Ad Auction and Allocation in Feed

Jan 03, 2024

Xuejian Li, Ze Wang, Bingqi Zhu, Fei He, Yongkang Wang, Xingxing Wang

Figure 1 for Deep Automated Mechanism Design for Integrating Ad Auction and Allocation in Feed

Figure 2 for Deep Automated Mechanism Design for Integrating Ad Auction and Allocation in Feed

Figure 3 for Deep Automated Mechanism Design for Integrating Ad Auction and Allocation in Feed

Figure 4 for Deep Automated Mechanism Design for Integrating Ad Auction and Allocation in Feed

Abstract:E-commerce platforms usually present an ordered list, mixed with several organic items and an advertisement, in response to each user's page view request. This list, the outcome of ad auction and allocation processes, directly impacts the platform's ad revenue and gross merchandise volume (GMV). Specifically, the ad auction determines which ad is displayed and the corresponding payment, while the ad allocation decides the display positions of the advertisement and organic items. The prevalent methods of segregating the ad auction and allocation into two distinct stages face two problems: 1) Ad auction does not consider externalities, such as the influence of actual display position and context on ad Click-Through Rate (CTR); 2) The ad allocation, which utilizes the auction-winning ad's payment to determine the display position dynamically, fails to maintain incentive compatibility (IC) for the advertisement. For instance, in the auction stage employing the traditional Generalized Second Price (GSP) , even if the winning ad increases its bid, its payment remains unchanged. This implies that the advertisement cannot secure a better position and thus loses the opportunity to achieve higher utility in the subsequent ad allocation stage. Previous research often focused on one of the two stages, neglecting the two-stage problem, which may result in suboptimal outcomes...

* 9 pages, 2 figures, Posting

Via

Access Paper or Ask Questions

Understanding Concepts in Graph Signal Processing for Neurophysiological Signal Analysis

Dec 06, 2023

Stephan Goerttler, Fei He, Min Wu

Abstract:Multivariate signals, which are measured simultaneously over time and acquired by sensor networks, are becoming increasingly common. The emerging field of graph signal processing (GSP) promises to analyse spectral characteristics of these multivariate signals, while at the same time taking the spatial structure between the time signals into account. A central idea in GSP is the graph Fourier transform, which projects a multivariate signal onto frequency-ordered graph Fourier modes, and can therefore be regarded as a spatial analog of the temporal Fourier transform. This chapter derives and discusses key concepts in GSP, with a specific focus on how the various concepts relate to one another. The experimental section focuses on the role of graph frequency in data classification, with applications to neuroimaging. To address the limited sample size of neurophysiological datasets, we introduce a minimalist simulation framework that can generate arbitrary amounts of data. Using this artificial data, we find that lower graph frequency signals are less suitable for classifying neurophysiological data as compared to higher graph frequency signals. Finally, we introduce a baseline testing framework for GSP. Employing this framework, our results suggest that GSP applications may attenuate spectral characteristics in the signals, highlighting current limitations of GSP for neuroimaging.

* 18 pages, 7 figures, book chapter

Via

Access Paper or Ask Questions

Graph Neural Network-based EEG Classification: A Survey

Oct 03, 2023

Dominik Klepl, Min Wu, Fei He

Figure 1 for Graph Neural Network-based EEG Classification: A Survey

Figure 2 for Graph Neural Network-based EEG Classification: A Survey

Figure 3 for Graph Neural Network-based EEG Classification: A Survey

Figure 4 for Graph Neural Network-based EEG Classification: A Survey

Abstract:Graph neural networks (GNN) are increasingly used to classify EEG for tasks such as emotion recognition, motor imagery and neurological diseases and disorders. A wide range of methods have been proposed to design GNN-based classifiers. Therefore, there is a need for a systematic review and categorisation of these approaches. We exhaustively search the published literature on this topic and derive several categories for comparison. These categories highlight the similarities and differences among the methods. The results suggest a prevalence of spectral graph convolutional layers over spatial. Additionally, we identify standard forms of node features, with the most popular being the raw EEG signal and differential entropy. Our results summarise the emerging trends in GNN-based approaches for EEG classification. Finally, we discuss several promising research directions, such as exploring the potential of transfer learning methods and appropriate modelling of cross-frequency interactions.

* 14 pages, 3 figures

Via

Access Paper or Ask Questions

Model-Free Market Risk Hedging Using Crowding Networks

Jun 13, 2023

Vadim Zlotnikov, Jiayu Liu, Igor Halperin, Fei He, Lisa Huang

Figure 1 for Model-Free Market Risk Hedging Using Crowding Networks

Figure 2 for Model-Free Market Risk Hedging Using Crowding Networks

Figure 3 for Model-Free Market Risk Hedging Using Crowding Networks

Figure 4 for Model-Free Market Risk Hedging Using Crowding Networks

Abstract:Crowding is widely regarded as one of the most important risk factors in designing portfolio strategies. In this paper, we analyze stock crowding using network analysis of fund holdings, which is used to compute crowding scores for stocks. These scores are used to construct costless long-short portfolios, computed in a distribution-free (model-free) way and without using any numerical optimization, with desirable properties of hedge portfolios. More specifically, these long-short portfolios provide protection for both small and large market price fluctuations, due to their negative correlation with the market and positive convexity as a function of market returns. By adding our long-short portfolio to a baseline portfolio such as a traditional 60/40 portfolio, our method provides an alternative way to hedge portfolio risk including tail risk, which does not require costly option-based strategies or complex numerical optimization. The total cost of such hedging amounts to the total cost of rebalancing the hedge portfolio.

* 8 pages, 6 figures

Via

Access Paper or Ask Questions