Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hau-Tieng Wu

Accelerate Vector Diffusion Maps by Landmarks

Mar 22, 2026

Sing-Yuan Yeh, Yi-An Wu, Hau-Tieng Wu, Mao-Pei Tsui

Abstract:We propose a landmark-constrained algorithm, LA-VDM (Landmark Accelerated Vector Diffusion Maps), to accelerate the Vector Diffusion Maps (VDM) framework built upon the Graph Connection Laplacian (GCL), which captures pairwise connection relationships within complex datasets. LA-VDM introduces a novel two-stage normalization that effectively address nonuniform sampling densities in both the data and the landmark sets. Under a manifold model with the frame bundle structure, we show that we can accurately recover the parallel transport with landmark-constrained diffusion from a point cloud, and hence asymptotically LA-VDM converges to the connection Laplacian. The performance and accuracy of LA-VDM are demonstrated through experiments on simulated datasets and an application to nonlocal image denoising.

Via

Access Paper or Ask Questions

Generalized Robust Adaptive-Bandwidth Multi-View Manifold Learning in High Dimensions with Noise

Feb 11, 2026

Xiucai Ding, Chao Shen, Hau-Tieng Wu

Abstract:Multiview datasets are common in scientific and engineering applications, yet existing fusion methods offer limited theoretical guarantees, particularly in the presence of heterogeneous and high-dimensional noise. We propose Generalized Robust Adaptive-Bandwidth Multiview Diffusion Maps (GRAB-MDM), a new kernel-based diffusion geometry framework for integrating multiple noisy data sources. The key innovation of GRAB-MDM is a {view}-dependent bandwidth selection strategy that adapts to the geometry and noise level of each view, enabling a stable and principled construction of multiview diffusion operators. Under a common-manifold model, we establish asymptotic convergence results and show that the adaptive bandwidths lead to provably robust recovery of the shared intrinsic structure, even when noise levels and sensor dimensions differ across views. Numerical experiments demonstrate that GRAB-MDM significantly improves robustness and embedding quality compared with fixed-bandwidth and equal-bandwidth baselines, and usually outperform existing algorithms. The proposed framework offers a practical and theoretically grounded solution for multiview sensor fusion in high-dimensional noisy environments.

* 4 figures

Via

Access Paper or Ask Questions

Sleep Staging from Airflow Signals Using Fourier Approximations of Persistence Curves

Nov 12, 2024

Shashank Manjunath, Hau-Tieng Wu, Aarti Sathyanarayana

Figure 1 for Sleep Staging from Airflow Signals Using Fourier Approximations of Persistence Curves

Figure 2 for Sleep Staging from Airflow Signals Using Fourier Approximations of Persistence Curves

Figure 3 for Sleep Staging from Airflow Signals Using Fourier Approximations of Persistence Curves

Figure 4 for Sleep Staging from Airflow Signals Using Fourier Approximations of Persistence Curves

Abstract:Sleep staging is a challenging task, typically manually performed by sleep technologists based on electroencephalogram and other biosignals of patients taken during overnight sleep studies. Recent work aims to leverage automated algorithms to perform sleep staging not based on electroencephalogram signals, but rather based on the airflow signals of subjects. Prior work uses ideas from topological data analysis (TDA), specifically Hermite function expansions of persistence curves (HEPC) to featurize airflow signals. However, finite order HEPC captures only partial information. In this work, we propose Fourier approximations of persistence curves (FAPC), and use this technique to perform sleep staging based on airflow signals. We analyze performance using an XGBoost model on 1155 pediatric sleep studies taken from the Nationwide Children's Hospital Sleep DataBank (NCHSDB), and find that FAPC methods provide complimentary information to HEPC methods alone, leading to a 4.9% increase in performance over baseline methods.

Via

Access Paper or Ask Questions

Landmark Alternating Diffusion

Apr 29, 2024

Sing-Yuan Yeh, Hau-Tieng Wu, Ronen Talmon, Mao-Pei Tsui

Figure 1 for Landmark Alternating Diffusion

Figure 2 for Landmark Alternating Diffusion

Figure 3 for Landmark Alternating Diffusion

Figure 4 for Landmark Alternating Diffusion

Abstract:Alternating Diffusion (AD) is a commonly applied diffusion-based sensor fusion algorithm. While it has been successfully applied to various problems, its computational burden remains a limitation. Inspired by the landmark diffusion idea considered in the Robust and Scalable Embedding via Landmark Diffusion (ROSELAND), we propose a variation of AD, called Landmark AD (LAD), which captures the essence of AD while offering superior computational efficiency. We provide a series of theoretical analyses of LAD under the manifold setup and apply it to the automatic sleep stage annotation problem with two electroencephalogram channels to demonstrate its application.

Via

Access Paper or Ask Questions

Convergence analysis of t-SNE as a gradient flow for point cloud on a manifold

Jan 31, 2024

Seonghyeon Jeong, Hau-Tieng Wu

Abstract:We present a theoretical foundation regarding the boundedness of the t-SNE algorithm. t-SNE employs gradient descent iteration with Kullback-Leibler (KL) divergence as the objective function, aiming to identify a set of points that closely resemble the original data points in a high-dimensional space, minimizing KL divergence. Investigating t-SNE properties such as perplexity and affinity under a weak convergence assumption on the sampled dataset, we examine the behavior of points generated by t-SNE under continuous gradient flow. Demonstrating that points generated by t-SNE remain bounded, we leverage this insight to establish the existence of a minimizer for KL divergence.

Via

Access Paper or Ask Questions

Design a Metric Robust to Complicated High Dimensional Noise for Efficient Manifold Denoising

Jan 08, 2024

Hau-Tieng Wu

Figure 1 for Design a Metric Robust to Complicated High Dimensional Noise for Efficient Manifold Denoising

Figure 2 for Design a Metric Robust to Complicated High Dimensional Noise for Efficient Manifold Denoising

Figure 3 for Design a Metric Robust to Complicated High Dimensional Noise for Efficient Manifold Denoising

Figure 4 for Design a Metric Robust to Complicated High Dimensional Noise for Efficient Manifold Denoising

Abstract:In this manuscript, we propose an efficient manifold denoiser based on landmark diffusion and optimal shrinkage under the complicated high dimensional noise and compact manifold setup. It is flexible to handle several setups, including the high ambient space dimension with a manifold embedding that occupies a subspace of high or low dimensions, and the noise could be colored and dependent. A systematic comparison with other existing algorithms on both simulated and real datasets is provided. This manuscript is mainly algorithmic and we report several existing tools and numerical results. Theoretical guarantees and more comparisons will be reported in the official paper of this manuscript.

Via

Access Paper or Ask Questions

Unveil Sleep Spindles with Concentration of Frequency and Time

Oct 27, 2023

Riki Shimizu, Hau-Tieng Wu

Figure 1 for Unveil Sleep Spindles with Concentration of Frequency and Time

Figure 2 for Unveil Sleep Spindles with Concentration of Frequency and Time

Figure 3 for Unveil Sleep Spindles with Concentration of Frequency and Time

Figure 4 for Unveil Sleep Spindles with Concentration of Frequency and Time

Abstract:Objective: Sleep spindles contain crucial brain dynamics information. We introduce the novel non-linear time-frequency analysis tool 'Concentration of Frequency and Time' (ConceFT) to create an interpretable automated algorithm for sleep spindle annotation in EEG data and to measure spindle instantaneous frequencies (IFs). Methods: ConceFT effectively reduces stochastic EEG influence, enhancing spindle visibility in the time-frequency representation. Our automated spindle detection algorithm, ConceFT-Spindle (ConceFT-S), is compared to A7 (non-deep learning) and SUMO (deep learning) using Dream and MASS benchmark databases. We also quantify spindle IF dynamics. Results: ConceFT-S achieves F1 scores of 0.749 in Dream and 0.786 in MASS, which is equivalent to or surpass A7 and SUMO with statistical significance. We reveal that spindle IF is generally nonlinear. Conclusion: ConceFT offers an accurate, interpretable EEG-based sleep spindle detection algorithm and enables spindle IF quantification.

* 22 pages, 6 figures, and 1 table

Via

Access Paper or Ask Questions

Geometric Scattering on Measure Spaces

Aug 17, 2022

Joyce Chew, Matthew Hirn, Smita Krishnaswamy, Deanna Needell, Michael Perlmutter, Holly Steach, Siddharth Viswanath, Hau-Tieng Wu

Figure 1 for Geometric Scattering on Measure Spaces

Figure 2 for Geometric Scattering on Measure Spaces

Figure 3 for Geometric Scattering on Measure Spaces

Figure 4 for Geometric Scattering on Measure Spaces

Abstract:The scattering transform is a multilayered, wavelet-based transform initially introduced as a model of convolutional neural networks (CNNs) that has played a foundational role in our understanding of these networks' stability and invariance properties. Subsequently, there has been widespread interest in extending the success of CNNs to data sets with non-Euclidean structure, such as graphs and manifolds, leading to the emerging field of geometric deep learning. In order to improve our understanding of the architectures used in this new field, several papers have proposed generalizations of the scattering transform for non-Euclidean data structures such as undirected graphs and compact Riemannian manifolds without boundary. In this paper, we introduce a general, unified model for geometric scattering on measure spaces. Our proposed framework includes previous work on geometric scattering as special cases but also applies to more general settings such as directed graphs, signed graphs, and manifolds with boundary. We propose a new criterion that identifies to which groups a useful representation should be invariant and show that this criterion is sufficient to guarantee that the scattering transform has desirable stability and invariance properties. Additionally, we consider finite measure spaces that are obtained from randomly sampling an unknown manifold. We propose two methods for constructing a data-driven graph on which the associated graph scattering transform approximates the scattering transform on the underlying manifold. Moreover, we use a diffusion-maps based approach to prove quantitative estimates on the rate of convergence of one of these approximations as the number of sample points tends to infinity. Lastly, we showcase the utility of our method on spherical images, directed graphs, and on high-dimensional single-cell data.

Via

Access Paper or Ask Questions

An iterative warping and clustering algorithm to estimate multiple wave-shape functions from a nonstationary oscillatory signal

Aug 12, 2022

Marcelo A. Colominas, Hau-Tieng Wu

Figure 1 for An iterative warping and clustering algorithm to estimate multiple wave-shape functions from a nonstationary oscillatory signal

Figure 2 for An iterative warping and clustering algorithm to estimate multiple wave-shape functions from a nonstationary oscillatory signal

Figure 3 for An iterative warping and clustering algorithm to estimate multiple wave-shape functions from a nonstationary oscillatory signal

Figure 4 for An iterative warping and clustering algorithm to estimate multiple wave-shape functions from a nonstationary oscillatory signal

Abstract:We present an algorithm to estimate multiple wave-shape functions (WSF) from a nonstationary oscillatory signal with time-varying amplitude and frequency. Suppose there are finite different $1$-periodic functions, $s_1,\ldots,s_K$, as WSFs that model different oscillatory patterns in an oscillatory signal, where the WSF might jump from one to another suddenly. The proposed algorithm detects change points and estimates $s_1,\ldots,s_K$ from the signal by a novel iterative warping and clustering algorithm, which is a combination of time-frequency analysis, singular value decomposition entropy and vector spectral clustering. We demonstrate the efficiency of the proposed algorithm with simulated and real signals, including the voice signal, arterial blood pressure, electrocardiogram and accelerometer signal. Moreover, we provide a mathematical justification of the algorithm under the assumption that the amplitude and frequency of the signal are slowly time-varying and there are finite change points that model sudden changes from one wave-shape function to another one.

* 33 pages, 11 figures

Via

Access Paper or Ask Questions

The Manifold Scattering Transform for High-Dimensional Point Cloud Data

Jun 21, 2022

Joyce Chew, Holly R. Steach, Siddharth Viswanath, Hau-Tieng Wu, Matthew Hirn, Deanna Needell, Smita Krishnaswamy, Michael Perlmutter

Figure 1 for The Manifold Scattering Transform for High-Dimensional Point Cloud Data

Figure 2 for The Manifold Scattering Transform for High-Dimensional Point Cloud Data

Figure 3 for The Manifold Scattering Transform for High-Dimensional Point Cloud Data

Figure 4 for The Manifold Scattering Transform for High-Dimensional Point Cloud Data

Abstract:The manifold scattering transform is a deep feature extractor for data defined on a Riemannian manifold. It is one of the first examples of extending convolutional neural network-like operators to general manifolds. The initial work on this model focused primarily on its theoretical stability and invariance properties but did not provide methods for its numerical implementation except in the case of two-dimensional surfaces with predefined meshes. In this work, we present practical schemes, based on the theory of diffusion maps, for implementing the manifold scattering transform to datasets arising in naturalistic systems, such as single cell genetics, where the data is a high-dimensional point cloud modeled as lying on a low-dimensional manifold. We show that our methods are effective for signal classification and manifold classification tasks.

* Accepted for publication in the TAG in DS Workshop at ICML

Via

Access Paper or Ask Questions