Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Roshan Godaliyadda

Mitigating Long-Tail Bias via Prompt-Controlled Diffusion Augmentation

Feb 04, 2026

Buddhi Wijenayake, Nichula Wasalathilake, Roshan Godaliyadda, Vijitha Herath, Parakrama Ekanayake, Vishal M. Patel

Abstract:Semantic segmentation of high-resolution remote-sensing imagery is critical for urban mapping and land-cover monitoring, yet training data typically exhibits severe long-tailed pixel imbalance. In the dataset LoveDA, this challenge is compounded by an explicit Urban/Rural split with distinct appearance and inconsistent class-frequency statistics across domains. We present a prompt-controlled diffusion augmentation framework that synthesizes paired label--image samples with explicit control of both domain and semantic composition. Stage~A uses a domain-aware, masked ratio-conditioned discrete diffusion model to generate layouts that satisfy user-specified class-ratio targets while respecting learned co-occurrence structure. Stage~B translates layouts into photorealistic, domain-consistent images using Stable Diffusion with ControlNet guidance. Mixing the resulting ratio and domain-controlled synthetic pairs with real data yields consistent improvements across multiple segmentation backbones, with gains concentrated on minority classes and improved Urban and Rural generalization, demonstrating controllable augmentation as a practical mechanism to mitigate long-tail bias in remote-sensing segmentation. Source codes, pretrained models, and synthetic datasets are available at \href{https://github.com/Buddhi19/SyntheticGen.git}{Github}

Via

Access Paper or Ask Questions

Spectrotemporal Feature Extraction in EHG Signals and Tocograms for Enhanced Preterm Birth Prediction

Sep 09, 2025

Senith Jayakody, Kalana Jayasooriya, Sashini Liyanage, Roshan Godaliyadda, Parakrama Ekanayake, Chathura Rathnayake

Abstract:Preterm birth (PTB), defined as delivery before 37 weeks of gestation, is a leading cause of neonatal mortality and long term health complications. Early detection is essential for enabling timely medical interventions. Electrohysterography (EHG) and tocography (TOCO) are promising non invasive tools for PTB prediction, but prior studies often suffer from class imbalance, improper oversampling, and reliance on features with limited physiological relevance. This work presents a machine learning pipeline incorporating robust preprocessing, physiologically grounded feature extraction, and rigorous evaluation. Features were extracted from EHG (and TOCO) signals using Mel frequency cepstral coefficients, statistical descriptors of wavelet coefficients, and peaks of the normalized power spectrum. Signal quality was enhanced via Karhunen Lo\`eve Transform (KLT) denoising through eigenvalue based subspace decomposition. Multiple classifiers, including Logistic Regression, Support Vector Machines, Random Forest, Gradient Boosting, Multilayer Perceptron, and CatBoost, were evaluated on the TPEHGT dataset. The CatBoost classifier with KLT denoising achieved the highest performance on fixed interval segments of the TPEHGT dataset, reaching 97.28% accuracy and an AUC of 0.9988. Ablation studies confirmed the critical role of both KLT denoising and physiologically informed features. Comparative analysis showed that including TOCO signals did not substantially improve prediction over EHG alone, highlighting the sufficiency of EHG for PTB detection. These results demonstrate that combining denoising with domain relevant features can yield highly accurate, robust, and clinically interpretable models, supporting the development of cost effective and accessible PTB prediction tools, particularly in low resource healthcare settings.

* 12 pages, 4 figures, 5 tables, manuscript under review

Via

Access Paper or Ask Questions

Mamba-FCS: Joint Spatio- Frequency Feature Fusion, Change-Guided Attention, and SeK Loss for Enhanced Semantic Change Detection in Remote Sensing

Aug 11, 2025

Buddhi Wijenayake, Athulya Ratnayake, Praveen Sumanasekara, Roshan Godaliyadda, Parakrama Ekanayake, Vijitha Herath, Nichula Wasalathilaka

Figure 1 for Mamba-FCS: Joint Spatio- Frequency Feature Fusion, Change-Guided Attention, and SeK Loss for Enhanced Semantic Change Detection in Remote Sensing

Figure 2 for Mamba-FCS: Joint Spatio- Frequency Feature Fusion, Change-Guided Attention, and SeK Loss for Enhanced Semantic Change Detection in Remote Sensing

Figure 3 for Mamba-FCS: Joint Spatio- Frequency Feature Fusion, Change-Guided Attention, and SeK Loss for Enhanced Semantic Change Detection in Remote Sensing

Figure 4 for Mamba-FCS: Joint Spatio- Frequency Feature Fusion, Change-Guided Attention, and SeK Loss for Enhanced Semantic Change Detection in Remote Sensing

Abstract:Semantic Change Detection (SCD) from remote sensing imagery requires models balancing extensive spatial context, computational efficiency, and sensitivity to class-imbalanced land-cover transitions. While Convolutional Neural Networks excel at local feature extraction but lack global context, Transformers provide global modeling at high computational costs. Recent Mamba architectures based on state-space models offer compelling solutions through linear complexity and efficient long-range modeling. In this study, we introduce Mamba-FCS, a SCD framework built upon Visual State Space Model backbone incorporating, a Joint Spatio-Frequency Fusion block incorporating log-amplitude frequency domain features to enhance edge clarity and suppress illumination artifacts, a Change-Guided Attention (CGA) module that explicitly links the naturally intertwined BCD and SCD tasks, and a Separated Kappa (SeK) loss tailored for class-imbalanced performance optimization. Extensive evaluation on SECOND and Landsat-SCD datasets shows that Mamba-FCS achieves state-of-the-art metrics, 88.62% Overall Accuracy, 65.78% F_scd, and 25.50% SeK on SECOND, 96.25% Overall Accuracy, 89.27% F_scd, and 60.26% SeK on Landsat-SCD. Ablation analyses confirm distinct contributions of each novel component, with qualitative assessments highlighting significant improvements in SCD. Our results underline the substantial potential of Mamba architectures, enhanced by proposed techniques, setting a new benchmark for effective and scalable semantic change detection in remote sensing applications. The complete source code, configuration files, and pre-trained models will be publicly available upon publication.

* 15 pages, 11 Figures

Via

Access Paper or Ask Questions

Preprocessing Algorithm Leveraging Geometric Modeling for Scale Correction in Hyperspectral Images for Improved Unmixing Performance

Aug 11, 2025

Praveen Sumanasekara, Athulya Ratnayake, Buddhi Wijenayake, Keshawa Ratnayake, Roshan Godaliyadda, Parakrama Ekanayake, Vijitha Herath

Figure 1 for Preprocessing Algorithm Leveraging Geometric Modeling for Scale Correction in Hyperspectral Images for Improved Unmixing Performance

Figure 2 for Preprocessing Algorithm Leveraging Geometric Modeling for Scale Correction in Hyperspectral Images for Improved Unmixing Performance

Figure 3 for Preprocessing Algorithm Leveraging Geometric Modeling for Scale Correction in Hyperspectral Images for Improved Unmixing Performance

Figure 4 for Preprocessing Algorithm Leveraging Geometric Modeling for Scale Correction in Hyperspectral Images for Improved Unmixing Performance

Abstract:Spectral variability significantly impacts the accuracy and convergence of hyperspectral unmixing algorithms. While many methods address complex spectral variability, large-scale variations in spectral signature scale caused by factors such as topography, illumination, and shadowing remain a major challenge. These variations often degrade unmixing performance and complicate model fitting. In this paper, we propose a novel preprocessing algorithm that corrects scale-induced spectral variability prior to unmixing. By isolating and compensating for these large-scale multiplicative effects, the algorithm provides a cleaner input, enabling unmixing methods to focus more effectively on modeling nonlinear spectral variability and abundance estimation. We present a rigorous mathematical framework to describe scale variability and extensive experimental validation of the proposed algorithm. Furthermore, the algorithm's impact is evaluated across a broad spectrum of state-of-the-art unmixing algorithms on two synthetic and two real hyperspectral datasets. The proposed preprocessing step consistently improves the performance of these algorithms, including those specifically designed to handle spectral variability, with error reductions close to 50% in many cases. This demonstrates that scale correction acts as a complementary step, facilitating more accurate unmixing by existing methods. The algorithm's generality and significant impact highlight its potential as a key component in practical hyperspectral unmixing pipelines. The implementation code will be made publicly available upon publication.

* 20 pages, 17 figures

Via

Access Paper or Ask Questions

Precision Spatio-Temporal Feature Fusion for Robust Remote Sensing Change Detection

Jul 15, 2025

Buddhi Wijenayake, Athulya Ratnayake, Praveen Sumanasekara, Nichula Wasalathilaka, Mathivathanan Piratheepan, Roshan Godaliyadda, Mervyn Ekanayake, Vijitha Herath

Figure 1 for Precision Spatio-Temporal Feature Fusion for Robust Remote Sensing Change Detection

Figure 2 for Precision Spatio-Temporal Feature Fusion for Robust Remote Sensing Change Detection

Figure 3 for Precision Spatio-Temporal Feature Fusion for Robust Remote Sensing Change Detection

Figure 4 for Precision Spatio-Temporal Feature Fusion for Robust Remote Sensing Change Detection

Abstract:Remote sensing change detection is vital for monitoring environmental and urban transformations but faces challenges like manual feature extraction and sensitivity to noise. Traditional methods and early deep learning models, such as convolutional neural networks (CNNs), struggle to capture long-range dependencies and global context essential for accurate change detection in complex scenes. While Transformer-based models mitigate these issues, their computational complexity limits their applicability in high-resolution remote sensing. Building upon ChangeMamba architecture, which leverages state space models for efficient global context modeling, this paper proposes precision fusion blocks to capture channel-wise temporal variations and per-pixel differences for fine-grained change detection. An enhanced decoder pipeline, incorporating lightweight channel reduction mechanisms, preserves local details with minimal computational cost. Additionally, an optimized loss function combining Cross Entropy, Dice and Lovasz objectives addresses class imbalance and boosts Intersection-over-Union (IoU). Evaluations on SYSU-CD, LEVIR-CD+, and WHU-CD datasets demonstrate superior precision, recall, F1 score, IoU, and overall accuracy compared to state-of-the-art methods, highlighting the approach's robustness for remote sensing change detection. For complete transparency, the codes and pretrained models are accessible at https://github.com/Buddhi19/MambaCD.git

* 6 pages, 4 figures, 2 pages, under review(conference paper)

Via

Access Paper or Ask Questions

BandRC: Band Shifted Raised Cosine Activated Implicit Neural Representations

May 16, 2025

Pandula Thennakoon, Avishka Ranasinghe, Mario De Silva, Buwaneka Epakanda, Roshan Godaliyadda, Parakrama Ekanayake, Vijitha Herath

Figure 1 for BandRC: Band Shifted Raised Cosine Activated Implicit Neural Representations

Figure 2 for BandRC: Band Shifted Raised Cosine Activated Implicit Neural Representations

Figure 3 for BandRC: Band Shifted Raised Cosine Activated Implicit Neural Representations

Figure 4 for BandRC: Band Shifted Raised Cosine Activated Implicit Neural Representations

Abstract:In recent years, implicit neural representations(INRs) have gained popularity in the computer vision community. This is mainly due to the strong performance of INRs in many computer vision tasks. These networks can extract a continuous signal representation given a discrete signal representation. In previous studies, it has been repeatedly shown that INR performance has a strong correlation with the activation functions used in its multilayer perceptrons. Although numerous activation functions have been proposed that are competitive with one another, they share some common set of challenges such as spectral bias(Lack of sensitivity to high-frequency content in signals), limited robustness to signal noise and difficulties in simultaneous capturing both local and global features. and furthermore, the requirement for manual parameter tuning. To address these issues, we introduce a novel activation function, Band Shifted Raised Cosine Activated Implicit Neural Networks \textbf{(BandRC)} tailored to enhance signal representation capacity further. We also incorporate deep prior knowledge extracted from the signal to adjust the activation functions through a task-specific model. Through a mathematical analysis and a series of experiments which include image reconstruction (with a +8.93 dB PSNR improvement over the nearest counterpart), denoising (with a +0.46 dB increase in PSNR), super-resolution (with a +1.03 dB improvement over the nearest State-Of-The-Art (SOTA) method for 6X super-resolution), inpainting, and 3D shape reconstruction we demonstrate the dominance of BandRC over existing state of the art activation functions.

* Submitted as a conference paper to ICCV 2025

Via

Access Paper or Ask Questions

Enhanced SCanNet with CBAM and Dice Loss for Semantic Change Detection

May 07, 2025

Athulya Ratnayake, Buddhi Wijenayake, Praveen Sumanasekara, Roshan Godaliyadda, Vijitha Herath, Parakrama Ekanayake

Figure 1 for Enhanced SCanNet with CBAM and Dice Loss for Semantic Change Detection

Figure 2 for Enhanced SCanNet with CBAM and Dice Loss for Semantic Change Detection

Figure 3 for Enhanced SCanNet with CBAM and Dice Loss for Semantic Change Detection

Figure 4 for Enhanced SCanNet with CBAM and Dice Loss for Semantic Change Detection

Abstract:Semantic Change Detection (SCD) in remote sensing imagery requires accurately identifying land-cover changes across multi-temporal image pairs. Despite substantial advancements, including the introduction of transformer-based architectures, current SCD models continue to struggle with challenges such as noisy inputs, subtle class boundaries, and significant class imbalance. In this study, we propose enhancing the Semantic Change Network (SCanNet) by integrating the Convolutional Block Attention Module (CBAM) and employing Dice loss during training. CBAM sequentially applies channel attention to highlight feature maps with the most meaningful content, followed by spatial attention to pinpoint critical regions within these maps. This sequential approach ensures precise suppression of irrelevant features and spatial noise, resulting in more accurate and robust detection performance compared to attention mechanisms that apply both processes simultaneously or independently. Dice loss, designed explicitly for handling class imbalance, further boosts sensitivity to minority change classes. Quantitative experiments conducted on the SECOND dataset demonstrate consistent improvements. Qualitative analysis confirms these improvements, showing clearer segmentation boundaries and more accurate recovery of small-change regions. These findings highlight the effectiveness of attention mechanisms and Dice loss in improving feature representation and addressing class imbalance in semantic change detection tasks.

* 7 pages, 3 figures, conference

Via

Access Paper or Ask Questions

Performance Benchmarking of Psychomotor Skills Using Wearable Devices: An Application in Sport

Nov 25, 2024

Mahela Pandukabhaya, Tharaka Fonseka, Madhumini Kulathunge, Roshan Godaliyadda, Parakrama Ekanayake, Chanaka Senanayake, Vijitha Herath

Figure 1 for Performance Benchmarking of Psychomotor Skills Using Wearable Devices: An Application in Sport

Figure 2 for Performance Benchmarking of Psychomotor Skills Using Wearable Devices: An Application in Sport

Figure 3 for Performance Benchmarking of Psychomotor Skills Using Wearable Devices: An Application in Sport

Figure 4 for Performance Benchmarking of Psychomotor Skills Using Wearable Devices: An Application in Sport

Abstract:This study proposes a versatile framework for optimizing psychomotor learning through human motion analysis. Utilizing a wearable IMU sensor system, the motion trajectories of a given psychomotor task are acquired and then linked to points in a performance space using a predefined set of quality metrics specific to the psychomotor skill. This enables the identification of a benchmark cluster in the performance space, allowing correspondences to be established between the performance clusters and sets of trajectories in the motion space. As a result, common or specific deviations in the performance space can be identified, enabling remedial actions in the motion space to optimize performance. A thorough validation of the proposed framework is done in this paper using a Table Tennis forehand stroke as a case study. The resulting quantitative and visual representation of performance empowers individuals to optimize their skills and achieve peak performance.

* 15 pages, 14 figures, 5 tables, currently under review at IEEE Access

Via

Access Paper or Ask Questions

Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise

Mar 25, 2024

Dilum Fernando, Dhananjaya jayasundara, Roshan Godaliyadda, Chaminda Bandara, Parakrama Ekanayake, Vijitha Herath

Abstract:Denoising Diffusion Probabilistic Models (DDPMs) have accomplished much in the realm of generative AI. Despite their high performance, there is room for improvement, especially in terms of sample fidelity by utilizing statistical properties that impose structural integrity, such as isotropy. Minimizing the mean squared error between the additive and predicted noise alone does not impose constraints on the predicted noise to be isotropic. Thus, we were motivated to utilize the isotropy of the additive noise as a constraint on the objective function to enhance the fidelity of DDPMs. Our approach is simple and can be applied to any DDPM variant. We validate our approach by presenting experiments conducted on four synthetic 2D datasets as well as on unconditional image generation. As demonstrated by the results, the incorporation of this constraint improves the fidelity metrics, Precision and Density for the 2D datasets as well as for the unconditional image generation.

Via

Access Paper or Ask Questions

Dual mode multispectral imaging system for food and agricultural product quality estimation

Oct 04, 2023

Darsha Udayanga, Ashan Serasinghe, Supun Dassanayake, Roshan Godaliyadda, H. M. V. R. Herath, M. P. B. Ekanayake, H. L. P. Malshan

Figure 1 for Dual mode multispectral imaging system for food and agricultural product quality estimation

Figure 2 for Dual mode multispectral imaging system for food and agricultural product quality estimation

Figure 3 for Dual mode multispectral imaging system for food and agricultural product quality estimation

Figure 4 for Dual mode multispectral imaging system for food and agricultural product quality estimation

Abstract:Multispectral imaging coupled with Artificial Intelligence, Machine Learning and Signal Processing techniques work as a feasible alternative for laboratory testing, especially in food quality control. Most of the recent related research has been focused on reflectance multispectral imaging but a system with both reflectance, transmittance capabilities would be ideal for a wide array of specimen types including solid and liquid samples. In this paper, a device which includes a dedicated reflectance mode and a dedicated transmittance mode is proposed. Dual mode operation where fast switching between two modes is facilitated. An innovative merged mode is introduced in which both reflectance and transmittance information of a specimen are combined to form a higher dimensional dataset with more features. Spatial and temporal variations of measurements are analyzed to ensure the quality of measurements. The concept is validated using a standard color palette and specific case studies are done for standard food samples such as turmeric powder and coconut oil proving the validity of proposed contributions. The classification accuracy of standard color palette testing was over 90% and the accuracy of coconut oil adulteration was over 95%. while the merged mode was able to provide the best accuracy of 99% for the turmeric adulteration. A linear functional mapping was done for coconut oil adulteration with an R2 value of 0.9558.

* This work has been submitted to the IEEE for possible publication

Via

Access Paper or Ask Questions