Picture for Sabato Marco Siniscalchi

Sabato Marco Siniscalchi

Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations

Add code
May 30, 2025
Viaarxiv icon

Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models

Add code
May 28, 2025
Viaarxiv icon

MVP: Multi-source Voice Pathology detection

Add code
May 26, 2025
Viaarxiv icon

Exploring Generative Error Correction for Dysarthric Speech Recognition

Add code
May 26, 2025
Viaarxiv icon

"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding

Add code
May 26, 2025
Viaarxiv icon

Variational Bayesian Adaptive Learning of Deep Latent Variables for Acoustic Knowledge Transfer

Add code
Jan 26, 2025
Viaarxiv icon

MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network

Add code
Nov 28, 2024
Figure 1 for MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network
Figure 2 for MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network
Figure 3 for MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network
Figure 4 for MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network
Viaarxiv icon

An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement

Add code
Sep 24, 2024
Figure 1 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 2 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 3 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 4 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Viaarxiv icon

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition

Add code
Sep 17, 2024
Figure 1 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 2 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 3 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 4 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Viaarxiv icon

Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-based Speech Enhancement

Add code
Aug 08, 2024
Viaarxiv icon