Picture for Sabato Marco Siniscalchi

Sabato Marco Siniscalchi

Few-Shot and Pseudo-Label Guided Speech Quality Evaluation with Large Language Models

Add code
Apr 15, 2026
Viaarxiv icon

A Knowledge-Driven Approach to Music Segmentation, Music Source Separation and Cinematic Audio Source Separation

Add code
Feb 25, 2026
Viaarxiv icon

MDM-ASR: Bridging Accuracy and Efficiency in ASR with Diffusion-Based Non-Autoregressive Decoding

Add code
Feb 24, 2026
Viaarxiv icon

A Bottom-up Framework with Language-universal Speech Attribute Modeling for Syllable-based ASR

Add code
Sep 09, 2025
Viaarxiv icon

An Investigation on Combining Geometry and Consistency Constraints into Phase Estimation for Speech Enhancement

Add code
Jul 02, 2025
Figure 1 for An Investigation on Combining Geometry and Consistency Constraints into Phase Estimation for Speech Enhancement
Figure 2 for An Investigation on Combining Geometry and Consistency Constraints into Phase Estimation for Speech Enhancement
Figure 3 for An Investigation on Combining Geometry and Consistency Constraints into Phase Estimation for Speech Enhancement
Figure 4 for An Investigation on Combining Geometry and Consistency Constraints into Phase Estimation for Speech Enhancement
Viaarxiv icon

Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations

Add code
May 30, 2025
Figure 1 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Figure 2 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Figure 3 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Figure 4 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Viaarxiv icon

Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models

Add code
May 28, 2025
Figure 1 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Figure 2 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Figure 3 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Figure 4 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Viaarxiv icon

"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding

Add code
May 26, 2025
Figure 1 for "KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding
Figure 2 for "KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding
Figure 3 for "KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding
Figure 4 for "KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding
Viaarxiv icon

Exploring Generative Error Correction for Dysarthric Speech Recognition

Add code
May 26, 2025
Viaarxiv icon

MVP: Multi-source Voice Pathology detection

Add code
May 26, 2025
Viaarxiv icon