Picture for Paula Andrea Pérez-Toro

Paula Andrea Pérez-Toro

Multilingual Phonological Feature Recognition with Self-Supervised Speech Models

Add code
May 25, 2026
Viaarxiv icon

Speech-Guided Multimodal Learning for Vocal Tract Segmentation in Real-Time MRI

Add code
May 18, 2026
Viaarxiv icon

Bias and Fairness in Self-Supervised Acoustic Representations for Cognitive Impairment Detection

Add code
Mar 03, 2026
Viaarxiv icon

Audio-Vision Contrastive Learning for Phonological Class Recognition

Add code
Jul 23, 2025
Viaarxiv icon

Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages

Add code
May 20, 2025
Viaarxiv icon

A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI

Add code
Mar 15, 2025
Figure 1 for A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI
Figure 2 for A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI
Figure 3 for A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI
Figure 4 for A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI
Viaarxiv icon

Speaker- and Text-Independent Estimation of Articulatory Movements and Phoneme Alignments from Speech

Add code
Jul 03, 2024
Viaarxiv icon

Self-Supervised Speech Representations Preserve Speech Characteristics while Anonymizing Voices

Add code
Apr 04, 2022
Figure 1 for Self-Supervised Speech Representations Preserve Speech Characteristics while Anonymizing Voices
Figure 2 for Self-Supervised Speech Representations Preserve Speech Characteristics while Anonymizing Voices
Figure 3 for Self-Supervised Speech Representations Preserve Speech Characteristics while Anonymizing Voices
Figure 4 for Self-Supervised Speech Representations Preserve Speech Characteristics while Anonymizing Voices
Viaarxiv icon

Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition

Add code
Apr 04, 2022
Figure 1 for Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
Figure 2 for Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
Figure 3 for Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
Figure 4 for Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
Viaarxiv icon

Common Phone: A Multilingual Dataset for Robust Acoustic Modelling

Add code
Jan 31, 2022
Figure 1 for Common Phone: A Multilingual Dataset for Robust Acoustic Modelling
Figure 2 for Common Phone: A Multilingual Dataset for Robust Acoustic Modelling
Figure 3 for Common Phone: A Multilingual Dataset for Robust Acoustic Modelling
Figure 4 for Common Phone: A Multilingual Dataset for Robust Acoustic Modelling
Viaarxiv icon